Monitor global infrastructure using Datadog and SolarWinds, triage and resolve L1 incidents, escalate complex issues, participate in incident response and post-incident reviews, maintain SOPs and ServiceNow tickets, support automation with basic scripting, and provide weekend on-call coverage.
JOB DESCRIPTION
- Monitor Sysco’s global infrastructure and systems using tools such as Datadog, SolarWinds, and other enterprise monitoring platforms.
- Detect, triage, and respond to incidents proactively before customer or business impact.
- Independently resolve:
- Server performance issues
- Monitoring agent issues
- Basic infrastructure and system alerts
- Escalate major incidents, complex infrastructure issues, and application-related incidents to L2/L3 teams in line with SOPs and SLAs.
- Ensure initial response and resolution targets are met for all priority levels.
- Participate in incident bridge calls and coordinate with internal and external stakeholders.
- Perform initial investigations and document findings to support faster resolution.
- Contribute to post-incident reviews and root cause analysis, including analysis via Datadog Watchdog.
- Follow and execute Standard Operating Procedures (SOPs) for known incidents.
- Maintain accurate documentation and ticket updates in ServiceNow.
- Support initiatives to improve First-Time Resolution (FTR) and reduce MTTR.
- Contribute to project-level operational improvements and initiatives tracked in Jira.
- Apply basic scripting or automation knowledge where applicable to support monitoring improvements and operational efficiency.
- Actively participate in knowledge sharing and continuous learning initiatives.
- Standard shift: Monday to Friday, from 10:30 AM to 7:30 PM CST
- Weekend on-call coverage required (one day per weekend, 10:30 AM – 7:30 PM CST; monthly shift rotation defined based on business needs, with prior notification provided by the team manager).
- Bachelor’s degree in Information Technology or equivalent experience.
- 2 years of experience in Operations Engineering, NOC, SRE, or similar roles.
- Strong understanding of:
- Windows Server and/or UNIX/Linux environments
- Networking fundamentals (LAN/WAN, TCP/IP, DHCP, firewalls, routing)
- Experience with an enterprise ticketing tool (e.g., ServiceNow,Jira).
- Strong communication skills in English and ability to work under pressure.
- Willingness to work in a Weekend on-call coverage required
- Excellent communication skills in English (B2+ or higher) and ability to collaborate across functions and geographies.
- Experience with Datadog, SolarWinds, or similar monitoring platforms.
- Exposure to AWS, Azure, or GCP.
- Familiarity with Jira for tracking initiatives and projects.
- ITIL certification or hands-on experience with ITIL practices.
- Basic scripting or automation knowledge (e.g., PowerShell, Bash, Python).
Benefits:
- This is a hybrid position based in Ultra Park II, Lagunilla (Heredia). On-site presence is required only when necessary, such as for meetings, trainings, or collaborative activities, in alignment with the company’s telework agreement, which currently requires employees to work on-site three (3) days per week)
- Private Medical Insurance
- Asociacion Solidarista
- Life Insurance
- Personal Day Off
Note: Only candidates with Costa Rican nationality or valid immigration status will be considered; applicants residing outside Costa Rica will not be considered, and relocation is not available
Similar Jobs
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead large Oracle Field Service implementation projects as a Solutions Architect senior manager. Design and deploy Oracle Fusion Service and Field Service Cloud solutions, guide assessment and future-state planning, interact with senior clients, coach teams, apply delivery methodologies and accelerators, and drive continuous improvement.
Top Skills:
Oracle Customer ExperienceOracle EpmOracle Field Service CloudOracle FinOracle Fusion ServiceOracle HcmOracle Lead ManagementOracle Marketing AutomationOracle Sales AutomationOracle Scm
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead end-to-end product lifecycle for internal and commercial software products, drive product strategy and commercialization, collaborate with cross-functional teams, apply Agile methods, coach teams, validate client outcomes, and execute roadmaps and go-to-market plans.
Top Skills:
Agile MethodologyProduct Lifecycle Management
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead product management and program delivery for enterprise software implementations. Manage product lifecycle, stakeholder relationships, reporting, risk, and financial tracking; mentor juniors and align project objectives with strategic goals.
Top Skills:
AgileErpExcelPowerPointSaaSWaterfall
What you need to know about the Mumbai Tech Scene
From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.

