The Lead SRE oversees reliability across production systems, mentors teams, and drives automation, incident management, and cloud optimization efforts.
The Lead Site Reliability Engineer (Lead SRE) is responsible for driving reliability, scalability, and performance across Honeywell’s production systems. This role bridges software engineering and operations, ensuring that cloud‑native platforms and AI‑enabled services are resilient, secure, and cost‑optimized. The Lead SRE will mentor engineers, establish reliability best practices, and partner with product and engineering teams to embed observability, automation, and intelligent validation into every stage of the lifecycle.
Responsibilities- Reliability Strategy & Leadership: Define and enforce SRE standards, SLIs/SLOs, and error budgets across critical systems.
- Automation & Tooling: Build and scale automation frameworks for deployment, monitoring, and incident response.
- Cloud & Infrastructure: Lead design and optimization of hybrid cloud infrastructure (Azure, GCP) with a focus on resilience and cost efficiency.
- AI/ML Readiness: Partner with engineering teams to operationalize ML workloads, strengthen MLOps pipelines, and ensure reliability of AI‑driven services.
- Incident Management: Drive root cause analysis, postmortems, and continuous improvement for production incidents.
- Mentorship & Collaboration: Guide SRE and engineering teams, fostering a culture of ownership, learning, and proactive reliability practices.
- Governance & Security: Ensure compliance, observability, and responsible use of automation and AI in production systems.
- Education: Bachelor’s or Master’s in Computer Science, Engineering, or related field.
- Experience: 12+ years in software engineering or operations, with 3–5 years in SRE leadership. Proven experience managing large‑scale distributed systems and cloud infrastructure.
- Technical Skills:
- Expertise in cloud architecture, containers, Kubernetes, serverless patterns.
- Strong knowledge of observability stacks (Prometheus, Grafana, ELK, OpenTelemetry).
- Proficiency in automation and CI/CD tools (Terraform, Ansible, Jenkins, GitHub Actions).
- Familiarity with ML pipelines and MLOps tools (Azure ML, MLflow, Databricks).
- Programming skills in Python, or Go
- Leadership Skills: Ability to mentor engineers, influence cross‑functional partners, and drive reliability culture. Strong communicator with executive presence.
Similar Jobs
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
As a Lead Customer Success Manager, you will drive customer adoption of Dynatrace, build executive relationships, manage risks, and ensure long-term value for enterprise clients in India.
Top Skills:
AWSAzureGCPKubernetes
Artificial Intelligence • Enterprise Web • Information Technology • Productivity • Sales • Software • Database
As a Senior Backend Engineer, you'll design scalable backends, mentor team members, and lead software development lifecycle activities while improving quality and performance.
Top Skills:
AnsibleDockerElasticsearchKubernetesMongoDBNode.jsReactRedisReduxRubyRuby On RailsTerraform
Artificial Intelligence • Enterprise Web • Information Technology • Productivity • Sales • Software • Database
As a Senior Backend Engineer at Apollo.io, you'll design scalable backend solutions, mentor teammates, and work cross-functionally to enhance product quality and performance.
Top Skills:
AIAnsibleDockerElasticsearchKubernetesMongoDBNode.jsReactRedisReduxRubyRuby On RailsTerraform
What you need to know about the Mumbai Tech Scene
From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.


