SupplyHouse.com Logo

SupplyHouse.com

Site Reliability Engineer

Posted 8 Days Ago
Easy Apply
Remote
Hiring Remotely in India
Mid level
Easy Apply
Remote
Hiring Remotely in India
Mid level
The Site Reliability Engineer ensures scalability and reliability of infrastructure and applications through automation and incident response while collaborating with DevOps teams.
The summary above was generated by AI

Real people. Real service.

At SupplyHouse.com, we value every individual team member and cultivate a community where people come first. Led by our core values of Generosity, Respect, Innovation, Teamwork, and GRIT, we’re dedicated to maintaining a supportive work environment that celebrates diversity and empowers everyone to reach their full potential. As an industry-leading e-commerce company specializing in HVAC, plumbing, heating, and electrical supplies since 2004, we strive to foster growth while providing the best possible experience for our customers.

Through an Employer of Record (EOR), we are looking for a new Site Reliability Engineer in India to join our growing IT Team. This individual will report into our Director of IT and ensure the scalability, reliability, and performance of our infrastructure and applications with a focus on automation, monitoring, and incident response. If you enjoy bridging software engineering with IT operations, we’d love to hear from you!  

Role Type: Full-Time

Location: Remote from India

Schedule: Monday through Friday with a minimum schedule overlap of 4-5 hours per day with 8:00 a.m. to 5:00 p.m. U.S. Eastern Time to ensure effective collaboration

Base Salary: $29,000 – $36,000 USD per year

Responsibilities:

  • High-level proficiency of written and verbal communication in English
  • Design, build, and maintain scalable, reliable systems on GCP (Compute Engine, GKE, Cloud Storage, Cloud SQL)
  • Develop automation for infrastructure provisioning using Terraform, Ansible, or Deployment Manager
  • Build and maintain observability platforms (monitoring, logging, tracing) using tools such as Stackdriver (Cloud Monitoring), Prometheus, or Grafana
  • Manage incident response, conduct postmortems, and implement improvements to reduce recurrence
  • Partner with DevOps and engineering teams to enhance CI/CD pipelines for resilient deployments
  • Define and monitor SLAs, SLOs, and SLIs to ensure application availability and performance
  • ​Implement disaster recovery (DR) and backup strategies across cloud services
  • Continuously optimize performance, capacity, and cost-efficiency of GCP resources

Requirements: 

  • Bachelors degree in Computer Science, Engineering, or a related field
  • 3+ years of hands-on experience as a Site Reliability Engineer, DevOps Engineer, Systems Engineer, or Cloud Infrastructure Engineer. Proven track record managing production-grade systems on Google Cloud Platform (GCP) or other cloud providers
  • Strong understanding of Linux/Unix system administration, networking, and troubleshooting. Experience implementing Infrastructure as Code (IaC) using tools like Terraform, Ansible, or Deployment Manager.Familiarity with containerization and orchestration technologies such as Docker and Kubernetes (GKE)
  • Experience with monitoring and observability tools (Google Cloud Operations Suite, Prometheus, Grafana, Datadog, ELK). Experience defining and monitoring SLAs, SLOs, and SLIs to ensure application uptime and performance. Proven ability to handle incident response, conduct postmortems, and drive root cause analysis
  • Proficiency in at least one scripting language (Python, Bash, or Go) for automation and tooling.Hands-on experience building or managing CI/CD pipelines (Jenkins, GitLab CI, Cloud Build).Strong background in configuration management and release automation
  • Knowledge of IAM (Identity and Access Management), network security, and cloud compliance controls.Familiarity with disaster recovery (DR), backups, and high-availability design

Preferred Qualifications:

  • Proven ability to optimize infrastructure performance and cost, particularly within GCP (FinOps experience a plus). Background in capacity planning, load testing, and horizontal scaling of distributed systems
  • Certification(s) as a Google Cloud Professional Cloud DevOps Engineer (strongly preferred), Google Cloud Professional Cloud Architect or Associate Cloud Engineer, Kubernetes CKA/CKAD, etc.
  • Experience implementing blue-green deployments, canary rollouts, and progressive delivery strategies
  • Experience working cross-functionally with software development, QA, and security teams.
  • Ability to mentor junior engineers and establish best practices for monitoring, deployment, and incident response

Why work with us: 

  • We have awesome benefits – We offer a wide variety of benefits to help support you and your loved ones. These include:
    • Comprehensive and affordable medical, dental, vision, and life insurance options
    • Competitive Provident Fund contributions
    • Paid time off and holidays
    • Mental health support and wellbeing program
    • Company-provided equipment and one-time $250 USD work from home stipend
    • $750 USD annual professional development budget
    • Company rewards and recognition program
    • And more!
  • We empower ownership – We all contribute to our success and we all share in it. Our Ownership for All program ensures each SupplyHouse team member will benefit financially from the company’s growth and accomplishments.
  • We promote work-life balance – We value your time and encourage a healthy separation between your professional and personal life to feel refreshed and recharged. Look out for our wellness initiatives!
  • We support growth – We strive to innovate every day. In an exciting and evolving industry, we provide potential for career growth through our hands-on training, diversity and inclusion initiatives, opportunities for internal mobility, and professional development budget.
  • We give back We live and breathe our core value, Generosity, by giving back to the trades and organizations around the world. We make a difference through donation drives, employee-nominated contributions, support for DE&I organizations, and more.
  • We listen –We value hearing from our employees. Everyone has a voice, and we encourage you to use it! We actively elicit feedback through our monthly town halls, regular 1:1 check-ins, and company-wide ideas form to incorporate suggestions and ensure our team enjoys coming to work every day.

Check us out and learn more at https://www.supplyhouse.com/our-company!

Additional Details: 

  • Remote employees are expected to work in a distraction-free environment. Personal devices, background noise, and other distractions should be kept to a minimum to avoid disrupting virtual meetings or business operations.
  • SupplyHouse.com is an Equal Opportunity Employer, strongly values inclusion, and encourages individuals of all backgrounds and experiences to apply for this position.
  • To ensure fairness, all application materials, assessments, and interview responses must reflect your own original work. The use of AI tools, plagiarism, or any uncredited assistance is not permitted at any stage of the hiring process and may result in disqualification. We appreciate your honesty and look forward to seeing your skills.
  • We are committed to providing a safe and secure work environment and conduct thorough background checks on all potential employees in accordance with applicable laws and regulations.
  • All emails from the SupplyHouse team will only be sent from an @supplyhouse.com email address. Please exercise caution if you receive an email from an alternate domain.

Top Skills

Ansible
Bash
Ci/Cd
Docker
GCP
Google Cloud Operations Suite
Google Cloud Platform
Grafana
Jenkins
Kubernetes
Prometheus
Python
Terraform

Similar Jobs

3 Days Ago
Remote
India
Mid level
Mid level
Cloud • Information Technology • Productivity • Software • Automation
As a Site Reliability Engineer, you'll develop advanced systems and software, manage cloud infrastructure, improve operational processes, and mentor engineers.
Top Skills: AnsibleAWSAzureBashCloudFormationDatadogDockerGCPGoKubernetesNewrelicPythonTerraform
6 Hours Ago
In-Office or Remote
5 Locations
Senior level
Senior level
Fintech • Financial Services
The Principal SRE Engineer will architect, build, and scale Sleek's infrastructure, integrating AI systems, ensuring security, and enhancing automation for reliability and performance.
Top Skills: ArgocdAWSAzureCloudflareCloudFormationCloudwatchEcsEksElkFluxGCPGitopsKongKubernetesNestjsNode.jsOpensearchOpentelemetryPrometheusPulumiPythonTerraformTraefik
11 Hours Ago
Remote
India
Senior level
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
As a Senior Manager of Site Reliability Engineering, you will lead a team of SREs in driving system reliability, automation, and incident management for cloud services.
Top Skills: AnsibleAWSAzureChefCloud ServicesElk StackGCPGoGrafanaJaegerKubernetesPrometheusPuppetPythonSplunkTerraform

What you need to know about the Mumbai Tech Scene

From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account