GitLab Logo

GitLab

Intermediate Site Reliability Engineer, Durability

Posted 22 Days Ago
Easy Apply
Remote
28 Locations
Mid level
Easy Apply
Remote
28 Locations
Mid level
As an SRE at GitLab, you will ensure the smooth operation of user-facing services and production systems by designing scalable infrastructure, responding to incidents, and automating operational tasks, while collaborating with various teams.
The summary above was generated by AI

GitLab is an open core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world. When everyone can contribute, consumers become contributors, significantly accelerating the rate of human progress. This mission is integral to our culture, influencing how we hire, build products, and lead our industry. We make this possible at GitLab by running our operations on our product and staying aligned with our values. Learn more about Life at GitLab.

An overview of this role

As a Site Reliability Engineer (SRE) at GitLab, you are responsible for keeping all user-facing services and other GitLab production systems running smoothly. SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to our operating environments and the GitLab codebase.

GitLab SREs specialize in systems (operating systems, storage subsystems, networking), while implementing best practices for availability, reliability and scalability, with varied interests in algorithms and distributed systems.

What you’ll do  

  • Design and implement highly scalable infrastructure to support the needs of current and future GitLab.com.
  • Collaborate closely with cross-functional teams and other teams throughout Infrastructure on projects to drive GitLab’s future.
  • Respond to incidents on an on call rotation (our team is distributed globally, so you only are on call during your daytime hours!) and participate in incident review.
  • Act as subject matter experts within the GitLab infrastructure department, specializing in knowledge of our edge services and kubernetes workloads.
  • Automate every operational task.

What you’ll bring 

  • Experience with the Kubernetes ecosystem including Helm.
  • Google Cloud Platform expertise, specifically around networking, GKE configuration, and scaling.
  • Experience with Terraform infrastructure as code.
  • Experience with configuration management tools such as Ansible and Chef.
  • Programming skills in Go or Ruby.
  • Ability to clearly define problems and think beyond initial solutions, looking at how to make things better in the future.
  • A drive for automating everything.
  • Ability to be a manager of one and have a strong bias for action.
  • An independent,  proactive and self-organized mindset.
  • An ability to clearly communicate asynchronously.
  • Excitement to be doing something different every day from project work to production change requests to emergency response.

About the team

Durability is responsible for safeguarding and securing customer data that is stored by the GitLab application and sets guidelines for data access. Running the largest GitLab instance in existence (and in fact, one of the largest single-tenancy open-source SaaS sites on the Internet) means we are constantly faced with unique and rewarding challenges that directly impact our users every day. Our future is all about increasing automation so we can continue to scale even bigger with enterprise level expectations around reliability and availability. Thanks to our Transparency value, you can see how we work on our team page or even see what we’re working on. 

Country Hiring Guidelines: GitLab hires new team members in countries around the world. All of our roles are remote, however some roles may carry specific location-based eligibility requirements. Our Talent Acquisition team can help answer any questions about location after starting the recruiting process.  

Privacy Policy: Please review our Recruitment Privacy Policy. Your privacy is important to us.

GitLab is proud to be an equal opportunity workplace and is an affirmative action employer. GitLab’s policies and practices relating to recruitment, employment, career development and advancement, promotion, and retirement are based solely on merit, regardless of race, color, religion, ancestry, sex (including pregnancy, lactation, sexual orientation, gender identity, or gender expression), national origin, age, citizenship, marital status, mental or physical disability, genetic information (including family medical history), discharge status from the military, protected veteran status (which includes disabled veterans, recently separated veterans, active duty wartime or campaign badge veterans, and Armed Forces service medal veterans), or any other basis protected by law. GitLab will not tolerate discrimination or harassment based on any of these characteristics. See also GitLab’s EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know during the recruiting process.

Top Skills

Go
Kubernetes
Ruby

Similar Jobs at GitLab

2 Days Ago
29 Locations
Remote
2,350 Employees
Entry level
2,350 Employees
Entry level
Cloud • Security • Software • Cybersecurity • Automation
As an Intermediate Site Reliability Engineer in FinOps at GitLab, you'll ensure systems are scalable, reliable, and financially optimized. Your role involves automating cost management, collaborating with finance and engineering teams, and promoting FinOps principles across operations for cost optimization and financial accountability.
Be an Early Applicant
3 Days Ago
29 Locations
Remote
2,350 Employees
Mid level
2,350 Employees
Mid level
Cloud • Security • Software • Cybersecurity • Automation
As an Engineering Manager at GitLab, you will lead the Cells Infrastructure team, focusing on team development and collaboration on technical projects. Responsibilities include hiring, managing a global team, facilitating communication, driving the team’s roadmap, and participating in Incident Management.
Be an Early Applicant
4 Days Ago
28 Locations
Remote
2,350 Employees
Mid level
2,350 Employees
Mid level
Cloud • Security • Software • Cybersecurity • Automation
The Intermediate Site Reliability Engineer will enhance GitLab's delivery platform by automating release processes, improving monitoring, and optimizing deployment strategies. Key tasks include collaborating with Engineering teams, creating new tools, and ensuring timely and efficient software releases.

What you need to know about the Mumbai Tech Scene

From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account