Session AI Logo

Session AI

Staff Site Reliability Engineer

Sorry, this job was removed at 12:11 p.m. (IST) on Friday, Oct 10, 2025
In-Office or Remote
Hiring Remotely in Mumbai, Maharashtra
In-Office or Remote
Hiring Remotely in Mumbai, Maharashtra

Similar Jobs

21 Days Ago
Easy Apply
In-Office or Remote
47 Locations
Easy Apply
Senior level
Senior level
Artificial Intelligence • Blockchain • Internet of Things • Machine Learning • Software • App development • Automation
As a Staff SRE, you will ensure the reliability, scalability, and performance of systems, lead incident management, and drive automation efforts.
Top Skills: AnsibleAWSAzureBashDockerElk StackGCPGitlab CiGoGrafanaJavaJenkinsKubernetesPrometheusPythonTerraform
35 Minutes Ago
Easy Apply
Remote
India
Easy Apply
Mid level
Mid level
Artificial Intelligence • Edtech • Mobile • Natural Language Processing • Productivity • Software
The role involves designing, developing, and maintaining full-stack applications focused on user retention and growth, collaborating with cross-functional teams, and optimizing systems to impact millions of users.
Top Skills: AmplitudeGa4JavaScriptMixpanelNode.jsNoSQLReactSQLTypescript
37 Minutes Ago
Remote
India
Entry level
Entry level
Machine Learning • Natural Language Processing
Join a community of linguists and contribute to AI through annotation, evaluation, and prompt creation in a flexible, remote role.
Top Skills: Digital Tools
Description

Are you ready to make your mark with a true industry disruptor? ZineOne, a subsidiary of , the pioneer of in-session marketing, is looking to add talented team members to help us grow into the premier revenue tool for e-commerce. We work with some of the leading brands nationwide and we innovate how brands connect with and convert customers.

Job Description

This position offers a hands-on, technical opportunity as a vital member of the Site Reliability Engineering Group. Our SRE team is dedicated to ensuring that our Cloud platform operates seamlessly, efficiently, and reliably at scale. The ideal candidate will bring over five years of experience managing cloud-based Big Data solutions, with a strong commitment to resolving operational challenges through automation and sophisticated software tools.

Candidates must uphold a high standard of excellence and possess robust communication skills, both written and verbal. A strong customer focus and deep technical expertise in areas such as Linux, automation, application performance, databases, load balancers, networks, and storage systems are essential.

Key Responsibilities:

As a Session AI SRE, you will:

  • Design and implement solutions that enhance the availability, performance, and stability of our systems, services, and products.
  • Develop, automate, and maintain infrastructure as code for provisioning environments in AWS, Azure, and GCP.
  • Deploy modern automated solutions that enable automatic scaling of the core platform and features in the cloud.
  • Apply cybersecurity best practices to safeguard our production infrastructure.
  • Collaborate on DevOps automation, continuous integration, test automation, and continuous delivery for the Session AI platform and its new features.
  • Manage data engineering tasks to ensure accurate and efficient data integration into our platform and outbound systems.
  • Utilize expertise in DevOps best practices, shell scripting, Python, Java, and other programming languages, while continually exploring new technologies for automation solutions.
  • Design and implement monitoring tools for service health, including fault detection, alerting, and recovery systems.
  • Oversee business continuity and disaster recovery operations.
  • Create and maintain operational documentation, focusing on reducing operational costs and enhancing procedures.
  • Demonstrate a continuous learning attitude with a commitment to exploring emerging technologies.
Preferred Skills:
  • Experience with cloud platforms like AWS, Azure, and GCP, including their management consoles and CLI.
  • Proficiency in building and maintaining infrastructure on:
    • AWS using services such as EC2, S3, ELB, VPC, CloudFront, Glue, Athena, etc.
    • Azure using services such as Azure VMs, Blob Storage, Azure Functions, Virtual Networks, Azure Active Directory, Azure SQL Database, etc.
    • GCP using services such as Compute Engine, Cloud Storage, Cloud Functions, VPC, Cloud IAM, BigQuery, etc.
  • Expertise in Linux system administration and performance tuning.
  • Strong programming skills in Python, Bash, and NodeJS.
  • In-depth knowledge of container technologies like Docker and Kubernetes.
  • Experience with real-time, big data platforms including architectures like HDFS/Hbase, Zookeeper, and Kafka.
  • Familiarity with central logging systems such as ELK (Elasticsearch, LogStash, Kibana).
  • Competence in implementing monitoring solutions using tools like Grafana, Telegraf, and Influx.
Benefits
  • Comparable salary package and stock options
  • Opportunity for continuous learning
  • Fully sponsored EAP services
  • Excellent work culture
  • Opportunity to be an integral part of our growth story and grow with our company
  • Health insurance for employees and dependents
  • Flexible work hours
  • Remote-friendly company

What you need to know about the Mumbai Tech Scene

From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account