Pod Network Logo

Pod Network

Site Reliability Engineer (APAC)

Posted 6 Hours Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Japan
Mid level
In-Office or Remote
Hiring Remotely in Japan
Mid level
Operate and improve the Pod platform: respond to incidents, investigate root causes, build automation and observability, design monitoring/alerting, reduce alert fatigue, and drive reliability improvements across production systems.
The summary above was generated by AI

Pod is building a next-generation decentralized exchange focused on fairness, performance, and user experience. We believe traders shouldn't have to choose between speed, simplicity, and fair treatment, so we're building an exchange that delivers all three while enabling entirely new kinds of financial markets.

Under the hood, Pod is powered by low-latency systems designed for fast settlement and strong guarantees around ordering, timing, and execution. These are challenging engineering problems, and the reliability of the platform depends on operating those systems safely and effectively at scale.

About the Role:

We're looking for our first Site Reliability Engineer to help operate, improve, and scale the reliability of the Pod platform.

You'll join a team of engineers who already share responsibility for production systems and participate in an established on-call rotation. From day one, you'll work closely with the broader engineering team while taking ownership of the tooling, processes, and operational practices that keep the platform running smoothly.

This is a hands-on role for someone who enjoys operating complex systems, investigating difficult production issues, and building the automation and infrastructure that turn reliability into a competitive advantage.

On Call:

You'll be responsible for platform health during Asian business hours as part of our existing engineering on-call rotation. There are no permanent overnight shifts, and you'll never be the sole person responsible for the platform—the rest of the rotation is covered by the wider team. Occasionally, you may flex outside your normal hours to help cover the schedule, but that's the exception rather than the rule.

What You’ll Do:

Respond to and resolve incidents:

  • Monitor the health and performance of the platform

  • Respond to production incidents and drive them through to resolution

  • Investigate failures, identify root causes, and coordinate fixes

  • Ensure issues are detected, understood, and addressed quickly

Improve platform reliability:

  • Identify recurring operational pain points and eliminate them

  • Improve software, deployment processes, and operational workflows

  • Participate in incident reviews and help drive preventative improvements

  • Contribute reliability-focused changes directly to production systems

Build observability and operational tooling:

  • Design and maintain dashboards, metrics, alerting, and monitoring systems

  • Improve signal quality while reducing alert fatigue

  • Build automation and internal tools that make the platform easier to operate

  • Help establish reliability best practices across the engineering organization

Qualifications:
  • Strong experience with Linux and cloud infrastructure

  • Experience operating and supporting production systems

  • Experience with Docker and containerized environments

  • Experience with observability and incident-management tools such as Grafana, Prometheus, PagerDuty, or similar

  • Ability to automate workflows using Rust, Python, Bash, or similar languages

  • Strong troubleshooting and debugging skills

  • A high degree of ownership and the ability to make sound decisions independently

Nice to Have:
  • Experience with distributed systems

  • Experience operating high-availability, low-latency services

  • Experience with CI/CD systems and deployment automation

  • Experience designing secure operational workflows and access controls

  • No prior blockchain or cryptocurrency experience is required.

What we offer:
  • Competitive compensation (~$100k USD/year), plus a meaningful token/equity allocation

  • Real ownership and responsibility from day one as part of a small team

  • Work from wherever you are within the target timezone range (UTC+7 to UTC+1)

  • Occasional travel to Europe and elsewhere for team meetups

 

Similar Jobs

11 Days Ago
Remote
Senior level
Senior level
Computer Vision • Machine Learning • Software
Lead observability, incident management, and reliability for Ditto's edge-to-cloud infrastructure. Build monitoring (Prometheus, Grafana, Datadog), define SLOs, automate recovery and tooling, author runbooks, collaborate with product teams, and participate in on-call rotations to ensure scalable, enterprise-grade system resilience.
Top Skills: AWSAzureC/C++DatadogGCPGoGrafanaHelmJavaPrometheusPythonRustTerraform
An Hour Ago
Remote
Junior
Junior
Artificial Intelligence • Productivity • Software • Automation
Provide frontline IT support for Zapier employees in AU/NZ: troubleshoot Mac hardware/software, manage user accounts and Jamf tasks, escalate complex issues, create runbooks and self-service resources, assist with SSO/SAML integrations and build simple automations in Okta Workflows and Zapier.
Top Skills: 1PasswordGoogle WorkspaceJAMFJIRAmacOSOktaOkta WorkflowsSAMLSlackSsoZapierZoom
2 Hours Ago
Remote
Senior level
Senior level
Artificial Intelligence • Hardware • Information Technology • Machine Learning
The role involves providing engineering support for the planning, development, and maintenance of mechanical systems in semiconductor facilities, enhancing system performance and sustainability.
Top Skills: Building SystemsChilled WaterCleanroom HvacCompressed AirFire Protection SystemsGeneral HvacHeating WaterMakeup AirProcess Cooling WaterProcess ExhaustProcess VacuumRecirculation Air

What you need to know about the Mumbai Tech Scene

From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account