Roche Logo

Roche

Principal Data Engineer

Posted Yesterday
Be an Early Applicant
In-Office
Pune, Mahārāshtra
Expert/Leader
In-Office
Pune, Mahārāshtra
Expert/Leader
The Principal Data Engineer will design and optimize data architectures on AWS, lead teams, ensure data accuracy, and drive GenAI strategies for data solutions in healthcare.
The summary above was generated by AI

At Roche you can show up as yourself, embraced for the unique qualities you bring. Our culture encourages personal expression, open dialogue, and genuine connections,  where you are valued, accepted and respected for who you are, allowing you to thrive both personally and professionally. This is how we aim to prevent, stop and cure diseases and ensure everyone has access to healthcare today and for generations to come. Join Roche, where every voice matters.

The PositionDESCRIPTION

We are looking for a Principal Data Engineer to join our growing team of Advanced Data Analytics experts. This role calls for a seasoned expert with a robust mix of technical prowess, comprehensive domain knowledge, and strong technical leadership abilities.

The ideal candidate is a hands-on technical leader who thrives on meeting data requirements across different products while driving the long-term technological vision of the department. You will be responsible for ensuring data accuracy and availability while spearheading future-ready strategies for GenAI and advanced analytics. You will work closely with System Architects, Product Owners, and Engineers to deliver scalable, cost-efficient, and high-impact data initiatives.

LOCATION

Pune, India

JOB TYPE

Full Time

KEY RESPONSIBILITIES
  • Define and drive the future strategy for GenAI and emerging technologies to uncover hidden patterns and drive decision-making across diagnostic products.

  • Design, implement, and optimize data architectures using AWS services, ensuring seamless integration and high performance.

  • Drive the design and development of efficient data processing workflows using PySpark, SparkSQL, SQL, and modern formats like Iceberg and Parquet.

  • Optimize the performance, scalability, and cost efficiency of data infrastructure and AWS service consumption.

  • Provide technical direction and mentorship to a team of developers, enforcing code quality standards and best practices for testing and CI/CD.

  • Collaborate closely with engineering and business stakeholders to translate complex requirements into impactful data-driven solutions.

  • Advocate for data security, governance, and compliance, ensuring all products are compliant with HIPPA, GDPR etc.

REQUIRED EXPERIENCE, SKILLS & QUALIFICATIONS
  • Minimum 12-15 years of hands-on data engineering experience, including significant leadership in technical projects.

  • Expert-level proficiency in AWS services including S3, Redshift, Glue, Athena, EMR, Step Functions, and MWAA.

  • Mastery of Python, SQL, and data processing frameworks such as Apache Spark.

  • Proven experience with Dimensional Modeling, Data Partitioning, and Infrastructure as Code using Terraform.

  • Demonstrated ability to provide technical direction, guide architectural decisions, and mentor cross-functional teams.

  • Excellent skills in bridging the gap between business needs and technical implementation for both technical and non-technical audiences.

DESIRED EXPERIENCE, SKILLS & QUALIFICATIONS
  • Experience in the Healthcare Laboratory domain and familiarity with regulations like HIPAA, HL7, and FHIR is a significant plus.

  • Hands-on experience implementing GenAI and Machine Learning technologies within a SaaS or Cloud application environment.

  • Experience in designing and implementing large-scale Data Lakes and Data Warehouses with a focus on long-term scalability.

WHY JOIN US?
  • Collaborative Culture: Engage with a diverse team of talented professionals.

  • Innovative Environment: Work on cutting-edge SaaS products and define the future of diagnostic data.

  • Growth Opportunities: Take on challenging projects that offer continuous learning and career development.

EDUCATION
  • Bachelor’s or Master’s degree in Engineering.

 

 

Who we are

A healthier future drives us to innovate. Together, more than 100’000 employees across the globe are dedicated to advance science, ensuring everyone has access to healthcare today and for generations to come. Our efforts result in more than 26 million people treated with our medicines and over 30 billion tests conducted using our Diagnostics products. We empower each other to explore new possibilities, foster creativity, and keep our ambitions high, so we can deliver life-changing healthcare solutions that make a global impact.


Let’s build a healthier future, together.

Roche is an Equal Opportunity Employer.

Top Skills

Spark
Athena
AWS
Emr
Glue
Iceberg
Mwaa
Parquet
Pyspark
Python
Redshift
S3
Sparksql
SQL
Step Functions
Terraform

Roche Mumbai, Maharashtra, IND Office

5th & 6th floor, Silver Utopia, B 501 / 601 B, Cardinal Gracious Rd, Chakala, Andheri East, Mumbai, Maharashtra, India, 400069

Similar Jobs

6 Days Ago
In-Office
Mumbai, Maharashtra, IND
Senior level
Senior level
Artificial Intelligence • Software
The Principal AI Engineer will design and deploy end-to-end AI products, manage customer engineering, and translate complex requirements into technical solutions while ensuring high standards of code quality and deployment efficiency.
Top Skills: Anthropic ApisAWSAzureGCPHuggingfaceKubernetesLangchainLlamaindexOpenai ApisPineconePythonPyTorchTerraformWeaviate
Yesterday
Hybrid
Expert/Leader
Expert/Leader
Software • Hospitality
The Principal Data Engineer will architect and implement data pipelines, optimize data models, mentor the engineering team, and collaborate with executives on data strategies.
Top Skills: DatabricksDbtDelta LakePhotonPythonSparkSQLUnity Catalog
8 Days Ago
Hybrid
Expert/Leader
Expert/Leader
Artificial Intelligence • Information Technology • Software
Lead architecture and design of data platforms, build data pipelines, implement data models, support ML workflows, mentor engineers, and drive data platform best practices.
Top Skills: Ai-Assisted ToolsAWSAzureBigQueryData LakesData ModelingData WarehousesDataflowDistributed Data ProcessingEltETLGCPHadoopKafkaLakehouse ArchitecturesLlmsPythonRedshiftS3SparkSQL

What you need to know about the Mumbai Tech Scene

From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account