Takeda

Data Engineering - Gen AI Product Manager

Posted Yesterday

Be an Early Applicant

Hybrid

Bengaluru, Karnataka

Senior level

Hybrid

Bengaluru, Karnataka

Senior level

The Data Engineering - Gen AI Product Manager oversees enterprise data engineering strategies, manages data pipelines, optimizes GenAI applications, and drives data governance and scalability.

The summary above was generated by AI

By clicking the "Apply" button, I understand that my employment application process with Takeda will commence and that the information I provide in my application will be processed in line with Takeda's Privacy Notice and Terms of Use. I further attest that all information I submit in my employment application is true to the best of my knowledge.
Job Description
The Future Begins Here
At Takeda, we are leading digital evolution and global transformation. By building innovative solutions and future-ready capabilities, we are meeting the need of patients, our people, and the planet.
Bengaluru, the city, which is India's epicenter of Innovation, has been selected to be home to Takeda's recently launched Innovation Capability Center. We invite you to join our digital transformation journey. In this role, you will have the opportunity to boost your skills and become the heart of an innovative engine that is contributing to global impact and improvement.
At Takeda's ICC we Unite in Diversity
Takeda is committed to creating an inclusive and collaborative workplace, where individuals are recognized for their backgrounds and abilities they bring to our company. We are continuously improving our collaborators journey in Takeda, and we welcome applications from all qualified candidates. Here, you will feel welcomed, respected, and valued as an important contributor to our diverse team.
Takeda is a global, values-based, R&D-driven biopharmaceutical leader committed to bringing better health and a brighter future to people worldwide. Our passion and pursuit of potentially life-changing treatments for patients are deeply rooted in over 240 years of heritage.
This position focuses on overseeing enterprise data engineering strategies, including advanced analytics, GenAI industrialization. The role involves managing the end-to-end lifecycle of enterprise data across diverse platforms while ensuring the integrity, availability, scalability, and performance of data pipelines and platforms.
The ideal candidate will possess a deep understanding of modern data engineering practices, Databricks capabilities for GenAI workflows, proactive monitoring and observability, and a proven track record in leading cross-functional teams to deliver impactful AI-driven data solutions.
Accountabilities & Responsibilities
Data Pipeline Development and Delivery:

Design and implement scalable data pipelines on Databricks/Informatica to enable seamless ingestion, transformation, and delivery of high-quality data across enterprise platforms.

Leverage Databricks' capabilities, including Delta Lake, workflows, to build robust pipelines that support data solutions / GenAI vector stores and inference workflows.

Establish CI/CD pipelines for automating deployment and monitoring of data pipelines.

GenAI and Vector Database Integration:

Develop and optimize data workflows on Databricks for GenAI applications, including embeddings generation and fine-tuning datasets.

Design and manage vector databases like Milvus, Pinecone, or Weaviate for efficient storage and retrieval of embeddings for semantic search and AI-driven insights.

Collaborate with AI/ML teams to integrate Databricks/AWS/Azure with vector databases for end-to-end GenAI solutions, ensuring low-latency performance and scalability.

Incident Management:

Lead triaging, troubleshooting, and root cause analysis to resolve pipeline and data flow issues promptly.

Develop and execute action plans to prevent recurring issues within environments and related integrations.

Data Governance:

Drive the adoption of federated governance practices and implement Databricks Unity Catalog/Immuta for secure and compliant data access management.

Manage and maintain an enterprise-level Informatica Data Catalog and Data Lineage system for transparency and traceability of workflows.

Performance and Scalability:

Utilize performance tuning features to ensure optimal operation of data pipelines and vector database integrations.

Collaborate with infrastructure teams to scale clusters and vector databases in alignment with business demands.

Self-Guided Data Enablement:

Build self-service data tools, enabling business teams to access and process data for analytics and data & AI experimentation.

Foster a culture of self-guided data management while reducing dependency on centralized data engineering resources.

Cost & Consumption Management:

Monitor usage and optimize costs using cluster configurations, autoscaling, and workload distribution.

Provide insights into resource consumption to improve planning and allocation.

Key Qualifications

7+ years in data engineering roles, with 3+ years in leadership positions.

Expertise in implementing and managing Databricks and Informatica, with hands-on experience in Delta Lake and seamlessly integrating Databricks Unity Catalog with Informatica IDMC.

Proven experience with GenAI data preparation, embeddings generation, and vector database management (e.g., Milvus, Pinecone).

Hands-on experience with cloud platforms like AWS and orchestration tools such as Airflow or Tidal.

Proficiency in building and managing ETL/ELT workflows, Data Warehousing, and Business Intelligence tools.

Deep knowledge of federated governance models, secure access management, and observability practices.

Demonstrated expertise in optimizing GenAI workflows on Databricks and integrating vector databases for AI/ML solutions.

Familiarity with cost and consumption optimization strategies for Databricks/Informatica/AWS.

Strong problem-solving, analytical, and communication skills with a collaborative leadership style.

Preferred Skills

Advanced knowledge of Databricks/Informatica IDMC features for GenAI workloads, including feature engineering and large-scale data processing.

Background in building self-service tools and promoting their adoption in AI-driven organizations.

What Takeda Can Offer You

Takeda is certified as a Top Employer, not only in India, but also globally. No investment we make pays greater dividends than taking good care of our people.
At Takeda, you take the lead on building and shaping your own career.
Joining the ICC in Bangalore will give you access to high-end technology, continuous training and a diverse and inclusive network of colleagues who will support your career growth.

Benefits
It is our priority to provide competitive compensation and a benefit package that bridges your personal life with your professional career. Amongst our benefits are:
Competitive Salary + Performance Annual Bonus

Flexible work environment, including hybrid working
Comprehensive Healthcare Insurance Plans for self, spouse, and children
Group Term Life Insurance and Group Accident Insurance programs
Employee Assistance Program
Broad Variety of learning platforms
Diversity, Equity, and Inclusion Programs
Reimbursements - Home Internet & Mobile Phone
Employee Referral Program
Leaves - Paternity Leave (4 Weeks) , Maternity Leave (up to 26 weeks), Bereavement Leave (5 days)

About ICC in Takeda

Takeda is leading a digital revolution. We're not just transforming our company; we're improving the lives of millions of patients who rely on our medicines every day.
As an organization, we are committed to our cloud-driven business transformation and believe the ICCs are the catalysts of change for our global organization.

#Li-Hybrid
Locations
IND - Bengaluru
Worker Type
Employee
Worker Sub-Type
Regular
Time Type
Full time

Top Skills

Airflow

AWS

Azure

Databricks

Delta Lake

Informatica

Milvus

Pinecone

Tidal

Weaviate

Similar Jobs at Takeda

Takeda

Data Operations Engineer

Yesterday

Hybrid

Bengaluru, Karnataka, IND

Senior level

Healthtech • Software • Analytics • Biotech • Pharmaceutical • Manufacturing

This role involves designing and optimizing data security, mentoring data engineers, ensuring data integration, and contributing to CI/CD pipelines.

Top Skills: AWSAzureGlobal ScapeGCPSQL

Takeda

Control Tower Lead

Yesterday

Hybrid

Bengaluru, Karnataka, IND

Senior level

Healthtech • Software • Analytics • Biotech • Pharmaceutical • Manufacturing

The Principal Data Engineer will oversee the centralized Data Management Control Tower, ensuring optimal data flow and performance. Responsibilities include incident management, data governance, performance optimization, and promoting self-service data management within teams. The role requires strategic leadership and expertise in data lifecycle management and observability solutions.

Takeda

Lead Data Engineer - Informatica IICS

Yesterday

Hybrid

Bengaluru, Karnataka, IND

Senior level

Healthtech • Software • Analytics • Biotech • Pharmaceutical • Manufacturing

As a Lead Data Engineer, you'll design, manage, and optimize ETL processes with Informatica IICS, lead a team of data engineers, and ensure seamless data integration.

Top Skills: AWSAzureGCPHadoopInformatica IicsKafkaMySQLOraclePythonShell ScriptingSparkSQLSQL Server

What you need to know about the Mumbai Tech Scene

From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.