Develop robust data pipelines, optimize performance, and troubleshoot data issues, utilizing technologies like Spark, SQL, and Hadoop.
Our Purpose
Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential.
Title and Summary
Data Engineer II
Data Engineer II
Overview:
Our Purpose
We work to connect and power an inclusive, digital economy that benefits everyone, everywhere by making transactions safe, simple, smart, and accessible. Using secure data and networks, partnerships and passion, our innovations and solutions help individuals, financial institutions, governments, and businesses realize their greatest potential. Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. We cultivate a culture of inclusion for all employees that respects their individual strengths, views, and experiences. We believe that our differences enable us to be a better team - one that makes better decisions, drives innovation, and delivers better business results.
Job Title
Data Engineer II
Who is Mastercard?
Mastercard is a global technology company in the payments industry. Our mission is to connect and power an inclusive, digital economy that benefits everyone, everywhere by making transactions safe, simple, smart, and accessible. Using secure data and networks, partnerships, and passion, our innovations and solutions help individuals, financial institutions, governments, and businesses realize their greatest potential.
Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. With connections across more than 210 countries and territories, we are building a sustainable world that unlocks priceless possibilities for all.
Our Team:
Product Data Analytics has pioneered a transaction-based revenue methodology to inform and guide Mastercard's Strategy, Product Design, and Innovation. We build internal analytics partnerships, strengthening their focus on the health of the business, portfolio and revenue optimization opportunities, initiative tracking, new product development and Go-To Market strategies.
• Are you excited about Data Assets and the value they bring to an organization? • Are you an evangelist for data driven decision making? • Are you motivated to be part of a Global Analytics team that builds large scale Analytical Capabilities supporting end users across 6 continents? • Do you want to be the go-to resource for data science & analytics in the company?
The Role:
Work with Global pricing and interchange strategy team to understand requirement and can design, build, and optimize robust data pipelines at scale. You'll work across Python/Scala/Spark, modern cloud platforms, and big data ecosystems to enable reliable data movement, transformation, and analysis for high impact business use cases. If you enjoy solving complex data problems, tuning performance, and modernizing legacy data flows
Key Responsibilities:• Design & Build Pipelines: Develop resilient, scalable data pipelines using Spark (PySpark), Hive/Impala, SQL, and orchestration tools (e.g., Airflow, NiFi).
.Orchestration & Scheduling: Implement and maintain job schedules, SLAs, retries, alerts, and dependencies using Airflow/NiFi or similar schedulers.• Data Integration: Create robust ingestion patterns for batch and streaming from diverse sources-files, APIs, RDBMS, NoSQL.• Data Modeling & Flow Analysis: Analyze existing pipelines and underlying data models to understand lineage, dependencies, and flow; document and improve them.• Troubleshooting & Root Cause: Independently investigate and resolve data pipeline failures, data quality issues, and performance bottlenecks.• Performance Optimization: Identify Spark performance bottlenecks and tune (e.g., partitioning, caching, join strategy, shuffle optimization, broadcast hints).• Code Modernization: Understand and refactor legacy pipelines; propose alternatives aligned with best practices and modern platforms.• Data Formats & Transformation: Handle CSV, JSON, XML, and multi layered transformation approaches (bronze/silver/gold).• Analytics Enablement: Build Spark/Python/Impala applications for ingestion and data analysis; enable downstream consumption in SQL/NoSQL and BI tools.• Collaboration: Partner with data scientists, analysts, and platform teams to operationalize models and enable self service analytics.
Required Qualifications:• 4-6 years of data engineering experience.• Strong programming skills in Python with production-grade Spark (RDD/DataFrame) development.• Solid experience with Hadoop ecosystem (HDFS, YARN), Hive, Impala.• Advanced SQL skills with query optimization for large datasets (e.g., window functions, CTEs, partition pruning).• Hands-on with job scheduling/orchestration tools such as Apache Airflow and/or Apache NiFi.• Experience building pipelines via Alteryx and/or SSIS (or willingness to modernize from these).• Exposure to cloud platforms (Databricks) and modern data warehouses.• Proven ability to analyze data pipelines and debug data quality/lineage issues end to end.• Experience working with structured, semi structured, and unstructured data.
Familiarity with NoSQL databases and building Spark apps on top of Hadoop/SQL/NoSQL stores.• Data visualization using Power BI/Tableau
Preferred / Good to Have:• AI/ML Knowledge
Tools & Environment :• Big Data: Hadoop, Spark, Hive, Impala• Scheduling: Apache Airflow/NiFi• ETL: Alteryx, SSIS• Data Stores: SQL Server, other SQL/NoSQL• Cloud: Databricks• Languages: Python, SQL• Visualization: Power BI/Tableau• Formats: CSV, JSON, XML, Parquet/ORC
Corporate Security Responsibility
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential.
Title and Summary
Data Engineer II
Data Engineer II
Overview:
Our Purpose
We work to connect and power an inclusive, digital economy that benefits everyone, everywhere by making transactions safe, simple, smart, and accessible. Using secure data and networks, partnerships and passion, our innovations and solutions help individuals, financial institutions, governments, and businesses realize their greatest potential. Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. We cultivate a culture of inclusion for all employees that respects their individual strengths, views, and experiences. We believe that our differences enable us to be a better team - one that makes better decisions, drives innovation, and delivers better business results.
Job Title
Data Engineer II
Who is Mastercard?
Mastercard is a global technology company in the payments industry. Our mission is to connect and power an inclusive, digital economy that benefits everyone, everywhere by making transactions safe, simple, smart, and accessible. Using secure data and networks, partnerships, and passion, our innovations and solutions help individuals, financial institutions, governments, and businesses realize their greatest potential.
Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. With connections across more than 210 countries and territories, we are building a sustainable world that unlocks priceless possibilities for all.
Our Team:
Product Data Analytics has pioneered a transaction-based revenue methodology to inform and guide Mastercard's Strategy, Product Design, and Innovation. We build internal analytics partnerships, strengthening their focus on the health of the business, portfolio and revenue optimization opportunities, initiative tracking, new product development and Go-To Market strategies.
• Are you excited about Data Assets and the value they bring to an organization? • Are you an evangelist for data driven decision making? • Are you motivated to be part of a Global Analytics team that builds large scale Analytical Capabilities supporting end users across 6 continents? • Do you want to be the go-to resource for data science & analytics in the company?
The Role:
Work with Global pricing and interchange strategy team to understand requirement and can design, build, and optimize robust data pipelines at scale. You'll work across Python/Scala/Spark, modern cloud platforms, and big data ecosystems to enable reliable data movement, transformation, and analysis for high impact business use cases. If you enjoy solving complex data problems, tuning performance, and modernizing legacy data flows
Key Responsibilities:• Design & Build Pipelines: Develop resilient, scalable data pipelines using Spark (PySpark), Hive/Impala, SQL, and orchestration tools (e.g., Airflow, NiFi).
.Orchestration & Scheduling: Implement and maintain job schedules, SLAs, retries, alerts, and dependencies using Airflow/NiFi or similar schedulers.• Data Integration: Create robust ingestion patterns for batch and streaming from diverse sources-files, APIs, RDBMS, NoSQL.• Data Modeling & Flow Analysis: Analyze existing pipelines and underlying data models to understand lineage, dependencies, and flow; document and improve them.• Troubleshooting & Root Cause: Independently investigate and resolve data pipeline failures, data quality issues, and performance bottlenecks.• Performance Optimization: Identify Spark performance bottlenecks and tune (e.g., partitioning, caching, join strategy, shuffle optimization, broadcast hints).• Code Modernization: Understand and refactor legacy pipelines; propose alternatives aligned with best practices and modern platforms.• Data Formats & Transformation: Handle CSV, JSON, XML, and multi layered transformation approaches (bronze/silver/gold).• Analytics Enablement: Build Spark/Python/Impala applications for ingestion and data analysis; enable downstream consumption in SQL/NoSQL and BI tools.• Collaboration: Partner with data scientists, analysts, and platform teams to operationalize models and enable self service analytics.
Required Qualifications:• 4-6 years of data engineering experience.• Strong programming skills in Python with production-grade Spark (RDD/DataFrame) development.• Solid experience with Hadoop ecosystem (HDFS, YARN), Hive, Impala.• Advanced SQL skills with query optimization for large datasets (e.g., window functions, CTEs, partition pruning).• Hands-on with job scheduling/orchestration tools such as Apache Airflow and/or Apache NiFi.• Experience building pipelines via Alteryx and/or SSIS (or willingness to modernize from these).• Exposure to cloud platforms (Databricks) and modern data warehouses.• Proven ability to analyze data pipelines and debug data quality/lineage issues end to end.• Experience working with structured, semi structured, and unstructured data.
Familiarity with NoSQL databases and building Spark apps on top of Hadoop/SQL/NoSQL stores.• Data visualization using Power BI/Tableau
Preferred / Good to Have:• AI/ML Knowledge
Tools & Environment :• Big Data: Hadoop, Spark, Hive, Impala• Scheduling: Apache Airflow/NiFi• ETL: Alteryx, SSIS• Data Stores: SQL Server, other SQL/NoSQL• Cloud: Databricks• Languages: Python, SQL• Visualization: Power BI/Tableau• Formats: CSV, JSON, XML, Parquet/ORC
Corporate Security Responsibility
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
- Abide by Mastercard's security policies and practices;
- Ensure the confidentiality and integrity of the information being accessed;
- Report any suspected information security violation or breach, and
- Complete all periodic mandatory security trainings in accordance with Mastercard's guidelines.
Top Skills
Alteryx
Apache Airflow
Apache Nifi
Databricks
Hadoop
Hive
Impala
Power BI
Python
Scala
Spark
SQL
Ssis
Tableau
Mastercard Mumbai, Maharashtra, IND Office
Bandra Kurla Complex Road, Mumbai, Maharashtra, India, 400051
Similar Jobs at Mastercard
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Manage regulatory readiness and responses for Treasury settlement across APAC. Monitor developments, coordinate regulatory inquiries and reports, maintain controls/documentation, support Legal and internal stakeholders, and lead governance and settlement-related projects.
Top Skills:
ExcelPowerPoint
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Work on small teams to solve clients' strategic problems using data-driven solutions; build client relationships; perform quantitative analysis; collaborate across functions; receive mentorship and training; support business development, account and product management; deliver high-quality recommendations and contribute from day one.
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Lead preparation and execution of Southeast Europe contra revenue, manage product revenue streams and deal workbooks, support month-end close, develop forecasts/budgets in Hyperion/CRS, provide advanced analytics and insights, partner with regional stakeholders, identify risks/opportunities, and drive process improvements and automation.
Top Skills:
Excel,Alteryx,Hyperion,Crs Budgeting And Forecasting Tool,Salesforce
What you need to know about the Mumbai Tech Scene
From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.

