Design, build, and maintain scalable data pipelines and ETL processes using PySpark/Apache Spark. Ensure data quality, optimize workflows, troubleshoot performance, document pipelines, and support/junior team members while collaborating with cross-functional teams.
Project Role : Data Engineer
Project Role Description : Design, develop and maintain data solutions for data generation, collection, and processing. Create data pipelines, ensure data quality, and implement ETL (extract, transform and load) processes to migrate and deploy data across systems.
Must have skills : PySpark
Good to have skills : Apache Spark
Minimum 3 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary:
As a Data Engineer, a typical day involves designing, developing, and maintaining comprehensive data solutions that support the generation, collection, and processing of data. This role requires creating efficient data pipelines to facilitate smooth data flow and ensuring the integrity and quality of data throughout its lifecycle. The position also involves implementing extract, transform, and load processes to enable seamless migration and deployment of data across various systems, contributing to the overall data infrastructure and operational efficiency within the organization.
Roles & Responsibilities:
- Expected to perform independently and become an SME.
- Required active participation/contribution in team discussions.
- Contribute in providing solutions to work related problems.
- Collaborate with cross-functional teams to understand data requirements and deliver scalable solutions.
- Monitor and optimize data workflows to improve performance and reliability.
- Document processes and maintain clear communication regarding data pipeline status and issues.
- Support junior team members by sharing knowledge and assisting with technical challenges.
Professional & Technical Skills:
- Must To Have Skills: Proficiency in PySpark, Apache Spark.
- Experience in building and managing large-scale data processing systems.
- Strong knowledge of data pipeline architecture and ETL frameworks.
- Ability to troubleshoot and resolve data quality and performance issues.
- Familiarity with distributed computing concepts and big data technologies.
Additional Information:
- The candidate should have minimum 3 years of experience in PySpark.
- This position is based at our Mumbai office.
- A 15 years full time education is required.
15 years full time education
Project Role Description : Design, develop and maintain data solutions for data generation, collection, and processing. Create data pipelines, ensure data quality, and implement ETL (extract, transform and load) processes to migrate and deploy data across systems.
Must have skills : PySpark
Good to have skills : Apache Spark
Minimum 3 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary:
As a Data Engineer, a typical day involves designing, developing, and maintaining comprehensive data solutions that support the generation, collection, and processing of data. This role requires creating efficient data pipelines to facilitate smooth data flow and ensuring the integrity and quality of data throughout its lifecycle. The position also involves implementing extract, transform, and load processes to enable seamless migration and deployment of data across various systems, contributing to the overall data infrastructure and operational efficiency within the organization.
Roles & Responsibilities:
- Expected to perform independently and become an SME.
- Required active participation/contribution in team discussions.
- Contribute in providing solutions to work related problems.
- Collaborate with cross-functional teams to understand data requirements and deliver scalable solutions.
- Monitor and optimize data workflows to improve performance and reliability.
- Document processes and maintain clear communication regarding data pipeline status and issues.
- Support junior team members by sharing knowledge and assisting with technical challenges.
Professional & Technical Skills:
- Must To Have Skills: Proficiency in PySpark, Apache Spark.
- Experience in building and managing large-scale data processing systems.
- Strong knowledge of data pipeline architecture and ETL frameworks.
- Ability to troubleshoot and resolve data quality and performance issues.
- Familiarity with distributed computing concepts and big data technologies.
Additional Information:
- The candidate should have minimum 3 years of experience in PySpark.
- This position is based at our Mumbai office.
- A 15 years full time education is required.
15 years full time education
About Accenture
Accenture is a leading global professional services company that helps the world’s leading businesses, governments and other organizations build their digital core, optimize their operations, accelerate revenue growth and enhance citizen services—creating tangible value at speed and scale. We are a talent- and innovation-led company with approximately 791,000 people serving clients in more than 120 countries. Technology is at the core of change today, and we are one of the world’s leaders in helping drive that change, with strong ecosystem relationships. We combine our strength in technology and leadership in cloud, data and AI with unmatched industry experience, functional expertise and global delivery capability. Our broad range of services, solutions and assets across Strategy & Consulting, Technology, Operations, Industry X and Song, together with our culture of shared success and commitment to creating 360° value, enable us to help our clients reinvent and build trusted, lasting relationships. We measure our success by the 360° value we create for our clients, each other, our shareholders, partners and communities.Visit us at www.accenture.com
Equal Employment Opportunity Statement
We believe that no one should be discriminated against because of their differences. All employment decisions shall be made without regard to age, race, creed, color, religion, sex, national origin, ancestry, disability status, military veteran status, sexual orientation, gender identity or expression, genetic information, marital status, citizenship status or any other basis as protected by applicable law. Our rich diversity makes us more innovative, more competitive, and more creative, which helps us better serve our clients and our communities.
Accenture Mumbai, Maharashtra, IND Office
Ganpatrao Kadam Marg, off Senapati Bapat Marg, Lower Parel West, Lower Parel, Mumbai, Maharashtra, India, 400013
Similar Jobs
Artificial Intelligence • Automotive • Computer Vision • Information Technology • Internet of Things • Logistics • Software
Senior GIS Data Engineer responsible for developing and maintaining geospatial datasets and data pipelines. Build, run, and optimize ETL workflows (batch to flow), transform and integrate spatial data using FME/ArcGIS/Python/SQL, ensure data quality, provide GIS consultancy, and collaborate cross-functionally to deliver scalable mapping solutions and improve processes.
Top Skills:
ArcgisData ScienceFmeGenerative AiLlmMachine LearningPythonSQL
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Build, test, and maintain GCP-based data pipelines and ETL/ELT processes using Python and SQL. Optimize BigQuery queries, design data models, ensure data quality, troubleshoot pipeline issues, document solutions, and collaborate with analysts and engineers to deliver scalable data warehouse solutions.
Top Skills:
BigQueryGoogle Cloud PlatformPub/SubPythonSQL
Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
Design and implement data solutions by translating business problems into technical designs; lead modules, mentor junior team members, validate architectures, develop ETL/warehouse/reporting solutions, and support project delivery including planning, testing, deployment and client collaboration.
Top Skills:
Big DataCloud PlatformsData AnalyticsData ManagementData WarehousingPythonRdbmsReportingSQL
What you need to know about the Mumbai Tech Scene
From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.



