GroundTruth Logo

GroundTruth

Engineering Manager- Data Engineer

Posted 12 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in India
Senior level
Remote
Hiring Remotely in India
Senior level
The Engineering Manager leads the Data Engineering team, overseeing the design of scalable data pipelines using AWS technologies, mentoring engineers, and collaborating with stakeholders on data-first initiatives.
The summary above was generated by AI

GroundTruth is an advertising platform that turns real-world behavior into marketing that drives in-store visits and other real business results. We use observed real-world consumer behavior, including location and purchase data, to create targeted advertising campaigns across all screens, measure how consumers respond, and uncover unique insights to help optimize ongoing and future marketing efforts.

With this focus on media, measurement, and insights, we provide marketers with tools to deliver media campaigns that drive measurable impact, such as in-store visits, sales, and more.

Learn more at groundtruth.com.

We believe that innovative technology starts with the best talent and have been ranked one of Ad Age’s Best Places to Work in 2021, 2022, 2023 & 2025! Learn more about the perks of joining our team here.

About Us

GroundTruth is looking for a Data Engineering Manager with strong expertise in designing and building scalable data platforms and pipelines to join our team. The Data Engineering Team is responsible for the core data infrastructure that powers our audience platform.
As an Engineering Manager on our Audience Engineering team, you will build solutions that add new data capabilities and analytical depth to our platform while managing sophisticated AWS-native data services.

You will:

  • Architect Scalable Pipelines: Oversee the design and deployment of large-scale distributed data processing jobs using PySpark on Amazon EMR clusters and serverless AWS Glue ETL jobs.
  • Coach and mentor engineers—supporting growth in technical skills (particularly Python and Spark optimization), data modeling best practices, and career progression.
  • Partner with stakeholders and engineering leadership to evaluate, plan, and deliver data-first projects across advertising systems, analytics services, and reporting features.
  • Lead by example: Write production-ready Python and PySpark code, perform code reviews, and optimize Spark configurations to improve performance and reduce costs. Apply Agile methodologies such as Scrum to drive iterative development, foster team collaboration, and ensure continuous delivery of high-quality data solutions.
  • Support engineers through regular 1:1s, feedback, quarterly reviews, recognition, and performance management.

You have:

  • Bachelor’s degree in Computer Engineering, Data Science, or equivalent practical experience.
  • 8+ years of experience in technology, specifically focused on data engineering, data warehousing, or big data architecture.
  •  2+ years of experience of leading a data engineering team.
  • Expertise in Python & PySpark: Deep experience writing and tuning distributed processing applications, handling data skew, and optimizing Spark memory management.
  • Advanced AWS Expertise: Proven track record of managing Amazon EMR for heavy-duty processing.
  • Experience with Big Data Infrastructure: Build the systems required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS ‘big data’ technologies (S3, EMR, Glue, Athena and Lambda).
  • Expert SQL skills for complex transformations, performance tuning, and deep-dive analytics.
  • Experience with Orchestration: Advanced proficiency with Airflow and Git.
  • AI-Driven Engineering: Proven track record of leveraging AI across the data engineering process to drive modernization, automate data quality checks, and enhance delivery outcomes.
  • Hands-on familiarity with AI-native tools such as Cursor, Claude, or GitHub Copilot to scale data development.

How you can impress us:

  • Performance Tuning Specialist: Ability to debug complex PySpark  and/or Scala jobs and optimize EMR Instance Fleets/Spot Instances to balance performance with infrastructure costs.
  • Good to have experience with event-driven architecture and hands-on experience using AWS SQS for scalable, reliable event processing.
  • AWS certification is preferred, demonstrating expertise in designing and building scalable cloud-based data solutions.
  • Organized and collaborative—comfortable in a fast-moving, data-intensive environment.
  • Detail-oriented: Catches data quality issues early and implements automated course-corrections.
  • Strong communicator who aligns business needs with technical data constraints through clear trade-offs.
  • Deep problem solver who diagnoses pipeline bottlenecks and partners across teams to drive durable data solutions

Benefits

At GroundTruth, we want our employees to be comfortable with their benefits so they can focus on doing the work they love.

  • Parental leave- Maternity and Paternity
  • Flexible Time Offs (Earned Leaves, Sick Leaves, Birthday leave, Bereavement leave & Company Holidays) 
  • In Office Daily Catered Breakfast, Lunch, Snacks and Beverages
  • Health cover for any hospitalization. Covers both nuclear family and parents
  • Tele-med for free doctor consultation, discounts on health checkups and medicines
  • Wellness/Gym Reimbursement
  • Pet Expense Reimbursement
  • Childcare Expenses and reimbursements
  • Employee referral program
  • Education reimbursement program
  • Skill development program
  • Cell phone reimbursement (Mobile Subsidy program).
  • Internet reimbursement/Postpaid cell phone bill/or both.
  • Birthday treat reimbursement
  • Employee Provident Fund Scheme offering different tax saving options such as Voluntary Provident Fund and employee and employer contribution up to 12% Basic
  • Creche reimbursement
  • Co-working space reimbursement
  • National Pension System employer match
  • Meal card for tax benefit
  • Special benefits on salary account

Top Skills

Airflow
Athena
AWS
Aws Glue
Emr
Git
Lambda
Pyspark
Python
S3
SQL

Similar Jobs

2 Days Ago
Remote
Karnataka, IND
Senior level
Senior level
Other • Retail
Lead a high-performing software engineering team focused on developing scalable data platforms and services, drive SDK and API development, and ensure adherence to best practices.
Top Skills: Apache AirflowApache KafkaSparkAWSAws RdsAzureAzure SqlC++DatabricksGCPGcp Cloud SqlJavaOciPythonSnowflakeSQL
An Hour Ago
Remote or Hybrid
India
Mid level
Mid level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The role involves designing and implementing AI-powered workflows, collaborating with teams to enhance operational efficiency through automation platforms.
Top Skills: Ai PlatformsGemini EnterpriseJSONMcpN8NPythonRest ApisSQLTray.AiXML
2 Hours Ago
Easy Apply
Remote or Hybrid
Easy Apply
Senior level
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
The Escalation Engineer - DLP will provide advanced technical support for complex DLP issues, perform root cause analysis, and collaborate with cross-functional teams to enhance Zscaler's DLP solutions.
Top Skills: AWSAzureCasbDlpGCPPowershellPythonUnix/LinuxWindows

What you need to know about the Mumbai Tech Scene

From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account