Code and Theory Logo

Code and Theory

Senior Data Quality Engineer, ML (India)

Posted 5 Days Ago
Be an Early Applicant
In-Office
Bengaluru, Bengaluru Urban, Karnataka
Senior level
In-Office
Bengaluru, Bengaluru Urban, Karnataka
Senior level
As a Senior Data Quality Engineer for ML, you will evaluate outputs from large language models using Python and SQL, implement quality metrics, and automate evaluation pipelines, while collaborating with multidisciplinary teams to ensure software quality.
The summary above was generated by AI

Our AI/ML engineering team ensures Code and Theory delivers innovative, immersive web experiences that delight our clients and their customers. We are always striving to balance the demanding nature of working on cutting-edge technologies with the real-world demands of high performance, high security, and accessibility. Working in collaboration with our multi-disciplinary engineering, design, and quality assurance teams, you will build software that solves real-world problems for incredible clients. 

WHAT YOU’LL DO

  • Write Python and SQL scripts to evaluate outputs from large language models (LLMs)
  • Design and implement LLM-as-Judge evaluations with clear scoring rubrics (faithfulness, relevance, completeness, correctness)
  • Define and calculate quality metrics such as exact match, token-level F1, ROUGE, and subjective rubric scores
  • Build and maintain ground-truth datasets for benchmarking and regression testing
  • Automate evaluation pipelines and integrate them into CI/CD workflows
  • Conduct in-depth analysis of large unstructured datasets to identify inconsistencies, anomalies, missing values, and potential biases
  • Diagnose and report failure modes (hallucinations, irrelevant answers, formatting errors)
  • Collaborate and serve as a crucial link between AI engineers, QA, data scientists and product managers to set quality standards and release criteria
  • Document processes and maintain reproducibility of evaluation runs
  • Create comprehensive technical documentation, including design specifications, architecture diagrams, and code comments

WHAT YOU’LL NEED

  • 6-8 years of experience in data quality engineering, with hands-on expertise in Python, SQL, automated evaluation pipelines, LLM quality metrics, and end-to-end data validation across complex datasets and AI/ML systems
  • Strong proficiency in Python and SQL (data handling, scripting, test automation)
  • Experience with data cleaning and standardization techniques to facilitate ingestion and analysis by various teams
  • Understanding of generative AI concepts (prompts, hallucinations, grounding)
  • Experience designing structured LLM prompts for evaluations
  • Familiarity with at least one evaluation framework (RAGAS, DeepEval, TruLens, LangSmith) or ability to learn quickly
  • Familiarity with cloud runs and automation (GCP preferred) or ability to learn quickly
  • Ability to translate ambiguous quality expectations into measurable metrics
  • Excellent problem-solving abilities and analytical thinking
  • Effective communication skills to collaborate with cross-functional teams and present technical concepts to both technical and non-technical stakeholders

ABOUT US

Born in 2001, Code and Theory is a digital-first creative agency that sits at the center of creativity and technology. We pride ourselves on not only solving consumer and business problems, but also helping to establish new capabilities for our clients. With a global client roster of Fortune 100s and start-ups alike, we crave the hardest problems to solve. We have teams distributed across North America, South America, Europe, and Asia. The Code and Theory global network of agencies is growing and includes Kettle, Instrument, Left Field Labs, Create Group, Mediacurrent, Rhythm, and TrueLogic.

Striving never to be pigeonholed, we work across every major category: from tech to CPG, financial services to travel & hospitality, government and education to media and publishing. We value the collaboration with our client partners, including but not limited to Adidas, Amazon, Con Edison, Diageo, EY, J.P. Morgan Chase, Lenovo, Marriott, Mars, Microsoft, Thomson Reuters, and TikTok.

The Code and Theory network is comprised of nearly 2,000 people with 50% engineers and 50% creative talent. We’re always on the lookout for smart, driven, and forward-thinking people to join our team.

Top Skills

GCP
Python
SQL

Similar Jobs

47 Minutes Ago
Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
Senior level
Senior level
Fintech • Financial Services
The Lead Digital Product Manager will develop and execute digital strategies, collaborate with teams to enhance product experiences, and lead initiatives impacting liquidity solutions.
Top Skills: AIMl
48 Minutes Ago
Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
Senior level
Senior level
Fintech • Financial Services
The Lead Operational Risk Officer will assess, mitigate operational risks, develop training, and consult on risk management across business units.
Top Skills: Artificial IntelligenceMS OfficeTableau
49 Minutes Ago
Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
Entry level
Entry level
Fintech • Financial Services
The Associate Fraud & Claims Operations Representative conducts risk reviews, validates fraud and operational risks, and supports customer claims while providing exceptional service in a call center environment.
Top Skills: Internal SystemsRisk Assessment Tools

What you need to know about the Mumbai Tech Scene

From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account