Apiphany Logo

Apiphany

Associate Data Scientist

Posted 6 Days Ago
Remote
Hiring Remotely in India
Junior
Remote
Hiring Remotely in India
Junior
Prepare, clean, and validate structured and unstructured data for LLM-driven systems; build training datasets, support RAG and NL->SQL pipelines, perform data quality checks, and assist in data pipelines/APIs and model evaluation.
The summary above was generated by AI
Role Overview

We are seeking an Associate Data Scientist to support AI/ML engineering efforts by preparing, validating, and structuring data for LLM-driven systems. This is a hands-on role focused on real-world data processing, pipeline support, and model evaluation.

Key Responsibilities
  • Process and clean structured and unstructured data for AI/ML pipelines.

  • Prepare training-ready datasets for LLM fine-tuning and evaluation workflows.

  • Support RAG and NL→SQL systems through data preparation and validation.

  • Perform data quality checks and ensure completeness and consistency.

  • Assist in building and maintaining data pipelines and APIs (e.g., FastAPI).

  • Collaborate with engineering teams to troubleshoot and optimize data workflows.

Required Skills
  • 1–3 years of experience in data processing or data-focused roles.

  • Strong Python skills with experience in data libraries (Pandas, NumPy, Scikit-learn).

  • Experience supporting LLM workflows (fine-tuning, prompt engineering, evaluation).

  • Familiarity with structured (SQL) and unstructured text data.

  • Understanding of data preparation for AI/ML systems.

Nice to Have
  • Exposure to RAG pipelines, embeddings, or evaluation metrics.

  • Experience with ML frameworks (PyTorch/TensorFlow) and Docker-based workflows.

  • Experience with CI/CD pipelines for ML systems.

  • Familiarity with vector databases (e.g., Chroma) and reranking techniques.

  • Research exposure to Transformer-based architectures.

Top Skills

Python,Pandas,Numpy,Scikit-Learn,Sql,Fastapi,Llms

Similar Jobs

3 Minutes Ago
Easy Apply
Remote or Hybrid
IND
Easy Apply
Mid level
Mid level
Artificial Intelligence • Cloud • Security • Software
As a Sales Solutions Engineer, you'll support customers in evaluating Sonar's products, deliver technical presentations, troubleshoot issues, and ensure successful product adoption.
Top Skills: AWSAzureDockerKubernetesLinuxWindows
2 Hours Ago
Remote or Hybrid
India
Senior level
Senior level
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Lead delivery engineer responsible for managing stakeholders, implementing technology solutions, adopting Agile practices, and overseeing the software lifecycle in Internal Audit projects.
Top Skills: AzureAzure DevopsCa SiteminderJavaJfrogMs EntraPingonePythonReactSQL
2 Hours Ago
Remote or Hybrid
India
Mid level
Mid level
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The Assistant Manager - IT will analyze user requirements, design and develop web applications, integrate applications, and support users while managing software development projects and troubleshooting issues.
Top Skills: .Net Core.Net MvcAngular 2+Angular JsSQL ServerWeb Api

What you need to know about the Mumbai Tech Scene

From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account