phData Logo

phData

Lead Data Engineer

Reposted 4 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in India
Senior level
Remote
Hiring Remotely in India
Senior level
The Lead Data Engineer role involves designing and implementing data solutions, mentoring team engineers, developing end-to-end technical solutions, and ensuring performance and security. Experience with cloud platforms and programming is required, along with SQL expertise and client-facing communication skills.
The summary above was generated by AI

Join phData, a remote-first data and AI consultancy company with employees across the United States, Latin America, and India. We partner with industry leaders, including Snowflake, AWS, Anthropic, Azure, GCP, Fivetran, Pinecone, Glean, and dbt, to solve the complex data and AI challenges that slow large enterprises.

We're growing fast, and we give our people real ownership over their work. We hire top performers and trust them to deliver results.

Why phData?

  • Snowflake Implementation Partner of the Year — 7 consecutive years, and 2026 Snowflake AI Partner of the Year
  • AWS Premier Tier Services Partner — the highest tier of recognition in the AWS Partner Network
  • 2025 Fivetran Partner of the Year (4th consecutive year)
  • 2025 dbt Labs Partner of the Year (3x winner) with Visionary partner status
  • 2026 KNIME Customer Excellence Partner of the Year
  • Preferred Partner in the Anthropic Claude Partner Network
  • #1 Partner in Snowflake Advanced Certifications
  • 600+ Expert Cloud Certifications (Sigma, AWS, Azure, Dataiku, and more)
  • Recognized as an award-winning workplace in the US, India and LATAM

Required Experience:

  • 8+ years as a hands-on Data Engineer designing and implementing data solutions
  • Team lead, and/or mentorship of other engineers
  • Ability to develop end-to-end technical solutions into production — and to help ensure performance, security, scalability, and robust data integration.
  • Programming expertise in Java, Python and/or Scala 
  • Core cloud data platforms including Snowflake, AWS, Azure, Databricks and GCP
  • SQL and the ability to write, debug, and optimize SQL queries
  • Client-facing written and verbal communication skills and experience
  • Create and deliver detailed presentations 
  • Detailed solution documentation (e.g. including POCS and roadmaps, sequence diagrams, class hierarchies, logical system views, etc.)
  • 4-year Bachelor's degree in Computer Science or a related field

Prefer any of the following: 

  • Production experience in core data platforms: Snowflake, AWS, Azure, GCP, Hadoop, Databricks
  • Cloud and Distributed Data Storage: S3, ADLS, HDFS, GCS, Kudu, ElasticSearch/Solr, Cassandra or other NoSQL storage systems
  • Data integration technologies: Spark, Kafka, event/streaming, Streamsets, Matillion, Fivetran, NiFi, AWS Data Migration Services, Azure DataFactory, Informatica Intelligent Cloud Services (IICS), Google DataProc or other data integration technologies
  • Multiple data sources (e.g. queues, relational databases, files, search, API)
  • Complete software development lifecycle experience including design, documentation, implementation, testing, and deployment
  • Automated data transformation and data curation: dbt, Spark, Spark streaming, automated pipelines
  • Workflow Management and Orchestration: Airflow, AWS Managed Airflow, Luigi, NiFi

Why phData? We Offer:

  • Remote-First Workplace
  • Medical Insurance for Self & Family
  • Medical Insurance for Parents
  • Term Life & Personal Accident
  • Wellness Allowance
  • Broadband Reimbursement
  • Continuous learning and growth opportunities to enhance your skills and expertise
  • Other benefits include paid certifications, professional development allowance, and bonuses for creating for company-approved content

#LI-DNI

phData celebrates diversity and is committed to creating an inclusive environment for all employees. Our approach helps us to build a winning team that represents a variety of backgrounds, perspectives, and abilities. So, regardless of how your diversity expresses itself, you can find a home here at phData. We are proud to be an equal opportunity employer. We prohibit discrimination and harassment of any kind based on race, color, religion, national origin, sex (including pregnancy), sexual orientation, gender identity, gender expression, age, veteran status, genetic information, disability, or other applicable legally protected characteristics. If you would like to request an accommodation due to a disability, please contact us at People Operations.

Similar Jobs

7 Days Ago
Remote
India
Senior level
Senior level
Software
Lead Data Engineer to design, build, and operate production data pipelines, retrieval/vector infrastructure, semantic/feature stores, and ML/LLMOps foundations. Drive CI/CD, governance, monitoring, and agent/data APIs for RAG, LLM, and predictive model workloads.
Top Skills: Anthropic ClaudeAws (S3AzureBedrock)ChromadbDatabricksDelta LakeDockerEksFaissFastapiGithub ActionsGlueHuggingfaceKafkaKinesisKubernetesLangchainLanggraphLlamaindexMlflowNeo4JOpenaiOpensearchPineconePysparkPythonRedshiftSnowflakeSpark Structured StreamingSQLTerraform
23 Days Ago
Remote
India
Senior level
Senior level
Software
As a Lead Data Engineer, you will design and manage data pipelines, develop ETL processes, and ensure data quality. Mentorship of junior engineers is also part of the role.
Top Skills: Etl FrameworksMongoDBMySQLPythonSnowflakeSnowflake Sql
11 Days Ago
In-Office or Remote
Mid level
Mid level
Fitness • Healthtech • Retail • Pharmaceutical
Design, develop, and maintain Python-based analytics and reporting solutions using healthcare data. Build self-service Plotly Dash applications, refactor legacy code, implement cloud-native and CI/CD practices, optimize SQL/data warehouse queries, mentor peers, and monitor production analytics to support business decisions.
Top Skills: AWSAzureCeleryCi/CdClaudeCopilotCSSDbtGCPGitHTMLJSONNumpyPandasPlotly Dash EnterprisePolarsPostgresPythonReactRedisSQLXlsxwriterXMLYaml

What you need to know about the Mumbai Tech Scene

From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account