Lead training and fine-tuning workflows for Diffusion Transformer models, focusing on both image and video generation capabilities.
We are looking for an Engineer to build the training infrastructure, data pipelines, and inference optimization systems for state-of-the-art Diffusion Transformer (DiT) models. This role focuses on scaling the fine-tuning and deployment of models like Qwen, Wan, and LTX-2.
Key Responsibilities
- Training Infrastructure: Design and maintain scalable pipelines for training and fine-tuning Diffusion Transformer models on large-scale GPU clusters.
- Model Optimization: Optimize the inference performance of Wan, LTX-2, and Qwen (Vision) using quantization, pruning, and hardware-aware tuning (e.g., TensorRT, FlashAttention).
- Data Engineering: Develop efficient ingestion and preprocessing pipelines for high-resolution image and video datasets used in generative tasks.
- Capability Expansion: Implement engineering workflows that allow researchers to rapidly fine-tune and expand the capabilities of open-weights diffusion models.
- Production Deployment: Transition experimental fine-tuned models into reliable, low-latency production services.
- Resource Management: optimize distributed training jobs (FSDP, DeepSpeed) to maximize GPU utilization and minimize costs.
Required Qualifications
- Min 2 years of experience in Machine Learning Engineering with a focus on generative models.
- Core Tech: Strong proficiency in PyTorch, JAX, and distributed training frameworks.
- Model Expertise: Hands-on experience deploying or fine-tuning Diffusion Transformers (DiT) and specifically Qwen (Image), Wan, or LTX-2.
- Architecture: Deep understanding of Transformer-based diffusion backbones and flow matching (removing legacy reliance on CNNs/RNNs).
- Tooling: Proficiency in Python and modern ML ecosystem tools (e.g., Hugging Face, Diffusers, FFmpeg for video processing).
- Compute: Experience debugging and optimizing workloads in multi-node GPU environments.
Preferred Qualifications
- Inference Optimization: Experience with techniques like KV-caching, compile-time optimizations, or kernel fusion for transformers.
- MLOps: Familiarity with experiment tracking (W&B) and model versioning tools in a generative media context.
- Streaming: Experience handling real-time video generation or streaming inference pipelines.
- Open Source: Contributions to libraries like diffusers or active experimentation with the latest open-source DiT implementations.
Top Skills
Deepspeed
Diffusion Transformers
Fsdp
Jax
Ltx-2
PyTorch
Qwen
Wan
Lexlegis.ai Mumbai, Maharashtra, IND Office
Mumbai, Maharashtra, India, 400021
Similar Jobs
Enterprise Web • Fintech • Financial Services
Lead engineering efforts in enterprise automation for Sales and Finance technologies, build and coach teams, ensuring robust integrations and high-quality performance.
Top Skills:
AWSBillingCatalystCpqDockerErpGainsightGCPJavaKafkaKubernetesOpenapiPythonRestSalesforceSnowflakeSQLWorkato
Enterprise Web • Fintech • Financial Services
The Lead Security Analyst supports application security automation by integrating analysis tools, helping with vulnerability remediation, and training technical staff.
Top Skills:
.NetBrightsecJavaJavaScriptJenkinsPHPRubySemgrepWaf
Fintech • Information Technology • Financial Services
Support investment research by building data workflows and AI-assisted capabilities, collaborating with research teams to deliver functional data solutions.
Top Skills:
AirflowBigQueryGcsKubernetesPower BIPythonSnowflakeSQLTableau
What you need to know about the Mumbai Tech Scene
From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.


