Datavail Logo

Datavail

Databricks-T3

Posted 13 Days Ago
Be an Early Applicant
Hybrid
Mumbai, Maharashtra, IND
Mid level
Hybrid
Mumbai, Maharashtra, IND
Mid level
The Technical Specialist will build ETL/ELT pipelines using Databricks, optimize data workflows, integrate multiple data sources, and maintain data integrity and security.
The summary above was generated by AI

Title: Technical Specialist

Location: Mumbai

Education: Bachelor’s Degree

Job Description:

  • Build scalable ETL/ELT pipelines using Databricks (PySpark, SQL, Spark Streaming).
  • Develop and optimize Delta Lake tables, ACID transactions, schema evolution, and time travel.
  • Implement Unity Catalog, data governance, and access control.Optimize cluster configurations, job workflows, and performance tuning in Databricks.
  • Design and implement batch and streaming pipelines using Spark Structured Streaming.
  • Integrate Databricks with multiple data sources (RDBMS, APIs, cloud storage, message queues).Develop reusable, modular, and automated data processing frameworks.
  • Implement CI/CD pipelines for Databricks using GitHub Actions / Azure DevOps / GitLab.Automate cluster management and job orchestration using Databricks REST APIs.
  • Maintain code quality, unit tests, and documentation. 
  • Write and optimize complex SQL queries and statements to ensure high performance and efficient data retrieval.
  • Strong database design including normalization, data modelling, and relational schema creation.
  • Conduct performance analysis, troubleshoot database issues like slow queries or deadlocks and implement solutions
  • Design and implement database structures, including tables, schemas, views, stored procedures, functions, and triggers.
  • Optimize database performance through query tuning, indexing, and performance analysis.
  • Ensure data integrity, security, and compliance standards
  • Need strong Python skills combined with expertise in Apache Spark for large scale data processing. Core abilities include building efficient ETL pipelines, optimizing distributed jobs, and handling large-scale data transformations
  • Expertise in Python programming, Spark APIs, and parallel processing.
  • Proficiency in Python (including Pandas, NumPy) for data manipulation and scripting
  • Deep knowledge of PySpark APIs like DataFrames, RDDs, Spark SQL for querying and processing.
  • Familiarity with RESTful APIs, batch processing, CI/CD, and monitoring data jobs.
  • Optimize Spark jobs for performance, troubleshoot issues, and ensure data quality across systems.
  • Collaborate with data engineers and scientists to implement workflows, conduct code reviews, and integrate with cloud platforms like AWS or Azure.
  • Design, develop, and maintain scalable data pipelines and ETL processes using Azure Databricks
  • Build data transformation workflows using Python or Scala.
  • Work with data lakes using Delta Lake.
  • Integrate data from multiple sources such as APIs, databases, and cloud storage.
  • Monitor and optimize data workflows for performance and reliability.
  • Collaborate with data scientists, analysts, and business teams

About UsDatavail is a leading provider of data management, application development, analytics, and cloud services, with more than 1,000 professionals helping clients build and manage applications and data via a world-class tech-enabled delivery platform and software solutions across all leading technologies. For more than 17 years, Datavail has worked with thousands of companies spanning different industries and sizes, and is an AWS Advanced Tier Consulting Partner, a Microsoft Solutions Partner for Data & AI and Digital & App Innovation (Azure), an Oracle Partner, and a MySQL Partner.

Top Skills

AWS
Azure
Azure Devops
Databricks
Delta Lake
Github Actions
Gitlab
Numpy
Pandas
Pyspark
Python
Restful Apis
Scala
Spark Apis
Spark Streaming
SQL
Unity Catalog

Similar Jobs

An Hour Ago
Hybrid
Junior
Junior
Automotive • Hardware • Robotics • Software • Transportation • Manufacturing
The Junior PLM Developer will participate in requirement meetings, implement solutions with Teamcenter, configure integrations, and manage access control.
Top Skills: AwcBmideCatiaDevOpsGit HubItk UtilitiesNxSvnTeamcenter
An Hour Ago
Hybrid
Mid level
Mid level
Automotive • Hardware • Robotics • Software • Transportation • Manufacturing
The IT Security Analyst ensures the protection of systems and data from cyber threats through monitoring, incident response, and compliance support.
Top Skills: CrowdstrikeDlpGdprIso 27001It SecurityNistSoc 2SoxTisax
Senior level
Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
Manage Salesforce Health Cloud projects, lead delivery, develop capabilities, mentor teams, and design customized solutions for healthcare organizations.
Top Skills: ApexAppexchangeLightningMulesoftSalesforceSalesforce DxSalesforce Health CloudVisual Force

What you need to know about the Mumbai Tech Scene

From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account