About Algotale
At Algotale, we believe in the transformative potential of data to reshape industries, drive innovation, and create unparalleled value. Established in 2020 by a team of passionate and visionary professionals, Algotale set out to redefine the landscape of IT services and consulting by integrating a data-centric approach into every solution we design.
Our journey began with a focus on creating customized, data-driven solutions that directly address the unique challenges and ambitions of our clients. Over the years, our dedication to excellence has allowed us to grow into a trusted partner for businesses across industries, supporting them through every stage of digital transformation. Today, Algotale stands as a prominent player in the IT consulting world, known for our agility, innovation, and commitment to delivering results.
With a team of over 500 skilled professionals, we specialize in IT services, staffing, and consulting, empowering our clients to unlock the full potential of their data. Our expertise extends across data analytics, artificial intelligence, machine learning, and cutting-edge cloud solutions. By leveraging these technologies, we help organizations build resilient digital infrastructures, optimize operations, and achieve strategic growth.
As the digital landscape evolves, so does our commitment to pushing boundaries, embracing new challenges, and exceeding client expectations. At Algotale, we don’t just provide solutions; we forge partnerships built on trust, insight, and a shared drive to thrive in an increasingly data-driven world.
Website: www.algotale.com
Industry: IT Services and Consulting
Company Size: 501–1,000 employees
Founded: 2020
Specialties: IT Services, IT Staffing, IT Consulting
Location: Mumbai
Experience: 4+ Years
We are looking for a skilled Data Engineer with 4+ years of experience to design, build, and optimize scalable data pipelines and data architectures. The ideal candidate should have strong expertise in big data technologies, data processing frameworks, and workflow orchestration tools.
Key Responsibilities:- Design, develop, and maintain scalable and efficient data pipelines using PySpark.
- Build and manage workflows using Apache Airflow for scheduling and orchestration.
- Work extensively with SQL for data extraction, transformation, and analysis.
- Develop and maintain data solutions on Cloudera Hadoop ecosystem (HDFS, Hive, Impala).
- Optimize data processing jobs for performance and scalability.
- Collaborate with data scientists, analysts, and business teams to deliver data solutions.
- Ensure data quality, integrity, and security across data pipelines.
- Troubleshoot and resolve data-related issues in a timely manner.
- Implement best practices for data engineering, including version control and CI/CD.
- Strong hands-on experience with PySpark and distributed data processing.
- Proficiency in Apache Airflow for workflow orchestration.
- Advanced knowledge of SQL.
- Experience with Cloudera ecosystem, including:
- Hadoop (HDFS)
- Hive
- Impala
- Understanding of data warehousing concepts and ETL processes.
- Experience working with large-scale structured and unstructured datasets.
- Experience with cloud platforms (AWS/Azure/GCP).
- Knowledge of Kafka or other streaming technologies.
- Familiarity with CI/CD pipelines and DevOps practices.
- Experience with Python for data processing beyond PySpark.
- Bachelor’s/Master’s degree in Computer Science, Engineering, or related field.
- 4+ years of relevant experience in data engineering or big data technologies.
- Strong problem-solving and analytical skills.
- Good communication and collaboration abilities.
- Ability to work in an agile and fast-paced environment.
- Opportunity to work on large-scale data platforms.
- Collaborative and innovation-driven work environment.
- Career growth and learning opportunities in big data technologies.



