📍 Location: Mumbai
🏢 Company: Algotale
🧑💻 Experience: 3+ Years
Algotale is a technology-driven organization focused on building scalable data solutions, cloud-native architectures, and advanced analytics platforms. We help businesses transform data into actionable insights through modern data engineering practices.
Role OverviewWe are looking for a skilled Data Engineer with strong expertise in AWS/GCP, SQL, and PySpark to design, build, and optimize scalable data pipelines and cloud-based data platforms. The ideal candidate should have hands-on experience in distributed data processing and cloud environments.
Key ResponsibilitiesDesign, develop, and maintain scalable ETL/ELT pipelines
Build and optimize data architectures on AWS and/or GCP
Develop data processing frameworks using PySpark
Write complex and optimized SQL queries for large datasets
Work with structured and unstructured data from multiple sources
Implement data quality, validation, and governance frameworks
Collaborate with data analysts, data scientists, and cross-functional teams
Monitor and troubleshoot production data systems
Ensure best practices in performance tuning and cost optimization in cloud environments
3+ years of experience in Data Engineering
Strong hands-on experience with AWS (S3, Redshift, Glue, EMR, Lambda) or GCP (BigQuery, Dataflow, Cloud Storage, Dataproc)
Strong proficiency in SQL
Hands-on experience with PySpark
Experience with data warehousing concepts and dimensional modeling
Familiarity with workflow orchestration tools (e.g., Airflow)
Understanding of distributed data processing systems
Experience with version control tools like Git
Experience with real-time data processing (Kafka or similar tools)
Knowledge of CI/CD pipelines
Exposure to Infrastructure as Code (Terraform, CloudFormation)
Basic understanding of machine learning pipelines



