Bounteous Logo

Bounteous

Databricks Solution Architect

Posted 3 Hours Ago
Be an Early Applicant
Remote
Hiring Remotely in Canada
Senior level
Remote
Hiring Remotely in Canada
Senior level
Lead architecture, build, and scale a Databricks lakehouse: design batch/streaming pipelines, enforce governance with Unity Catalog, optimize Spark workloads, operationalize ML (MLflow), manage cloud/IaC, mentor engineers, and partner with stakeholders on roadmap and security/compliance.
The summary above was generated by AI
Bounteous is a premier end-to-end digital transformation consultancy dedicated to partnering with ambitious brands to create digital solutions for today’s complex challenges and tomorrow’s opportunities. With uncompromising standards for technical and domain expertise, we deliver innovative and strategic solutions in Strategy, Analytics, Digital Engineering, Cloud, Data & AI, Experience Design, and Marketing.

Our Co-Innovation methodology is a unique engagement model designed to align interests and accelerate value creation. Our clients worldwide benefit from the skills and expertise of over 4,000+ expert team members across the Americas, APAC, and EMEA. By partnering with leading technology providers, we craft transformative digital experiences that enhance customer engagement and drive business success.

We are seeking a Lead Databricks Engineer/Architect to design, build, and scale our cloud-based lakehouse platform. In this role, you will own the end-to-end architecture of our data ecosystem on Databricks, partner with data science and analytics teams to productionize ML and analytical workloads, and set the technical direction for ingestion, transformation, governance, and performance optimization across petabyte-scale datasets. You will be a hands-on technical leader: writing production code, mentoring engineers, and shaping standards that the broader data organization will adopt. 

Information Security Responsibilities

  • Promote and enforce awareness of key information security practices, including acceptable use of information assets, malware protection, and password security protocols
  • Identify, assess, and report security risks, focusing on how these risks impact the confidentiality, integrity, and availability of information assets
  • Understand and evaluate how data is stored, processed, or transmitted, ensuring compliance with data privacy and protection standards (GDPR, CCPA, etc.)
  • Ensure data protection measures are integrated throughout the information lifecycle to safeguard sensitive information

Role and Responsibilities

  • Architect and lead the implementation of an enterprise lakehouse on Databricks (Delta Lake, Unity Catalog, Photon, Workflows) across one or more major clouds (AWS, Azure, or GCP).
  • Design scalable batch and streaming data pipelines using PySpark, Spark SQL, Structured Streaming, and Delta Live Tables; establish patterns for ingestion from operational systems, event streams, and third-party APIs.
  • Define and enforce platform standards for data modeling (medallion architecture), CI/CD, code quality, testing, observability, and cost optimization.
  • Lead the governance strategy using Unity Catalog — fine-grained access control, data lineage, audit, and PII handling — in partnership with security and compliance.
  • Optimize Spark workloads for performance and cost: cluster sizing, Photon, autoscaling, file layout, Z-ordering, caching, and query tuning.
  • Partner with ML engineers and data scientists to operationalize models using MLflow, feature stores, and model serving on Databricks.
  • Own the cloud infrastructure footprint for the platform: networking, IAM, secrets, encryption, and Terraform/IaC for Databricks workspaces and supporting services.
  • Mentor a team of data engineers; lead architecture reviews, code reviews, and technical design sessions; raise the bar on engineering practices.
  • Engage with stakeholders across analytics, product, and finance to translate business needs into a roadmap for the data platform. 

Preferred Qualifications

  • 8+ years of data engineering experience, with 4+ years building production workloads on Databricks.
  • Deep expertise in Apache Spark (PySpark and Spark SQL) — including performance tuning, partitioning strategy, and the Catalyst/Photon execution model.
  • Strong hands-on experience with Delta Lake, Unity Catalog, Databricks Workflows, and Delta Live Tables.
  • Production experience on at least one major cloud (AWS, Azure, or GCP), including networking, IAM, storage (S3/ADLS/GCS), and compute primitives.
  • Proficiency in Python and SQL; comfort with Scala is a plus.
  • Experience designing medallion (bronze/silver/gold) architectures and dimensional models for analytics.
  • Strong CI/CD and DevOps practice: Git, Terraform, Databricks Asset Bundles or dbx, automated testing of data pipelines.
  • Track record of leading technical projects end-to-end and mentoring engineers.
  • Excellent written and verbal communication; able to drive alignment with both engineering and business stakeholders. 

We invite you to stay connected with us by subscribing to our monthly job openings alert here.

Bounteous is proud to be an equal opportunity employer. Bounteous does not discriminate on the basis of race, religion, color, sex, gender identity, sexual orientation, age, physical or mental disability, national origin, veteran status, or any other status protected under federal, state, or local law. Bounteous is willing to sponsor eligible candidates for employment visas. 

For employment opportunities based in Canada:
Bounteous is an equal opportunity employer. In accordance with the Ontario Human Rights Code and Accessibility for Ontarians with Disabilities Act, 2005, accommodation will be provided at any point throughout the hiring process, provided the candidate makes their accommodation needs known to Bounteous. We welcome applications from all qualified candidates. 

*Must be legally eligible to work in Canada. 

#LI-Remote

Similar Jobs at Bounteous

12 Days Ago
In-Office or Remote
Senior level
Senior level
Artificial Intelligence • Information Technology • Professional Services • Software • Analytics • Generative AI • Big Data Analytics
Lead client-facing data solutioning and pre-sales within Consumer verticals, design and translate architectures into delivery, maintain ~50% billable involvement, drive pipeline growth and close complex data opportunities while ensuring information security and privacy compliance.
Top Skills: AWSAzureDatabricksGCPSnowflake
17 Days Ago
Remote
Mid level
Mid level
Artificial Intelligence • Information Technology • Professional Services • Software • Analytics • Generative AI • Big Data Analytics
The Personalization Manager will drive testing and personalization projects, develop strategies aligned with business objectives, oversee CRO platforms, and ensure effective audience management for marketing campaigns.
Top Skills: Adobe Experience PlatformAdobe TargetAPIsCdpsMarketing AutomationOmni-Channel Campaign ManagementTag Management Systems

What you need to know about the Mumbai Tech Scene

From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account