Databricks Data Engineer: Lakehouse Pipelines & PySpark

Perficient • Remote, Remote, Colombia • Posted May 26, 2026

Location Remote, Remote
Job Type Full-time
Category Bases de datos, analítica y BI
Posted May 26, 2026

Job Description

  • Design, build, and maintain end-to-end data pipelines for ingestion, transformation, and delivery of large‑scale data.
  • Develop and optimize data processing logic using PySpark on Databricks (Apache Spark).
  • Implement ETL/ELT pipelines integrating data from multiple structured and semi‑structured sources.
  • Contribute to the design and implementation of lakehouse architectures (Delta Lake, Medallion architecture).
  • Ensure data quality, reliability, performance, and observability across pipelines.
  • Optimize Spark jobs through partitioning, caching, and performance tuning techniques.
  • Collaborate with data architects, analysts, and business stakeholders to translate requirements into scalable data solutions.
  • Implement best practices in CI/CD, version control, and pipeline automation.
  • Support the evolution of modern data platforms and analytics capabilities.
  • Work with o...

Interested in this role?

Click the button below to start your application.

Apply Now