Search suggestions:

praca zdalna
praca
praca od zaraz
front end developer
praca biurowa
kierowca c e
java
dam pracę
urząd pracy
hr
biuro
księgowa
bez doświadczenia
województwo mazowieckie
województwo śląskie
Warsaw
powiat lubelski
województwo dolnośląskie
Wrocław
Bielsko-Biała
województwo łódzkie
województwo kujawsko-pomorskie
województwo zachodniopomorskie
Gliwice
Katowice

Senior Data Engineer (Databricks/Spark/PySpark)

Exadel
Warsaw, województwo mazowieckie
4 tygodnie temu

We are looking for a Senior Data Engineer with strong experience in building and optimizing data pipelines using Databricks, Apache Spark, and PySpark. The ideal candidate is passionate about data architecture, performance optimization, and working with high-scale distributed data systems.

You will play a key role in designing and developing scalable data ingestion, transformation, and processing pipelines, enabling reliable and timely data for downstream analytics, reporting, and machine learning.

Why Join Exadel

We're an AI-first global tech company with 25+ years of engineering leadership, 2,000+ team members, and 500+ active projects powering Fortune 500 clients, including HBO, Microsoft, Google, and Starbucks.

From AI platforms to digital transformation, we partner with enterprise leaders to build what's next.

What powers it all? Our people are ambitious, collaborative, and constantly evolving.

About the Client

The world's largest human resources consulting firm is headquartered in New York City, with its main branches in 40+ countries. Over 20,500 employees operate internationally in more than 130 countries. Its services are used by 97% of Fortune 500 companies.

What You'll Do

  • Design, develop, and maintain scalable and efficient data pipelines using Databricks, Apache Spark, and PySpark
  • Collaborate with data scientists, analysts, and product teams to understand data requirements and ensure reliable data delivery
  • Implement ETL/ELT workflows to extract, cleanse, transform, and load data from various structured and unstructured sources
  • Optimize Spark jobs and workflows for performance, scalability, and cost-efficiency
  • Develop reusable components, frameworks, and libraries to accelerate pipeline development
  • Monitor data quality and pipeline health; implement data validation and error-handling mechanisms
  • Ensure compliance with security, privacy, and governance policies
  • Contribute to best practices in data engineering and cloud-native data architecture

What You Bring

  • 3–6+ years of experience in data engineering or software engineering with a focus on large-scale data processing
  • Strong hands-on experience with Apache Spark and PySpark
  • Proficiency in working with Databricks platform (including notebooks, jobs, clusters, and workspace management)
  • Solid knowledge of data formats (Parquet, Avro, JSON, etc.) and data modeling concepts
  • Experience building and orchestrating ETL/ELT pipelines (e.g., using Airflow, Databricks Workflows, Azure Data Factory, etc.)
  • Familiarity with cloud platforms (Azure, AWS, or GCP) and their data services
  • Strong programming skills in Python; SQL expertise is a must
  • Understanding of CI/CD practices and version control (Git)
  • Ability to work in Agile development environments and collaborate with cross-functional teams

Nice to have

  • Experience with Delta Lake or other transactional data lake technologies
  • Familiarity with data lakehouse architecture
  • Exposure to data warehousing tools and MPP databases (Snowflake, Redshift, BigQuery, etc.)
  • Knowledge of data governance, lineage, and cataloging tools (e.g., Unity Catalog, DataHub, Collibra)
  • Experience with streaming data (Kafka, Spark Structured Streaming)

English level

Upper-Intermediate

Legal & Hiring Information

  • Exadel is proud to be an Equal Opportunity Employer committed to inclusion across minority, gender identity, sexual orientation, disability, age, and more
  • Reasonable accommodations are available to enable individuals with disabilities to perform essential functions
  • Please note: this job description is not exhaustive. Duties and responsibilities may evolve based on business needs
Apply
Save
Report job
Other Job Recommendations:

Senior/Lead Data Engineer (Databricks, Python, .NET)

Exadel
Warsaw, województwo mazowieckie
  • Design, develop, and maintain data pipelines and ETL...
  • Support and gradually migrate the existing .NET codebase to...
3 dni temu

Senior Full Stack Engineer (m/f/d)

ZF
Warsaw, województwo mazowieckie
  • Work with the Product team to design and build products from...
  • Explore and adapt to new technologies as required...
2 tygodnie temu

Full Stack Engineer (m/f/d)

ZF
Warsaw, województwo mazowieckie
  • Work with the Product team to design and build products from...
  • Explore and adapt to new technologies as required...
2 tygodnie temu

Senior Data Engineer (GCP)

Exadel
Warsaw, województwo mazowieckie
  • Enhancing the functionality of the current data platform...
  • Handling data quality incidents (as the main point of...
2 tygodnie temu

Senior MLOps Engineer - Freelancer

Monterail
Wrocław, województwo dolnośląskie
  • Hands-on knowledge of MLflow for experiment tracking and...
  • Experience with Docker and Kubernetes for container...
2 tygodnie temu

Senior Software Java Engineer

zero effort nonbank (ZEN)
powiat lubelski, województwo lubelskie
  • In-depth knowledge of OOP paradigms, design patterns, and...
  • Understanding of protocols and security in web environment...
3 tygodnie temu

Senior AI Engineer

Procter & Gamble
Warsaw, województwo mazowieckie
Senior AI Engineer at P&G partners with data scientists, data managers, analysts, infrastructure engineers, and peer AI...
3 tygodnie temu

DevOps Engineer (Senior)

VIRTUSLAB
powiat lubelski, województwo lubelskie
We’re part of a long-term engineering partnership with a Swiss digital wallet provider, active in crypto and investment fund...
3 tygodnie temu

Senior Data Engineer (AdTech)

Sigma Software
Warsaw, województwo mazowieckie
  • Work with Apache Spark batch & real-time streaming to...
  • Work with Scala microservices hosted on K8s (GKE) to support...
4 tygodnie temu

Asystent Kierownika Sklepu

Action
Warsaw, województwo mazowieckie
Czy masz już niezbędne doświadczenie w sprzedaży detalicznej i chciałbyś osiągnąć więcej? Czy chciałbyś pracować w najlepszym...
1 dzień temu