Search suggestions:

praca zdalna
praca
praca od zaraz
księgowa
gis
praca od 16 lat
kierowca c e
urząd pracy
dam pracę
dla nieletnich
praca biurowa
opiekun medyczny
hr
województwo śląskie
województwo dolnośląskie
województwo mazowieckie
Wrocław
Warsaw
powiat lubelski
Szczecin
Opole
województwo opolskie
Poznan
województwo warmińsko-mazurskie
powiat pruszkowski

Junior Data Engineer (Databricks)

Addepto
Warsaw, województwo mazowieckie
Full time
2 dni temu

Addepto is a leading consulting and technology company specializing in AI and Big Data, helping clients deliver innovative data projects. We partner with top-tier global enterprises and pioneering startups, including Rolls Royce, Continental, Porsche, ABB, and WGU. Our exclusive focus on AI and Big Data has earned us recognition by Forbes as one of the top 10 AI consulting companies.


As a Junior Data Engineer, you will have the exciting opportunity to work with a team of technology experts on challenging projects across various industries, leveraging cutting-edge technologies. Here are some of the projects we are seeking talented individuals to join:

  • Design and development of a universal data platform for global aerospace companies. This Azure and Databricks powered initiative combines diverse enterprise and public data sources. The data platform is at the early stages of the development, covering design of architecture and processes as well as giving freedom for technology selection.

  • Data Platform Transformation for energy management association body. This project addressed critical data management challenges, boosting user adoption, performance, and data integrity. The team is implementing a comprehensive data catalog, leveraging Databricks and Apache Spark/PySpark, for simplified data access and governance. Secure integration solutions and enhanced data quality monitoring, utilizing Delta Live Table tests, established trust in the platform. The intermediate result is a user-friendly, secure, and data-driven platform, serving as a basis for further development of ML components.

  • Design of the data transformation and following data ops pipelines for global car manufacturer. This project aims to build a data processing system for both real-time streaming and batch data. We’ll handle data for business uses like process monitoring, analysis, and reporting, while also exploring LLMs for chatbots and data analysis. Key tasks include data cleaning, normalization, and optimizing the data model for performance and accuracy.


Your main responsibilities:

  • Design scalable data processing pipelines for streaming and batch processing using Big Data technologies like Databricks, Airflow and/or Dagster.

  • Contribute to the development of CI/CD and MLOps processes.

  • Develop applications to aggregate, process, and analyze data from diverse sources.

  • Collaborate with the Data Science team on Machine Learning projects, including text/image analysis and predictive model building.

  • Develop and organize data transformations using Databricks/DBT and Apache Airflow.

  • Translate business requirements into technical solutions and ensure optimal performance and quality.


What you'll need to succeed in this role:

  • At least 1 year of proven commercial experience developing, or maintaining Big Data systems.

  • Hands-on experience with Big Data technologies, including Databricks, Apache Spark, Airflow, and DBT.

  • Strong programming skills in Python: writing a clean code, OOP design.

  • Experience in designing and implementing data governance and data management processes.

  • Experience implementing and deploying solutions in cloud environments (with a preference for Azure).

  • Practical knowledge of DevOps practices, including designing and maintaining CI/CD pipelines for data and ML workflows, and Terraform for Infrastructure as Code.

  • Knowledge of how to build and deploy Power BI reports and dashboards for data visualization.

  • Excellent understanding of dimensional data and data modeling techniques.

  • Excellent communication skills and consulting experience with direct interaction with clients.

  • Ability to work independently and take ownership of project deliverables.

  • Bachelor’s or Master's degree in Computer Science, Data Science, Mathematics, Physics, or a related field.


Discover our perks & benefits:

  • Work in a supportive team of passionate enthusiasts of AI & Big Data.

  • Engage with top-tier global enterprises and cutting-edge startups on international projects.

  • Enjoy flexible work arrangements, allowing you to work remotely or from modern offices and coworking spaces.

  • Accelerate your professional growth through career paths, knowledge-sharing initiatives, language classes, and sponsored training or conferences, including a partnership with Databricks, which offers industry-leading training materials and certifications.

  • Choose from various employment options: B2B, employment contracts, or contracts of mandate.

  • Make use of 20 fully paid days off available for B2B contractors and individuals under contracts of mandate.

  • Participate in team-building events and utilize the integration budget.

  • Celebrate work anniversaries, birthdays, and milestones.

  • Access medical and sports packages, eye care, and well-being support services, including psychotherapy and coaching.

  • Get full work equipment for optimal productivity, including a laptop and other necessary devices.

  • With our backing, you can boost your personal brand by speaking at conferences, writing for our blog, or participating in meetups.

  • Experience a smooth onboarding with a dedicated buddy, and start your journey in our friendly, supportive, and autonomous culture.

Save Apply
Report job
Other Job Recommendations:

Full Stack Engineer (m/f/d)

ZF
Warsaw, województwo mazowieckie
  • Work with the Product team to design and build products from...
  • Explore and adapt to new technologies as required...
6 dni temu

Senior Full Stack Engineer (m/f/d)

ZF
Warsaw, województwo mazowieckie
  • Work with the Product team to design and build products from...
  • Explore and adapt to new technologies as required...
6 dni temu

Senior Data Engineer (GCP)

Exadel
Warsaw, województwo mazowieckie
  • Enhancing the functionality of the current data platform...
  • Handling data quality incidents (as the main point of...
1 tydzień temu

Senior AI Engineer

Procter & Gamble
Warsaw, województwo mazowieckie
Senior AI Engineer at P&G partners with data scientists, data managers, analysts, infrastructure engineers, and peer AI...
2 tygodnie temu

DevOps Engineer (Senior)

VIRTUSLAB
powiat lubelski, województwo lubelskie
We’re part of a long-term engineering partnership with a Swiss digital wallet provider, active in crypto and investment fund...
1 tydzień temu

Junior Software Engineer (DevOps)

Adtran
Gdańsk, województwo pomorskie
  • Design and implement scalable infrastructure as code for...
  • Implement automated CI/CD pipelines...
3 tygodnie temu

Tester / Delivery Engineer - Embedded Systems

Teleste
Wrocław, województwo dolnośląskie
If you enjoy working close to hardware, configuring systems, and testing software in real-world scenarios — this role is for you...
2 tygodnie temu

Senior Data Engineer (AdTech)

Sigma Software
Warsaw, województwo mazowieckie
  • Work with Apache Spark batch & real-time streaming to...
  • Work with Scala microservices hosted on K8s (GKE) to support...
2 tygodnie temu

Senior Databricks/Informatica Intelligent Cloud Services Consultant

Infosys
Warsaw, województwo mazowieckie
  • Proficiency in SQL, Python, and data modeling.
  • Effective communication and collaboration skills to work...
3 tygodnie temu

Kasjer - sprzedawca

Pepco
powiat miński, województwo mazowieckie
Jesteśmy europejską siecią dyskontów, oferującą odzież dla całej rodziny i produkty dla domu w najniższych cenach • umowę o...
2 dni temu