How to avoid professional compromises? Choose once and for all to
ENTER SUPERPOSITION
Form of Employment: Contract of Employment
Location: Warszawa/ {Remote/Hybrid} work model
Have you ever asked yourself whether you'd rather work in a big-tech, software house or startup? At home or in the office? In international teams or polish? At T Hub, there's one answer - YES!
T Hub Poland is a dynamic, forward-thinking division in T-Mobile Poland, where technology matters! It’s a place where best talents converge to ideate, prototype, and implement cutting-edge solutions in international projects held in Deutsche Telekom Group.
Role Overview
We seek an AI Expert with deep expertise in designing, implementing, and optimizing Retrieval Augmented Generation (RAG) systems in on-premises environments. The ideal candidate will have hands-on experience with vLLM, liteLLM, and open-source LLMs like LLAMA 3.2, along with a proven ability to integrate these tools into scalable, secure, and high-performance enterprise workflows.
Network & Services International (NWI) develops, plans, builds and operates the international network infrastructure of DTAG and produces intercarrier and wholesale services for the sales units W-IC, B2B and IoT.
The Squad T-BDA is responsible for providing seamless AI and Automation solutions to DT internal customers which are integrated in a self-hosted environment.
What tasks await you?
Role Overview
We seek an AI Expert with deep expertise in designing, implementing, and optimizing Retrieval Augmented Generation (RAG) systems in on-premises environments. The ideal candidate will have hands-on experience with vLLM, liteLLM, and open-source LLMs like LLAMA 3.2, along with a proven ability to integrate these tools into scalable, secure, and high-performance enterprise workflows.
Network & Services International (NWI) develops, plans, builds and operates the international network infrastructure of DTAG and produces intercarrier and wholesale services for the sales units W-IC, B2B and IoT.
The Squad T-BDA is responsible for providing seamless AI and Automation solutions to DT internal customers which are integrated in a self-hosted environment.
RAG System Development:
- Architect and deploy end-to-end RAG pipelines, combining retrieval mechanisms (e.g., vector databases like Neo4j) with generative models (e.g., LLAMA) for enterprise use cases.
- Fine-tune and optimize retrieval models to ensure high accuracy and low latency in on-prem environments.
Model Integration & Deployment:
- Implement and customize inference servers using vLLM for efficient LLM serving and LiteLLM for lightweight model orchestration.
- Integrate open-source LLMs (e.g., LLAMA, Mistral) with proprietary data sources and APIs.
On-Prem Infrastructure Management:
- Design GPU-optimized, scalable infrastructure for LLM training and inference, ensuring compliance with security and data governance policies.
- Collaborate with DevOps teams to containerize workflows using Docker/Kubernetes and automate MLOps pipelines.
Performance Optimization:
- Apply techniques like quantization, pruning, and dynamic batching to maximize resource efficiency in resource-constrained on-prem setups.
- Monitor system performance, troubleshoot bottlenecks, and ensure high availability.
Cross-Functional Collaboration:
- Partner with data engineers to curate and preprocess domain-specific datasets for retrieval and generation tasks.
- Translate business requirements into technical solutions for stakeholders in telco environments.
What skills will be appreciated?
Education
- Bachelor’s/Master’s/PhD in Computer Science, AI, or related field
Experience:
- 3+ years in ML/AI roles, with 2+ years focused on RAG systems.
- Proven experience deploying LLMs in on-prem or hybrid environments.
- Proficiency with vLLM, LiteLLM, and open-source LLMs (e.g., LLAMA, Deepseek, Mistral).
- Experience in introducing AI Agents/Assistants
Technical Skills:
- Strong Python expertise with frameworks like PyTorch, Hugging Face Transformers, and LangChain.
- Experience with vector/graph databases (e.g. Neo4j).
- Familiarity with Linux-based systems and RedHat OpenShift
Soft Skills:
- Ability to communicate complex AI concepts to non-technical stakeholders.
- Strong problem-solving skills and adaptability in fast-paced environments.
Our offer for you
Working at T Hub will offer you an unique and highly rewarding experience on IT market. As a leader in the telecommunications industry, we do not only provide a platform to hone your technical skills but also empower you to be a catalyst for innovation.
You'll have the opportunity to work at the forefront of modern technologies, from 5G to IoT and AI, shaping the future of connectivity.
Benefits
- No dress code
- you can just be yourself here
- Medical,
sport and life insurance
packages at preferential terms
- Access to: Percipio, Coursera, Rodos learning platforms
- Access to our products and
services at preferential terms
- Employment contract-based cooperation
- Know Talent - receive training
or financial bonus for
recommending new
employees
What will your recruitment process be like?
A fair approach to all people who want to join T Hub means that:
- The recruitment process is transparent;
- Our recruitment decision is based solely on an assessment of your skills (your race, skin color, sexual orientation, gender identity, origin, disability, political view, appearance, or religion will not have any influence on yhe outcome of the process);
- Regardless of the outcome of the process, you will get detailed feedback.
Who are we?
We're one of the four European technological hubs of the Deutsche Telekom group, to which T-Mobile belongs. Thanks to this, you have the opportunity to work on global projects as well as collaborate with business partners from various industries. Whether it's automotive, security, banking, or countless other options - be sure to explore them all!