For more than two decades, Telestream has been at the forefront of innovation in the digital video industry, pioneering file-based video transcoding and high-quality media exchange over IP networks. Telestream provides world-class live and on-demand digital video tools and workflow solutions that allow businesses and consumers to transform video on the desktop and across the enterprise. Many of the world's most demanding media and entertainment companies such as CBS, BBC, CNN, FOX, CBC, Comcast, Direct TV, Time Warner, MTV, Discovery and Lifetime, as well as a growing number of users in a broad range of business environments, rely on Telestream products to streamline operations, reach broader audiences, and generate more revenue from their media. If you're looking for an industry leader in the high growth area of video, Telestream is for you.
Site Reliability Engineer – Krakow, Poland (Hybrid)
Key Responsibilities:
- Design, implement, and maintain infrastructure on AWS and Google Cloud Platform using Terraform and Terragrunt.
- Manage Kubernetes clusters (K8s, Helm) for scalable and reliable service delivery.
- Automate configuration management and provisioning using Ansible and related tools.
- Build and optimize CI/CD pipelines for application deployment and infrastructure updates.
- Implement and manage secrets management using SOPS and secure credential handling.
- Monitor, troubleshoot, and improve system reliability, performance, and scalability.
- Develop operational tooling to streamline deployments, monitoring, and incident response.
- Collaborate with development teams to ensure reliability and observability are built into applications.
- Implement best practices for infrastructure security, compliance, and cost optimization.
- Participate in on-call rotation to support critical systems and respond to incidents.
Required Skills & Experience:
- Strong hands-on experience with AWS and/or GCP cloud services.
- Proficiency in Terraform, Terragrunt, and Git-based workflows.
- Experience with Kubernetes (K8s) administration and Helm chart management.
- Solid Linux system administration skills.
- Expertise in automation tools such as Ansible.
- Experience with CI/CD pipelines (GitHub Actions, Bitbucket pipelines, or similar).
- Knowledge of SOPS or other secrets management solutions.
- Strong understanding of networking, security best practices, and monitoring tools.
- Familiarity with observability stacks (Elasicsearch).
- Strong problem-solving, debugging, and incident response skills.
Nice-to-Have Skills:
- Programming/Scripting skills in Python and Bash.
- Experience with multi-cloud architecture and hybrid deployments.
- Background in cost optimization and capacity planning.
- Exposure to compliance frameworks (SOC 2, CIS).