We’re looking for highly motivated, passionate site reliability engineers to join our growing team. At evertz.io, our teams are building services that are used by the biggest names in the exciting broadcast and media industry. Our services are hosted in AWS, with a Serverless First mindset.
As part of this role you will work with our talented teams to help harden our multi-tenant SaaS platform. Using best in class observability tooling, you will be working to debug incidents, while also identifying and implementing improvements to the platform to ensure its continued reliability. Your drive to eliminate toil will see you automating processes and building the tools to do so.
We offer flexible working hours, great benefits, and the freedom to experiment with new technologies and tools to build better products.
Skills and Experience you will bring:
- At least 3 years of hands-on experience managing critical, high-availability production infrastructure, demonstrating success in maintaining reliability and maximizing application uptime.
- Proficient in at least one programming language (such as Python, Java, or Rust), with experience designing and building production-quality automation, tools, or software libraries.
- At least 3 years working with monitoring, log aggregation, and observability platforms such as Datadog, CloudWatch, Honeycomb, Splunk, or New Relic, using data-driven insights to proactively identify and resolve issues.
- Excellent analytical skills with the ability to understand end-to-end use cases, map system flows, debug complex issues, and anticipate potential failure points.
- Proven track record translating SLO’s and SLI’s into actionable improvements. Reliability, monitoring, and observability are not just words to you.
- At least 3 years of experience with cloud technologies, in particular AWS Services and tools such as Cloud Formation, Lambda, DynamoDB, SQS, SNS, EC2, S3, AWS CLI, Boto3.
- Solid foundation in Linux systems administration, networking, and security.
- Familiarity with the use and configuration of CI & CD pipelines such as Jenkins & AWS CodePipeline.
Additional skills and experience that will make you standout:
- Experience architecting and deploying serverless applications in cloud environments.
- Experience with infrastructure-as-code tools like Terraform or CloudFormation, enabling reproducible and scalable environments.
- Previous participation in production on-call rotations, with direct involvement in incident management and post-incident reviews.
- Demonstrated expertise in performance optimization for core AWS services, including Lambda, DynamoDB, API Gateway, SQS, EventBridge, and EC2.
- Experience supporting and improving systems with frequent, high-velocity deployment cycles.
- Familiarity with security compliance frameworks (e.g., OWASP, ISO, CSA, PCI), and hands-on experience conducting threat assessments and implementing remediation plans.
- Background in security practices, including penetration testing, threat modeling, and usage of both open-source and commercial security tools.
- Experience developing and implementing advanced deployment strategies for web application infrastructures—such as canary, A/B testing, blue/green deployments, or red/line patterns.
- Hands-on experience with chaos engineering—intentionally testing systems under extreme conditions to improve reliability and fault tolerance.
- Track record of championing system reliability, continuous improvement, and operational excellence throughout an organization.
Recruitment process:
- Screening with recruiter (45 min)
- Technical interview with Hiring Manager (60 min)
Please note, this email address will only respond to requests regarding privacy concerns. This inbox will not respond to job applications, resumes, or questions regarding an application. When you apply to a job on this site, the personal data contained in your application will be collected by Evertz Microsystems Ltd (“Controller”), which is located at 5292 John Lucas Drive, Burlington, Ontario, Canada and can be contacted by emailing [email protected]. Controller’s data protection officer is Nadiera Toolsieram, who can be contacted at [email protected]. Your personal data will be processed for the purposes of managing Controller’s and its' subsidiaries' and affiliates' recruitment related activities, which include setting up and conducting interviews and tests for applicants, evaluating and assessing the results thereto, and as is otherwise needed in the recruitment and hiring processes. Such processing is legally permissible under Art. 6(1)(f) of Regulation (EU) 2016/679 (General Data Protection Regulation) as necessary for the purposes of the legitimate interests pursued by the Controller, which are the solicitation, evaluation, and selection of applicants for employment.
A complete privacy policy can be found at https://evertz.com/contact/privacy/
Your personal data will be retained by Controller as long as Controller determines it is necessary to evaluate your application for employment. Under the GDPR, you have the right to request access to your personal data, to request that your personal data be rectified or erased, and to request that processing of your personal data be restricted. You also have to right to data portability. In addition, you may lodge a complaint with an EU supervisory authority.
HBEIfAaRY1