Site reliability engineer

hace 1 semana


Chihuahua City, México RedHolt A tiempo completo

Hiring: On-Site Support Engineer - Linear Video Distribution (AWS Cloud)Location: Mexico OR ColombiaIndustry: Next-gen video distribution, cloud streaming, media technologyAre you passionate about live video delivery, cloud workflows, and supporting cutting-edge broadcast operations? A global media technology provider is seeking an On-Site Support Engineer to work directly within a major operator's facility, supporting a state-of-the-art AWS-based Linear Video Distribution Platform.This role is perfect for someone who thrives in fast-paced operational environments, enjoys solving complex technical problems, and wants to be at the centre of a high-profile video delivery ecosystem.The OpportunityYou'll be the primary on-site technical expert for a cloud-driven linear video distribution platform handling:SRT ingestTranscodingPackaging (DASH/HLS)DRMOrigin and CDN deliverySession ManagementManifest ManipulationDynamic Ad Insertion (DAI)Your work ensures seamless, high-quality live channel delivery to millions of viewers.Key ResponsibilitiesAct as the on-site technical contact supporting the cloud-based video distribution platform.Work closely with the operator's engineering and operations teams to maintain platform availability.Validate SRT content feeds and ensure correct service configuration and channel alignment.Support additions, changes, and removals of services, coordinating interfaces with back-office systems.Perform configuration updates following strict change-management processes.Monitor alarms, system health, and service KPIs.Troubleshoot issues across ingest, encoding/transcoding, packaging, DRM, CDN, and DAI workflows, escalating when needed.Document runbooks, operational processes, and lessons learned.Collaborate with remote engineering, cloud ops, and product teams to resolve complex issues.Provide technical insight for upgrades, new features, and ongoing architecture optimisation.Assist with API-related queries for monitoring, dashboards, and integration into NMS systems.Technical Expertise Needed5+ years working with video delivery systems in broadcast or OTT environments.Strong experience in Dev Ops, Cloud Infrastructure, or SRE roles.Knowledge of Docker, Kubernetes, and containerised environments.Deep understanding of video workflows: ingest, encoding, packaging (DASH/HLS), DRM, CDN delivery, DAI SREHands-on experience with AWS services (EC2, S3, Cloud Front, Cloud Watch, IAM, Lambda, etc.).Proficiency in Linux, shell scripting, system logs, and root-cause analysis.Solid networking fundamentals (TCP/IP, multicast/unicast, DNS, routing, load balancing).Strong documentation and customer-facing communication skills.Nice-to-Have SkillsExperience with CI/CD, observability tools (Grafana, Prometheus, ELK).Exposure to DRM technologies (Widevine, Play Ready, Fair Play).Understanding of DAI workflows.AWS or cloud certifications (e.g., AWS Solutions Architect Associate).



  • Mexico City W3Global A tiempo completo

    Site Reliability Engineer Join to apply for the Site Reliability Engineer role at W3Global Required qualifications: AWS experience Gitlab Terraform or AWS CDK Python Familiarity with GO Linux OS administration advanced scripting - bash Windows OS administration advanced scripting - powershell Seniority level Entry level Employment type Full-time Job function...


  • Mexico City W3Global A tiempo completo

    Site Reliability Engineer Join to apply for the Site Reliability Engineer role at W3Global Required qualifications: AWS experience Gitlab Terraform or AWS CDK Python Familiarity with GO Linux OS administration advanced scripting - bash Windows OS administration advanced scripting - powershell Seniority level Entry level Employment type Full-time Job function...

  • Site Reliability Engineer

    hace 3 semanas


    Mexico City Royal Caribbean Group A tiempo completo

    Talent Acquisition @Royal Caribbean Group Journey with us! Combine your career goals and sense of adventure by joining our incredible team of employees at Royal Caribbean Group. We are proud to offer a competitive compensation and benefits package, and excellent career development opportunities, each offering unique ways to explore the world. We are proud to...


  • Chihuahua City, México Sngular A tiempo completo

    En Sngular buscamos un/a Site Reliability Engineer con foco real en operación, confiabilidad y monitoreo de plataformas de microservicios en Azure.


  • Chihuahua City, México ITJ A tiempo completo

    Position OverviewOur customer is revolutionizing the cancer diagnostics space and is now looking for another Site Reliability Engineer (SRE) to join its incredible team. SREs support our mission by pushing out new features and applications every day. The Site Reliability Engineering team constantly practices the Dev Ops mindset to build and deploy...


  • Chihuahua City, México Randstad México A tiempo completo

    About The Company: Randstad is the #1 HR Services Provider in the world, and we are hiring a Site Reliability Engineering to join our Nearshore Center at Randstad Mexico. This is your chance to join a dynamic, collaborative, and fast-paced environment where your expertise will make a real impact!

  • Site Reliability Engineer

    hace 3 semanas


    Mexico City Tata Consultancy Services A tiempo completo

    We are looking for a Site Reliability Engineer (SRE) to join our team and help us ensure seamless, high-performing, and reliable technology operations.What you’ll work with:Azure DevOps - Pipelines, repositories, and automationServiceNow - Incident, change, and problem managementAppDynamics - Application performance monitoring and alertingMicrosoft Azure...

  • Site Reliability Engineer

    hace 3 semanas


    Mexico City Tata Consultancy Services A tiempo completo

    We are looking for a Site Reliability Engineer (SRE) to join our team and help us ensure seamless, high-performing, and reliable technology operations.What you’ll work with:Azure DevOps - Pipelines, repositories, and automationServiceNow - Incident, change, and problem managementAppDynamics - Application performance monitoring and alertingMicrosoft Azure...


  • Mexico City The Functionary A tiempo completo

    Senior Site Reliability Engineer We are looking for a Senior Site Reliability Engineer to build and maintain reliable, high‑capacity, and high‑performing systems that support our mission to protect and improve customer platforms, with a strong focus on reliability, security, performance, cost, and operational excellence. As a Site Reliability Engineer on...

  • Site Reliability Engineer

    hace 2 semanas


    Mexico City The Functionary A tiempo completo

    Senior Site Reliability Engineer We are looking for a Senior Site Reliability Engineer to build and maintain reliable, high‑capacity, and high‑performing systems that support our mission to protect and improve customer platforms, with a strong focus on reliability, security, performance, cost, and operational excellence. As a Site Reliability Engineer on...