Site Reliability Engineer

hace 7 días


Mexico City RedHolt A tiempo completo

Hiring: On-Site Support Engineer - Linear Video Distribution (AWS Cloud)Location: Mexico OR Colombia Industry: Next-gen video distribution, cloud streaming, media technology Are you passionate about live video delivery, cloud workflows, and supporting cutting-edge broadcast operations? A global media technology provider is seeking an On-Site Support Engineer to work directly within a major operator's facility, supporting a state-of-the-art AWS-based Linear Video Distribution Platform . This role is perfect for someone who thrives in fast-paced operational environments, enjoys solving complex technical problems, and wants to be at the centre of a high-profile video delivery ecosystem. The Opportunity You'll be the primary on-site technical expert for a cloud-driven linear video distribution platform handling: SRT ingest Transcoding Packaging (DASH/HLS)DRM Origin and CDN delivery Session Management Manifest Manipulation Dynamic Ad Insertion (DAI)Your work ensures seamless, high-quality live channel delivery to millions of viewers. Key Responsibilities Act as the on-site technical contact supporting the cloud-based video distribution platform. Work closely with the operator's engineering and operations teams to maintain platform availability. Validate SRT content feeds and ensure correct service configuration and channel alignment. Support additions, changes, and removals of services, coordinating interfaces with back-office systems. Perform configuration updates following strict change-management processes. Monitor alarms, system health, and service KPIs. Troubleshoot issues across ingest, encoding/transcoding, packaging, DRM, CDN, and DAI workflows, escalating when needed. Document runbooks, operational processes, and lessons learned. Collaborate with remote engineering, cloud ops, and product teams to resolve complex issues. Provide technical insight for upgrades, new features, and ongoing architecture optimisation. Assist with API-related queries for monitoring, dashboards, and integration into NMS systems. Technical Expertise Needed 5+ years working with video delivery systems in broadcast or OTT environments. Strong experience in DevOps, Cloud Infrastructure, or SRE roles. Knowledge of Docker , Kubernetes , and containerised environments. Deep understanding of video workflows: ingest, encoding, packaging (DASH/HLS), DRM, CDN delivery, DAI SRE Hands-on experience with AWS services (EC2, S3, CloudFront, CloudWatch, IAM, Lambda, etc.). Proficiency in Linux , shell scripting, system logs, and root-cause analysis. Solid networking fundamentals (TCP/IP, multicast/unicast, DNS, routing, load balancing). Strong documentation and customer-facing communication skills. Nice-to-Have Skills Experience with CI/CD , observability tools (Grafana, Prometheus, ELK). Exposure to DRM technologies (Widevine, PlayReady, FairPlay). Understanding of DAI workflows. AWS or cloud certifications (e.g., AWS Solutions Architect Associate).



  • Mexico City Tata Consultancy Services A tiempo completo

    We are looking for a Site Reliability Engineer (SRE) to join our team and help us ensure seamless, high-performing, and reliable technology operations.What you’ll work with:Azure DevOps - Pipelines, repositories, and automationServiceNow - Incident, change, and problem managementAppDynamics - Application performance monitoring and alertingMicrosoft Azure...


  • Mexico City The Functionary A tiempo completo

    Senior Site Reliability Engineer We are looking for a Senior Site Reliability Engineer to build and maintain reliable, high‑capacity, and high‑performing systems that support our mission to protect and improve customer platforms, with a strong focus on reliability, security, performance, cost, and operational excellence. As a Site Reliability Engineer on...


  • Mexico City Royal Caribbean Group A tiempo completo

    Join to apply for the Senior Site Reliability Engineer role at Royal Caribbean Group.1 week ago Be among the first 25 applicants.Journey with us! Combine your career goals and sense of adventure by joining our incredible team at Royal Caribbean Group. We offer a competitive compensation and benefits package, along with excellent career development...


  • Mexico City BairesDev A tiempo completo

    Site Reliability Engineer - Remote Work | REF#282116Join to apply for the Site Reliability Engineer - Remote Work | REF#282116 role at BairesDevSite Reliability Engineer - Remote Work | REF#282116Join to apply for the Site Reliability Engineer - Remote Work | REF#282116 role at BairesDevGet AI-powered advice on this job and more exclusive features.At...


  • Mexico City Sur Global A tiempo completo

    Site Reliability Engineer - 100% Remote in Mexico As the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission‑critical SaaS platform. You must be confident in operating and debugging both modern infrastructure (cloud‑native, containerized services) and classic Windows production environments (IIS, SQL...

  • Site Reliability Engineer

    hace 2 semanas


    Mexico City HCLTech A tiempo completo

    Role: Site Reliability Engineer (SRE) Location: Remote (Mexico) Key Skills: AWS, Kubernetes Fulltime Permanent Position with HCLTech Key Responsibilities Design, build, and maintain highly available, scalable, and secure infrastructure Implement monitoring, alerting, and incident response strategies Optimize system reliability and performance across cloud...

  • Site Reliability Engineer

    hace 2 semanas


    Mexico City ITJ A tiempo completo

    Mid-level Site Reliability Engineer (SRE).The Site Reliability Engineering team constantly practices the DevOps mindset to build and deploy distributed, fault-tolerant systems at scale. As part of this team, you will work with developers, operations, and product sponsors to help design, build, and deploy the critical infrastructure needed.Essential...

  • Site reliability engineer

    hace 2 semanas


    Mexico City ITJ A tiempo completo

    Mid-level Site Reliability Engineer (SRE).The Site Reliability Engineering team constantly practices the Dev Ops mindset to build and deploy distributed, fault-tolerant systems at scale. As part of this team, you will work with developers, operations, and product sponsors to help design, build, and deploy the critical infrastructure needed.Essential...

  • Site Reliability Engineer

    hace 2 semanas


    Mexico City ITJ A tiempo completo

    Mid-level Site Reliability Engineer (SRE).The Site Reliability Engineering team constantly practices the DevOps mindset to build and deploy distributed, fault-tolerant systems at scale. As part of this team, you will work with developers, operations, and product sponsors to help design, build, and deploy the critical infrastructure needed.Essential...

  • Site Reliability Engineer

    hace 2 semanas


    Mexico City ITJ A tiempo completo

    Site Reliability Engineer (SRE). The Site Reliability Engineering team constantly practices the DevOps mindset to build and deploy distributed, fault-tolerant systems at scale. As part of this team, you will work with developers, operations, and product sponsors to help design, build, and deploy the critical infrastructure needed. Essential DutiesInclude,...