Site Reliability Engineer

hace 3 semanas


Veracruz, México Coderoad Inc A tiempo completo

Overview Senior Site Reliability Engineer / Observability Engineer At CodeRoad, we're more than just a software development company—we're your gateway to the global tech world. We offer end-to-end software development services and give you the opportunity to work on exciting, real-world projects in a supportive environment. Whether it's staff augmentation, dedicated IT teams, or general software engineering, we have opportunities for everyone to challenge themselves and take their career to the next level About the role: We are looking for a Senior Site Reliability Engineer (SRE) with strong experience in observability, metrics, logging, and reliability engineering. This role will lead the design and implementation of our monitoring and observability strategy across multiple services, ensuring system performance, resiliency, and operational excellence. The ideal candidate combines deep expertise in SRE practices, strong understanding of software engineering, and hands-on experience with modern observability stacks. Responsibilities Define and implement SLIs, SLOs, and error budgets for critical services. Design and maintain dashboards and alerting systems using tools like Prometheus, Grafana, ELK, OpenTelemetry, or equivalents. Standardize logging, tracing, and metrics across all applications and services. Continuously improve the system's visibility and health tracking to support high availability. Drive incident response, post-mortems, and root-cause analyses. Identify performance bottlenecks and propose architectural improvements. Implement chaos testing and resilience strategies where applicable. Develop CI / CD improvements that support reliability and quality. Automate operational workflows, deployments, and monitoring pipelines. Collaborate with development teams to ensure reliability is built into every service. Work closely with software engineers to establish observability best practices. Create internal standards for logs, metrics, and distributed tracing. Provide technical mentorship and help shape long-term reliability roadmaps. Qualifications 5-7+ years of experience in SRE, DevOps, or Platform Engineering roles. Strong experience with observability tools such as Prometheus, Grafana, ELK Stack, OpenTelemetry, Jaeger, Datadog, New Relic, etc. Solid understanding of Kubernetes, Docker, cloud platforms (AWS / GCP / Azure). Proficiency in at least one programming language (e.g., Java, Go, Python, Node.js). Experience implementing SLIs, SLOs, alerting strategies, and incident response. Ability to work cross-functionally and drive technical decisions. Experience with service mesh technologies (e.g., Istio). Background in performance testing, load testing, or capacity planning. Experience with infrastructure as code (Terraform, Ansible). What you’ll love USA Contractor 100% Remote Holidays Off Paid Time Off Health insurance assistance program Competitive Pay (USD) Excellent teamwork and work environment Training Seniority level: Mid-Senior level Employment type: Contract Job function: Consulting and Business Development Industry: IT Services and IT Consulting #J-18808-Ljbffr



  • Veracruz, México GrainChain Inc A tiempo completo

    Una empresa tecnológica del sector agrícola está buscando un Site Reliability Engineer para integrar y automatizar las áreas de desarrollo y operaciones. El candidato ideal debe tener experiencia en infraestructura, scripting y manejo de contenedores. Las responsabilidades incluyen construir la observabilidad de sistemas, crear pipelines de despliegue y...

  • Site Reliability Engineer

    hace 3 semanas


    Veracruz, México GrainChain Inc A tiempo completo

    Overview Estamos en busca de nuevos talentos! GrainChain es una empresa tecnológica dedicada a reducir la brecha digital en la industria agrícola. Nuestras plataformas facilitan las transacciones de manera rápida, seguras y sencillas para nuestros usuarios. Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de...


  • Veracruz, México GrainChain Inc A tiempo completo

    Estamos en busca de nuevos talentos! GrainChain es una empresa tecnológica dedicada a reducir la brecha digital en la industria agrícola. Nuestras plataformas facilitan las transacciones de manera rápida, seguras y sencillas para nuestros usuarios. Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de...


  • Veracruz, México GrainChain Inc A tiempo completo

    Estamos en busca de nuevos talentos! GrainChain es una empresa tecnológica dedicada a reducir la brecha digital en la industria agrícola. Nuestras plataformas facilitan las transacciones de manera rápida, seguras y sencillas para nuestros usuarios. Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de...

  • Industrial Engineer

    hace 3 semanas


    Veracruz, México Loesencial Business Solutions A tiempo completo

    We are looking for an Industrial Engineer to lead and optimize daily manufacturing operations in a fast-paced production environment. This role is responsible for improving efficiency, standardizing processes, and ensuring quality, safety, and productivity across the plant. The ideal candidate has strong experience in manufacturing operations, team...

  • Test Engineer

    hace 6 días


    Veracruz, México WestWell A tiempo completo

    Position: QA Engineer (System Operations) Company: Westwell Location: Port of Veracruz, Mexico Job Description: Westwell is seeking several Test Engineers responsible for software version testing of automated vehicles within the port, as well as undertaking system operations. As a Test Engineer, you will be a key member of the team, ensuring the stability...

  • Test Engineer

    hace 6 días


    Veracruz, México WestWell A tiempo completo

    Position: QA Engineer (System Operations)Company: Westwell Location: Port of Veracruz, MexicoJob Description: Westwell is seeking several Test Engineers responsible for software version testing of automated vehicles within the port, as well as undertaking system operations. As a Test Engineer, you will be a key member of the team, ensuring the stability and...

  • Commissioning Engineer

    hace 4 semanas


    Veracruz, México Enerflex A tiempo completo

    General Summary- This role is responsible for providing service and support to ITK team in construction, pre-commissioning,- commissioning, startup and performance test, overseeing the installation of equipment, systems or units at site- and in an early stage during engineering. Ensure all is working according to specifications and meet the client’s- needs...

  • Automation Engineer

    hace 3 semanas


    Veracruz, Ver., México Pentangle Tech Services | P5 Group A tiempo completo

    Job Title :- Automation Engineer Location :- USA Duration :- Long Term Job Summary We are seeking a skilled Automation Engineer to design, develop, and support automation systems that improve manufacturing efficiency, quality, and reliability. The ideal candidate has hands-on experience with PLC programming, troubleshooting automation equipment, and...


  • Veracruz, México Arganteal, Corp. A tiempo completo

    Job Title: Cisco Routing & Switching Engineers (Remote - 3 Openings) Contract Duration: 12 Months (Full-Time, 40 Hours / Week) Start Date: May 12th Location: 100% Remote - Work from Home Hourly Rate: $ USD Skill Level: Strong CCNA or CCNP-Level Network Engineer Shift: Working hours are 8am to 5pm CST Overview: Open Positions 1 Staging Engineer 2 Migration /...