Site Reliability Engineer
hace 3 semanas
Overview Senior Site Reliability Engineer / Observability Engineer At CodeRoad, we're more than just a software development company—we're your gateway to the global tech world. We offer end-to-end software development services and give you the opportunity to work on exciting, real-world projects in a supportive environment. Whether it's staff augmentation, dedicated IT teams, or general software engineering, we have opportunities for everyone to challenge themselves and take their career to the next level About the role: We are looking for a Senior Site Reliability Engineer (SRE) with strong experience in observability, metrics, logging, and reliability engineering. This role will lead the design and implementation of our monitoring and observability strategy across multiple services, ensuring system performance, resiliency, and operational excellence. The ideal candidate combines deep expertise in SRE practices, strong understanding of software engineering, and hands-on experience with modern observability stacks. Responsibilities Define and implement SLIs, SLOs, and error budgets for critical services. Design and maintain dashboards and alerting systems using tools like Prometheus, Grafana, ELK, OpenTelemetry, or equivalents. Standardize logging, tracing, and metrics across all applications and services. Continuously improve the system's visibility and health tracking to support high availability. Drive incident response, post-mortems, and root-cause analyses. Identify performance bottlenecks and propose architectural improvements. Implement chaos testing and resilience strategies where applicable. Develop CI / CD improvements that support reliability and quality. Automate operational workflows, deployments, and monitoring pipelines. Collaborate with development teams to ensure reliability is built into every service. Work closely with software engineers to establish observability best practices. Create internal standards for logs, metrics, and distributed tracing. Provide technical mentorship and help shape long-term reliability roadmaps. Qualifications 5-7+ years of experience in SRE, DevOps, or Platform Engineering roles. Strong experience with observability tools such as Prometheus, Grafana, ELK Stack, OpenTelemetry, Jaeger, Datadog, New Relic, etc. Solid understanding of Kubernetes, Docker, cloud platforms (AWS / GCP / Azure). Proficiency in at least one programming language (e.g., Java, Go, Python, Node.js). Experience implementing SLIs, SLOs, alerting strategies, and incident response. Ability to work cross-functionally and drive technical decisions. Experience with service mesh technologies (e.g., Istio). Background in performance testing, load testing, or capacity planning. Experience with infrastructure as code (Terraform, Ansible). What you’ll love USA Contractor 100% Remote Holidays Off Paid Time Off Health insurance assistance program Competitive Pay (USD) Excellent teamwork and work environment Training Seniority level: Mid-Senior level Employment type: Contract Job function: Consulting and Business Development Industry: IT Services and IT Consulting #J-18808-Ljbffr
-
Senior Site Reliability Engineer
hace 2 semanas
Veracruz, México GrainChain Inc A tiempo completoUna empresa tecnológica del sector agrícola está buscando un Site Reliability Engineer para integrar y automatizar las áreas de desarrollo y operaciones. El candidato ideal debe tener experiencia en infraestructura, scripting y manejo de contenedores. Las responsabilidades incluyen construir la observabilidad de sistemas, crear pipelines de despliegue y...
-
Site Reliability Engineer
hace 3 semanas
Veracruz, México GrainChain Inc A tiempo completoOverview Estamos en busca de nuevos talentos! GrainChain es una empresa tecnológica dedicada a reducir la brecha digital en la industria agrícola. Nuestras plataformas facilitan las transacciones de manera rápida, seguras y sencillas para nuestros usuarios. Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de...
-
Site Reliability Engineer Senior- Devops
hace 2 semanas
Veracruz, México GrainChain Inc A tiempo completoEstamos en busca de nuevos talentos! GrainChain es una empresa tecnológica dedicada a reducir la brecha digital en la industria agrícola. Nuestras plataformas facilitan las transacciones de manera rápida, seguras y sencillas para nuestros usuarios. Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de...
-
Senior Site Reliability Engineer/Devops
hace 3 semanas
Veracruz, México GrainChain Inc A tiempo completoEstamos en busca de nuevos talentos! GrainChain es una empresa tecnológica dedicada a reducir la brecha digital en la industria agrícola. Nuestras plataformas facilitan las transacciones de manera rápida, seguras y sencillas para nuestros usuarios. Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de...
-
Industrial Engineer
hace 3 semanas
Veracruz, México Loesencial Business Solutions A tiempo completoWe are looking for an Industrial Engineer to lead and optimize daily manufacturing operations in a fast-paced production environment. This role is responsible for improving efficiency, standardizing processes, and ensuring quality, safety, and productivity across the plant. The ideal candidate has strong experience in manufacturing operations, team...
-
Test Engineer
hace 6 días
Veracruz, México WestWell A tiempo completoPosition: QA Engineer (System Operations) Company: Westwell Location: Port of Veracruz, Mexico Job Description: Westwell is seeking several Test Engineers responsible for software version testing of automated vehicles within the port, as well as undertaking system operations. As a Test Engineer, you will be a key member of the team, ensuring the stability...
-
Test Engineer
hace 6 días
Veracruz, México WestWell A tiempo completoPosition: QA Engineer (System Operations)Company: Westwell Location: Port of Veracruz, MexicoJob Description: Westwell is seeking several Test Engineers responsible for software version testing of automated vehicles within the port, as well as undertaking system operations. As a Test Engineer, you will be a key member of the team, ensuring the stability and...
-
Commissioning Engineer
hace 4 semanas
Veracruz, México Enerflex A tiempo completoGeneral Summary- This role is responsible for providing service and support to ITK team in construction, pre-commissioning,- commissioning, startup and performance test, overseeing the installation of equipment, systems or units at site- and in an early stage during engineering. Ensure all is working according to specifications and meet the client’s- needs...
-
Automation Engineer
hace 3 semanas
Veracruz, Ver., México Pentangle Tech Services | P5 Group A tiempo completoJob Title :- Automation Engineer Location :- USA Duration :- Long Term Job Summary We are seeking a skilled Automation Engineer to design, develop, and support automation systems that improve manufacturing efficiency, quality, and reliability. The ideal candidate has hands-on experience with PLC programming, troubleshooting automation equipment, and...
-
Cisco Catalyst Sd-Wan Engineer
hace 3 semanas
Veracruz, México Arganteal, Corp. A tiempo completoJob Title: Cisco Routing & Switching Engineers (Remote - 3 Openings) Contract Duration: 12 Months (Full-Time, 40 Hours / Week) Start Date: May 12th Location: 100% Remote - Work from Home Hourly Rate: $ USD Skill Level: Strong CCNA or CCNP-Level Network Engineer Shift: Working hours are 8am to 5pm CST Overview: Open Positions 1 Staging Engineer 2 Migration /...