Site Reliability Engineer
hace 1 día
Overview Senior Site Reliability Engineer / Observability Engineer At CodeRoad, we're more than just a software development company—we're your gateway to the global tech world. We offer end-to-end software development services and give you the opportunity to work on exciting, real-world projects in a supportive environment. Whether it's staff augmentation, dedicated IT teams, or general software engineering, we have opportunities for everyone to challenge themselves and take their career to the next level About the role: We are looking for a Senior Site Reliability Engineer (SRE) with strong experience in observability, metrics, logging, and reliability engineering. This role will lead the design and implementation of our monitoring and observability strategy across multiple services, ensuring system performance, resiliency, and operational excellence. The ideal candidate combines deep expertise in SRE practices, strong understanding of software engineering, and hands-on experience with modern observability stacks. Responsibilities Define and implement SLIs, SLOs, and error budgets for critical services. Design and maintain dashboards and alerting systems using tools like Prometheus, Grafana, ELK, OpenTelemetry, or equivalents. Standardize logging, tracing, and metrics across all applications and services. Continuously improve the system's visibility and health tracking to support high availability. Drive incident response, post-mortems, and root-cause analyses. Identify performance bottlenecks and propose architectural improvements. Implement chaos testing and resilience strategies where applicable. Develop CI / CD improvements that support reliability and quality. Automate operational workflows, deployments, and monitoring pipelines. Collaborate with development teams to ensure reliability is built into every service. Work closely with software engineers to establish observability best practices. Create internal standards for logs, metrics, and distributed tracing. Provide technical mentorship and help shape long-term reliability roadmaps. Qualifications 5-7+ years of experience in SRE, DevOps, or Platform Engineering roles. Strong experience with observability tools such as Prometheus, Grafana, ELK Stack, OpenTelemetry, Jaeger, Datadog, New Relic, etc. Solid understanding of Kubernetes, Docker, cloud platforms (AWS / GCP / Azure). Proficiency in at least one programming language (e.g., Java, Go, Python, Node.js). Experience implementing SLIs, SLOs, alerting strategies, and incident response. Ability to work cross-functionally and drive technical decisions. Experience with service mesh technologies (e.g., Istio). Background in performance testing, load testing, or capacity planning. Experience with infrastructure as code (Terraform, Ansible). What you’ll love USA Contractor 100% Remote Holidays Off Paid Time Off Health insurance assistance program Competitive Pay (USD) Excellent teamwork and work environment Training Seniority level: Mid-Senior level Employment type: Contract Job function: Consulting and Business Development Industry: IT Services and IT Consulting #J-18808-Ljbffr
-
Site Reliability Engineer
hace 3 días
Veracruz, México GrainChain Inc A tiempo completoOverview Estamos en busca de nuevos talentos! GrainChain es una empresa tecnológica dedicada a reducir la brecha digital en la industria agrícola. Nuestras plataformas facilitan las transacciones de manera rápida, seguras y sencillas para nuestros usuarios. Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de...
-
Senior Site Reliability Engineer/Devops
hace 3 días
Veracruz, México GrainChain Inc A tiempo completoEstamos en busca de nuevos talentos! GrainChain es una empresa tecnológica dedicada a reducir la brecha digital en la industria agrícola. Nuestras plataformas facilitan las transacciones de manera rápida, seguras y sencillas para nuestros usuarios. Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de...
-
Build Engineer
hace 4 semanas
Veracruz, Ver., México S&P Global A tiempo completoSite Reliability Engineer - Data Support | S&P Dow Jones Indices We are seeking an Site Reliability Engineer - Data Support to be a key player in the implementation and support of our Global Index Data Platform that supports our major headline indices like S&P 500, Dow Jones Industrial Averages & also the co-branded indices with our exchange partners such as...
-
Industrial Engineer
hace 3 días
Veracruz, México Loesencial Business Solutions A tiempo completoWe are looking for an Industrial Engineer to lead and optimize daily manufacturing operations in a fast-paced production environment. This role is responsible for improving efficiency, standardizing processes, and ensuring quality, safety, and productivity across the plant. The ideal candidate has strong experience in manufacturing operations, team...
-
Village Engineer
hace 2 semanas
Veracruz, México Rockwell Land Corporation A tiempo completoAbout the RoleThe Village Engineer (Civil & Property Management) is responsible for planning, supervising, and maintaining civil infrastructure, as well as managing village-level properties and community assets.This role combines engineering expertise with property and asset management to ensure the sustainable use, protection, and development of village...
-
Test Engineer
hace 2 semanas
Veracruz, México WestWell A tiempo completoPosition: QA Engineer (System Operations) Company: Westwell Location: Port of Veracruz, Mexico Job Description: Westwell is seeking several Test Engineers responsible for software version testing of automated vehicles within the port, as well as undertaking system operations. As a Test Engineer, you will be a key member of the team, ensuring the stability...
-
Commissioning Engineer
hace 6 días
Veracruz, México Enerflex A tiempo completoGeneral Summary- This role is responsible for providing service and support to ITK team in construction, pre-commissioning,- commissioning, startup and performance test, overseeing the installation of equipment, systems or units at site- and in an early stage during engineering. Ensure all is working according to specifications and meet the client’s- needs...
-
Control Engineer
hace 2 semanas
Veracruz, México Covia A tiempo completo1 day ago Be among the first 25 applicants Covia is a leading supplier of minerals and material solutions to the industrial and energy markets. Covia’s rich legacy includes many achievements across industries and capital success through partnership. Our ability to deliver the right product, to the right place, at the right time, is unmatched. Just as...
-
Customer Engineer-Poza Rica, Veracruz
hace 2 semanas
Veracruz, México Ncr Atleos Corporation A tiempo completoCustomer Engineer-Poza Rica, Veracruz page is loadedCustomer Engineer-Poza Rica, VeracruzApply locations MEXICO VIRTUAL, MEX time type Full time posted on Posted 2 Days Ago job requisition id R*******About NCR AtleosNCR Atleos, headquartered in Atlanta, is a leader in expanding financial access.Our dedicated 20,000 employees optimize the branch, improve...
-
Cisco Catalyst Sd-Wan Engineer
hace 1 día
Veracruz, México Arganteal, Corp. A tiempo completoJob Title: Cisco Routing & Switching Engineers (Remote - 3 Openings) Contract Duration: 12 Months (Full-Time, 40 Hours / Week) Start Date: May 12th Location: 100% Remote - Work from Home Hourly Rate: $ USD Skill Level: Strong CCNA or CCNP-Level Network Engineer Shift: Working hours are 8am to 5pm CST Overview: Open Positions 1 Staging Engineer 2 Migration /...