FBS Site Reliability Engineer
hace 4 semanas
2 days ago Be among the first 25 applicants This range is provided by Capgemini. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range MX$1.00/yr - MX$1.00/yr Client Overview Our Client is one of the United States' largest insurers, providing a wide range of insurance and financial services products with gross written premiums well over US$25 Billion (P&C). They proudly serve more than 10 million U.S. households with more than 19 million individual policies across all 50 states through the efforts of over 48,000 exclusive and independent agents and nearly 18,500 employees. Finally, our Client is part of one of the largest Insurance Groups in the world. Job Summary This position will focus on infrastructure & code reviews to ensure solutions built and delivered are Highly Available and to minimize unplanned downtime. Key Responsibilities Expert troubleshooter within IT who has broad technical experience in multiple disciplines of IT and is willing to help our Incident and Problem Management teams Understand root cause and the necessary tasks needed to ensure this incident does not recur Validate root cause of incidents in nonproduction regions, ensuring that the cause is validated and then work with teams to determine the best approach to resolve Participate in chaos testing – where we leverage a third‑party tool to disable functions on a server and we verify that we can alert teams to the failure and then assemble a technical troubleshooting call to identify and restore the service Leverage Observability tools set to define key transactions and observe their performance within systems Create golden signal reporting and error budgets for development teams. Must know the framework Perform failure analysis, leveraging chaos testing practices to break nonproduction systems to find weak points and work with infrastructure and development teams to improve the applications resilience Requirements At least 6 years of experience in a similar role as a Reliability Engineer or Resilience Engineer Full English Fluency BS in Computer Science or similar Very strong experience using Code (writing, testing leveraging observability process) – Ideally JAVA, C++ Hands on approach, troubleshooting, very technical background Technical & Business Skills Site Reliability Engineer - Advanced Trend & Pattern Analysis - Advanced, Optimization Resilience Engineering - Advanced Golden Signal Cyber Reliability (MUST) Dynatrace - Intermediate (4-6 Years) – Desirable, not a must, any other Observability tool Gremlin - Entry Level (1-3 Years) – Chaos testing, Failure modeling experience or similar (Very Desirable) Cloud Infrastructure, Experience: AWS / Azure / GCP - Intermediate (4-6 Years) Strong Coding experience Benefits Competitive salary and performance-based bonuses Comprehensive benefits package Career development and training opportunities Flexible work arrangements (remote and/or office-based) Dynamic and inclusive work culture within a globally renowned group Private Health Insurance Pension Plan Paid Time Off Training & Development About Capgemini Capgemini is a global leader in partnering with companies to transform and manage their business by harnessing the power of technology. The Group is guided everyday by its purpose of unleashing human energy through technology for an inclusive and sustainable future. It is a responsible and diverse organization of over 340,000 team members in more than 50 countries. With its strong 55-year heritage and deep industry expertise, Capgemini is trusted by its clients to address the entire breadth of their business needs, from strategy and design to operations, fueled by the fast evolving and innovative world of cloud, data, AI, connectivity, software, digital engineering and platforms. The Group €22.5 billion in revenues in 2023. Seniority level Mid-Senior level Employment type Full-time Job function Information Technology Industries: IT Services and IT Consulting #J-18808-Ljbffr
-
Site Reliability Engineer
hace 2 semanas
WorkFromHome, México KI people A tiempo completo18 hours ago Be among the first 25 applicants Direct message the job poster from KI people In Search of the Best Global IT & Digital Talent We are looking for a Site Reliability Engineer to work on hybrid mode from GDL, MTY o CDMX for a multicultural project with stability and growth in the short, medium and long term. Role Overview: The SRE Operations...
-
Site Reliability Engineer
hace 2 semanas
WorkFromHome, México BairesDev A tiempo completoSite Reliability Engineer - Remote Work | REF# Join to apply for the Site Reliability Engineer - Remote Work | REF# role at BairesDev Site Reliability Engineer - Remote Work | REF# 6 months ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer - Remote Work | REF# role at BairesDev At BairesDev, we've been leading the way in...
-
Sr. Site Reliability Engineer
hace 2 semanas
WorkFromHome, México Nova A tiempo completoSr. Site Reliability Engineer (Remote, Mexico) Join to apply for the Sr. Site Reliability Engineer (Remote, Mexico) role at Nova Sr. Site Reliability Engineer (Remote, Mexico) 1 year ago Be among the first 25 applicants Join to apply for the Sr. Site Reliability Engineer (Remote, Mexico) role at Nova Get AI-powered advice on this job and more exclusive...
-
Site Reliability Engineer
hace 2 semanas
WorkFromHome, México BairesDev A tiempo completoSite Reliability Engineer - Remote Work | REF# Join to apply for the Site Reliability Engineer - Remote Work | REF# role at BairesDev Site Reliability Engineer - Remote Work | REF# Join to apply for the Site Reliability Engineer - Remote Work | REF# role at BairesDev Get AI-powered advice on this job and more exclusive features. At BairesDev, we've been...
-
Remote Site Reliability Engineer
hace 3 semanas
WorkFromHome, México Resend A tiempo completoA modern email platform company is seeking a Site Reliability Engineer for a fully remote position. In this role, you will enhance system reliability and automation, monitor performance parameters, and collaborate with engineering teams. Ideal candidates will have over 5 years in Site Reliability or Infrastructure Engineering, strong skills in Node.js and...
-
Site Reliability Engineer
hace 4 semanas
WorkFromHome, México - A tiempo completoJOB DESCRIPTION Site Reliability Engineer (SRE) - Application Performance Monitoring (APM) Location: Monterrey, Nuevo León, Mexico (Hybrid - candidates must reside in Monterrey or the metropolitan area) Language requirement: Fluent English (spoken and written) About the Role We're looking for a Site Reliability Engineer (SRE) with a passion for Application...
-
Site Reliability Engineer
hace 1 semana
WorkFromHome, México National Oilwell Varco, Inc. A tiempo completoSite Reliability Engineer (SRE) – Application Performance Monitoring (APM) Location: Monterrey, Nuevo León, Mexico (Hybrid – candidates must reside in Monterrey or the metropolitan area) Language requirement: Fluent English (spoken and written) About the Role We’re looking for a Site Reliability Engineer (SRE) with a passion for Application...
-
Site Reliability Engineer
hace 2 semanas
WorkFromHome, México BairesDev A tiempo completoSite Reliability Engineer - Remote Work | REF# Join to apply for the Site Reliability Engineer - Remote Work | REF# role at BairesDev Site Reliability Engineer - Remote Work | REF# 6 months ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer - Remote Work | REF# role at BairesDev At BairesDev, we've been leading the way in...
-
Senior Site Reliability Engineer
hace 2 semanas
WorkFromHome, México DuckDuckGo A tiempo completo6 days ago Be among the first 25 applicants Who We AreHi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable since 2014, our annual revenue now exceeds $100 million USD. Millions use our browser on Mac, Windows, iOS, and Android, our search engine,...
-
Site Reliability Engineer ID45689
hace 4 semanas
WorkFromHome, México AgileEngine A tiempo completoJoin to apply for the Site Reliability Engineer ID45689 role at AgileEngine . AgileEngine is an Inc. 5000 company that creates award‑winning software for Fortune 500 brands and startups across 17+ industries. We rank among leaders in application development and AI/ML, and our people‑first culture has earned us multiple Best Place to Work awards. Why join...