Site Reliability Engineer

hace 5 días


WorkFromHome, México F5 Networks, Inc. A tiempo completo

At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation. The Reliability Engineer will be a critical contributor within the Site Reliability Engineering (SRE) and Incident Management team, focusing on ensuring the availability, reliability, and performance of critical systems and services. This role is responsible for managing and facilitating major incident response efforts, ensuring that service disruptions are quickly identified, triaged, and resolved. As an incident facilitator, the Reliability Engineer will take the lead during high-pressure situations, collaborating with cross-functional teams to restore service and drive root cause analysis to prevent future issues. Clear and consistent communication will be critical to the success of the incident management team and processes.In addition to incident management, the Reliability Engineer will apply technical expertise to design, deploy, and manage modern observability tools, including synthetic monitoring and infrastructure monitoring solutions. The ideal candidate will demonstrate a mix of strong technical skills, effective communication, and the ability to remain composed and solutions-oriented under pressure. **Key Responsibilities** Create, document, and improve incident response and management processes, defining clear roles and responsibilities for all participants during incidents. Ensure open lines of communication by ensuring engineering teams engage in communication processes during incidents and have a clear understanding of their responsibilities. Identify and implement opportunities to automate manual operational tasks to further reduce incident response and resolution times.Qualifications Education: Bachelor’s degree in Computer Science, Information Technology, or a related field (or equivalent professional experience). 3+ years of professional experience in Site Reliability Engineering (SRE), System Engineering, DevOps, or IT Operations roles. Highly experienced as a major incident manager, incident commander, or similar role, with a proven ability to facilitate, communicate, and drive resolution of technical incidents. Experience with hybrid infrastructure environments and understand monitoring signals from static on-premise infrastructure, cloud based ephemeral infrastructure, and SaaS applications.Experience with Python, Go, Bash, or a similar language to develop and maintain monitoring and automation scripts. Proven ability to remain calm and effective during high-pressure situations, facilitating resolution in a methodical, professional manner. d QualificationsExperience with Infrastructure-as-Code (IaC) tools such as Terraform, CloudFormation, or Ansible as part of observability and monitoring pipelines. Experience building tooling using modern infrastructure patterns such as containerization and serverless. Experience implementing SLAs, SLOs, and error budgets in environments operating under Site Reliability Engineering or ITIL frameworks. Knowledge of network and system security, including secure configurations, traffic monitoring, and network observability. It is the policy of F5 to provide equal employment opportunities to all employees and employment applicants without regard to unlawful considerations of race, religion, color, national origin, sex, sexual orientation, gender identity or expression, age, sensory, physical, or mental disability, marital status, veteran or military status, genetic information, or any other classification protected by applicable local, state, or federal laws. This policy applies to all aspects of employment, including, but not limited to, hiring, job assignment, compensation, promotion, benefits, training, discipline, and termination. F5 offers a variety of reasonable accommodations for candidates. Requesting an accommodation is completely voluntary. F5 will assess the need for accommodations in the application process separately from those that may be needed to perform the job. Request by contacting **Remote**: Primarily work from designated home location but can come into an F5 office to work or travel to an offsite location as needed.#J-18808-Ljbffr


  • Site Reliability Engineer

    hace 3 semanas


    WorkFromHome, México KI people A tiempo completo

    18 hours ago Be among the first 25 applicants Direct message the job poster from KI people In Search of the Best Global IT & Digital Talent We are looking for a Site Reliability Engineer to work on hybrid mode from GDL, MTY o CDMX for a multicultural project with stability and growth in the short, medium and long term. Role Overview: The SRE Operations...

  • Site Reliability Engineer

    hace 3 semanas


    WorkFromHome, México BairesDev A tiempo completo

    Site Reliability Engineer - Remote Work | REF# Join to apply for the Site Reliability Engineer - Remote Work | REF# role at BairesDev Site Reliability Engineer - Remote Work | REF# 6 months ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer - Remote Work | REF# role at BairesDev At BairesDev, we've been leading the way in...


  • WorkFromHome, México Nova A tiempo completo

    Sr. Site Reliability Engineer (Remote, Mexico) Join to apply for the Sr. Site Reliability Engineer (Remote, Mexico) role at Nova Sr. Site Reliability Engineer (Remote, Mexico) 1 year ago Be among the first 25 applicants Join to apply for the Sr. Site Reliability Engineer (Remote, Mexico) role at Nova Get AI-powered advice on this job and more exclusive...

  • Site Reliability Engineer

    hace 3 semanas


    WorkFromHome, México BairesDev A tiempo completo

    Site Reliability Engineer - Remote Work | REF# Join to apply for the Site Reliability Engineer - Remote Work | REF# role at BairesDev Site Reliability Engineer - Remote Work | REF# Join to apply for the Site Reliability Engineer - Remote Work | REF# role at BairesDev Get AI-powered advice on this job and more exclusive features. At BairesDev, we've been...


  • WorkFromHome, México Resend A tiempo completo

    A modern email platform company is seeking a Site Reliability Engineer for a fully remote position. In this role, you will enhance system reliability and automation, monitor performance parameters, and collaborate with engineering teams. Ideal candidates will have over 5 years in Site Reliability or Infrastructure Engineering, strong skills in Node.js and...

  • Site Reliability Engineer

    hace 4 semanas


    WorkFromHome, México - A tiempo completo

    JOB DESCRIPTION Site Reliability Engineer (SRE) - Application Performance Monitoring (APM) Location: Monterrey, Nuevo León, Mexico (Hybrid - candidates must reside in Monterrey or the metropolitan area) Language requirement: Fluent English (spoken and written) About the Role We're looking for a Site Reliability Engineer (SRE) with a passion for Application...

  • Site Reliability Engineer

    hace 2 semanas


    WorkFromHome, México National Oilwell Varco, Inc. A tiempo completo

    Site Reliability Engineer (SRE) – Application Performance Monitoring (APM) Location: Monterrey, Nuevo León, Mexico (Hybrid – candidates must reside in Monterrey or the metropolitan area) Language requirement: Fluent English (spoken and written) About the Role We’re looking for a Site Reliability Engineer (SRE) with a passion for Application...

  • Site Reliability Engineer

    hace 3 semanas


    WorkFromHome, México BairesDev A tiempo completo

    Site Reliability Engineer - Remote Work | REF# Join to apply for the Site Reliability Engineer - Remote Work | REF# role at BairesDev Site Reliability Engineer - Remote Work | REF# 6 months ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer - Remote Work | REF# role at BairesDev At BairesDev, we've been leading the way in...


  • WorkFromHome, México DuckDuckGo A tiempo completo

    6 days ago Be among the first 25 applicants Who We AreHi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable since 2014, our annual revenue now exceeds $100 million USD. Millions use our browser on Mac, Windows, iOS, and Android, our search engine,...

  • Site Reliability Engineer

    hace 2 semanas


    WorkFromHome, México Epam A tiempo completo

    A leading digital services company in Mexico City seeks a Site Reliability Engineer to enhance communication between operational and developmental sides of software. You will guide teams in designing, building, testing, and deploying software changes while maintaining and improving cloud infrastructure. Ideal candidates are proficient in Site Reliability...