Systems Reliability Engineer II

hace 11 horas


Mexico City Nutanix A tiempo completo

We are seeking aTechnical Support Engineer with a strong focus onhardware support for enterprise-grade servers within hyper-converged infrastructure environments. This role involves providing remote assistance tofield engineers during hardware replacement activities for servers managed by aNutanix cluster stack, as well as handling support tickets related to hardware issues. You will be responsible for accurately framing problems, troubleshooting hardware issues, and engaging additional technical resources as necessary.About the TeamAct as the primary technical escalation point for hardware-related support cases from global customers.Troubleshoot complex issues involving servers, storage and networking.Collaborate with engineering and vendors for hardware failure analysis and RMA investigations.Perform in-depth log analysis, firmware validation, and system health checks.Guide field teams through hardware replacement procedures and validations.Own and drive critical incidents to resolution with minimal downtime to customer environments.Contribute to and maintain internal knowledge-based articles and troubleshooting guides.Identify recurring issues and work with engineering to drive long-term solutions and design improvements.Go above and beyond to support their business and use of the Nutanix stack.Your RoleProvide remote technical support via phone and Remote Sharing to field engineers performing hardware replacements.Work on support tickets related to hardware issues, ensuring timely resolution.Frame problems clearly and escalate to specialized technical resources when required.Follow documented procedures and ensure compliance during replacement activities.Assist with troubleshooting and resolving hardware-related issues in Nutanix-based hyper-converged systems.Collaborate with customers and field engineers to deliver a seamless support experience.Document cases and maintain accurate records in the support system.What You Will Bring2 to 4 years of hands-on experience inenterprise hardware support or systems engineering.Hardware Expertise: Proven experience in server hardware replacement and troubleshooting.Operating Systems: Linux administration skills (user and basic system management).Virtualization: Basic to medium level experience with VMware and/or AHV (or similar KVM-based hypervisors).Storage: Basic to medium understanding of storage concepts.Networking: Basic to medium networking knowledge (IP addressing, VLANs, connectivity troubleshooting).Framing & Troubleshooting: Ability to frame technical problems accurately and perform effective troubleshooting before escalation.Communication: Fluent in English; strong ability to handle calls and video sessions professionally.Ability to follow procedures, guide others through technical tasks, and manage escalations effectively.Experience withNutanix clusters or similar hyper-converged platforms.Familiarity with enterprise support environments and ticketing systems.Customer-focused mindset with excellent problem-solving skills.Work ArrangementHybrid: This role operates in a hybrid capacity, blending the benefits of remote work with the advantages of in-person collaboration. For most roles, that will mean coming into an office a minimum of 3 days per week, however certain roles and/or teams may require more frequent in-office presence. Additional team-specific guidance and norms will be provided by your manager.--Nutanix is an equal opportunity employer.Nutanix is an Equal Employment Opportunity and (in the U.S.) an Affirmative Action employer. Qualified applicants are considered for employment opportunities without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, marital status, protected veteran status, disability status or any other category protected by applicable law. We hire and promote individuals solely on the basis of qualifications for the job to be filled. We strive to foster an inclusive working environment that enables all our Nutants to be themselves and to do great work in a safe and welcoming environment, free of unlawful discrimination, intimidation or harassment. As part of this commitment, we will ensure that persons with disabilities are provided reasonable accommodations. If you need a reasonable accommodation, please let us know by contacting CandidateAccommodationRequests@nutanix.com . #J-18808-Ljbffr



  • Mexico City Kyndryl A tiempo completo

    A leading technology provider in Mexico City is hiring a Site Reliability Engineer to ensure reliability and innovation in information systems. The role involves collaborating with a talented team, driving continuous improvement, and managing operational challenges. Ideal candidates have 10+ years of experience in operational management and a strong...


  • Mexico City Mastercard A tiempo completo

    A leading global payments technology company based in Mexico City is seeking a Platform Engineer II to innovate and enhance customer experiences. In this role, you will be responsible for system reliability, scalability, and performance while working closely with engineering teams. Ideal candidates will have strong automation skills, knowledge of...


  • Mexico City The Functionary A tiempo completo

    Senior Site Reliability Engineer We are looking for a Senior Site Reliability Engineer to build and maintain reliable, high‑capacity, and high‑performing systems that support our mission to protect and improve customer platforms, with a strong focus on reliability, security, performance, cost, and operational excellence. As a Site Reliability Engineer on...


  • Mexico City The Functionary A tiempo completo

    Senior Site Reliability Engineer We are looking for a Senior Site Reliability Engineer to build and maintain reliable, high‑capacity, and high‑performing systems that support our mission to protect and improve customer platforms, with a strong focus on reliability, security, performance, cost, and operational excellence. As a Site Reliability Engineer on...

  • Site Reliability Engineer

    hace 4 semanas


    Mexico City Sur Global A tiempo completo

    Site Reliability Engineer - 100% Remote in Mexico As the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission‑critical SaaS platform. You must be confident in operating and debugging both modern infrastructure (cloud‑native, containerized services) and classic Windows production environments (IIS, SQL...

  • Site Reliability Engineer

    hace 4 semanas


    Mexico City Sur Global A tiempo completo

    Site Reliability Engineer - 100% Remote in Mexico As the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission‑critical SaaS platform. You must be confident in operating and debugging both modern infrastructure (cloud‑native, containerized services) and classic Windows production environments (IIS, SQL...

  • Site Reliability Engineer

    hace 3 semanas


    Mexico City Royal Caribbean Group A tiempo completo

    Talent Acquisition @Royal Caribbean Group Journey with us! Combine your career goals and sense of adventure by joining our incredible team of employees at Royal Caribbean Group. We are proud to offer a competitive compensation and benefits package, and excellent career development opportunities, each offering unique ways to explore the world. We are proud to...


  • Mexico City Zipdev A tiempo completo

    A technology company is seeking a System Reliability Engineer in Mexico City. The role involves designing and maintaining resilient systems on Google Cloud Platform, enhancing reliability practices, and automating operational tasks. Ideal candidates have 5+ years of experience, strong skills in GCP and container technologies, and a proactive mindset....


  • Mexico City ITJ A tiempo completo

    Site Reliability Engineer (SRE). The Site Reliability Engineering team constantly practices the DevOps mindset to build and deploy distributed, fault-tolerant systems at scale. As part of this team, you will work with developers, operations, and product sponsors to help design, build, and deploy the critical infrastructure needed. Essential Duties Include,...


  • Mexico City ITJ A tiempo completo

    Site Reliability Engineer (SRE). The Site Reliability Engineering team constantly practices the DevOps mindset to build and deploy distributed, fault-tolerant systems at scale. As part of this team, you will work with developers, operations, and product sponsors to help design, build, and deploy the critical infrastructure needed. Essential...