Systems Reliability Engineer II
hace 19 horas
We are seeking aTechnical Support Engineer with a strong focus onhardware support for enterprise-grade servers within hyper-converged infrastructure environments. This role involves providing remote assistance tofield engineers during hardware replacement activities for servers managed by aNutanix cluster stack, as well as handling support tickets related to hardware issues. You will be responsible for accurately framing problems, troubleshooting hardware issues, and engaging additional technical resources as necessary.About the TeamAct as the primary technical escalation point for hardware-related support cases from global customers.Troubleshoot complex issues involving servers, storage and networking.Collaborate with engineering and vendors for hardware failure analysis and RMA investigations.Perform in-depth log analysis, firmware validation, and system health checks.Guide field teams through hardware replacement procedures and validations.Own and drive critical incidents to resolution with minimal downtime to customer environments.Contribute to and maintain internal knowledge-based articles and troubleshooting guides.Identify recurring issues and work with engineering to drive long-term solutions and design improvements.Go above and beyond to support their business and use of the Nutanix stack.Your RoleProvide remote technical support via phone and Remote Sharing to field engineers performing hardware replacements.Work on support tickets related to hardware issues, ensuring timely resolution.Frame problems clearly and escalate to specialized technical resources when required.Follow documented procedures and ensure compliance during replacement activities.Assist with troubleshooting and resolving hardware-related issues in Nutanix-based hyper-converged systems.Collaborate with customers and field engineers to deliver a seamless support experience.Document cases and maintain accurate records in the support system.What You Will Bring2 to 4 years of hands-on experience inenterprise hardware support or systems engineering.Hardware Expertise: Proven experience in server hardware replacement and troubleshooting.Operating Systems: Linux administration skills (user and basic system management).Virtualization: Basic to medium level experience with VMware and/or AHV (or similar KVM-based hypervisors).Storage: Basic to medium understanding of storage concepts.Networking: Basic to medium networking knowledge (IP addressing, VLANs, connectivity troubleshooting).Framing & Troubleshooting: Ability to frame technical problems accurately and perform effective troubleshooting before escalation.Communication: Fluent in English; strong ability to handle calls and video sessions professionally.Ability to follow procedures, guide others through technical tasks, and manage escalations effectively.Experience withNutanix clusters or similar hyper-converged platforms.Familiarity with enterprise support environments and ticketing systems.Customer-focused mindset with excellent problem-solving skills.Work ArrangementHybrid: This role operates in a hybrid capacity, blending the benefits of remote work with the advantages of in-person collaboration. For most roles, that will mean coming into an office a minimum of 3 days per week, however certain roles and/or teams may require more frequent in-office presence. Additional team-specific guidance and norms will be provided by your manager.--Nutanix is an equal opportunity employer.Nutanix is an Equal Employment Opportunity and (in the U.S.) an Affirmative Action employer. Qualified applicants are considered for employment opportunities without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, marital status, protected veteran status, disability status or any other category protected by applicable law. We hire and promote individuals solely on the basis of qualifications for the job to be filled. We strive to foster an inclusive working environment that enables all our Nutants to be themselves and to do great work in a safe and welcoming environment, free of unlawful discrimination, intimidation or harassment. As part of this commitment, we will ensure that persons with disabilities are provided reasonable accommodations. If you need a reasonable accommodation, please let us know by contacting CandidateAccommodationRequests@nutanix.com . #J-18808-Ljbffr
-
Site Reliability Engineer: Mainframe Systems
hace 3 semanas
Mexico City Kyndryl A tiempo completoA leading technology provider in Mexico City is hiring a Site Reliability Engineer to ensure reliability and innovation in information systems. The role involves collaborating with a talented team, driving continuous improvement, and managing operational challenges. Ideal candidates have 10+ years of experience in operational management and a strong...
-
Platform Engineer II: Drive Reliability
hace 2 días
Mexico City Mastercard A tiempo completoA leading global payments technology company based in Mexico City is seeking a Platform Engineer II to innovate and enhance customer experiences. In this role, you will be responsible for system reliability, scalability, and performance while working closely with engineering teams. Ideal candidates will have strong automation skills, knowledge of...
-
Site Reliability Engineer
hace 7 días
Mexico City The Functionary A tiempo completoSenior Site Reliability Engineer We are looking for a Senior Site Reliability Engineer to build and maintain reliable, high‑capacity, and high‑performing systems that support our mission to protect and improve customer platforms, with a strong focus on reliability, security, performance, cost, and operational excellence. As a Site Reliability Engineer on...
-
Site Reliability Engineer
hace 7 días
Mexico City The Functionary A tiempo completoSenior Site Reliability Engineer We are looking for a Senior Site Reliability Engineer to build and maintain reliable, high‑capacity, and high‑performing systems that support our mission to protect and improve customer platforms, with a strong focus on reliability, security, performance, cost, and operational excellence. As a Site Reliability Engineer on...
-
Site Reliability Engineer
hace 4 semanas
Mexico City Sur Global A tiempo completoSite Reliability Engineer - 100% Remote in Mexico As the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission‑critical SaaS platform. You must be confident in operating and debugging both modern infrastructure (cloud‑native, containerized services) and classic Windows production environments (IIS, SQL...
-
Site Reliability Engineer
hace 4 semanas
Mexico City Sur Global A tiempo completoSite Reliability Engineer - 100% Remote in Mexico As the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission‑critical SaaS platform. You must be confident in operating and debugging both modern infrastructure (cloud‑native, containerized services) and classic Windows production environments (IIS, SQL...
-
Site Reliability Engineer
hace 3 semanas
Mexico City Royal Caribbean Group A tiempo completoTalent Acquisition @Royal Caribbean Group Journey with us! Combine your career goals and sense of adventure by joining our incredible team of employees at Royal Caribbean Group. We are proud to offer a competitive compensation and benefits package, and excellent career development opportunities, each offering unique ways to explore the world. We are proud to...
-
Senior Cloud Reliability Engineer — Remote
hace 2 semanas
Mexico City Zipdev A tiempo completoA technology company is seeking a System Reliability Engineer in Mexico City. The role involves designing and maintaining resilient systems on Google Cloud Platform, enhancing reliability practices, and automating operational tasks. Ideal candidates have 5+ years of experience, strong skills in GCP and container technologies, and a proactive mindset....
-
Site Reliability Engineer
hace 3 días
Mexico City ITJ A tiempo completoSite Reliability Engineer (SRE). The Site Reliability Engineering team constantly practices the DevOps mindset to build and deploy distributed, fault-tolerant systems at scale. As part of this team, you will work with developers, operations, and product sponsors to help design, build, and deploy the critical infrastructure needed. Essential Duties Include,...
-
Site Reliability Engineer
hace 2 días
Mexico City ITJ A tiempo completoSite Reliability Engineer (SRE). The Site Reliability Engineering team constantly practices the DevOps mindset to build and deploy distributed, fault-tolerant systems at scale. As part of this team, you will work with developers, operations, and product sponsors to help design, build, and deploy the critical infrastructure needed. Essential...