Empleos actuales relacionados con Site Reliability Engineer - Ciudad de México, Ciudad de México - Ford Motor Company

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    Unlock the Power of Cloud OperationsThomson Reuters is seeking a skilled Site Reliability Engineer to join our team. As a key member of our Cloud Operations team, you will be responsible for ensuring the reliability and performance of our cloud-based services.About the RoleWe are looking for a highly motivated and experienced Site Reliability Engineer who...


  • Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Thomson Reuters. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Key ResponsibilitiesDesign, implement, and maintain scalable and highly available cloud-based...


  • Ciudad de México, Ciudad de México Azka IT Consulting A tiempo completo

    Azka IT Consulting is a leading IT services company that connects top talent with Latin American and US companies.We are seeking a skilled Site Reliability Engineer to join our team.Job SummaryThe Site Reliability Engineer plays a critical role in designing, implementing, and maintaining highly available, scalable, and reliable systems.Key...

  • Site Reliability Engineer

    hace 3 semanas


    Ciudad de México, Ciudad de México Svitla Systems A tiempo completo

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Svitla Systems. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Responsibilities:Design and implement automation to reduce toil and improve...


  • Ciudad de México, Ciudad de México Azka IT Consulting A tiempo completo

    Azka IT Consulting is a leading IT talent connector between Latin America and the United States.We are seeking a skilled Site Reliability Engineer to join our team.Job SummaryThe Site Reliability Engineer plays a critical role in designing, implementing, and maintaining highly available, scalable, and reliable systems.Key ResponsibilitiesDevelop and maintain...

  • Site Reliability Engineer

    hace 3 semanas


    Ciudad de México, Ciudad de México Azka IT Consulting A tiempo completo

    Azka IT Consulting is a leading IT services company that connects top talent with Latin American and US companies.We are seeking a skilled Site Reliability Engineer to join our team.Job SummaryThe Site Reliability Engineer plays a critical role in designing, implementing, and maintaining highly available, scalable, and reliable systems.Key...


  • Ciudad de México, Ciudad de México Thales A tiempo completo

    Job DescriptionThales is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our large-scale ODC services.ResponsibilitiesDesign, build, and maintain scalable and reliable infrastructure using Infrastructure as a Code...


  • Ciudad de México, Ciudad de México Azka IT Consulting A tiempo completo

    Azka IT Consulting is a Mexican company that connects top IT talent with Latin American and United States companies.We are seeking a skilled Site Reliability Engineer to join our team.Job RequirementsThe Site Reliability Engineer plays a crucial role in designing, implementing, and maintaining highly available, scalable, and reliable systems.Technical...


  • Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Thomson Reuters. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based applications and infrastructure.Key ResponsibilitiesDesign, implement, and maintain scalable and highly...

  • Site Reliability Engineer

    hace 3 semanas


    Ciudad de México, Ciudad de México Thales A tiempo completo

    Job DescriptionThales is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our large-scale ODC services.ResponsibilitiesDesign, build, and maintain scalable and reliable infrastructure using Infrastructure as a Code...


  • Ciudad de México, Ciudad de México Ford Motor Company A tiempo completo

    Job Title: Site Reliability EngineerAt Ford Motor Company, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, configuring, and maintaining our observability solutions to ensure optimal performance and reliability of our IT systems and applications.Key...

  • Site Reliability Engineer

    hace 3 semanas


    Ciudad de México, Ciudad de México Thales A tiempo completo

    Job DescriptionThales is a leading provider of digital security solutions, and we're seeking a skilled Site Reliability Engineer to join our team.About the RoleAs a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our large-scale ODC services. You will work closely with development teams...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México Ford Motor Company A tiempo completo

    Job Title: Site Reliability EngineerAt Ford Motor Company, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, configuring, and maintaining our observability solutions to ensure optimal performance and reliability of our IT systems and applications.Key...


  • Ciudad de México, Ciudad de México Virtualent A tiempo completo

    {"h2": "Site Reliability Engineer at Virtualent", "p": "At Virtualent, we're passionate about connecting top talent with the best opportunities. We're looking for a Site Reliability Engineer to join our team and help us deliver high-quality services to our clients.", "ul": [{"li": "Design, implement, and maintain scalable and highly available...

  • Site Reliability Engineer

    hace 4 semanas


    Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Thomson Reuters. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key ResponsibilitiesDesign, implement, and maintain scalable and highly available cloud-based...

  • Site Reliability Engineer

    hace 3 semanas


    Ciudad de México, Ciudad de México Epam A tiempo completo

    About the RoleWe are seeking a skilled Site Reliability Engineer to join our team at EPAM. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.ResponsibilitiesDesign, build, test, and deploy changes to existing softwareEnhance the company's IT infrastructure...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    About the RoleIn this exciting opportunity as a Site Reliability Engineer, you will play a crucial role in ensuring the smooth operation of our cloud-based services. Your primary responsibility will be to design, test, deliver, support, and maintain production services in our technical operations environment.Key ResponsibilitiesProvide skilled technical...


  • Ciudad de México, Ciudad de México Medallia A tiempo completo

    OverviewMedallia is a pioneer in Experience Management, offering a leading SaaS platform, Medallia Experience Cloud, that helps organizations understand and manage experiences for various stakeholders. Our mission is to create a culture that values every person and experience, fostering a diverse and inclusive workforce.The Role and TeamThe Site Reliability...


  • Ciudad de México, Ciudad de México Hitachi Vantara Corporation A tiempo completo

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Hitachi Vantara Corporation. As a Site Reliability Engineer, you will be responsible for ensuring the stability and performance of our cloud infrastructure, particularly in the Azure platform.Key ResponsibilitiesManage and troubleshoot pipelines for client onboarding...

  • Site Reliability Engineer

    hace 3 semanas


    Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    About the RoleAs a Site Reliability Engineer at Thomson Reuters, you will play a critical role in ensuring the reliability and scalability of our cloud-based systems. You will work closely with cross-functional teams to design, implement, and maintain high-quality software systems that meet the needs of our customers.Key ResponsibilitiesDesign and implement...

Site Reliability Engineer

hace 2 meses


Ciudad de México, Ciudad de México Ford Motor Company A tiempo completo
About the Role

We are seeking a highly skilled Site Reliability Engineer to join our team at Ford Motor Company. As a key member of our IT organization, you will be responsible for designing, configuring, and maintaining our observability solutions to ensure optimal performance and reliability of our IT systems and applications.

Key Responsibilities
  1. Observability and Monitoring: Utilize advanced monitoring tools to detect and resolve issues affecting user experience, and automate alerting and remediation processes to reduce mean time to resolution (MTTR) and improve system uptime.
  2. Cloud Monitoring Services: Implement comprehensive monitoring and alerting solutions using GCP monitoring services and external services, and gather and analyze metrics from operating systems and applications to assist in performance tuning and fault finding.
  3. Tooling and Automation: Build vital and efficient tooling to lower the barrier of entrance for engineering teams to plug in and enjoy the benefits of Reliability focused on Observability, and develop and integrate tools for logging, monitoring, and alerting to enhance visibility into system performance.
  4. Troubleshooting and Collaboration: Troubleshoot issues and outages, working closely with development and operations teams to identify root causes and develop solutions, and participate in strategic planning for the technology roadmap, including scalability, cost-effectiveness, and risk management considerations related to observability infrastructure.
Requirements
  1. 6+ years of SRE observability engineering experience.
  2. 6+ years of experience in observability best practices working with Dynatrace or similar tools, delivering solutions across all environments, and integrating platforms and applications with monitoring and APM tools.
  3. Knowledge of CI/CD tools such as Puppet, Jenkins, Terraform, Ansible.
  4. Minimum 4 to 5 years' working experience in OpenShift and Docker/K8s.
  5. Proficiency in implementing monitoring and observability solutions using GCP monitoring services such as Cloud Monitoring, Logging, and Tracing.
  6. Deep understanding of IT infrastructure monitoring and observability best practices.
  7. Experience with gathering and organizing large amounts of data to use for instrumentation into an Enterprise monitoring solution.
  8. Experience with recommending baseline monitoring thresholds and performance monitoring KPIs and SLAs.
  9. At least 4 years of experience in the development of Grafana dashboards, developing metrics/monitoring standardization - metrics, collection, dashboards with Grafana a must.
  10. 3-5 years of experience with SQL and familiarity with at least one managed Kubernetes platform (EKS, AKS, GKE).
  11. Strong background in software engineering, with expertise in relevant programming languages (like Python, Java, Go) and cloud platforms (like AWS, GCP, Azure).
  12. Experience with container orchestration tools like Kubernetes.
Competencies and Skills
  1. Strong interpersonal and organizational skills.
  2. Strong verbal and written skills.
  3. Attention to detail.
  4. Excellent time management.
  5. Extraordinary teamwork and collaborative skills.