Principal Site Reliability Engineer

hace 2 meses


Ciudad de México, Ciudad de México Oracle A tiempo completo
Job Summary

We are seeking a highly skilled Principal Site Reliability Engineer to join our team at Oracle. As a key member of our infrastructure team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure.

Key Responsibilities
  • Infrastructure Management
    • Design and implement high-availability, fault-tolerant systems
    • Develop and maintain automation scripts to streamline infrastructure management
    • Collaborate with development teams to ensure seamless integration with cloud services
  • Problem-Solving and Root Cause Analysis
    • Analyze complex problems related to Linux infrastructure and Oracle Cloud Infrastructure
    • Provide root cause analysis and recommendations for improvement
  • Technical Leadership
    • Lead technical discussions and provide guidance on infrastructure design and implementation
    • Collaborate with cross-functional teams to drive technical innovation and improvement
  • Documentation and Knowledge Sharing
    • Author technical documentation and standard operating procedures
    • Share knowledge and best practices with team members
Requirements
  • Proven experience in Site Reliability Engineering and automation
  • Experience in Linux Administration with good knowledge of Kernel-level debugging
  • Experience in debugging operating system performance issues and performance tuning
  • Experience working with fault-tolerant, highly available, high-efficiency, distributed and scalable systems
  • Expertise in developing scripts, utilities, and tools to automate routine or manual intensive tasks
  • Experience in application, compute, storage, and database solving for improving application reliability, scalability, availability
  • Experience in cloud infrastructure technologies
  • Experience in operations and problem management
  • Development experience using Python and building Infrastructure using Terraform
  • Experience in handling high-availability production applications
  • Experience working with global teams across different time zones
  • Possesses and demonstrates strong logical-thinking skills, full of intellectual curiosity and high for self-development
  • Ability to be a good teammate and the desire to learn and implement new Cloud technologies as needed
  • Good understanding of Agile software development principles including using common tools such as JIRA
  • Good understanding of cloud security, and compliance management including patching
  • Excellent interpersonal, verbal, and written communication skills
Qualifications
  • Proven experience working in IT Operations/Infrastructure team
  • Bachelor degree in Computer Science, Computer Engineering, Software Engineering, or related areas is helpful


  • Ciudad de México, Ciudad de México Oracle A tiempo completo

    Job DescriptionJob Title: Principal Site Reliability EngineerJob Summary:We are seeking a highly skilled Principal Site Reliability Engineer to join our team at Oracle. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining our cloud infrastructure to ensure high availability, scalability, and...


  • Ciudad de México, Ciudad de México Oracle A tiempo completo

    Job DescriptionOverviewWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Oracle. As a key member of our infrastructure team, you will be responsible for designing and delivering mission-critical automation solutions that ensure the security, resiliency, scale, and performance of our cloud...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    Unlock the Power of Cloud OperationsThomson Reuters is seeking a skilled Site Reliability Engineer to join our team. As a key member of our Cloud Operations team, you will be responsible for ensuring the reliability and performance of our cloud-based services.About the RoleWe are looking for a highly motivated and experienced Site Reliability Engineer who...


  • Ciudad de México, Ciudad de México Oracle A tiempo completo

    Site Reliability and Automation ExpertiseWe are seeking a seasoned site reliability engineer to join our team at Oracle. As a key member of our infrastructure team, you will be responsible for ensuring the reliability, scalability, and performance of our critical systems.Key ResponsibilitiesSolve complex problems related to Linux infrastructure and Oracle...


  • Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Thomson Reuters. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Key ResponsibilitiesDesign, implement, and maintain scalable and highly available cloud-based...


  • Ciudad de México, Ciudad de México Azka IT Consulting A tiempo completo

    Azka IT Consulting is a leading IT services company that connects top talent with Latin American and US companies.We are seeking a skilled Site Reliability Engineer to join our team.Job SummaryThe Site Reliability Engineer plays a critical role in designing, implementing, and maintaining highly available, scalable, and reliable systems.Key...

  • Site Reliability Engineer

    hace 3 semanas


    Ciudad de México, Ciudad de México Svitla Systems A tiempo completo

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Svitla Systems. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Responsibilities:Design and implement automation to reduce toil and improve...


  • Ciudad de México, Ciudad de México Azka IT Consulting A tiempo completo

    Azka IT Consulting is a leading IT talent connector between Latin America and the United States.We are seeking a skilled Site Reliability Engineer to join our team.Job SummaryThe Site Reliability Engineer plays a critical role in designing, implementing, and maintaining highly available, scalable, and reliable systems.Key ResponsibilitiesDevelop and maintain...

  • Site Reliability Engineer

    hace 3 semanas


    Ciudad de México, Ciudad de México Azka IT Consulting A tiempo completo

    Azka IT Consulting is a leading IT services company that connects top talent with Latin American and US companies.We are seeking a skilled Site Reliability Engineer to join our team.Job SummaryThe Site Reliability Engineer plays a critical role in designing, implementing, and maintaining highly available, scalable, and reliable systems.Key...


  • Ciudad de México, Ciudad de México Thales A tiempo completo

    Job DescriptionThales is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our large-scale ODC services.ResponsibilitiesDesign, build, and maintain scalable and reliable infrastructure using Infrastructure as a Code...


  • Ciudad de México, Ciudad de México Azka IT Consulting A tiempo completo

    Azka IT Consulting is a Mexican company that connects top IT talent with Latin American and United States companies.We are seeking a skilled Site Reliability Engineer to join our team.Job RequirementsThe Site Reliability Engineer plays a crucial role in designing, implementing, and maintaining highly available, scalable, and reliable systems.Technical...


  • Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Thomson Reuters. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based applications and infrastructure.Key ResponsibilitiesDesign, implement, and maintain scalable and highly...

  • Site Reliability Engineer

    hace 3 semanas


    Ciudad de México, Ciudad de México Thales A tiempo completo

    Job DescriptionThales is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our large-scale ODC services.ResponsibilitiesDesign, build, and maintain scalable and reliable infrastructure using Infrastructure as a Code...


  • Ciudad de México, Ciudad de México Ford Motor Company A tiempo completo

    Job Title: Site Reliability EngineerAt Ford Motor Company, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, configuring, and maintaining our observability solutions to ensure optimal performance and reliability of our IT systems and applications.Key...

  • Site Reliability Engineer

    hace 3 semanas


    Ciudad de México, Ciudad de México Thales A tiempo completo

    Job DescriptionThales is a leading provider of digital security solutions, and we're seeking a skilled Site Reliability Engineer to join our team.About the RoleAs a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our large-scale ODC services. You will work closely with development teams...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México Ford Motor Company A tiempo completo

    Job Title: Site Reliability EngineerAt Ford Motor Company, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, configuring, and maintaining our observability solutions to ensure optimal performance and reliability of our IT systems and applications.Key...


  • Ciudad de México, Ciudad de México Virtualent A tiempo completo

    {"h2": "Site Reliability Engineer at Virtualent", "p": "At Virtualent, we're passionate about connecting top talent with the best opportunities. We're looking for a Site Reliability Engineer to join our team and help us deliver high-quality services to our clients.", "ul": [{"li": "Design, implement, and maintain scalable and highly available...

  • Site Reliability Engineer

    hace 4 semanas


    Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Thomson Reuters. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key ResponsibilitiesDesign, implement, and maintain scalable and highly available cloud-based...

  • Site Reliability Engineer

    hace 3 semanas


    Ciudad de México, Ciudad de México Epam A tiempo completo

    About the RoleWe are seeking a skilled Site Reliability Engineer to join our team at EPAM. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.ResponsibilitiesDesign, build, test, and deploy changes to existing softwareEnhance the company's IT infrastructure...

  • Site Reliability Engineer

    hace 2 semanas


    Ciudad de México, Ciudad de México Thomson Reuters A tiempo completo

    About the RoleIn this exciting opportunity as a Site Reliability Engineer, you will play a crucial role in ensuring the smooth operation of our cloud-based services. Your primary responsibility will be to design, test, deliver, support, and maintain production services in our technical operations environment.Key ResponsibilitiesProvide skilled technical...