Senior Site Reliability Engineer

hace 6 días


Zapopan, México Oracle A tiempo completo

Are you someone with a passion for taking on big challenges? Are you interested in operating and working on the operations infrastructure for a large-scale, cutting edge cloud database service? If so, Oracle's MySQL HeatWave Service team on Oracle Cloud Infrastructure (OCI) can provide you the opportunity to build and operate a cloud service on a broadly distributed, multi-tenant cloud environment. OCI is committed to providing the best in cloud products that meet the needs of customers who are tackling some of the world's biggest challenges.

MySQL is the world's most popular open source database. OCI is the industry's broadest and most integrated public cloud and helps organizations increase business agility, lower costs, and reduce IT complexity. MHS is built, operated and supported by the Oracle staff responsible for the MySQL products. MHS offers secure, stable, and performant MySQL services for those requiring an enterprise-class experience. The MHS team is responsible for developing, deploying and operating the cloud service framework powering Oracle's MySQL Database Service and HeatWave, MySQL's in-memory, query accelerator. We are a worldwide team of problem-solvers who are driven to deliver MySQL at cloud scale to meet the real-world needs of our customers. As a key leader on our DevOps our team, you will partner with Control Plane, Data Plane, Console and SRE colleagues to provide a secure, integrated, seamless, User Experience to customers managing their MySQL Database Systems.

**Responsibilities**
- Build observability, automation and tooling for a set of modern, cloud native, fault tolerant and scalable cloud database management services
- Contribute to operational activities such as writing runbooks, troubleshooting, operations automation, and instrumentation for metrics and events
- Develop infrastructure tooling and code to automate deployment and continuous verification of healthy service levels
- Solve reliability issues across the entire service architecture and its deployments
- Mentor junior team members
- Work productively in a fast-paced, team-oriented agile development environment
- Contribute to a healthy, supportive and inclusive team culture
- Work with geographically distributed teams and contribute to the success of your team and other related teams

**Qualifications**
- BE/BS/MS degree in Computer science or Computer Engineering or 4+ years related experience
- 3+ years experience including DevOps, Site Reliability Engineer (SRE), on-call rotations, working on highly scalable, distributed systems
- Highly proficient in at least one programming and/or scripting language (Python, Ruby, Java etc.), shell scripting, ssh, git, etc.
- Proficient in Linux/Unix systems administration
- Skilled at debugging and troubleshooting complex software and/or networking issues, performing root cause analysis
- Proficient in cloud development tools/infrastructure, and experienced in infrastructure automation through Terraform, Chef, Ansible, Puppet or similar
- Hands-on experience on at least one of the following cloud platforms: AWS, Azure, Google or OCI
- Procedurally oriented and willing a contributor to documentation and runbooks
- Proven ability to quickly learn new technical domains and then train others
- A productive, proactive team-player and good communicator who thrives when collaborating with others

**Preferred Qualifications**
- MySQL DBA experience a huge plus
- Experienced deploying code within change management procedures



  • Zapopan, Jalisco, México Oracle A tiempo completo

    We are looking for a skilled and motivated Cloud Region Build Site Reliability Engineer (SRE) to join our Oracle Cloud Infrastructure Region Build team. In this role, you will be responsible for building, deploying, and maintaining compute cloud infrastructure services across multiple regions to ensure high availability, scalability, and performance. You...

  • Site Reliability Engineer

    hace 2 semanas


    Zapopan, México GrainChain Inc A tiempo completo

    ¡Te estamos buscando, únete a GrainChain!Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de desarrollo y operaciones, asegurando la calidad y la entrega de soluciones de software.Somos una empresa de tecnología que ayuda a la industria agrícola a cerrar la brecha digital, con diferentes plataformas que...

  • Site Reliability Engineer

    hace 3 semanas


    Zapopan, México Oracle A tiempo completo

    Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate...

  • Site Reliability Engineer

    hace 2 semanas


    Zapopan, México BairesDev A tiempo completo

    WinDifferent specializes in helping businesses achieve rapid and sustainable growth through our powerful proprietary marketing system. Our data-driven solutions generate positive engagement that leads to ready-to-close opportunities, massively expanding sales pipelines and enabling companies to scale faster than the competition. As one of WinDifferent's...


  • Zapopan, Jalisco, México Oracle A tiempo completo

    DescriptionWe are looking for a skilled and motivated Cloud Region Build Site Reliability Engineer (SRE) to join our Oracle Cloud Infrastructure Region Build team. In this role, you will be responsible for building, deploying, and maintaining compute cloud infrastructure services across multiple regions to ensure high availability, scalability, and...


  • Zapopan, Jalisco, México GrainChain Inc A tiempo completo

    Estamos en busca de nuevos talentosGrainChain es una empresa tecnológica dedicada a reducir la brecha digital en la industria agrícola. Nuestras plataformas facilitan las transacciones de manera rápida, seguras y sencillas para nuestros usuarios. Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de desarrollo...


  • Zapopan, Jalisco, México GrainChain Inc A tiempo completo

    Estamos en busca de nuevos talentosGrainChaines una empresa tecnológica dedicada a reducir la brecha digital en la industria agrícola. Nuestras plataformas facilitan las transacciones de manera rápida, seguras y sencillas para nuestros usuarios. Estamos en búsqueda de unSite Reliability Engineercapaz de integrar y automatizar las áreas de desarrollo y...


  • Zapopan, Jalisco, México GrainChain A tiempo completo

    Estamos en busca de nuevos talentosGrainChain es una empresa tecnológica dedicada a reducir la brecha digital en la industria agrícola. Nuestras plataformas facilitan las transacciones de manera rápida, seguras y sencillas para nuestros usuarios. Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de desarrollo...


  • Zapopan, México Oracle A tiempo completo

    Oracle Database Technology including RAC, Dataguard, Exadata and ASM/RMAN etc.- Technologies for scripted and orchestrated automation and Some understanding of Security fundamentals.- Development using Python, SQL/PlSql, Java/JavaScript, or Oracle APEXCareer Level - IC3DevOps/SRE - Shared Infrastructure and Engineered System Platform ServicesA unique...

  • Site Reliability Engineer

    hace 2 semanas


    Zapopan, México BairesDev A tiempo completo

    WinDifferent specializes in helping businesses achieve rapid and sustainable growth through our powerful proprietary marketing system. Our data-driven solutions generate positive engagement that leads to ready-to-close opportunities, massively expanding sales pipelines and enabling companies to scale faster than the competition.As one of WinDifferent's best...