Senior Site Reliability Engineer

hace 6 meses


Zapopan, México Oracle A tiempo completo

Are you someone with a passion for taking on big challenges? Are you interested in operating and working on the operations infrastructure for a large-scale, cutting edge cloud database service? If so, Oracle's MySQL HeatWave Service team on Oracle Cloud Infrastructure (OCI) can provide you the opportunity to build and operate a cloud service on a broadly distributed, multi-tenant cloud environment. OCI is committed to providing the best in cloud products that meet the needs of customers who are tackling some of the world's biggest challenges.

MySQL is the world's most popular open source database. OCI is the industry's broadest and most integrated public cloud and helps organizations increase business agility, lower costs, and reduce IT complexity. MHS is built, operated and supported by the Oracle staff responsible for the MySQL products. MHS offers secure, stable, and performant MySQL services for those requiring an enterprise-class experience. The MHS team is responsible for developing, deploying and operating the cloud service framework powering Oracle's MySQL Database Service and HeatWave, MySQL's in-memory, query accelerator. We are a worldwide team of problem-solvers who are driven to deliver MySQL at cloud scale to meet the real-world needs of our customers. As a key leader on our DevOps our team, you will partner with Control Plane, Data Plane, Console and SRE colleagues to provide a secure, integrated, seamless, User Experience to customers managing their MySQL Database Systems.

**Responsibilities**
- Build observability, automation and tooling for a set of modern, cloud native, fault tolerant and scalable cloud database management services
- Contribute to operational activities such as writing runbooks, troubleshooting, operations automation, and instrumentation for metrics and events
- Develop infrastructure tooling and code to automate deployment and continuous verification of healthy service levels
- Solve reliability issues across the entire service architecture and its deployments
- Mentor junior team members
- Work productively in a fast-paced, team-oriented agile development environment
- Contribute to a healthy, supportive and inclusive team culture
- Work with geographically distributed teams and contribute to the success of your team and other related teams

**Qualifications**
- BE/BS/MS degree in Computer science or Computer Engineering or 4+ years related experience
- 3+ years experience including DevOps, Site Reliability Engineer (SRE), on-call rotations, working on highly scalable, distributed systems
- Highly proficient in at least one programming and/or scripting language (Python, Ruby, Java etc.), shell scripting, ssh, git, etc.
- Proficient in Linux/Unix systems administration
- Skilled at debugging and troubleshooting complex software and/or networking issues, performing root cause analysis
- Proficient in cloud development tools/infrastructure, and experienced in infrastructure automation through Terraform, Chef, Ansible, Puppet or similar
- Hands-on experience on at least one of the following cloud platforms: AWS, Azure, Google or OCI
- Procedurally oriented and willing a contributor to documentation and runbooks
- Proven ability to quickly learn new technical domains and then train others
- A productive, proactive team-player and good communicator who thrives when collaborating with others

**Preferred Qualifications**
- MySQL DBA experience a huge plus
- Experienced deploying code within change management procedures



  • zapopan, México Oracle A tiempo completo

    25 Km 35 Km 50 Km 75 Km 100 Km Senior Site Reliability Developer Regular Employee Oracle 20.09.2024 Senior Site Reliability Developer Regular Employee Zapopan, Jalisco Oracle 20.09.2024 Are you interested in the exciting challenges of building and operating large-scale distributed infrastructure for the cloud?...


  • Zapopan, Jalisco, México Oracle A tiempo completo

    Job SummaryOracle is seeking a skilled Principal Site Reliability Engineer to join our team. As a Principal Site Reliability Engineer, you will be responsible for solving complex problems related to Linux infrastructure and Oracle Cloud Infrastructure.ResponsibilitiesDesign and delivery of mission-critical automation with a focus on security, resiliency,...

  • Site Reliability Engineer

    hace 3 semanas


    Zapopan, México Oracle A tiempo completo

    About The Job: At Oracle, we're seeking a talented and skilled Site Reliability Engineer to work on Oracle Cloud Observability and Management platform. As a Site Reliability Engineer, you will solve interesting technical challenges by designing, deploying, and troubleshooting key Cloud services, platforms, and infrastructure, always thinking about...


  • Zapopan, México Oracle A tiempo completo

    DevOps/Service Reliability Engineer - Shared Infrastructure and Engineered System Platform Services A unique opportunity to join a rapidly growing world-class team of engineer, implement, and operate cutting edge systems built on Oracle technologies that make up Oracle Cloud Core Framework solutions. As part of the global Oracle Cloud Strategic Solutions...


  • Zapopan, México GrainChain Inc A tiempo completo

    ¡Te estamos buscando, únete a GrainChain! Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de desarrollo y operaciones, asegurando la calidad y la entrega de soluciones de software. Somos una empresa de tecnología que ayuda a la industria agrícola a cerrar la brecha digital, con diferentes plataformas que...


  • Zapopan, México Oracle A tiempo completo

    The role provides a mixture of production platform Operations ownership as well as engineering. You will solve challenging technical problems, identify improvements, and work on implementing your recommendations. You will also work directly with high-level developers on projects and work to blur the lines between traditional system operations and development...


  • Zapopan, México Oracle A tiempo completo

    Project Description Oracle Store is an eCommerce platform for selling Oracle’s products and services to its customers and partners. It is a one stop place for the consumers to create, view and manage various transactions such as purchase SW, HWand Cloud based services as well as track orders, subscribe memberships in various Oracle Programs, renew...


  • Zapopan, Jalisco, México Oracle A tiempo completo

    About the RoleWe are seeking an experienced Site Reliability Engineer to join our team in Guadalajara. As a key member of our engineering team, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.The ideal candidate will have a strong background in DevOps, Linux/Unix system administration, and networking...


  • Zapopan, México Oracle A tiempo completo

    Oracle Database Technology including RAC, Dataguard, Exadata and ASM/RMAN etc. - Technologies for scripted and orchestrated automation and Some understanding of Security fundamentals. - Development using Python, SQL/PlSql, Java/JavaScript, or Oracle APEX Career Level - IC3 DevOps/SRE - Shared Infrastructure and Engineered System Platform Services A unique...


  • Zapopan, México Oracle A tiempo completo

    **Responsibilities** - Solve complex problems related to Linux infrastructure and Oracle Cloud Infrastructure - Act as a partner concern point for critical issues that may not have a detailed procedure and provide Root Cause Analysis (RCA) - Understand the end-to-end configuration, technical dependencies, characteristics of production infrastructure and...


  • Zapopan, México Oracle A tiempo completo

    Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate...


  • Zapopan, Jalisco, México Oracle A tiempo completo

    Are you looking for a challenging role in cloud infrastructure engineering? Oracle's Cloud Infrastructure is building its next generation of cloud technologies that operate in a highly distributed, available, scalable, and multi-tenant environment. Our mission is to provide customers with an enterprise-level cloud infrastructure platform that delivers...


  • Zapopan, Jalisco, México Oracle A tiempo completo

    Job DescriptionOracle is seeking a skilled Senior Site Reliability Engineer to join their Ecommerce Systems team. As a key member of the team, you will be responsible for ensuring the reliability, scalability, and performance of our ecommerce applications and services.Responsibilities:Identify technical and process gaps to implement improvements that...


  • Zapopan, México Oracle A tiempo completo

    Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security,...


  • Zapopan, México Oracle A tiempo completo

    Career Level - IC3 Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack,...


  • Zapopan, México Oracle A tiempo completo

    Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate...


  • Zapopan, México GrainChain Inc A tiempo completo

    ¡Te estamos buscando, únete a GrainChain! Estamos en búsqueda de un **_Site Reliability Engineer Junior_** capaz de integrar y automatizar las áreas de desarrollo y operaciones, asegurando la calidad y la entrega de soluciones de software. Somos una empresa de tecnología que ayuda a la industria agrícola a cerrar la brecha digital, con diferentes...


  • Zapopan, México Oracle A tiempo completo

    **Job Description**: Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of Database cloud services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission...


  • Zapopan, México Oracle A tiempo completo

    **Job Description**: Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the critical stack, with focus...


  • Zapopan, Jalisco, México Oracle A tiempo completo

    Job SummaryWe are seeking a highly skilled Senior Principal Site Reliability Engineer to join our team at Oracle. As a key member of our IT Operations team, you will be responsible for designing and delivering mission-critical automation solutions that prioritize security, resiliency, scale, and performance.About the RoleThe ideal candidate will have a...