Empleos actuales relacionados con Senior Site Reliability Engineer - Guadalajara - Oracle

  • Site Reliability Engineer

    hace 42 minutos


    Guadalajara, Jalisco, México NTT DATA A tiempo completo

    SRE - Site Reliability EngineerWe are currently seeking a Site Reliability Engineer to join our team in GDL, Jalisco (MX-JAL), Mexico (MX). Perform L1.5 activities such as monitoring, deployment, rollback. Monitor the efficiency of the Azure cloud systems to prevent outages and initiate an Incident Management bridge in case of an outage. Troubleshoot Azure...

  • Site Reliability Engineer

    hace 59 minutos


    Guadalajara, Jalisco, México NTT DATA North America A tiempo completo

    SRE – Site Reliability EngineerWe are currently seeking a Site Reliability Engineer to join our team in GDL, Jalisco (MX-JAL), Mexico (MX).Perform L1.5 activities such as monitoring, deployment, rollback. Monitor the efficiency of the Azure cloud systems to prevent outages and initiate an Incident Management bridge in case of an outage. Troubleshoot Azure...


  • Guadalajara, México f5 A tiempo completo

    Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive. - Site Reliability Engineer III Why do you want to join our team? - Everything we do centers around people. That means we obsess over how to...

  • Site Reliability Engineer

    hace 4 semanas


    Guadalajara, México F5 A tiempo completo

    Everything we do centers around people.That means we obsess over how to make the lives of our customers, and their customers, better.And it means we prioritize a diverse F5 community where each individual can thrive.- Site Reliability Engineer IIIWhy do you want to join our team?- Everything we do centers around people.That means we obsess over how to make...

  • Site Reliability Engineer

    hace 3 semanas


    Guadalajara, México Valce Talent Solutions A tiempo completo

    We are looking for a Lead Site Reliability Engineer who takes the initiative on developing and maintain the system and services for our Cash Management Platform, automating the deployment process, ensuring system scaling, investigating and resolving outdates, identifying and implementing preventive measures proactively, collaborating with key stakeholders,...


  • Guadalajara, México Nextiva A tiempo completo

    At Nextiva, we create connected communication tools that help businesses stay in touch with their customers and teams. Over 100,000 companies rely on Nextiva for phone service and customer management tools. We're not your parent's phone company.Founded in 2008, Nextiva took on the trillion-dollar telecom industry and succeeded in changing the game by making...


  • Guadalajara, México Nextiva A tiempo completo

    At Nextiva, we create connected communication tools that help businesses stay in touch with their customers and teams. Over 100,000 companies rely on Nextiva for phone service and customer management tools. We're not your parent's phone company. Founded in 2008, Nextiva took on the trillion-dollar telecom industry and succeeded in changing the game by...


  • Guadalajara, México Nextiva Mexico A tiempo completo

    At Nextiva, we create connected communication tools that help businesses stay in touch with their customers and teams. Over 100,000 companies rely on Nextiva for phone service and customer management tools. We're not your parent's phone company.Founded in 2008, Nextiva took on the trillion-dollar telecom industry and succeeded in changing the game by making...


  • Guadalajara, México Capgemini Engineering A tiempo completo

    **Senior SRE - Capgemini**:We’re hiring a **Senior Site Reliability Engineer**to join a major telecom client through Capgemini Engineering. Join a collaborative team building and operating large-scale cloud platforms that power next‑generation connectivity and customer experiences. This is a hands‑on role where you’ll design, automate, secure, and...


  • Guadalajara, México C3 Ai A tiempo completo

    We are looking for a Senior Site Reliability Engineer to join our team in Guadalajara.**Responsibilities**:- Maximize system uptime and availability, ensuring functional and performance SLAs.- Establish end-to-end monitoring and alerting on all critical aspects.- Solve complex problems for critical services and build automation to prevent problem...

Senior Site Reliability Engineer

hace 2 semanas


Guadalajara, México Oracle A tiempo completo

Oracle
- s Cloud Infrastructure team is supporting and building Block Storage Service, it involves Support, Operation, Deployment at scale in a broadly distributed multi-tenant cloud environment, closely working with various engineering teams. Our customers run their businesses on our cloud, and our mission is to provide them with best-in-class Block storage capabilities in conjunction with other compute, storage, networking, database, security offerings.

We’re looking for hands-on engineers with a passion for solving problems in distributed systems, virtualized infrastructure, and highly available services. Joining Oracle will give you the opportunity to learn and help build innovative new systems from the ground up and operate services at scale. Engineers at every level can have significant technical and business impact while delivering critical enterprise level features during multiple parallel deployments.

As a member of the software reliability engineering, you will take an active role in the support and operation of Block Storage service.

As **Senior SRE member in the Block Storage **team you will be required to:

- **Monitor **our service and proactively debug operational issues.
- Work with internal and external teams to diagnose **performance issues **.
- **Support Automation **and maintain build and test systems including systems for performance and scalability testing.
- Improve efficiency of the **deployment **processes across a **fast-growing number of regions **through automation and scale improvements to tools and dashboards.
- Participate in our **on-call rotation **and resolve complex distributed issues through debugging, communication and collaboration across multiple SRE teams across OCI.
- Improve our operational capabilities by developing **runbooks, alarming, and building tools **and documentation that enable customers to self-diagnose problems.
- **Deploy our service in new regions **and help to automate this process

**Basic Qualifications**:

- 6+ years of **SRE/Devops/Automation experience in a Linux based environment**:

- Familiarity with Storage Technologies - **iSCSI, NVME, SAN/NAS, Block Storage etc**:

- 2+ years of experience with Linux shell scripting, and **Python**:

- Proficient with Linux based build and analysis tools (e.g. make, scons/cons, bazel)
- Familiarity with **CICD **environments
- Familiarity with Agile Development
- Proficient with commonly used networking protocols such as TCP/IP, HTTP
- Familiarity with docker containers
- Familiarity with databases, NoSQL systems, **storage and distributed persistence technologies.**:

- **Troubleshooting and performance tuning skills **.