Principal Site Reliability Developer
hace 6 meses
Applicants are required to read, write, and speak the following languages: English
**Role**: Site Reliability Engineer
**Location**: Guadalajara preferred
**Who are we looking for?**
**Roles and Responsibilities**
- Perform DevOps activities to support customers, engineers, and processes through our release cycles as well as production
- Participate in a follow-the-sun model for 24x7 support of OAC services
- Respond to incidents, own them and drive to completion, participate in root cause analysis
- Document various processes & runbooks; update existing processes
- Execute, with excellence, delivery of interim patches and hotfixes as required
- Work with various teams to take ownership of and resolve service failure/outages.
- Monitor metrics and develop ways to improve the CI and CD tools utilized by the team
- Follow all best practices and procedures as established by the company
- Mentor and train other engineers and seek to continually improve processes Other duties as assigned
- A BS or MS in Computer Science, or equivalent
- Providing cloud networking, infrastructure, and service support, configuration, operations, tools, and processes
- Understand networking, and TCP/IP fundamentals and services such as DNS, HTTP, etc.
- Linux/Unix system administration including system level knowledge of Linux on OCI Gen 2, creating and executing scripts
- Methodical approaches to troubleshooting and solving complex technical problems
- Producing documentation in support of developed work (KBs, run books, help guides)
- Utilizing agile methodologies
- Communicating effectively in a team environment
- Working with remote, global teams as well as individuals
- Working independently and in a self-directed manner
- Able to work extended week day and week-end shifts as required for on-call, after hours upgrades, and other duties as assigned.
- 5+ years of experience of running large scale customer facing web services.
- Oracle Cloud Infrastructure (OCI) or AWS, Azure, GCP compute, storage, and network operational experience.
- Programming and scripting languages (Python, bash, Java Script - additional experience with PHP, Groovy, Java, and/or Go is a plus)
- Using CI/CD scripting tools such as Ansible, Puppet, or Chef
- Containers and orchestration (Docker, Kubernetes, and docker-compose).
- Oracle database, MySQL (experience with MS SQL and/or NoSQL is a plus).
- Issue tracking and collaboration (Jira and Confluence).
-
Senior Site Reliability Developer
hace 2 meses
zapopan, México Oracle A tiempo completo25 Km 35 Km 50 Km 75 Km 100 Km Senior Site Reliability Developer Regular Employee Oracle 20.09.2024 Senior Site Reliability Developer Regular Employee Zapopan, Jalisco Oracle 20.09.2024 Are you interested in the exciting challenges of building and operating large-scale distributed infrastructure for the cloud?...
-
Principal Site Reliability Engineer
hace 3 semanas
Zapopan, Jalisco, México Oracle A tiempo completoJob SummaryOracle is seeking a skilled Principal Site Reliability Engineer to join our team. As a Principal Site Reliability Engineer, you will be responsible for solving complex problems related to Linux infrastructure and Oracle Cloud Infrastructure.ResponsibilitiesDesign and delivery of mission-critical automation with a focus on security, resiliency,...
-
Principal Site Reliability Developer
hace 3 semanas
Zapopan, México Oracle A tiempo completoTechnical Skills Strong knowledge of Exadata, Real Application Clusters, Oracle database, Storage, and Linux fundamentals. Oracle Exadata Database Machine and Oracle Cloud Infrastructure (OCI) Certifications - Preferred Knowledge of network fundamentals such as VCN, Ethernet, RoCE, TCP/IP, routing, DHCP etc. Experience automating management of Linux...
-
Principal Site Reliability Developer
hace 5 meses
Zapopan, México Oracle A tiempo completoBe comfortable with mission-critical production issues and manage customer anxiety appropriately. We would like to see some combination of the following skills: - 5+ years of software design or development experience or DevOps role with distributed, highly-scalable, maximum availability (HA, brownout), multi-node environments (partitioning, isolation with...
-
Principal Site Reliability Engineer
hace 7 meses
Zapopan, México Oracle A tiempo completo**Responsibilities** - Solve complex problems related to Linux infrastructure and Oracle Cloud Infrastructure - Act as a partner concern point for critical issues that may not have a detailed procedure and provide Root Cause Analysis (RCA) - Understand the end-to-end configuration, technical dependencies, characteristics of production infrastructure and...
-
Site Reliability Developer
hace 3 meses
Zapopan, México Oracle A tiempo completoWe are hiring for OCI Corporate Network and Security operation. You will be a member of the team responsible for network and security Incident/change/capacity management for supporting the Oracle Corporate Network. Resolve the complex problems related to network infrastructure services and build automation to prevent problem recurrence. Design, write, and...
-
Site Reliability Engineer
hace 3 semanas
Zapopan, México Oracle A tiempo completoAbout The Job: At Oracle, we're seeking a talented and skilled Site Reliability Engineer to work on Oracle Cloud Observability and Management platform. As a Site Reliability Engineer, you will solve interesting technical challenges by designing, deploying, and troubleshooting key Cloud services, platforms, and infrastructure, always thinking about...
-
Site Reliability Developer
hace 6 meses
Zapopan, México Oracle A tiempo completo**Job Description**: Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the critical stack, with focus...
-
Senior Site Reliability Developer
hace 8 meses
Zapopan, México Oracle A tiempo completoWork with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security,...
-
Senior Site Reliability Developer
hace 3 meses
Zapopan, México Oracle A tiempo completoCareer Level - IC3 Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack,...
-
Site Reliability Developer
hace 6 meses
Zapopan, México Oracle A tiempo completoSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate...
-
Site Reliability Engineer
hace 6 meses
Zapopan, México GrainChain Inc A tiempo completo¡Te estamos buscando, únete a GrainChain! Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de desarrollo y operaciones, asegurando la calidad y la entrega de soluciones de software. Somos una empresa de tecnología que ayuda a la industria agrícola a cerrar la brecha digital, con diferentes plataformas que...
-
Principal Site Reliability Developer
hace 1 mes
zapopan, México Ll Oefentherapie A tiempo completoJob Requirements: 8+ years of software design and development experience with distributed, highly-scalable, maximum availability (HA, brownout), multi-node environments (partitioning, isolation with VLAN, pkeys, qinq, vrf, evpn). Knowledge of server virtualization technologies: Xen, KVM Linux containers, Docker including vnuma, domain groups, SR-IOV....
-
Principal Site Reliability Developer
hace 6 meses
Zapopan, México Oracle A tiempo completo**OTA-LAD-MX **Would you like to contribute your own ideas on how to smartly deploy and manage large scale distributed Database-as-a-Service offerings for the public and private clouds? Oracle’s Database development group designs and develops the Database-as-a-Service platform that drives Oracle's Database, Engineered Systems, Oracle Public & Private...
-
Principal Site Reliability Engineer
hace 6 meses
Zapopan, México Oracle A tiempo completoThe role provides a mixture of production platform ownership as well as dedicated development time. You will solve challenging technical problems, identify improvements and work on implementing your recommendations. You will also work directly with high level developers on projects and work to blur the lines between traditional system operations and...
-
Principal Site Reliability Engineer
hace 6 meses
Zapopan, México Oracle A tiempo completoCareer Level - IC4 The role provides a mixture of production platform ownership as well as dedicated development time. You will solve challenging technical problems, identify improvements and work on implementing your recommendations. You will also work directly with high level developers on projects and work to blur the lines between traditional system...
-
Site Reliability Engineer Lead
hace 3 semanas
Zapopan, Jalisco, México Oracle A tiempo completoAbout the RoleWe are seeking an experienced Site Reliability Engineer to join our team in Guadalajara. As a key member of our engineering team, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.The ideal candidate will have a strong background in DevOps, Linux/Unix system administration, and networking...
-
Site Reliability Engineering
hace 4 semanas
Zapopan, México Oracle A tiempo completo**Job Description**: Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of Database cloud services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission...
-
Highly Skilled Oracle Database Cloud Expert
hace 7 días
Zapopan, Jalisco, México Oracle A tiempo completoAbout the RoleWe are seeking a highly skilled Senior Site Reliability Developer to join our team at Oracle.
-
Senior Site Reliability Architect
hace 3 semanas
Zapopan, Jalisco, México Oracle A tiempo completoAre you looking for a challenging role in cloud infrastructure engineering? Oracle's Cloud Infrastructure is building its next generation of cloud technologies that operate in a highly distributed, available, scalable, and multi-tenant environment. Our mission is to provide customers with an enterprise-level cloud infrastructure platform that delivers...