Principal Site Reliability Engineer
hace 1 mes
The role provides a mixture of production platform ownership as well as dedicated development time. You will solve challenging technical problems, identify improvements and work on implementing your recommendations. You will also work directly with high level developers on projects and work to blur the lines between traditional system operations and development support.
In this role you will need to:
- Contribute to development of platform services including architecture, configuration, deployment, and support.
- Participate in prototyping new customer facing platform services.
- Stay informed of new technologies.
- Innovate.
- Architecture and design: create and improve current service deployment infrastructure using automation and the latest cloud capabilities to improve agility, reliability, and observability.
- Ownership: understand internal development processes end-to-end in order to streamline CI/CD pipelines.
- Migration: move/re-implement existing On-Prem services to Oracle Cloud Infrastructure in a manner that is secure and leverages the latest cloud services.
- Troubleshooting: have a deep understanding of our services and dependencies in order to respond quickly and efficiently to major incidents and minimize service disruptions when they occur. Root cause issues so that improvements can be made. Customer support, configuration, and address escalated issues in timely manner.
- Database support: Need expertise in cutting-edge products and technologies like Real Application Clusters, High Availability, Data Guard, Corruption, Backup and Recovery, RMAN, Performance, Memory Management, Parallel query, Query tuning, Storage, ASM, Security, Networking, Enterprise Manager etc. The Engineer should have good hands on experience on UNIX, Linux and/or Solaris platforms.
- BS in Computer Science or related technical field or equivalent practical experience.
- 6+ years of industry experience.
- Strong communication and analytical skills.
- Able to accurately estimate efforts and deliver on time.
- Experience with agile processes and general understanding of product development.
- Some understanding of Linux-based OS internals, virtualization solutions and Cloud services.
- Experience with configuration management tools.
- Strong knowledge of internet protocols.
- Exposure to troubleshooting network services.
- Understanding of the DevOps Toolchain components and how they fit together; experience developing automation and using open source tools, github, jenkins, rundeck, ansible.
- Understanding and experience with CI/CD practices.
- Strong technical knowledge of Web Logic and OBIEE.
- Strong technical background in cloud networking, storage, and security.
- Strong technical knowledge of Kubernetes, Docker, Registry.
- Strong technical knowledge of monitoring and building dashboards (e.g. Kibana, Prometheus, Grafana, etc).
- Ability to automate tasks using Python, or bash.
- Team player and able to work with others all skill levels.
-
Principal Site Reliability Engineer
hace 2 meses
Zapopan, Jal., México Ll Oefentherapie A tiempo completoJoinOCIMX We are looking to recruit a Site Reliability Engineer to the established Oracle Cloud Infrastructure (OCI) Enterprise Engineering team. The successful candidate will be located in Mexico and will mainly be responsible for defining and deploying key services with deep focus on architecture, production operations, performance management,...
-
Principal Site Reliability Engineer
hace 1 mes
Zapopan, México Oracle A tiempo completo**Responsibilities** - Solve complex problems related to Linux infrastructure and Oracle Cloud Infrastructure - Act as a partner concern point for critical issues that may not have a detailed procedure and provide Root Cause Analysis (RCA) - Understand the end-to-end configuration, technical dependencies, characteristics of production infrastructure and...
-
Principal Site Reliability Engineer
hace 2 días
Zapopan, México Oracle A tiempo completo**Responsibilities** - Solve complex problems related to Linux infrastructure and Oracle Cloud Infrastructure - Act as escalation point for critical issues that may not have a documented procedure and provide Root Cause Analysis (RCA) - Understand the end-to-end configuration, technical dependencies, characteristics of production infrastructure and...
-
Site Reliability Engineer
hace 2 meses
Zapopan, México GrainChain Inc A tiempo completo¡Te estamos buscando, únete a GrainChain! Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de desarrollo y operaciones, asegurando la calidad y la entrega de soluciones de software. Somos una empresa de tecnología que ayuda a la industria agrícola a cerrar la brecha digital, con diferentes plataformas que...
-
Principal Site Reliability Engineer
hace 2 semanas
Zapopan, Jalisco, México Oracle A tiempo completoJob DescriptionmdclpJoinOCIMXWe are looking to recruit a Site Reliability Engineer to the established Oracle Cloud Infrastructure (OCI) Enterprise Engineering team. The successful candidate will be located in Mexico and will mainly be responsible for defining and deploying key services with deep focus on architecture, production operations, performance...
-
Principal Site Reliability Engineer
hace 5 días
Zapopan, Jalisco, México Oracle A tiempo completoJob DescriptionmdclpJoinOCIMXWe are looking to recruit a Site Reliability Engineer to the established Oracle Cloud Infrastructure (OCI) Enterprise Engineering team. The successful candidate will be located in Mexico and will mainly be responsible for defining and deploying key services with deep focus on architecture, production operations, performance...
-
Principal Site Reliability Engineer
hace 6 días
Zapopan, Jalisco, México myGwork - LGBTQ+ Business Community A tiempo completoThis inclusive employer is a member of myGwork – the largest global platform for the LGBTQ+ business community. ResponsibilitiesSolve complex problems related to Linux infrastructure and Oracle Cloud Infrastructure Act as a partner concern point for critical issues that may not have a detailed procedure and provide Root Cause Analysis (RCA)Understand the...
-
Principal Site Reliability Engineer
hace 2 semanas
Zapopan, Jalisco, México Oracle A tiempo completoResponsibilitiesJob DescriptionSolve complex problems related to Linux infrastructure and Oracle Cloud Infrastructure Act as a partner concern point for critical issues that may not have a detailed procedure and provide Root Cause Analysis (RCA)Understand the end-to-end configuration, technical dependencies, characteristics of production infrastructure and...
-
Principal Site Reliability Engineer
hace 5 días
Zapopan, Jalisco, México Oracle A tiempo completoResponsibilitiesJob DescriptionSolve complex problems related to Linux infrastructure and Oracle Cloud Infrastructure Act as a partner concern point for critical issues that may not have a detailed procedure and provide Root Cause Analysis (RCA)Understand the end-to-end configuration, technical dependencies, characteristics of production infrastructure and...
-
Principal Site Reliability Engineer
hace 5 días
Zapopan, Jalisco, México myGwork - LGBTQ+ Business Community A tiempo completoThis inclusive employer is a member of myGwork – the largest global platform for the LGBTQ+ business community. ResponsibilitiesSolve complex problems related to Linux infrastructure and Oracle Cloud Infrastructure Act as a partner concern point for critical issues that may not have a detailed procedure and provide Root Cause Analysis (RCA)Understand the...
-
Senior Site Reliability Engineer
hace 1 mes
Zapopan, México Oracle A tiempo completoDevOps/Service Reliability Engineer - Shared Infrastructure and Engineered System Platform Services A unique opportunity to join a rapidly growing world-class team of engineer, implement, and operate cutting edge systems built on Oracle technologies that make up Oracle Cloud Core Framework solutions. As part of the global Oracle Cloud Strategic Solutions...
-
Site Reliability Engineer
hace 3 semanas
Zapopan, Jalisco, México Oracle A tiempo completoJob Description:- Address challenges related to infrastructure cloud services and create automation to prevent problem repetition.- Develop and deploy software to enhance the availability, scalability, and efficiency of Oracle products and services.- Design architectures and standards for large-scale distributed systems.- Assist in service capacity planning,...
-
Site Reliability Engineer
hace 5 días
Zapopan, Jalisco, México Oracle A tiempo completoJob Description:- Address challenges related to infrastructure cloud services and create automation to prevent problem repetition.- Develop and deploy software to enhance the availability, scalability, and efficiency of Oracle products and services.- Design architectures and standards for large-scale distributed systems.- Assist in service capacity planning,...
-
Principal Site Reliability Engineer
hace 1 mes
Zapopan, México Oracle A tiempo completoCareer Level - IC4 The role provides a mixture of production platform ownership as well as dedicated development time. You will solve challenging technical problems, identify improvements and work on implementing your recommendations. You will also work directly with high level developers on projects and work to blur the lines between traditional system...
-
Principal Site Reliability Engineer
hace 2 semanas
Zapopan, Jalisco, México Oracle A tiempo completoResponsibilitiesSolve complex problems related to Linux infrastructure and Oracle Cloud Infrastructure Act as a partner concern point for critical issues that may not have a detailed procedure and provide Root Cause Analysis (RCA) Understand the end-to-end configuration, technical dependencies, characteristics of production infrastructure and services...
-
Principal Site Reliability Engineer
hace 5 días
Zapopan, Jalisco, México Oracle A tiempo completoResponsibilitiesSolve complex problems related to Linux infrastructure and Oracle Cloud Infrastructure Act as a partner concern point for critical issues that may not have a detailed procedure and provide Root Cause Analysis (RCA) Understand the end-to-end configuration, technical dependencies, characteristics of production infrastructure and services...
-
Principal Site Reliability Developer
hace 1 mes
Zapopan, México Oracle A tiempo completoApplicants are required to read, write, and speak the following languages: English **Role**: Site Reliability Engineer **Location**: Guadalajara preferred **Who are we looking for?** **Roles and Responsibilities** - Perform DevOps activities to support customers, engineers, and processes through our release cycles as well as production - Participate in a...
-
Principal Site Reliability Developer
hace 3 días
Zapopan, México Oracle A tiempo completoApplicants are required to read, write, and speak the following languages: English **Role**: Site Reliability Engineer **Location**: Guadalajara preferred **Who are we looking for?** **Roles and Responsibilities** - Perform DevOps activities to support customers, engineers, and processes through our release cycles as well as production - Participate in a...
-
Senior Site Reliability Engineer
hace 1 mes
Zapopan, México Oracle A tiempo completoThe role provides a mixture of production platform Operations ownership as well as engineering. You will solve challenging technical problems, identify improvements, and work on implementing your recommendations. You will also work directly with high-level developers on projects and work to blur the lines between traditional system operations and development...
-
Site Reliability Engineer
hace 5 días
Zapopan, México Oracle A tiempo completoSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate...