Site Reliability Engineer
hace 6 días
About the role:
We are building and expanding the next generation Platform as a Service (PaaS) cloud and the next generation cloud support experience to go with it. As our cloud service grows, we are expanding our team of energetic, customer-focused site reliability engineers (SREs). Our team performs an operational role in supporting Oracle's Exadata platform in the Oracle Cloud Infrastructure (OCI). Oracle Exadata is a full-stack solution that improves the performance, scale, security, and availability of an enterprise's Oracle databases. It incorporates more than 60 unique features, such as Smart Scan SQL offload, that are engineered with Oracle Database to accelerate OLTP, analytics and machine-learning applications. Exadata also reduces capital costs and management expenses by enabling IT departments to consolidate hundreds of databases onto a single system. Blending traditional roles of system administration, database engineering, and cloud disciplines, you'll be part of a team that supports this amazing machine. As part of the broader engineering organization, you will act as the voice of the customer to influence product features and plans to improve customer experience. This role is integral to the success of our customer relationships and is critical to the success of the platform.
Requirements:
- At least a bachelor's degree, in Computer Science, MIS or another technical field, or equivalent work experience.
- Experience with installing and administrating Linux operating systems.
- A good understanding how the Linux kernel works (IO, Network, & etc).
- Knowledge or experience with Virtual Machines (KVM, OVM, ESX/ESXi, etc).
- Knowledge or experience with OEL or RedHat Linux.
- Knowledge or experience in cloud network. (Load balancer, routing, subnets, etc)
- Knowledge or experience with TCP/IP based networking.
- Knowledge or experience with distributed computing, cloud concepts and platforms.
- Experience with troubleshooting hardware, software, and networking issues.
- Experience with Oracle databases including RAC, Data Guard, Oracle GI (Clusterware, ASM) & RMAN.
- Experience in Oracle Exadata Administration of both hardware & software.
- Previous experience in cloud technical support, operations, NOC or similar is preferred, but not required.
- BASH or other shell scripting experience.
- Knowledge of advanced programming languages, such as python, are preferred, but not required.
- Proven ability to quickly learn new technical domains and then train others.
- Possess very strong analytical skills to identify problems and their root cause.
- Great verbal and written communication skills are very important to this role.
- Ability to collaborate effectively with internal and external customers.
Responsibilities:
- Actively participate in the resolution of complex technical issues with Oracle Cloud Infrastructure (OCI)'s Exadata Cloud Service.
- Work towards ensuring highly available and scalable database service by developing and implementing solutions to complex problems and incidents.
- Create and execute best practices recommendations and technical documents.
- Contribute to making our infrastructure simple, reliable, and easy to operate.
- You may be asked to conduct periodic on call duties.
Career Level - IC3
-
Site Reliability Developer 4
hace 2 semanas
Zapopan, Jalisco, México Oracle A tiempo completoDescriptionWe are looking for a skilled and motivated Cloud Region Build Site Reliability Engineer (SRE) to join our Oracle Cloud Infrastructure Region Build team. In this role, you will be responsible for building, deploying, and maintaining compute cloud infrastructure services across multiple regions to ensure high availability, scalability, and...
-
Site Reliability Engineer
hace 2 semanas
Zapopan, Jalisco, México GrainChain A tiempo completoEstamos en busca de nuevos talentosGrainChain es una empresa tecnológica dedicada a reducir la brecha digital en la industria agrícola. Nuestras plataformas facilitan las transacciones de manera rápida, seguras y sencillas para nuestros usuarios. Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de desarrollo...
-
Site Reliability Engineer
hace 2 semanas
Zapopan, Jalisco, México GrainChain Inc A tiempo completoEstamos en busca de nuevos talentosGrainChaines una empresa tecnológica dedicada a reducir la brecha digital en la industria agrícola. Nuestras plataformas facilitan las transacciones de manera rápida, seguras y sencillas para nuestros usuarios. Estamos en búsqueda de unSite Reliability Engineercapaz de integrar y automatizar las áreas de desarrollo y...
-
Site Reliability Engineer
hace 2 semanas
Zapopan, Jalisco, México GrainChain Inc A tiempo completoEstamos en busca de nuevos talentosGrainChain es una empresa tecnológica dedicada a reducir la brecha digital en la industria agrícola. Nuestras plataformas facilitan las transacciones de manera rápida, seguras y sencillas para nuestros usuarios. Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de desarrollo...
-
Site Reliability Developer 3
hace 3 días
Zapopan, Jalisco, México Oracle A tiempo completoDescriptionWe are looking for a skilled and motivated Cloud Region Build Site Reliability Engineer (SRE) to join our Oracle Cloud Infrastructure Region Build team. In this role, you will be responsible for building, deploying, and maintaining compute cloud infrastructure services across multiple regions to ensure high availability, scalability, and...
-
Principal Site Reliability Engineer
hace 3 días
Zapopan, Jalisco, México Oracle A tiempo completoDescriptionAs a senior member of the Site Reliability Engineering (SRE) team, you'll take ownership of highly available systems, influence service design, and work across teams to drive resiliency, automation, and operational excellence. This is a hands-on engineering role where deep infrastructure knowledge meets software engineering expertise, ideal for...
-
Principal Site Reliability Developer
hace 2 semanas
Zapopan, Jalisco, México Oracle A tiempo completoDescriptionWork with the Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, focusing...
-
Principal Site Reliability Developer
hace 1 día
Zapopan, Jalisco, México Oracle A tiempo completoDescriptionSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems....
-
Principal Site Reliability Engineer
hace 24 horas
Zapopan, Jalisco, México Oracle A tiempo completoJob DescriptionAs a senior member of the Site Reliability Engineering (SRE) team, you'll take ownership of highly available systems, influence service design, and work across teams to drive resiliency, automation, and operational excellence. This is a hands-on engineering role where deep infrastructure knowledge meets software engineering expertise, ideal...
-
Principal Site Reliability Developer
hace 3 días
Zapopan, Jalisco, México Oracle A tiempo completoDescriptionWork with an elite team to provide Oracle Database Administration support for customer production systems in the Oracle Cloud, with the opportunity to work on the latest Oracle database releases and features as part of the cloud first strategy. Provide DBA operational support with a high degree of customer service, technical expertise, and...