Principal Site Reliability Engineer
hace 5 meses
Career Level - IC4
The role provides a mixture of production platform ownership as well as dedicated development time. You will solve challenging technical problems, identify improvements and work on implementing your recommendations. You will also work directly with high level developers on projects and work to blur the lines between traditional system operations and development support.
In this role you will need to:
- Contribute to development of platform services including architecture, configuration, deployment, and support.
- Participate in prototyping new customer facing platform services.
- Stay informed of new technologies.
- Innovate.
- Architecture and design: create and improve current service deployment infrastructure using automation and the latest cloud capabilities to improve agility, reliability, and observability.
- Ownership: understand internal development processes end-to-end in order to streamline CI/CD pipelines.
- Migration: move/re-implement existing On-Prem services to Oracle Cloud Infrastructure in a manner that is secure and leverages the latest cloud services.
- Troubleshooting: have a deep understanding of our services and dependencies in order to respond quickly and efficiently to major incidents and minimize service disruptions when they occur. Root cause issues so that improvements can be made. Customer support, configuration, and address escalated issues in timely manner.
- Database support: Need expertise in cutting-edge products and technologies like Real Application Clusters, High Availability, Data Guard, Corruption, Backup and Recovery, RMAN, Performance, Memory Management, Parallel query, Query tuning, Storage, ASM, Security, Networking, Enterprise Manager etc. The Engineer should have good hands on experience on UNIX, Linux and/or Solaris platforms.
- BS in Computer Science or related technical field or equivalent practical experience.
- 6+ years of industry experience.
- Strong communication and analytical skills.
- Able to accurately estimate efforts and deliver on time.
- Experience with agile processes and general understanding of product development.
- Some understanding of Linux-based OS internals, virtualization solutions and Cloud services.
- Experience with configuration management tools.
- Strong knowledge of internet protocols.
- Exposure to troubleshooting network services.
- Understanding of the DevOps Toolchain components and how they fit together; experience developing automation and using open source tools, github, jenkins, rundeck, ansible.
- Understanding and experience with CI/CD practices.
- Strong technical knowledge of Web Logic and OBIEE.
- Strong technical background in cloud networking, storage, and security.
- Strong technical knowledge of Kubernetes, Docker, Registry.
- Strong technical knowledge of monitoring and building dashboards (e.g. Kibana, Prometheus, Grafana, etc).
- Ability to automate tasks using Python, or bash.
- Team player and able to work with others all skill levels.
-
Principal Site Reliability Engineer
hace 2 semanas
Zapopan, Jalisco, México Oracle A tiempo completoJob SummaryOracle is seeking a skilled Principal Site Reliability Engineer to join our team. As a Principal Site Reliability Engineer, you will be responsible for solving complex problems related to Linux infrastructure and Oracle Cloud Infrastructure.ResponsibilitiesDesign and delivery of mission-critical automation with a focus on security, resiliency,...
-
Principal Site Reliability Engineer
hace 1 mes
Zapopan, Jalisco, México Oracle A tiempo completoJob SummaryWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Oracle. As a key member of our infrastructure team, you will be responsible for designing and delivering mission-critical automation solutions that meet the needs of our customers.ResponsibilitiesDesign and implement automation solutions to improve service...
-
Site Reliability Engineer
hace 1 semana
Zapopan, México Oracle A tiempo completoAbout The Job: At Oracle, we're seeking a talented and skilled Site Reliability Engineer to work on Oracle Cloud Observability and Management platform. As a Site Reliability Engineer, you will solve interesting technical challenges by designing, deploying, and troubleshooting key Cloud services, platforms, and infrastructure, always thinking about...
-
Principal Site Reliability Engineer
hace 7 meses
Zapopan, México Oracle A tiempo completo**Responsibilities** - Solve complex problems related to Linux infrastructure and Oracle Cloud Infrastructure - Act as a partner concern point for critical issues that may not have a detailed procedure and provide Root Cause Analysis (RCA) - Understand the end-to-end configuration, technical dependencies, characteristics of production infrastructure and...
-
Site Reliability Engineer
hace 6 meses
Zapopan, México GrainChain Inc A tiempo completo¡Te estamos buscando, únete a GrainChain! Estamos en búsqueda de un Site Reliability Engineer capaz de integrar y automatizar las áreas de desarrollo y operaciones, asegurando la calidad y la entrega de soluciones de software. Somos una empresa de tecnología que ayuda a la industria agrícola a cerrar la brecha digital, con diferentes plataformas que...
-
Senior Site Reliability Engineer
hace 7 meses
Zapopan, México Oracle A tiempo completoDevOps/Service Reliability Engineer - Shared Infrastructure and Engineered System Platform Services A unique opportunity to join a rapidly growing world-class team of engineer, implement, and operate cutting edge systems built on Oracle technologies that make up Oracle Cloud Core Framework solutions. As part of the global Oracle Cloud Strategic Solutions...
-
Site Reliability Engineer Lead
hace 2 semanas
Zapopan, Jalisco, México Oracle A tiempo completoAbout the RoleWe are seeking an experienced Site Reliability Engineer to join our team in Guadalajara. As a key member of our engineering team, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.The ideal candidate will have a strong background in DevOps, Linux/Unix system administration, and networking...
-
Principal Site Reliability Developer
hace 2 semanas
Zapopan, México Oracle A tiempo completoTechnical Skills Strong knowledge of Exadata, Real Application Clusters, Oracle database, Storage, and Linux fundamentals. Oracle Exadata Database Machine and Oracle Cloud Infrastructure (OCI) Certifications - Preferred Knowledge of network fundamentals such as VCN, Ethernet, RoCE, TCP/IP, routing, DHCP etc. Experience automating management of Linux...
-
Principal Site Reliability Engineer
hace 6 meses
Zapopan, México Oracle A tiempo completoThe role provides a mixture of production platform ownership as well as dedicated development time. You will solve challenging technical problems, identify improvements and work on implementing your recommendations. You will also work directly with high level developers on projects and work to blur the lines between traditional system operations and...
-
Principal Site Reliability Developer
hace 6 meses
Zapopan, México Oracle A tiempo completoApplicants are required to read, write, and speak the following languages: English **Role**: Site Reliability Engineer **Location**: Guadalajara preferred **Who are we looking for?** **Roles and Responsibilities** - Perform DevOps activities to support customers, engineers, and processes through our release cycles as well as production - Participate in a...
-
Senior Site Reliability Developer
hace 1 mes
zapopan, México Oracle A tiempo completo25 Km 35 Km 50 Km 75 Km 100 Km Senior Site Reliability Developer Regular Employee Oracle 20.09.2024 Senior Site Reliability Developer Regular Employee Zapopan, Jalisco Oracle 20.09.2024 Are you interested in the exciting challenges of building and operating large-scale distributed infrastructure for the cloud?...
-
Site Reliability Engineer
hace 6 meses
Zapopan, México Oracle A tiempo completoSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate...
-
Senior Site Reliability Engineer
hace 7 meses
Zapopan, México Oracle A tiempo completoThe role provides a mixture of production platform Operations ownership as well as engineering. You will solve challenging technical problems, identify improvements, and work on implementing your recommendations. You will also work directly with high-level developers on projects and work to blur the lines between traditional system operations and development...
-
Principal Site Reliability Developer
hace 5 meses
Zapopan, México Oracle A tiempo completoBe comfortable with mission-critical production issues and manage customer anxiety appropriately. We would like to see some combination of the following skills: - 5+ years of software design or development experience or DevOps role with distributed, highly-scalable, maximum availability (HA, brownout), multi-node environments (partitioning, isolation with...
-
Site Reliability Engineer
hace 6 meses
Zapopan, México Oracle A tiempo completoSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate...
-
Site Reliability Engineer Jr
hace 5 meses
Zapopan, México GrainChain Inc A tiempo completo¡Te estamos buscando, únete a GrainChain! Estamos en búsqueda de un **_Site Reliability Engineer Junior_** capaz de integrar y automatizar las áreas de desarrollo y operaciones, asegurando la calidad y la entrega de soluciones de software. Somos una empresa de tecnología que ayuda a la industria agrícola a cerrar la brecha digital, con diferentes...
-
Site Reliability Engineering
hace 3 semanas
Zapopan, México Oracle A tiempo completo**Job Description**: Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of Database cloud services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission...
-
Senior Site Reliability Engineer
hace 5 meses
Zapopan, México Oracle A tiempo completoAre you someone with a passion for taking on big challenges? Are you interested in operating and working on the operations infrastructure for a large-scale, cutting edge cloud database service? If so, Oracle's MySQL HeatWave Service team on Oracle Cloud Infrastructure (OCI) can provide you the opportunity to build and operate a cloud service on a broadly...
-
Senior Site Reliability Engineer
hace 5 meses
Zapopan, México Oracle A tiempo completoProject Description Oracle Store is an eCommerce platform for selling Oracle’s products and services to its customers and partners. It is a one stop place for the consumers to create, view and manage various transactions such as purchase SW, HWand Cloud based services as well as track orders, subscribe memberships in various Oracle Programs, renew...
-
Senior Site Reliability Engineer
hace 6 meses
Zapopan, México Oracle A tiempo completoOracle Database Technology including RAC, Dataguard, Exadata and ASM/RMAN etc. - Technologies for scripted and orchestrated automation and Some understanding of Security fundamentals. - Development using Python, SQL/PlSql, Java/JavaScript, or Oracle APEX Career Level - IC3 DevOps/SRE - Shared Infrastructure and Engineered System Platform Services A unique...