Site Reliability Developer
hace 2 semanas
Site Reliability Developer (Cloud DevOps)-22000EJV
**Applicants are required to read, write, and speak the following languages***: English
**Preferred Qualifications**
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.
Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the effect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.
A BS or MS in Computer Science, or equivalent. Identifies solutions to knowledge of server hardware and software configuration, networking, standard internet services, scripting languages, cloud computing patterns, technology security and compliance. Experience running large scale customer facing web services. Identifies solutions to understanding of load balancing technologies and experience with development in programming languages, databases and big data stores, and container technologies. Work involves defining and documenting technical architecture of complex and highly scalable products. A minimum of 5+ years’ experience of running large scale customer facing web services.
Be comfortable with mission critical production issues and manage customer anxiety appropriately. We would like to see some combination of the following skills:
- 5+ years of software design or development experience or devops role with distributed, highly-scalable, maximum availability (HA, brownout), multi-node environments (partitioning, isolation with vlan, pkeys, qinq, vrf, evpn)
- Oncall
- Knowledge of server virtualization technologies: Xen, KVM Linux containers, docker including vnuma, domain groups, SR-IOV
- Knowledge of Linux kernel internals (memory management, scheduler, builds), TCP/IP Networking stack, Infiniband/ OFED Architecture (RDS, RoCE V2, OCFS2), Filesystems/volumes
- Familiar with x86 systems, network switches from either Cisco, Arista, Juniper, Mellanox, L3 top of switch routing (OSPF, BGP), Mellanox HCAs (CX3, CX5 and newer) programmer's guide
- Experience working with Cloud infrastructure APIs, REST API model, and developing REST APIs
- Demonstrate experience with Java, as well as strong experience with scripting languages such as Python, Bash.
- Strong troubleshooting and performance tuning skills, OPS or system administration
Knowledge on any of the following areas is a plus:
- Understand latest features of Exadata / Engineered systems, Oracle Grid Infrastructure and Database is a plus
- Familiar with Openstack and/or other Cloud infrastructure products is a plus
- Understanding and experience of Cloud Networking & Security (like Application Firewall, IPSec VPN, NAT, IPv6, websockets, TLS, certificates, tunneling protocols) architectures
- Strong understanding of I/O characteristics and storage systems
- A background in multi-tenant service offering and concepts on Service Level Availability a strong plus
- PCI, HIPAA audits, UK gov, security vulnerabilities remediation
**Detailed Description and Job Requirements**
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting,
-
Site Reliability Engineer
hace 2 semanas
Guadalajara, Jalisco, México Finastra A tiempo completoResponsibilitiesWhat will you contribute?As a Site Reliability Engineer your mission is to protect and advance the software & systems behind Finastra's Cloud hosted services running on Fusion Operate. Finastra believes in a blameless culture where the primary objective is continuous improvement. You'll be treating operations as a software engineering problem...
-
Site Reliability Engineer
hace 2 semanas
Guadalajara, México Finastra A tiempo completoResponsibilitiesWhat will you contribute?As a Site Reliability Engineer your mission is to protect and advance the software & systems behind Finastra’s Cloud hosted services running on Fusion Operate. Finastra believes in a blameless culture where the primary objective is continuous improvement. You’ll be treating operations as a software engineering...
-
Site Reliability Engineer
hace 5 días
Guadalajara, Jalisco, México myGwork - LGBTQ+ Business Community A tiempo completoThis inclusive employer is a member of myGwork – the largest global platform for the LGBTQ+ business community. ResponsibilitiesWhat will you contribute?As a Site Reliability Engineer your mission is to protect and advance the software & systems behind Finastra's Cloud hosted services running on Fusion Operate. Finastra believes in a blameless culture...
-
Site Reliability Engineer
hace 6 días
Guadalajara, México myGwork - LGBTQ+ Business Community A tiempo completoThis inclusive employer is a member of myGwork – the largest global platform for the LGBTQ+ business community. ResponsibilitiesWhat will you contribute?As a Site Reliability Engineer your mission is to protect and advance the software & systems behind Finastra's Cloud hosted services running on Fusion Operate. Finastra believes in a blameless culture...
-
Site Reliability Engineer
hace 4 semanas
Guadalajara, Jal., México Finastra A tiempo completoResponsibilities What will you contribute? As a Site Reliability Engineer your mission is to protect and advance the software & systems behind Finastra’s Cloud hosted services running on Fusion Operate. Finastra believes in a blameless culture where the primary objective is continuous improvement. You’ll be treating operations as a software engineering...
-
Site Reliability Engineer
hace 4 semanas
Guadalajara, México Finastra A tiempo completoYour deliverables as a Site Reliability Engineer will include, but are not limited to, the following: - Work with containers and container orchestration systems such as Kubernetes - Capacity Planning to determine resource requirements of your service for it to be scalable, efficient, and reliable - Collaborate with other engineers to implement operational...
-
Site Reliability Engineer Iii/network
hace 5 horas
Guadalajara, México f5 A tiempo completoEverything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive. Position Summary Software engineering is a core discipline at F5 for many roles. As a software engineer specializing in site reliability,...
-
Site Reliability Engineer
hace 4 semanas
Guadalajara, México Finastra A tiempo completoJob DescriptionYour deliverables as a Site Reliability Engineer will include, but are not limited to, the following:Work with containers and container orchestration systems such as Kubernetes Capacity Planning to determine resource requirements of your service for it to be scalable, efficient, and reliable Identify and troubleshoot any availability and...
-
Site Reliability Engineer
hace 2 semanas
Guadalajara, México Grid Dynamics A tiempo completoWe are seeking a strong Site Reliability Engineer with good technical expertise. Our client is the world's largest American retail chain sells supplying tools, construction products, and services with over 90 distribution centers throughout the United States to serve over 2,000 stores. As of 2020, this company is ranked in the Fortune 500 rankings of the...
-
Site Reliability Engineering Team Lead
hace 2 horas
Guadalajara, México myGwork - LGBTQ+ Business Community A tiempo completoThis inclusive employer is a member of myGwork – the largest global platform for the LGBTQ+ business community. ResponsibilitiesWhat will you contribute?As a Site Reliability Engineer Lead your mission is to protect and advance the software & systems behind Finastra's Cloud hosted services running on Fusion Operate. Finastra believes in a blameless...
-
Site Reliability Engineer
hace 4 semanas
Guadalajara, México Azka IT Consulting A tiempo completoAZKA IT is a Mexican company that seeks and connects the best IT talent with Latin American and United States companies.We are looking for your talent as Site Reliability Engineer.Requirements:Mediator b/w development and application team, and good knowledge on automation Unix, Linux, Ubuntu, or Windows, Oracle, MYSQL, NOSQL Solutions.Key...
-
Site Reliability Engineering Team Lead
hace 2 semanas
Guadalajara, Jal., México Finastra A tiempo completoResponsibilities What will you contribute? As a Site Reliability Engineer Lead your mission is to protect and advance the software & systems behind Finastra’s Cloud hosted services running on Fusion Operate. Finastra believes in a blameless culture where the primary objective is continuous improvement. You’ll be leading an operations team whose aim is...
-
Site Reliability Engineer Iii
hace 6 días
Guadalajara, México f5 A tiempo completoEverything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive. Why do you want to join our team? F5 has innovated a consistent, cloud-native environment that can be deployed across multiple public...
-
Front End Developer
hace 4 semanas
Guadalajara, México Sonata Software Ltd A tiempo completo**Role: Sr Front End Developer** **Location: Guadalajara and Mexico City (Remote)** **Type of Hire: Full Time** **Sr Front End Developer** Secondary function is to act as Product Reliability engineer (PRE) for fulfillment teams. Supporting UI defects in production. **Mandatory Skills** - Sr Front End Developer with 10+ years of experience - Strong...
-
Sr Python
hace 4 semanas
Guadalajara, México Sonata Software Ltd A tiempo completo**Role: Sr Python & AWS Developer** **Location: Guadalajara & Mexico City (Remote)** **Type of Hire: Full Time** **Sr Python & AWS Backend developer (PRE).** **Product reliability engineer is responsible for** - Identifying, diagnosing, and resolving production issues - Conduct RCA to identify underlying issues. - Implement proactive solutions to prevent...
-
Site Reliability Engineer
hace 3 semanas
Guadalajara, México Daxx A tiempo completoSummary The Site Reliability Engineer (SRE) position requires a mix of strategic engineering and design along with hands-on, technical work. An ideal candidate will have experience building and managing infrastructure in AWS, and have coding skills to automate tasks and build tools to help with our service operations. The SRE will configure, tune,...
-
Site Reliability Engineer
hace 2 semanas
Guadalajara, México Grid Dynamics A tiempo completoSummaryThe Site Reliability Engineer (SRE) position requires a mix of strategic engineering and design along with hands-on, technical work. An ideal candidate will have experience building and managing infrastructure in AWS, and have coding skills to automate tasks and build tools to help with our service operations. The SRE will configure, tune, and...
-
Senior Salesforce Developer
hace 6 días
Guadalajara, México Talent Accelerator A tiempo completoWe are looking for a Software Developer with at least 6 years of experience to join our team in Guadalajara. If you are passionate about software development and have experience in high-demand and transactional traffic environments, this could be the opportunity you've been waiting for! This is an on-site position at Guadalajara,...
-
Senior Pega Developer
hace 6 días
Guadalajara, México Talent Accelerator A tiempo completoWe are in search of a developer with experience in Pega technology to join our team in Guadalajara! If you have at least 6 years of experience in information technology, technology consulting, and enterprise solution architecture, you could be the person we are looking for! **This is an on-site position in Guadalajara, Jalisco.** **Responsibilities**: -...
-
Site Reliability Engineer II
hace 4 semanas
Guadalajara, México F5 A tiempo completoAt F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation. Everything we do centers...