Site Reliability Developer

hace 2 semanas


Guadalajara, México Oracle A tiempo completo

Site Reliability Developer (Cloud DevOps)-22000EJV

**Applicants are required to read, write, and speak the following languages***: English

**Preferred Qualifications**

Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.

Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the effect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.

A BS or MS in Computer Science, or equivalent. Identifies solutions to knowledge of server hardware and software configuration, networking, standard internet services, scripting languages, cloud computing patterns, technology security and compliance. Experience running large scale customer facing web services. Identifies solutions to understanding of load balancing technologies and experience with development in programming languages, databases and big data stores, and container technologies. Work involves defining and documenting technical architecture of complex and highly scalable products. A minimum of 5+ years’ experience of running large scale customer facing web services.

Be comfortable with mission critical production issues and manage customer anxiety appropriately. We would like to see some combination of the following skills:

- 5+ years of software design or development experience or devops role with distributed, highly-scalable, maximum availability (HA, brownout), multi-node environments (partitioning, isolation with vlan, pkeys, qinq, vrf, evpn)
- Oncall
- Knowledge of server virtualization technologies: Xen, KVM Linux containers, docker including vnuma, domain groups, SR-IOV
- Knowledge of Linux kernel internals (memory management, scheduler, builds), TCP/IP Networking stack, Infiniband/ OFED Architecture (RDS, RoCE V2, OCFS2), Filesystems/volumes
- Familiar with x86 systems, network switches from either Cisco, Arista, Juniper, Mellanox, L3 top of switch routing (OSPF, BGP), Mellanox HCAs (CX3, CX5 and newer) programmer's guide
- Experience working with Cloud infrastructure APIs, REST API model, and developing REST APIs
- Demonstrate experience with Java, as well as strong experience with scripting languages such as Python, Bash.
- Strong troubleshooting and performance tuning skills, OPS or system administration

Knowledge on any of the following areas is a plus:

- Understand latest features of Exadata / Engineered systems, Oracle Grid Infrastructure and Database is a plus
- Familiar with Openstack and/or other Cloud infrastructure products is a plus
- Understanding and experience of Cloud Networking & Security (like Application Firewall, IPSec VPN, NAT, IPv6, websockets, TLS, certificates, tunneling protocols) architectures
- Strong understanding of I/O characteristics and storage systems
- A background in multi-tenant service offering and concepts on Service Level Availability a strong plus
- PCI, HIPAA audits, UK gov, security vulnerabilities remediation

**Detailed Description and Job Requirements**

Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting,


  • Site Reliability Engineer

    hace 2 semanas


    Guadalajara, Jalisco, México Finastra A tiempo completo

    ResponsibilitiesWhat will you contribute?As a Site Reliability Engineer your mission is to protect and advance the software & systems behind Finastra's Cloud hosted services running on Fusion Operate. Finastra believes in a blameless culture where the primary objective is continuous improvement. You'll be treating operations as a software engineering problem...

  • Site Reliability Engineer

    hace 2 semanas


    Guadalajara, México Finastra A tiempo completo

    ResponsibilitiesWhat will you contribute?As a Site Reliability Engineer your mission is to protect and advance the software & systems behind Finastra’s Cloud hosted services running on Fusion Operate. Finastra believes in a blameless culture where the primary objective is continuous improvement. You’ll be treating operations as a software engineering...


  • Guadalajara, Jalisco, México myGwork - LGBTQ+ Business Community A tiempo completo

    This inclusive employer is a member of myGwork – the largest global platform for the LGBTQ+ business community. ResponsibilitiesWhat will you contribute?As a Site Reliability Engineer your mission is to protect and advance the software & systems behind Finastra's Cloud hosted services running on Fusion Operate. Finastra believes in a blameless culture...


  • Guadalajara, México myGwork - LGBTQ+ Business Community A tiempo completo

    This inclusive employer is a member of myGwork – the largest global platform for the LGBTQ+ business community. ResponsibilitiesWhat will you contribute?As a Site Reliability Engineer your mission is to protect and advance the software & systems behind Finastra's Cloud hosted services running on Fusion Operate. Finastra believes in a blameless culture...

  • Site Reliability Engineer

    hace 4 semanas


    Guadalajara, Jal., México Finastra A tiempo completo

    Responsibilities What will you contribute? As a Site Reliability Engineer your mission is to protect and advance the software & systems behind Finastra’s Cloud hosted services running on Fusion Operate. Finastra believes in a blameless culture where the primary objective is continuous improvement. You’ll be treating operations as a software engineering...

  • Site Reliability Engineer

    hace 4 semanas


    Guadalajara, México Finastra A tiempo completo

    Your deliverables as a Site Reliability Engineer will include, but are not limited to, the following: - Work with containers and container orchestration systems such as Kubernetes - Capacity Planning to determine resource requirements of your service for it to be scalable, efficient, and reliable - Collaborate with other engineers to implement operational...


  • Guadalajara, México f5 A tiempo completo

    Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive. Position Summary Software engineering is a core discipline at F5 for many roles. As a software engineer specializing in site reliability,...

  • Site Reliability Engineer

    hace 4 semanas


    Guadalajara, México Finastra A tiempo completo

    Job DescriptionYour deliverables as a Site Reliability Engineer will include, but are not limited to, the following:Work with containers and container orchestration systems such as Kubernetes Capacity Planning to determine resource requirements of your service for it to be scalable, efficient, and reliable Identify and troubleshoot any availability and...

  • Site Reliability Engineer

    hace 2 semanas


    Guadalajara, México Grid Dynamics A tiempo completo

    We are seeking a strong Site Reliability Engineer with good technical expertise. Our client is the world's largest American retail chain sells supplying tools, construction products, and services with over 90 distribution centers throughout the United States to serve over 2,000 stores. As of 2020, this company is ranked in the Fortune 500 rankings of the...


  • Guadalajara, México myGwork - LGBTQ+ Business Community A tiempo completo

    This inclusive employer is a member of myGwork – the largest global platform for the LGBTQ+ business community. ResponsibilitiesWhat will you contribute?As a Site Reliability Engineer Lead your mission is to protect and advance the software & systems behind Finastra's Cloud hosted services running on Fusion Operate. Finastra believes in a blameless...

  • Site Reliability Engineer

    hace 4 semanas


    Guadalajara, México Azka IT Consulting A tiempo completo

    AZKA IT is a Mexican company that seeks and connects the best IT talent with Latin American and United States companies.We are looking for your talent as Site Reliability Engineer.Requirements:Mediator b/w development and application team, and good knowledge on automation Unix, Linux, Ubuntu, or Windows, Oracle, MYSQL, NOSQL Solutions.Key...


  • Guadalajara, Jal., México Finastra A tiempo completo

    Responsibilities What will you contribute? As a Site Reliability Engineer Lead your mission is to protect and advance the software & systems behind Finastra’s Cloud hosted services running on Fusion Operate. Finastra believes in a blameless culture where the primary objective is continuous improvement. You’ll be leading an operations team whose aim is...


  • Guadalajara, México f5 A tiempo completo

    Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive. Why do you want to join our team? F5 has innovated a consistent, cloud-native environment that can be deployed across multiple public...

  • Front End Developer

    hace 4 semanas


    Guadalajara, México Sonata Software Ltd A tiempo completo

    **Role: Sr Front End Developer** **Location: Guadalajara and Mexico City (Remote)** **Type of Hire: Full Time** **Sr Front End Developer** Secondary function is to act as Product Reliability engineer (PRE) for fulfillment teams. Supporting UI defects in production. **Mandatory Skills** - Sr Front End Developer with 10+ years of experience - Strong...

  • Sr Python

    hace 4 semanas


    Guadalajara, México Sonata Software Ltd A tiempo completo

    **Role: Sr Python & AWS Developer** **Location: Guadalajara & Mexico City (Remote)** **Type of Hire: Full Time** **Sr Python & AWS Backend developer (PRE).** **Product reliability engineer is responsible for** - Identifying, diagnosing, and resolving production issues - Conduct RCA to identify underlying issues. - Implement proactive solutions to prevent...

  • Site Reliability Engineer

    hace 3 semanas


    Guadalajara, México Daxx A tiempo completo

    Summary The Site Reliability Engineer (SRE) position requires a mix of strategic engineering and design along with hands-on, technical work. An ideal candidate will have experience building and managing infrastructure in AWS, and have coding skills to automate tasks and build tools to help with our service operations. The SRE will configure, tune,...

  • Site Reliability Engineer

    hace 2 semanas


    Guadalajara, México Grid Dynamics A tiempo completo

    SummaryThe Site Reliability Engineer (SRE) position requires a mix of strategic engineering and design along with hands-on, technical work. An ideal candidate will have experience building and managing infrastructure in AWS, and have coding skills to automate tasks and build tools to help with our service operations. The SRE will configure, tune, and...


  • Guadalajara, México Talent Accelerator A tiempo completo

    We are looking for a Software Developer with at least 6 years of experience to join our team in Guadalajara. If you are passionate about software development and have experience in high-demand and transactional traffic environments, this could be the opportunity you've been waiting for! This is an on-site position at Guadalajara,...

  • Senior Pega Developer

    hace 6 días


    Guadalajara, México Talent Accelerator A tiempo completo

    We are in search of a developer with experience in Pega technology to join our team in Guadalajara! If you have at least 6 years of experience in information technology, technology consulting, and enterprise solution architecture, you could be the person we are looking for! **This is an on-site position in Guadalajara, Jalisco.** **Responsibilities**: -...


  • Guadalajara, México F5 A tiempo completo

    At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation.    Everything we do centers...