SRE (Linux, Bash, Python, Jira, Cloud) / Guadalajara On-site

hace 1 semana


Guadalajara, Jalisco, México Oracle A tiempo completo

Job Description
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle Cloud product and services.

Design and develop designs, architectures, standards, and methods for large-scale distributed systems.

Facilitate service capacity planning and demand forecasting, software performance, analysis and system tuning. Requires 5+ years relevant experience.

Responsibilities
Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services.

Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Ownership of improvements upon on KPIs and SLOs for the entire service.

Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio.

Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Use clear understanding of automation and orchestration principles.

Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations.

Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.

Provide on-call support, on a rotating basis to cover 12 hrs. and 7 days a week. Hybrid Work Environment, On Site and Home Office.

Technical Skills

  • Advanced Linux administration
  • Advanced Python: automation libraries
  • Advanced Bash/Shell Scripting
  • Networking knowledge base
  • Database knowledge
  • Solid understanding coding principals
  • Experience in maintaining and coding unit test
  • Comfortable working with Agile Methodologies
  • CI/CD Knowledge

Qualifications
Career Level - IC4

About Us
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity.

We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.

Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation- or by calling in the United States.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.



  • Guadalajara, Jalisco, México Oracle A tiempo completo

    DescriptionSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle Cloud  product and services.Design and develop designs, architectures, standards, and methods for large-scale distributed...


  • Guadalajara, Jalisco, México Oracle A tiempo completo

    DescriptionThis role requires a SRE mindset combined with AI/ML expertise and strong application engineering skills across public and private cloud environments.ResponsibilitiesKey Responsibilities- End-to-end service ownership: design for telemetry, security, resiliency, scalability, and performance; lead sizing/architecture; drive service health reviews...


  • Guadalajara, Jalisco, México Oracle A tiempo completo

    DescriptionSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems....


  • Guadalajara, Jalisco, México Oracle A tiempo completo

    DescriptionManage a team that designs, develops, troubleshoots and debugs software programs for databases, applications, tools, networks etc.ResponsibilitiesAs a manager, you will lead people and apply your knowledge of SRE to manage tasks associated with cloud operations, developing, debugging or designing software applications, operating systems and...


  • Guadalajara, Jalisco, México tbo A tiempo completo

    We are looking for a highly skilled Site Reliability Engineer (SRE) to join our team and ensure the reliability, scalability, and efficiency of our platforms and services. The ideal candidate will have extensive hands-on experience in Kubernetes, cloud platforms, infrastructure automation, and observability, while also bringing an analytical mindset and...

  • SRE Developer

    hace 7 días


    Guadalajara, Jalisco, México TouchTunes A tiempo completo

    SRE DeveloperLocation:GuadalajaraYour mission in the SRE team:As a Site Reliability Engineer (SRE) embedded in our mobile app development squads, you will work side-by-side with backend and mobile engineers to ensure new features and services are reliable, scalable, and maintainable from day one. You'll bring an operational mindset into the development...


  • Guadalajara, Jalisco, México Oracle A tiempo completo

    Job DescriptionAbout the role:We are building and expanding the next generation Platform as a Service (PaaS) cloud and the next generation cloud support experience to go with it. As our cloud service grows, we are expanding our team of energetic, customer-focused site reliability engineers (SREs). Our team performs an operational role in supporting Oracle's...

  • Cloud Engineer

    hace 5 días


    Guadalajara, Jalisco, México NTT DATA North America A tiempo completo

    Req ID:343454NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.We are currently seeking a Cloud Engineer - AWS IoT to join our team in GDL, Jalisco (MX-JAL), Mexico (MX).Key Responsibilities:Cloud Architecture &...

  • Site Reliability Engineer

    hace 2 semanas


    Guadalajara, Jalisco, México FICO A tiempo completo

    FICO (NYSE: FICO)is a leading global analytics software company, helping businesses in 100+ countries make better decisions. Join our world-class team today and fulfill your career potentialThe Opportunity"The Site Reliability Engineer is an overlay of software development and systems engineering. Your responsibility is a full-stack support role, managing...

  • Site Reliability Engineer

    hace 2 semanas


    Guadalajara, Jalisco, México FICO A tiempo completo

    FICO (NYSE: FICO) is a leading global analytics software company, helping businesses in 100+ countries make better decisions. Join our world-class team today and fulfill your career potentialThe Opportunity"The Site Reliability Engineering group is a global team responsible for providing 24x7 operational support of the company's Cloud, SaaS, ASP and hosted...