SRE (Linux, Bash, Python, Jira, Cloud) / Guadalajara On-site
hace 1 semana
Job Description
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle Cloud product and services.
Design and develop designs, architectures, standards, and methods for large-scale distributed systems.
Facilitate service capacity planning and demand forecasting, software performance, analysis and system tuning. Requires 5+ years relevant experience.
Responsibilities
Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services.
Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Ownership of improvements upon on KPIs and SLOs for the entire service.
Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio.
Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Use clear understanding of automation and orchestration principles.
Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations.
Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.
Provide on-call support, on a rotating basis to cover 12 hrs. and 7 days a week. Hybrid Work Environment, On Site and Home Office.
Technical Skills
- Advanced Linux administration
- Advanced Python: automation libraries
- Advanced Bash/Shell Scripting
- Networking knowledge base
- Database knowledge
- Solid understanding coding principals
- Experience in maintaining and coding unit test
- Comfortable working with Agile Methodologies
- CI/CD Knowledge
Qualifications
Career Level - IC4
About Us
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation- or by calling in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
-
SRE (Linux, Bash, Python, Jira, Cloud) / Guadalajara On-site
hace 2 semanas
Guadalajara, Jalisco, México Oracle A tiempo completoDescriptionSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle Cloud product and services.Design and develop designs, architectures, standards, and methods for large-scale distributed...
-
Principal SRE – Cloud Automation
hace 2 semanas
Guadalajara, Jalisco, México Oracle A tiempo completoDescriptionThis role requires a SRE mindset combined with AI/ML expertise and strong application engineering skills across public and private cloud environments.ResponsibilitiesKey Responsibilities- End-to-end service ownership: design for telemetry, security, resiliency, scalability, and performance; lead sizing/architecture; drive service health reviews...
-
Site Reliability Developer 3
hace 2 semanas
Guadalajara, Jalisco, México Oracle A tiempo completoDescriptionSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems....
-
Manager, ERP Cloud Operations
hace 2 semanas
Guadalajara, Jalisco, México Oracle A tiempo completoDescriptionManage a team that designs, develops, troubleshoots and debugs software programs for databases, applications, tools, networks etc.ResponsibilitiesAs a manager, you will lead people and apply your knowledge of SRE to manage tasks associated with cloud operations, developing, debugging or designing software applications, operating systems and...
-
Site Reliability Engineer
hace 7 días
Guadalajara, Jalisco, México tbo A tiempo completoWe are looking for a highly skilled Site Reliability Engineer (SRE) to join our team and ensure the reliability, scalability, and efficiency of our platforms and services. The ideal candidate will have extensive hands-on experience in Kubernetes, cloud platforms, infrastructure automation, and observability, while also bringing an analytical mindset and...
-
SRE Developer
hace 7 días
Guadalajara, Jalisco, México TouchTunes A tiempo completoSRE DeveloperLocation:GuadalajaraYour mission in the SRE team:As a Site Reliability Engineer (SRE) embedded in our mobile app development squads, you will work side-by-side with backend and mobile engineers to ensure new features and services are reliable, scalable, and maintainable from day one. You'll bring an operational mindset into the development...
-
Site Reliability Engineer
hace 1 semana
Guadalajara, Jalisco, México Oracle A tiempo completoJob DescriptionAbout the role:We are building and expanding the next generation Platform as a Service (PaaS) cloud and the next generation cloud support experience to go with it. As our cloud service grows, we are expanding our team of energetic, customer-focused site reliability engineers (SREs). Our team performs an operational role in supporting Oracle's...
-
Cloud Engineer
hace 5 días
Guadalajara, Jalisco, México NTT DATA North America A tiempo completoReq ID:343454NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.We are currently seeking a Cloud Engineer - AWS IoT to join our team in GDL, Jalisco (MX-JAL), Mexico (MX).Key Responsibilities:Cloud Architecture &...
-
Site Reliability Engineer
hace 2 semanas
Guadalajara, Jalisco, México FICO A tiempo completoFICO (NYSE: FICO)is a leading global analytics software company, helping businesses in 100+ countries make better decisions. Join our world-class team today and fulfill your career potentialThe Opportunity"The Site Reliability Engineer is an overlay of software development and systems engineering. Your responsibility is a full-stack support role, managing...
-
Site Reliability Engineer
hace 2 semanas
Guadalajara, Jalisco, México FICO A tiempo completoFICO (NYSE: FICO) is a leading global analytics software company, helping businesses in 100+ countries make better decisions. Join our world-class team today and fulfill your career potentialThe Opportunity"The Site Reliability Engineering group is a global team responsible for providing 24x7 operational support of the company's Cloud, SaaS, ASP and hosted...