Principal Service Reliability Applications Developer
hace 1 semana
Own and scale mission-critical ERP/SaaS services while building intelligent, cloud-native capabilities. This role requires a SRE mindset combined with AI/ML expertise and strong application engineering skills across public and private cloud environments.
ResponsibilitiesKey Responsibilities
- End-to-end service ownership: design for telemetry, security, resiliency, scalability, and performance; lead sizing/architecture; drive service health reviews and process simplification.
- Incident management and prevention: lead postmortems/RCAs, coordinate fixes, define repair items, and implement data-driven prevention and continuous improvement.
- AI/ML and GenAI delivery: design and integrate solutions with LLMs, RAG, agentic workflows, and conversational AI; build low-latency model serving and retraining pipelines.
- Application engineering: develop performant microservices for distributed, containerized, cloud-native systems.
- Automation: eliminate toil by automating operational workflows, recovery procedures, code delivery, and configuration management; build internal tools and reusable scripts/services to accelerate delivery and reduce errors.
- Observability: define and implement monitoring, logging, alerting, and tracing strategies; establish SLOs/SLIs/error budgets; improve diagnostics and performance visibility for rapid triage.
- Cross-functional collaboration: partner with product, operations, and data teams to translate requirements into secure, scalable solutions; communicate effectively with technical and non-technical stakeholders.
Minimum Qualifications
- BS/MS in Computer Science or related field; 10+ years of software engineering in cloud environments.
- Strong in distributed systems/microservices using java / python; SQL/data modeling; python for AI/automation.
- SRE/DevOps expertise: systems and networking fundamentals, application security, observability, performance analysis, and incident response.
- Proven SDLC excellence: code quality, reviews, version control, CI/CD, testing, and release engineering.
- Excellent written and verbal communication; English fluency.
Preferred/Technical Skills
- AI/ML/GenAI: experience with foundational models, RAG, agentic architectures; model deployment, optimization, monitoring, and retraining.
- Cloud and containers: experience with containerization, orchestration, and resilient, fault-tolerant microservices.
- Observability: hands-on experience designing dashboards, alerts, traces, logs, and metrics; defining SLOs/SLIs and error budgets; on-call readiness and runbook quality.
- Operations: performance tuning across java / python and SQL for large-scale enterprise applications; strong Linux/Unix expertise; capacity planning and reliability reviews.
- Automation and scripting: proficiency in scripting to automate operational workflows, build tooling, and CI/CD tasks (e.g., shell scripting, python, configuration-as-code, task runners).
- Familiarity with enterprise ERP applications and standard DevOps tooling and practices.
QualificationsCareer Level - IC4
-
Principal Service Reliability Applications Developer
hace 2 semanas
Zapopan, Jalisco, México Oracle A tiempo completoJob DescriptionOwn and scale mission-critical ERP/SaaS services while building intelligent, cloud-native capabilities. This role requires a SRE mindset combined with AI/ML expertise and strong application engineering skills across public and private cloud environments.ResponsibilitiesKey ResponsibilitiesEnd-to-end service ownership: design for telemetry,...
-
Principal Site Reliability Developer
hace 2 días
Zapopan, Jalisco, México Oracle A tiempo completoDescriptionWork with an elite team to provide Oracle Database Administration support for customer production systems in the Oracle Cloud, with the opportunity to work on the latest Oracle database releases and features as part of the cloud first strategy. Provide DBA operational support with a high degree of customer service, technical expertise, and...
-
Principal Site Reliability Engineer
hace 2 días
Zapopan, Jalisco, México Oracle A tiempo completoDescriptionAs a senior member of the Site Reliability Engineering (SRE) team, you'll take ownership of highly available systems, influence service design, and work across teams to drive resiliency, automation, and operational excellence. This is a hands-on engineering role where deep infrastructure knowledge meets software engineering expertise, ideal for...
-
Principal Site Reliability Developer
hace 2 días
Zapopan, Jalisco, México Oracle A tiempo completoWork with an elite team to provide Oracle Database Administration support for customer production systems in the Oracle Cloud, with the opportunity to work on the latest Oracle database releases and features as part of the cloud first strategy. Provide DBA operational support with a high degree of customer service, technical expertise, and timeliness. ...
-
Applications Developer 2
hace 1 semana
Zapopan, Jalisco, México Oracle A tiempo completoDescriptionAnalyze, design develop, troubleshoot and debug software programs for commercial or end user applications. Writes code, completes programming and performs testing and debugging of applications.ResponsibilitiesAs a member of the software engineering division, you will perform detailed design based on provided high level design specifications....
-
Site Reliability Developer 3
hace 2 días
Zapopan, Jalisco, México Oracle A tiempo completoDescriptionWe are looking for a skilled and motivated Cloud Region Build Site Reliability Engineer (SRE) to join our Oracle Cloud Infrastructure Region Build team. In this role, you will be responsible for building, deploying, and maintaining compute cloud infrastructure services across multiple regions to ensure high availability, scalability, and...
-
Principal Consultant – Senior Developer
hace 3 días
Zapopan, Jalisco, México Genpact A tiempo completoReady to build the future with AI? At Genpact, we don't just keep up with technology—we set the pace. AI and digital innovation are redefining industries, and we're leading the charge. Genpact's AI Gigafactory, our industry-first accelerator, is an example of how we're scaling advanced technology solutions to help global enterprises work smarter, grow...
-
Software Developer
hace 3 días
Zapopan, Jalisco, México Oracle A tiempo completoDescriptionThis position requires the candidate to contribute in All areas of the Autonomous Database Dedicated Service.The candidate will mainly play these two roles: as a Development Engineer and as Operations Engineer.===========================Operations Engineer Description:===========================Work with an elite team to provide Oracle Database...
-
Exadata Database
hace 7 horas
Zapopan, Jalisco, México Oracle A tiempo completoDescriptionOracle Exadata serves as the cornerstone of Oracle Cloud's infrastructure. We are working on exciting projects that involve developing and delivering next-generation capabilities, with cutting-edge infrastructure features designed to seamlessly integrate with leading hyperscaler cloud providers. Our multi-cloud solutions offer customers enhanced...
-
Senior Java Developer
hace 1 semana
Zapopan, Jalisco, México Oracle A tiempo completoDescriptionAs a Developer of the software engineering division, you will apply your knowledge to architecture, design, prototype, troubleshooting and development of the various aspects of analytics platform. Design, develop, troubleshoot and debug software programs for databases, applications, tools, networks etc.As a member of the software engineering...