Cloud Ops Engineer
hace 3 semanas
POSITION SUMMARYThe Cloud Ops Engineer will support Amazon Web Services (AWS) and Linux/Windows environments. The Cloud Ops Engineer will be responsible for all aspects of the production lifecycle of maintenance, and administration, including but not limited to: infrastructure automation, continuous integration and deployment, product release and support, running a scalable production environment for hosting the ARCOS platform, maintaining application/database availability, and ensuring continuous 24x7 production uptime of our services. The Cloud Ops Engineer needs to be familiar with AWS, Apache, Tomcat, PostgreSQL, Oracle, Ansible, Jenkins, Jira, Confluence and SaaS operations. ESSENTIAL JOB FUNCTIONSThis is intended as an outline of the essential functions of the position. Actual metrics that measure job performance may be set forth in separate performance management documentation. Design, develop and maintain scalable AWS solutions and infrastructure, including but not limited to: EC2, RDS, S3, DynamoDB, Elasticache, and Route53. Develop tooling and processes to automate the deployment of SaaS based applications and their underlying operating systems and infrastructure. Perform PostgreSQL and Oracle database administration, including maintenance, troubleshooting, tuning, optimization, installation, upgrades, backup/recovery, and data migration. Partner with Engineering, Development, Quality Assurance, Professional Services, and Technical Support to ensure the success of the assigned product offerings and schedules. Engage in Agile team practices such as daily standups, backlog refinement, release planning and sprint planning. Coordinate configuration changes, installs, and upgrades with appropriate development teams and product owners while following company change control procedures. Participate in capacity planning to determine future infrastructure needs. Participate in 24x7 on-call responsibilities, maintaining the availability and performance of all customer-facing production services. Triage and participate in the resolution of complex problems, including network connectivity issues, that span multiple tiers of application/infrastructure. Implement monitoring and reporting capabilities to assist engineering in rapidly identifying issues. Actively monitor supported systems and respond promptly to security or usability concerns. Review application logs and analyze events using cloud-native services (e.g. CloudWatch, CloudTrail) or third party SIEM tools (e.g. Splunk). Upgrade systems and processes as required for enhanced functionality and security compliance. Maintain product service level agreements. Accurately document all processes and procedures for routine and non-routine tasks. All other duties and responsibilities as assigned. QUALIFICATIONS REQUIREMENTS AND SKILLS Bachelor’s degree in Computer Science or related field, or equivalent work experience. 4-5 years of system administration experience, ideally in global management and operations of highly trafficked production applications. Experience working in a 24x7 SaaS environment is preferred. 4-5 years of experience designing solutions for and managing AWS services, including but not limited to: EC2, RDS, S3, DynamoDB, Elasticache, WAF/Shield, Route53, IAM and Directory Service. Experience with Linux and Windows system administration, automation and performance tuning. Experience with configuration management and infrastructure as code tools such as Ansible and Terraform. Experience with Apache, Nginx, Tomcat, NodeJS/PM2. Experience with scripting languages, including Bash, Python and Powershell. Knowledge of CI/CD technologies and best practices. Knowledge of PostgreSQL, Oracle, Docker, Jira, Confluence. Advanced knowledge of system vulnerability management and security best practices. Solid understanding of networking concepts and troubleshooting. Proven ability to work effectively with highly reliable and highly available mission critical technologies with detail and results shown while meeting deadlines. Ability to operate deployment automation, SaaS operations, internal and external SaaS infrastructure, security and cost management. Solid understanding of technical issues and opportunities related to modern cloud infrastructure and operations. Action oriented, decisive approach to work required, with the willingness to take a hands-on role when needed to ensure deliverables are met on time. High energy, motivated self-starter with ability to take direction and manage tasks with minimal supervision within an energized, collaborative, and entrepreneurial environment. Excellent written and verbal communication skills. Production Support/On-Call Duties: As a key member of our engineering team, you will address escalated production issues from customer support. Your responsibilities will include:· Participating in a rotational on-call schedule to handle significant production issues.· Rapidly diagnosing and resolving technical challenges that arise in production.· Collaborating with customer support and engineering teams for seamless issue resolution.· Maintaining clear communication and documentation during and after incidents.· Leveraging these experiences to contribute to continuous process improvement. Compensatory Time for On-Call Work: We value work-life balance and recognize the extra effort required during on-call rotations. For hours spent actively working on-call, compensatory time off is provided, unless the law requires otherwise. This ensures your commitment is appropriately acknowledged. Please coordinate with your manager regarding the approval and scheduling of compensatory time, to align with team needs and workload. Your contribution is essential in maintaining the smooth operation of our systems and in upholding high standards of customer satisfaction.
-
ML Ops Engineer: Cloud Pipelines
hace 2 días
Mexico City NTT DATA North America A tiempo completoA leading technology services company is seeking a skilled ML Ops Engineer in Mexico City. The role involves building ML pipelines, developing APIs, and leveraging cloud technologies like AWS. Ideal candidates will have over 5 years of experience with machine learning tools and strong programming skills. This full-time position offers opportunities for...
-
Senior Cloud Sales Engineer
hace 5 días
Mexico City Matilda Cloud A tiempo completoSenior Cloud Sales Engineer – Cloud Solutions Specialist (LATAM) REMOTE Full-Time About Matilda Cloud: Matilda Cloud is a pioneering force in cloud computing solutions, recognized globally for our innovative technology and premium service. Our mission is to empower businesses through transformative cloud solutions, enabling them to achieve unprecedented...
-
Ops Engineer: Automate Ops
hace 4 semanas
Mexico City Fintual A tiempo completoUna empresa de tecnología financiera busca un Ops Engineer para liderar proyectos que mejoren sus operaciones en México. Se requieren 3-4 años de experiencia, habilidades en automatización y herramientas de datos como SQL. Ofrecemos salario competitivo de 50,000 MXN brutos, 7 semanas de vacaciones, seguro de gastos médicos y un plan de Stock Options. Se...
-
Senior Cloud Reliability Engineer
hace 2 semanas
Mexico Oracle A tiempo completoA leading cloud solutions provider in Mexico is seeking a DevOps Engineer to enhance service reliability in Oracle Analytics. Responsibilities include troubleshooting, monitoring service metrics, and supporting 24/7 operations. Ideal candidates have a BS/MS in Computer Science, solid understanding of cloud networking, and previous experience in agile...
-
Senior CloudOps Engineer — AWS SaaS, 24/7 Uptime
hace 2 semanas
Mexico The Resume Database A tiempo completoA leading SaaS solution provider in Mexico is seeking a Cloud Ops Engineer to support AWS and Linux/Windows environments. Responsibilities include managing infrastructure automation, continuous integration, and ensuring 24/7 uptime. The ideal candidate will have experience with AWS, PostgreSQL, and automation tools, along with strong problem-solving skills....
-
Retail Cloud Applications Engineer
hace 2 semanas
Mexico Oracle A tiempo completoA leading cloud services company is seeking a Retail Application Engineer in Mexico. This role involves installing, configuring, and supporting retail applications, requiring expertise in cloud platforms and programming skills. The ideal candidate should have 3-5 years of experience in the IT industry, with strong abilities in troubleshooting and performance...
-
Site Reliability Lead: Cloud, Apps
hace 2 semanas
Mexico FICO A tiempo completoA global analytics software company in Mexico is seeking a Site Reliability Engineer to support operational stability for Cloud and hosted applications. Responsibilities include managing cloud environments, incident resolution, and applying strong analytical skills. Ideal candidates will have experience with RedHat Linux, AWS, and containerization...
-
Cloud Engineer
hace 6 días
Mexico City Nasdaq A tiempo completoRecruiting Specialist @ Nasdaq | Corporate Recruiting, IT & Capital Markets Recruitment. Cloud Operations Engineer The Role We’re looking for a Cloud Operations Engineer to join our team, reporting to Sr. Director- Platform Operations. In this role, you’ll help shape how we deliver reliable, scalable, and secure cloud-based solutions that support global...
-
Cloud Engineer
hace 6 días
Mexico City Nasdaq A tiempo completoRecruiting Specialist @ Nasdaq | Corporate Recruiting, IT & Capital Markets Recruitment. Cloud Operations Engineer The Role We’re looking for a Cloud Operations Engineer to join our team, reporting to Sr. Director- Platform Operations. In this role, you’ll help shape how we deliver reliable, scalable, and secure cloud-based solutions that support global...
-
Ops Engineer
hace 3 semanas
Mexico City Fintual A tiempo completoUna startup de inversiones busca un Ops Engineer en México. El rol implica liderar proyectos, automatizar procesos y usar herramientas de datos. Se requiere experiencia en manejo de proyectos y habilidades analíticas. Se ofrece salario competitivo a partir de 50.000 MXN brutos y beneficios como 7 semanas de vacaciones, seguro médico, y un plan de stock...