DevOps & Cloud Architect Engineer

hace 1 semana


Ciudad de México, Ciudad de México Hyphametrics A tiempo completo

HyphaMetrics is an AI-powered media intelligence company redefining how the world understands audience behavior. Our technology unifies viewing across streaming, linear TV, social video, and gaming to deliver the industry's first person-level, cross-platform metric. We partner with leading networks, brands, and measurement platforms to unlock transparent, privacy-first insights that fuel the future of media.

We're a team of builders, scientists, and innovators who believe that data can - and should - tell a more human story. If you thrive in fast-moving environments, crave impact, and love shaping what's next, you'll feel right at home at HyphaMetrics.

Position Overview

HyphaMetrics is seeking a highly skilled DevOps & Cloud Architect Engineer. This role is critical to maintaining and enhancing our cloud infrastructure and CI/CD automation for current and future software development initiatives.

The ideal candidate will be responsible for ensuring 99.9% uptime of our online services to meet Service Level Agreements (SLA), implementing continuous improvements with each software release, and managing both on-premises and cloud-based infrastructure across GCP, AWS, and other platforms.

Reporting directly to our Tech Lead, this is a senior-level technical position requiring a minimum of 5 years of experience, strong expertise in DevOps practices, cloud architecture, and infrastructure automation. The role offers flexibility in work modality and the opportunity to work with cutting-edge technologies in an AI-powered media intelligence company.

Duties & Responsibilities
Core Responsibilities
  • Support the company's efforts in developing and refining the corporate vision
  • Work collaboratively with teams to develop and apply strategies that create long-term value in the company's evolution, including both internal and external efforts
  • Create effective alignment by working as a team to successfully resolve challenges across the company using empathetic, problem-solving, practical, and future-focused tactics
  • Inspire current and future HyphaMetrics employees by upholding and exercising the company's core values to help maintain a customer-first focused organization

  • Infrastructure Automation: Implement and maintain Infrastructure as Code (IaC) using tools such as Terraform, CloudFormation, or Ansible

  • CI/CD Pipeline Management: Design and maintain continuous integration and continuous deployment pipelines to automate release and deployment processes using Jenkins or GitHub Actions
  • Cloud Infrastructure Management: Manage and optimize cloud infrastructure on platforms including AWS, Azure, and Google Cloud, ensuring high availability, security, and cost optimization
  • Monitoring and Logging: Implement robust monitoring and logging systems using tools such as Prometheus, Grafana, ELK stack, or Datadog to ensure high performance and rapid issue resolution
  • Containerization and Orchestration: Utilize Docker for containerization and Kubernetes for orchestration, managing both on-premises and cloud environments
  • Security Implementation: Apply security best practices in infrastructure, ensuring systems are secure, compliant, and resilient against threats
  • FinOps Management: Collaborate with finance and engineering teams to establish financial accountability for cloud usage, implement cost management practices, and provide forecasting and budgeting insights
  • System Uptime Management: Maintain 99.9% uptime of online services to meet Service Level Agreements (SLA)
  • Disaster Recovery: Implement weekly disaster recovery backups for software and databases, stored securely both locally and online
  • Security & Threat Prevention: Prevent and mitigate online attacks through proactive security measures
Additional Responsibilities (Cloud Architecture)

Note: These responsibilities are secondary in priority to core DevOps duties.

  • Architecture Design: Lead the design and architecture of scalable, maintainable, and high-performance cloud systems with ownership of key technical decisions
  • Performance Optimization: Collaborate with teams to optimize cloud resource usage for performance and cost efficiency
  • High Availability Design: Design and implement effective disaster recovery strategies and ensure high availability of services in cloud environments
  • Technology Evaluation: Assess and recommend emerging cloud technologies and tools that align with business goals
  • Cross-functional Collaboration: Work closely with product management, engineering, and operations teams to ensure cloud solutions meet business requirements and project timelines
Qualifications
Required Qualifications
  • Experience: Minimum 5 years of professional experience in DevOps, cloud infrastructure, or related roles
  • English Proficiency: 90% advanced level (reading, writing, speaking)
  • Level: Senior / Expert level professional
Technical Skills

1. Linux System Administration

  • File systems: permissions (chmod, chown), disk partitions, and volume mounting
  • Process management: identifying zombie processes, killing hung applications, managing background services (systemd)
  • SSH & Access: generating SSH keys, managing authorized_keys, secure tunneling to remote servers

2. Networking Fundamentals

  • Protocols: HTTP/HTTPS (status codes), TCP/IP, and DNS
  • Cloud Networking: VPCs, Subnets, Route Tables, and NAT Gateways
  • Firewalls: configuring Security Groups or iptables to allow traffic on specific ports

3. Scripting & Automation

  • Bash/Shell: writing scripts with loops, variables, and error handling
  • Python/Go: writing complex scripts that interact with APIs
  • API Interaction: RESTful APIs, JSON data formats, and authentication using tokens

4. Version Control

  • Branching models: Feature Branching vs Trunk-Based Development
  • Pull Requests: code review and merge conflict resolution
  • Tagging/Releases: semantic versioning and release tagging for deployment

5. Infrastructure as Code (IaC)

  • Declarative vs Imperative: understanding the difference between scripting vs desired state configuration
  • State Files: understanding state management and the risks of manual modification

6. Security (DevSecOps)

  • IAM: understanding Roles, Policies, and the Principle of Least Privilege
  • Secrets Management: never committing passwords or API keys to Git, injecting them as environment variables at runtime

7. Troubleshooting & Debugging

  • Log Analysis: ability to grep through massive log files to find root cause
  • Resource Analysis: identifying performance bottlenecks (CPU, RAM, Disk I/O, Network latency)
Technology Stack

From Day 1:

  • GCP GKE (Google Kubernetes Engine)
  • GitHub Repositories
  • PAS (Platform Application Services)
  • AWS (Amazon Web Services)
  • MongoDB

Core Tools & Technologies:

  • Foundation: Linux (Ubuntu/CentOS), Terminal/Bash, Git
  • Build & Deploy: Docker, Kubernetes (K8s), Python
  • CI/CD Automation: Jenkins, GitHub Actions
  • Infrastructure: GCP, AWS, Terraform, Ansible
  • Monitoring: Prometheus, Grafana, ELK Stack
Core Competencies
  • Technical Excellence: Deep expertise in DevOps practices, cloud infrastructure, and automation with ability to make key technical decisions independently
  • Problem-Solving: Strong troubleshooting and debugging skills with ability to identify root causes quickly and implement effective solutions
  • Automation Focus: Passion for automating processes and optimizing workflows to improve efficiency and reduce manual intervention
  • Security Mindset: Commitment to security best practices and compliance with ability to implement secure infrastructure and prevent threats
  • Collaboration: Effective team player who can work collaboratively with cross-functional teams including development, operations, and business stakeholders
  • Performance Under Pressure: Ability to work effectively under pressure, especially during critical incidents and system outages
  • Proactive Approach: Self-starter who identifies and addresses potential issues before they become problems
  • Continuous Learning: Commitment to staying current with emerging technologies and industry best practices
  • Communication: Strong verbal and written communication skills in both English (90%) and technical documentation
  • Adaptability: Flexibility to work in different modalities (remote/hybrid/on-site) and adjust to changing priorities and requirements
Preferred Qualifications
  • Experience with serverless architectures and event-driven systems
  • Experience with microservices architecture
  • Certifications in AWS, Azure, or GCP
  • Familiarity with Agile methodologies (Scrum, Kanban)
Additional Information
Work Environment
  • Work Modality: Flexible - remote, hybrid, or on-site based on team needs
  • Schedule Flexibility: Occasional flexibility required for off-hours software releases to avoid impact on team and core metrics during business hours
  • Methodology: Scrum and Kanban agile methodologies
  • Daily Tools: Jira, Slack, Email, WhatsApp
  • Equipment: we provide you with a MacBook Pro

  • Cloud Engineer

    hace 18 horas


    Ciudad de México, Ciudad de México DFX5 A tiempo completo

    We're Hiring: Cloud Engineer DevOps (AWS Specialist) – Join DFX5 DFX5 is on a mission to revolutionize customer experience through cutting-edge Generative AI, cloud-native solutions, and advanced analytics. As an AWS Advanced Partner and leader in AI-driven transformation, we deliver impactful solutions across industries including FSI, retail, insurance,...


  • Ciudad de México, Ciudad de México HyphaMetrics A tiempo completo

    HyphaMetrics is an AI-powered media intelligence company redefining how the world understands audience behavior. Our technology unifies viewing across streaming, linear TV, social video, and gaming to deliver the industry's first person-level, cross-platform metric. We partner with leading networks, brands, and measurement platforms to unlock transparent,...

  • L3 Senior Cloud Engineer

    hace 1 semana


    Ciudad de México, Ciudad de México Connectingology A tiempo completo

    Buscamos un L3 Senior Cloud Engineer con sólida experiencia en entornos cloud empresariales y operación crítica, que participe activamente en la ejecución, mantenimiento y evolución de nuestra infraestructura multicloud.Este rol trabaja de manera transversal con los equipos de Infraestructura y Aplicaciones , y actúa como nivel de escalamiento L3 / SME...

  • Senior DevOps Engineer

    hace 16 horas


    Ciudad de México, Ciudad de México PSL Group A tiempo completo

    Senior DevOps EngineerOur PurposeP\S\L Group is a global organization dedicated to putting information at the service of medicine. The companies and people of the P\S\L Group aim to improve medical care by serving those who need it, those who provide it and those who seek to improve it.Our primary purpose is to help clients increase the effectiveness of...

  • DevOps Engineer#

    hace 2 días


    Ciudad de México, Ciudad de México WTW A tiempo completo

    DescriptionWillis Towers Watson is looking for an experienced DevOps Engineer to join their team. This full-time role involves automating build processes, infrastructure, and software configuration management. The engineer will develop and maintain continuous integration platforms across multiple products and support their operations. The position requires...

  • Cloud Architect

    hace 5 días


    Ciudad de México, Ciudad de México Huawei Latinoamérica A tiempo completo

    Company DescriptionHuawei is a global leader in Information and Communication Technology (ICT) solutions. Through customer-focused innovation and strong partnerships, Huawei has established end-to-end strengths across carrier networks, enterprises, consumer markets, and cloud computing. Operating in over 170 countries and regions, Huawei serves more than...


  • Ciudad de México, Ciudad de México Global Applications Solution A tiempo completo

    Position: Lead/Principal Data Engineer/Data ArchitectClient Domain: Banking InstitutionWork StructureInitial Phase: Mandatory 3 weeks on-site in Mexico City (Santa Fe) for essential client immersion and team onboarding.Ongoing: Following the initial phase, the role transitions to fully remote .Note on Expenses: The client will not cover travel,...


  • Ciudad de México, Ciudad de México Hovland Barnes A tiempo completo

    Job DescriptionResponsable de mantener y supervisar la infraestructura y servicios basados en la nube de la empresa, garantizando la disponibilidad, seguridad, rendimiento y eficiencia de los sistemas cloud, asegurando que los servicios se mantengan operativos y cumpliendo con los estándares de calidad establecidos.ResponsabilidadesDesplegar, configurar y...


  • Ciudad de México, Ciudad de México ION A tiempo completo

    Lab49 is an award-winning specialist consultancy that creates bespoke technology in partnership with the most important companies in finance.We were founded in 2002 to bring Silicon Valley solutions to Wall Street's door. Since then, we have worked on successive waves of technological change, including distributed computing, high-speed automation, enterprise...

  • Cloud Architect

    hace 1 semana


    Ciudad de México, Ciudad de México Caylent A tiempo completo

    Caylent is a cloud native services company that helps organizations bring the best out of their people and technology using Amazon Web Services (AWS). We provide a full-range of AWS services including workload migrations and modernization, cloud native application development, DevOps, data engineering, security and compliance, and everything in between.At...