Site Reliability Engineer With Docker, Aws, Ec2

hace 2 semanas


Xico, México Ampstek A tiempo completo

Title: Site Reliability Engineer Location: 100% Remote Job Type: Contract Skills: SRE, EKS, EC2, Docker, AWS The Senior SRE role is ultimately responsible for ensuring the reliability, availability, and performance of our technology and systems directly supporting our end customers and internal customers. They will work closely with the product development and platform engineering teams to build and maintain scalable systems and robust automation that supports the company's business goals. The ideal candidate will have a history of successfully implementing and using tools like Terraform, Packer, Splunk, SignalFx, and other observability / IAC tools supporting systems with around the clock availability requirements. In addition, the ideal candidate will possess sufficient software skills to properly scrutinize and troubleshoot applications supporting our customers. They should have a strong aptitude for learning new technologies, embracing and driving solutions to challenging projects and problems. This role requires a seasoned engineer with the ability to collaborate across multiple cross‑functional teams while exhibiting a rich set of problem‑solving skills, along with being self‑motivated and having a passion for quality. Responsibilities Develop and maintain monitoring tools, alerts, and dashboards to provide visibility into system health and performance. Proactively gather and analyze both metric and log data from systems and applications to perform anomaly detection, performance tuning, capacity planning and fault isolation. Collaborate with development teams to implement and deploy new features and enhancements, ensuring they meet reliability, security and performance standards. Partner closely with other teams on enterprise standards / best practices. Identify options for problem resolution and initiate corrective actions. Mentor junior members, document and share solutions. Qualifications Minimum 4 years' experience in any combination of software engineering roles of some type: SRE, DevOps, applications, services, tools / automation, release, etc. Minimum 3 years' experience with SRE / DevOps practices and automation tooling. Experience with observability solutions tools like Splunk, Datadog, SignalFx, etc. Experience deploying, maintaining and supporting software applications / services in the AWS ecosystem. Proactive approach to identifying problems and solutions. Experience writing code with one or more interpreted languages such as: Python, PHP, Perl, Ruby, Linux Shell. Experience with Terraform or Cloud Formation scripting. Experience with configuration management tools like Ansible, Chef or Puppet. Experience with standard software development best practices and tools such as code repositories (Git preferred). Experience executing in an agile software development environment. Good understanding of pricing / cost models across AWS services, especially compute, storage, and database offerings. Must be able to multitask and work well with changing priorities in a fast paced, 24x7 environment. Must be highly collaborative and be able to work in a team environment consisting of both technical and business people. Excellent communication, problem solving and customer service skills. A strong ability to learn and adapt to new technologies. Education: Bachelor's degree in computer science, science, engineering or workforce equivalent technical certifications preferred. Thanks, Aatmesh #J-18808-Ljbffr



  • Xico, México Nearsure A tiempo completo

    A technology solutions firm in Mexico is seeking a Senior Site Reliability Engineer. The ideal candidate will have over 8 years of software development experience with significant expertise in AWS, Kubernetes, and Docker. Responsibilities include designing infrastructure, enhancing reliability, and mentoring teams. This position offers a competitive salary...

  • Site Reliability Engineer

    hace 2 semanas


    Xico, México Gsb Solutions A tiempo completo

    Important IT company At the Latin American level, growth requires:**SRE /GITHUB with terraform****Job description**:**_Key Responsibilities_**- Develop and maintain CI/CD pipelines using GitHub Actions to streamline the software development lifecycle.- Design, deploy, and manage AWS infrastructure, ensuring high availability and security.- Collaborate with...


  • Xico, México Ampstek A tiempo completo

    A leading tech company is seeking a Site Reliability Engineer responsible for ensuring the reliability, availability, and performance of technology systems. The ideal candidate will have a minimum of 4 years in software engineering, with strong expertise in SRE and DevOps practices, particularly in the AWS ecosystem. Responsibilities include developing...

  • Site Reliability Engineer

    hace 2 semanas


    Xico, México Quantum World Technologies Inc. A tiempo completo

    Role: Site Reliability Engineer (SRE) – Database Services Location: Open to LATAM About the Role We are looking for a Site Reliability Engineer (SRE) to join the Database Engineering team and contribute to the reliability, resilience, and automation of mission-critical PostgreSQL environments.This role is ideal for an SRE who wants to grow into database...

  • Site Reliability Engineer

    hace 2 semanas


    Xico, México Quantum World Technologies Inc. A tiempo completo

    Role: Site Reliability Engineer (SRE) – Database Services. Location: Open to LATAM. About the Role We are looking for a Site Reliability Engineer (SRE) to join the Database Engineering team and contribute to the reliability, resilience, and automation of mission‑critical PostgreSQL environments. This role is ideal for an SRE who wants to grow into...

  • Site Reliability Engineer

    hace 2 semanas


    Xico, México Quantum World Technologies Inc. A tiempo completo

    Role: Site Reliability Engineer (SRE) – Database Services. Location: Open to LATAM. About the Role We are looking for a Site Reliability Engineer (SRE) to join the Database Engineering team and contribute to the reliability, resilience, and automation of mission‑critical PostgreSQL environments. This role is ideal for an SRE who wants to grow into...

  • Site Reliability Engineer

    hace 2 semanas


    Xico, México Redholt A tiempo completo

    Hiring: On-Site Support Engineer - Linear Video Distribution (AWS Cloud)Location: Mexico OR ColombiaIndustry: Next-gen video distribution, cloud streaming, media technologyAre you passionate about live video delivery, cloud workflows, and supporting cutting-edge broadcast operations?A global media technology provider is seeking an On-Site Support Engineer to...

  • Site Reliability Engineer

    hace 2 semanas


    Xico, México Redholt A tiempo completo

    Hiring: On‑Site Support Engineer - Linear Video Distribution (AWS Cloud) Location: Mexico OR Colombia Industry: Next‑gen video distribution, cloud streaming, media technology Are you passionate about live video delivery, cloud workflows, and supporting cutting‑edge broadcast operations? A global media technology provider is seeking an On‑Site Support...

  • Site Reliability Engineer

    hace 2 semanas


    Xico, México Redholt A tiempo completo

    Hiring: On‑Site Support Engineer - Linear Video Distribution (AWS Cloud) Location: Mexico OR Colombia Industry: Next‑gen video distribution, cloud streaming, media technology Are you passionate about live video delivery, cloud workflows, and supporting cutting‑edge broadcast operations? A global media technology provider is seeking an On‑Site Support...

  • Site Reliability Engineer

    hace 2 semanas


    Xico, México Redholt A tiempo completo

    Hiring: On‑Site Support Engineer - Linear Video Distribution (AWS Cloud) Location: Mexico OR Colombia Industry: Next‑gen video distribution, cloud streaming, media technology Are you passionate about live video delivery, cloud workflows, and supporting cutting‑edge broadcast operations? A global media technology provider is seeking an On‑Site Support...