Cloud Site Reliability Engineer
hace 6 meses
**DESCRIPTION**:
Are you a skilled **Cloud Site Reliability Engineer with experience in AWS or GCP?**
If so, we have an exciting opportunity for you
We're currently seeking a Cloud Site Reliability Engineer to join our vibrant team.
This role offers the chance to help the product team in maximizing the reliability of software solutions and ensure that the energy needs of the planet are met. If you're ready to take your career to the next level, we'd love to hear from you
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
RESPONSIBILITIES
- Maintain and improve and optimize a LightOps monitoring system
- Share knowledge and actively collaborate with other teams in the organization for playbook development
- Be the liaison between the engineering and operations teams, in order to ensure proper communication and collaboration
**REQUIREMENTS**:
- Knowledge of OS Administration, PowerShell, Bash
- Experience with Automation using Scripting and Programming Languages
- Mastery of Observability and monitoring; Grafana/Prometheus and Open source (e.g. Loki)
- Familiarity with containerization (Docker), Kubernetes
- Hands-on experience in infrastructure performance and capacity planning (design, build or implementation)
- Competence in technologies such as NAT, DNS and DHCP
- Know-how of tools such as Git and PagerDuty
- Solid programming skills with Python and GO are a strong Plus
- Database skills (MongoDB, Oracle, Postgres)
- Disaster Recovery experience (backup georedundancy, recovery scripting)
- Networking e.g. VPN, Load Balancer, NSG or familiar with medium-> high Networking concepts
WE OFFER
- Career plan and real growth opportunities
- Unlimited access to LinkedIn learning solutions
- International Mobility Plan within 25 countries
- Constant training, mentoring, online corporate courses, eLearning and more
- English classes with a certified teacher
- Support for employee’s initiatives (Algorithms club, toastmasters, agile club and more)
- Enjoyable working environment (Gaming room, napping area, amenities, events, sport teams and more)
- Flexible work schedule and dress code
- Collaborate in a multicultural environment and share best practices from around the globe
- Hired directly by EPAM & 100% under payroll
- Law benefits (IMSS, INFONAVIT, 25% vacation bonus)
- Major medical expenses insurance: Life, Major medical expenses with dental & visual coverage (for the employee and direct family members)
- 13 % employee savings fund, capped to the law limit
- Grocery coupons
- 30 days December bonus
- Employee Stock Purchase Plan
- 12 vacations days plus 4 floating days
- Official Mexican holidays, plus 5 extra holidays (Maundry Thursday and Friday, November 2nd, December 24th & 31st)
- Relocation bonus: transportation, 2 weeks of accommodation for you and your family and more
- Monthly non-taxable amount for the electricity and internet bills
CONDITIONS
-
Site Reliability Engineer
hace 6 meses
Desde casa, México thegetch mexico A tiempo completo**Función: Site Reliability Engineer** **Aperturas: más de 10 contrataciones** **Ubicación: - any city with TCS Office presence (Queretaro, Guadalajara, Mexico City or Monterrey)** **Salario: - 25-33 USD/hr** **Comunicación en inglés: avanzado** **Experiência: 4+ años** **Responsabilidades de Site Reliability Engineer**: Reúna y analice métricas...
-
Lead Site Reliability Engineer
hace 6 meses
Desde casa, México Tekshapers Inc A tiempo completo**Position : Lead Site Reliability Engineer** **Location : Remote** **Duration : Contract** - Lead and mentor a team of SREs to ensure operational excellence and maximize the reliability and availability of client systems. - Minimum 10 years of work experience in DevOps/SRE, including leadership roles. - Architect and design highly scalable and available...
-
Site Reliability Engineer
hace 3 meses
Desde casa, México Right Balance A tiempo completo**Overview** We're looking for a Site Reliability Engineer. Headquartered in Los Angeles, California, Right Balance provides top-tier technology talent for innovative companies in the US. We’re in the top 50 companies to watch in LA. **Engagement Details** Our client is a USA-based company producing video solutions with the mission to advance scientific...
-
Cloud Operations Engineer
hace 6 meses
Desde casa, México Consultoria Aguilar A tiempo completoCloud Operations Engineer / Site Reliability Engineer (SRE) Job Description: Cloud Operations Engineer / Site Reliability Engineer (SRE) About the Company: Datascore.ai, through its EngageIQ platform, specializes in enhancing lead generation and engagement. The company leverages advanced data science and AI to score and enrich leads, optimizing outreach...
-
Aws Cloud Site Reliability Engineer
hace 6 meses
Desde casa, México EPAM Systems A tiempo completo**DESCRIPTION**: Join EPAM as an **AWS Cloud Site Reliability Engineer.** In this role, you'll transfer security processes, manage authentication technologies, and support the implementation of a Palo Alto firewall. If you have 3+ years of experience with AWS, proficiency in designing and managing data migration processes, and superior communication...
-
Site Reliability Engineer
hace 1 semana
Desde casa, México Wise Athena A tiempo completo**Join Our Team as an SRE!** Wise Athena looking for a **Site Reliability Engineer (SRE)** to join our dynamic and innovative team! At our company, we’re revolutionizing Revenue Growth Management (RGM) with the power of AI. You will work with a passionate, forward-thinking team. This is a fully remote position. **Key Responsibilities** - **Problem...
-
Azure DevOps Site Reliability Engineer
hace 6 meses
Desde casa, México EPAM Systems A tiempo completo**DESCRIPTION**: Are you a skilled Azure DevOps Site Reliability Engineer with a passion for ensuring business continuity and helping businesses always be near their clients? Do you have experience in optimizing and supporting OSDU deployment, performing monitoring including incidents resolution, and suggesting improvements? If so, we have an exciting...
-
Site Reliability Engineer
hace 6 meses
Desde casa, México Synechron A tiempo completoSynechron is a self-funded, leading digital transformation Consulting firm focused on the financial services industry working to accelerate digital initiatives for Banks, Asset Managers and Insurance. We achieve this by providing our clients with innovative solutions that solve their most complex business challenges and combining Synechron’s unique,...
-
Cloud Network Engineer
hace 6 meses
Desde casa, México RED AMIGO DAL S.A.P.I. of C.V. S.O.F.O.M. E.N.R A tiempo completo**What´s Konfio?** A financial technology company dedicated to supporting the small and medium-sized companies in Mexico, developing and offering financial solutions to solve their main problems, and seeking to be the best ally of entrepreneurs with dreams and ambitions to create value, consolidate their well-being and contribute to the...
-
Senior Site Reliability Engineer
hace 6 meses
Desde casa, México EPAM Systems A tiempo completo**DESCRIPTION**: Join EPAM as a **Senior Site Reliability Engineer specializing in AWS!** In this role, you'll ensure fleet services reliability and availability under the SRE model. If you have a good track record of highly scalable, distributed systems projects and previous experience working as an SRE, we'd love to hear from you. EPAM is a leading...
-
Site Reliability Engineer Iii
hace 6 meses
Desde casa, México Cabify A tiempo completoDo you want to change the world? At Cabify, that’s what we’re doing. We aim to make cities better places to live by improving mobility for the people living in them, connecting riders to drivers, providing mobility alternatives such as scooters and mopeds and many others to come, all at the touch of a button. Maybe one day cities will be places where...
-
Senior Cloud Engineer
hace 6 meses
Desde casa, México BUSINESS EXCELLENCE ´PROFESSIONAL CONSULTING A tiempo completo**Senior Cloud Engineer (REMOTE)**: - At least 10 years in leadership positions with extensive experience in manufacturing industry. - Bachelor degree in engineering. - Advanced english level. - **Architecture Design**: Develop cloud architecture solutions that meet the organization's requirements for scalability, reliability, security, and performance. -...
-
Cloud Engineer
hace 6 meses
Desde casa, México merican Inc A tiempo completo**Cloud Engineer** **Remote** **Responsibilities**: - Cloud Infrastructure Design and Implementation: - Automation and Orchestration: - Security and Compliance: - Monitoring and Optimization: - Collaboration and Documentation: - Troubleshooting and Support: - Stay Current with Cloud Technologies: **Job Type**: Contract **Salary**: $170.00 - $255.00...
-
Site Reliability Engineer with Java
hace 6 meses
Desde casa, México EPAM Systems A tiempo completo**DESCRIPTION**: Join EPAM as a remote **Site Reliability Engineer specializing in Java.** In this role, you'll provide 24/7 on-call support for Java backend services, prepare and deploy patches, and assist in establishing top-of-the-line metrics and dashboards. If you have 5-8 years of experience as a DevOps/SRE, proficiency in Java, and experience with...
-
Senior Cloud Engineer
hace 6 meses
Desde casa, México Franklin Templeton Investments A tiempo completoAt Franklin Templeton, we’re advancing our industry forward by developing new and innovative ways to help our clients achieve their investment goals. Our dynamic and diversified firm spans asset management, wealth management, and fintech, offering many ways to help investors make progress toward their goals. Our talented teams working around the globe...
-
Senior C++ Software Engineer
hace 1 mes
Desde casa, México EPAM Systems A tiempo completoWe are on the lookout for a skilled **Senior C++ Software Engineer** with deep expertise in Site Reliability Engineering, Borg, Spanner, and Google Cloud Platform. As a critical member of our Engineering team, you'll engage with a prestigious global Google infrastructure project, deploying various cutting-edge backend and cloud technologies. Your...
-
Security Engineer
hace 6 meses
Desde casa, México Framework Science A tiempo completoFramework Science is on a MISSION that focuses on Exploring new technologies and building tomorrow’s Applications. This means we hire TOP Engineers and Designers by providing great benefits and pay so they can focus on solving what’s never been solved before. Our aim is to push the needle of innovation while enabling Technical staff to impact code or...
-
C++ Software Engineer
hace 1 mes
Desde casa, México EPAM Systems A tiempo completoWe are seeking an experienced **C++ Software Engineer** with expertise in Site Reliability Engineering, Borg, Spanner, and Google Cloud Platform. You will be an integral part of the Engineering team, working on a top-notch global Google infrastructure project involving a variety of modern backend and cloud technologies. Your role will involve...
-
Cloud Engineer
hace 6 meses
Desde casa, México ITKAWA A tiempo completo**Cloud Engineer -** **Santander** **Esquema de trabajo**: 100% Remoto. **Salario**: Abierto a negociar de acuerdo a experiência. **Duración**: Posibilidad de prórrogas y/o contratación a tiempo completo. **Educación**: Sistemas Computacionales, Informática, Mecatrónica, Electrónica y comunicaciones o afín. **Idioma**:Inglés...
-
Cloud Sysops Engineer
hace 6 meses
Desde casa, México The Cervantes Group A tiempo completoThe Cloud SysOps Engineer will be in charge of the execution of infrastructure updates, patching, including the compute and storage services and the integrity of the code that generate infrastructure (i.e. stacksets, modules). The ideal person can automate provisioning of infrastructure services (IaC) and will lead the configuration, monitoring and incident...