28 may
|
Agileengine
|
Colombia
28 may
Agileengine
Colombia
Postúlate en Kit Empleo: kitempleo.com.co/empleo/1as7db
5 days ago Be among the first 25 applicants
AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards.
If you’re looking for a place to grow, make an impact, and work with people who care, we’d love to meet you
About the Role
As a Site Reliability Engineer , you will play a key role in ensuring the stability and performance of a large-scale SaaS platform, directly impacting customer experience and trust. This role offers the chance to collaborate across engineering, support, and product teams while driving improvements in automation, monitoring, and cloud infrastructure. You’ll be part of a culture that values proactive problem-solving, continuous learning, and innovation, giving you the opportunity to grow your expertise in AWS, Kubernetes, Terraform, and DevSecOps practices while shaping the reliability of mission-critical systems.
What you will do
- Shift: Monday – Thursday 8AM – 7PM PST (11AM – 10PM EST) with rotating on-call;
- On call shifts: every 6 weeks, for one week as primary responder and next week as secondary;
- Manage alerts daily, check systems, and elevate issues as needed;
- Be part of a team that provides 24×7 on-call support for critical SaaS events;
- Be available in case of emergencies when team members are not available or need help;
- Document issues and remediation steps;
- Proactively create appropriate monitors in the EKS/K8S ecosystem;
- Deploy to EKS/K8s cluster using Terraform and Helm;
- Learn and maintain existing infrastructure running under Docker Swarm;
- Improve existing infrastructure health by implementing checks and scripts to correct known issues;
- Maintain and develop deployment code;
- Implement/integrate new technologies in our Cloud Infrastructure;
- Collaborate with other teams and depa
Postúlate en Kit Empleo: kitempleo.com.co/empleo/1as7db
📌 Site Reliability Engineer (Colombia)
🏢 Agileengine
📍 Colombia