30 may
|
Cloudbeds
|
Colombia
30 may
Cloudbeds
Colombia
Postúlate en Kit Empleo: kitempleo.com.co/empleo/1azbbn
Responsibilities
As a Sr. Site Reliability Engineer, you will be the guardian of our platform’s reliability and performance, ensuring millions of hospitality transactions flow seamlessly across the globe. You will architect and implement scalable AWS cloud solutions, maintain and support highly‑loaded Kubernetes (EKS) clusters, support the CICD process with ArgoCD and GitOps, automate platform deployments with Terraform, and develop and continuously improve product observability and monitoring systems using Grafana, Prometheus, Datadog, and CloudWatch. You will also participate in incident management and root‑cause analysis, optimize system performance, collaborate with development teams on monitoring best practices, and work with security teams to maintain security best practices and infrastructure support.
What You Bring to the Team
- Design and implement a reliable and scalable AWS architecture to meet the organization’s needs.
- Maintain and support highly loaded Kubernetes (EKS) clusters and infrastructure‑related components.
- Support the CICD process with ArgoCD and GitOps.
- Automate the platform deployments with Terraform infrastructure‑as‑code.
- Develop and continuously improve product observability and monitoring systems based on Grafana, Prometheus, Datadog, and CloudWatch.
- Respond and participate with incident management and root‑cause analysis, ensuring minimal impact on services.
- Optimize system performance and troubleshoot issues as they arise.
- Collaborate with development teams to establish monitoring best practices and ensure systems meet reliability targets.
- Collaborate with security teams to implement and maintain security best practices.
- Provide infrastructure support in rotation, guiding other engineering teams.
Qualifications
- 5+ years of experience as a DevOps or SRE working within the AWS ecosystem.
- 5+ years of experience with Kubernetes (EKS) and Helm charts.
- Experience with designing, building, and supporting CI/CD pipelin
#J-18808-Ljbffr
Postúlate en Kit Empleo: kitempleo.com.co/empleo/1azbbn
📌 Senior Site Reliability Engineer (Colombia)
🏢 Cloudbeds
📍 Colombia