28 may
|
Datavail Career Site
|
Colombia
28 may
Datavail Career Site
Colombia
Postúlate en Kit Empleo: kitempleo.com.co/empleo/1ass6a
Datavail is a leading provider of data management, application development, analytics, and cloud services, with more than 1,000 professionals helping clients build and manage applications and data via a world‑class tech‑enabled delivery platform and software solutions across all leading technologies. For more than 17 years, Datavail has worked with thousands of companies spanning different industries and sizes, and is an AWS Advanced Tier Consulting Partner, a Microsoft Solutions Partner for Data & AI and Digital & App Innovation (Azure), an Oracle Partner, and a MySQL Partner.
Job Description
You will own reliability for core services across multiple clouds, drive automation, and mentor more junior engineers. You will partner with developer teams to embed resilience into feature delivery.
Responsibilities
- Define and maintain SLIs/SLOs, monitor alignment and error budget usage
- Lead incident response and postmortems, implement corrective measures
- Automate operations tasks via tooling (e.g. auto‑remediation, scaling rules)
- Build, improve, and maintain CI/CD pipelines,
canary deployments, blue/green strategies
- Lead technical discussions with customers to align on reliability, scalability, and performance requirements
- Drive continuous platform improvements across the service lifecycle, including architecture, monitoring, and operational processes
- Implement and extend observability systems (metrics, tracing, log aggregation)
- Optimize performance and cost by tuning cloud services, autoscaling, resource rightsizing
- Design, deploy, and operate containerized workloads using Docker and Kubernetes in production environments
- Collaborate with dev teams to integrate resilience patterns (circuit breakers, bulkheading)
- Participate in architecture discussions around high availability, disaster recovery
- Mentor mid and junior SREs; conduct reliability design reviews
Must-have Qualifications
- 5–8 years of experience in a reliability or operations role
- Cloud‑
Postúlate en Kit Empleo: kitempleo.com.co/empleo/1ass6a
📌 Site Reliability Engineer (Colombia)
🏢 Datavail Career Site
📍 Colombia