AI Evaluation Engineer (Medellín)

AI Evaluation Engineer (Medellín)

28 may
|
Gramian Consulting
|
Medellín

28 may

Gramian Consulting

Medellín

About Us
Gramian Consultancy is a boutique consultancy specializing in IT professional services and engineering talent solutions. With a strong background in software engineering and leadership, we help companies build high‑performing teams by matching them with professionals who truly fit their needs.
Role Overview
We are looking for a highly analytical and computationally strong professional with a solid research background in mathematics or quantitative fields. In this role, you will design advanced benchmark tasks for multi‑agent AI systems, focusing on complex mathematical reasoning, algorithmic problem‑solving, and verifiable computational outputs. You will craft challenging problems, build validation systems, and structure tasks that require decomposition into coordinated sub‑solutions.
Commitments
8 hours per day with an overlap of 4 hours with PST.
Employment Type
Contractor assignment (no medical/paid leave)
Duration
4 weeks+
Location
Bangladesh, Brazil, Colombia, Egypt, Ghana, India, Indonesia, Kenya, Nigeria, Turkey, Vietnam
Interview
Take home assessment (60min)



+ short interview
Responsibilities
- Design and build multi‑agent benchmark tasks requiring multi‑step mathematical reasoning and algorithmic problem‑solving
- Create complex, decomposable problems across domains such as:
- Competition mathematics
- Numerical analysis
- Combinatorial optimization
- Statistical inference
- Develop verification scripts to validate numerical outputs (with tolerance thresholds), proof correctness and logical steps, algorithmic outputs and constraints
- Write clear, structured problem statements with precise notation and defined outputs
- Design task decomposition strategies for parallel or multi‑agent execution
- Implement computational solutions and validation pipelines using Python
- Work with containerized environments (Docker) for reproducibility and evaluation

Requirements
- 5+ years in mathematics, quantitative research, or computational science
- Strong Pyth

📌 AI Evaluation Engineer (Medellín)
🏢 Gramian Consulting
📍 Medellín

Postulate a este anuncio

Muestra tus habilidades a la empresa, rellenar el formulario y deja un toque personal en la carta, ayudará el reclutador en la elección del candidato.

Suscribete a esta alerta:
Escribe tu dirección de correo electrónico, te permitirá de estar al tanto de los últimos empleos por: ai evaluation engineer (medellín) / medellín
Suscribete a esta alerta:
Escribe tu dirección de correo electrónico, te permitirá de estar al tanto de los últimos empleos por: ai evaluation engineer (medellín) / medellín