AI Evaluation Engineer (Medellín)

28 may

Gramian Consulting

Medellín

28 may

Gramian Consulting

Medellín

Gramian Consultancy is a boutique consultancy specializing in IT professional services and engineering talent solutions. With a strong background in software engineering and leadership, we help companies build high-performing teams by matching them with professionals who truly fit their needs.
Role Overview
We are looking for an AI Evaluation Engineer with a strong research background to design and evaluate complex, multi-agent tasks used to benchmark next-generation AI systems. In this role, you will work at the intersection of research, data structuring, and AI evaluation , building high-quality tasks that require deep document understanding, structured reasoning, and multi-step synthesis. You will create datasets and evaluation frameworks that test whether AI agents can truly read, reason, and extract knowledge from large-scale unstructured data .
This is a high-precision, detail-oriented role requiring strong analytical thinking, structured problem decomposition, and the ability to translate research content into measurable evaluation tasks.
Commitments Required:

8 hours per day with an overlap of 4 hours with PST.
Employment type: Contractor assignment (no medical/paid leave)
Duration of contract: 5 weeks+
Location: Bangladesh, Brazil, Colombia, Egypt, Ghana, India, Indonesia, Kenya, Nigeria, Turkey, Vietnam
Interview: take home assessment (60min)
Responsibilities
- Build multi-agent benchmark tasks that require reading, analyzing, and synthesizing large document collections
- Curate real-world research corpora — academic papers, case studies, technical reports — and design questions that require comprehensive analysis
- Write structured ground-truth oracles (JSON) with specific, verifiable answers that prove the agent actually read the source material
- Design LLM judge prompts that evaluate agent output field-by-field against the oracle
- Create decomposition guides that split research across multiple parallel sub-agents (one per document, one per domain, the

📌 AI Evaluation Engineer (Medellín)
🏢 Gramian Consulting
📍 Medellín

Postulate a este anuncio

Muestra tus habilidades a la empresa, rellenar el formulario y deja un toque personal en la carta, ayudará el reclutador en la elección del candidato.

Asesor comercial de zona (Medellín)

14 jun

Kaszek

Medellín

14 jun
Kaszek
Medellín

Somos Internet Medellín, Antioquia, Colombia Join or sign in to find your next job Join to apply for the Asesor comercial de zona role at Somos Internet Continue with GoogleContinue with Google Em [...]

Soldador Argonero Medellín

13 jun

FAISMON

Medellín

13 jun
FAISMON
Medellín

Soldador TIG (tubería industrial) - Medellín Palabras principal: - Soldador TIG Medellín - Soldador de tubería Medellín - Soldador industrial Medellín - Soldadura TIG - Soldadura de tubería - [...]

Analista Planoteca y Diagnósticos Medellín

13 jun

Empresa reconocida

Medellín

13 jun
Empresa reconocida
Medellín

Compartir Facebook Empresa Asesórate Consultores Descripción de la Empresa Empresa de Gestión Humana dedicada a la procesos de selección Departamento Antioquia Localidad Medellin Salario [...]

Engineering Manager (Medellín)

15 jun

NewtonX

Medellín

15 jun
NewtonX
Medellín

The Role As NewtonX's first Engineering Manager in Colombia, this is a dual mandate: drive high‑velocity engineering delivery and act as the talent magnet who establishes Medellín as a core Newton [...]

AI Evaluation Engineer (Medellín)

AI Evaluation Engineer (Medellín)

Postulate a este anuncio

Asesor comercial de zona (Medellín)

Asesor comercial de zona (Medellín)

Soldador Argonero Medellín

Soldador Argonero Medellín

Suscribete a esta alerta:

Escribe tu dirección de correo electrónico, te permitirá de estar al tanto de los últimos empleos por: ai evaluation engineer (medellín) / medellín

Analista Planoteca y Diagnósticos Medellín

Analista Planoteca y Diagnósticos Medellín

Engineering Manager (Medellín)

Engineering Manager (Medellín)

Suscribete a esta alerta:

Escribe tu dirección de correo electrónico, te permitirá de estar al tanto de los últimos empleos por: ai evaluation engineer (medellín) / medellín