24 may
|
Crossing Hurdles
|
Colombia
24 may
Crossing Hurdles
Colombia
Postúlate en Kit Empleo: kitempleo.com.co/empleo/1as4nj
Position: Swarm Bench Task Engineer Data Analysis
Type: Short-Term Contract (4 weeks)
Compensation: $15 per hour
Location: Remote
Commitment: 30-40 hours per week with 4 hours overlap with PST
Role Responsibilities
- Design and author multi-agent benchmark tasks centered on complex data analysis workflows
- Create realistic synthetic datasets or curate real-world style datasets across domains such as finance, operations, security, or market analysis
- Build tasks that require agents to perform cross-referencing, anomaly detection, contradiction identification, and statistical computation across multiple sources
- Develop decomposition guides that split analytical work across specialist sub-agents such as financial, technical, security, or operations analysts
- Write precise oracle logic or verification scripts that validate specific analytical conclusions rather than generic summaries
- Create reproducible evaluation environments using Python and Docker
- Review task performance signals to ensure strong separation between weaker and stronger agentic systems
- Refine tasks to improve determinism, clarity, difficulty, and scoring quality
Requirements
- Strong years of experience in data analysis
- Strong proficiency in SQL and Python for data analysis and scripting (pandas, Num Py, or similar)
- Experience working with real-world, messy datasets such as CSV, JSON, logs, and reports
- Ability to design non-trivial analytical questions with clear, specific, and verifiable answers
- Solid understanding of statistical concepts including averages, distributions, outliers, and correlations
- Familiarity with AI coding benchmark environments (e.g., SWE-bench, Terminal-Bench)
- Comfortable with Docker including writing Dockerfiles, building images, and debugging container issues
- Ability to work independently in a remote environment
#J-18808-Ljbffr
Postúlate en Kit Empleo: kitempleo.com.co/empleo/1as4nj
📌 4-Week Remote Data Analyst (SQL/Python) for Benchmarks (Colombia)
🏢 Crossing Hurdles
📍 Colombia