Juan Manuel Ciro

Building intelligent systems at scale

>

Senior AI/ML Engineer with +9 years of experience building agentic AI systems and large-scale ML infrastructure. Currently leading the Agent Composer platform at Contextual AI, transforming RAG pipelines into modular multi-agent systems used in production. Co-author of 14 papers across NeurIPS, ICML, ACL and Nature.

Google Scholar LinkedIn

scroll

+9 yrs

Engineering Experience

200K+

Monthly Requests Handled

14

Research Publications

99.98%

System Reliability

about.ts

1const juan = {
2role: "Technical Lead / Staff AI Engineer",
3stack: [
4"Python", "React", "Next.js", "TypeScript", "SQL"
5"Temporal", "GCP", "AWS", "LLMs", "Docker", "Redis"
6],
7publications: [
8"NeurIPS (Best Paper)", "ICML", "ACL", "Nature"
9],
10reliability: 99.98,
11monthlyRequests: 200_000,
12ships: true
13}

Career

Professional Experience

+9 years of experience building production ML systems, from early-stage startups to industry-defining platforms.

Leading the design and development of modular, graph-based agent frameworks for scalable AI composition.

Key Achievements

Designed modular, graph-based agent framework using DDD, enabling scalable composition across research and sales teams
Operated production agentic systems at scale: 200K+ monthly requests, 99.98% reliability with strict SLAs
Migrated orchestration to Temporal, improving evaluation success from 60% to 98% and 10x throughput
Built core platform modules (evaluation, query, feedback), accelerating internal developer velocity

PythonNext.jsTypeScriptRedisAgentic AITemporalDDDProduction ML

Tech lead for Dynabench, the collaborative AI evaluation platform used by industry leaders.

Built end-to-end analytics platforms and ML solutions for manufacturing optimization.

Delivered predictive analytics and computer vision solutions for construction management.

Depth by Domain · Toolkit

Technical Expertise & Skills

Agentic AI Platforms

At Contextual AI, I lead the Agent Composer team -- designing modular, graph-based agent frameworks that convert RAG pipelines into scalable, observable distributed systems.

Architecture

Domain-Driven Design with composable agent workflows enabling scalable composition across research, sales, and applied teams.

Orchestration

Temporal-based orchestration improving evaluation success from 60% to 98% with 10x throughput scaling.

Observability

Distributed logging, tracing, and metrics reducing MTTR and enabling data-driven reliability engineering.

Temporal.ioDDDGraphQLgRPCDockerK8s

Programming

PythonJavaScriptTypeScriptReactNext.jsSQL

Distributed Systems & Infra

Temporal.ioDockerKubernetesGCPAWSRedisSSEWebSocket

AI / ML

LLM SystemsRAGAgentic WorkflowsNLPComputer VisionML Infrastructure

Observability

GrafanaKibanaDistributed TracingMetricsReliability Engineering

Research

Selected Publications

NeurIPS 2024

Building intelligent systems at scale

Professional Experience

Contextual AI

Key Achievements

MLCommons / Factored AI

Iluma Alliance

Quantum Leap Data / Softytec

Technical Expertise & Skills

Agentic AI Platforms

Programming

Distributed Systems & Infra

AI / ML

Observability

Selected Publications

Best Paper Award (Co-author)

Nature Publication

Adversarial Nibbler: A Data-Centric Challenge

BabyLM Challenge Proceedings

Data-Centric AI Workshop

14 Publications Total