Yiwen Lu

Yiwen Lu 陆逸文

I work on autonomous systems that learn to act without being explicitly programmed. I started with the mathematics of adaptive control: how to safely operate systems you don't fully understand. That led me to dexterous manipulation, capturing how humans use their hands and turning that into robot behavior. I also built infrastructure and models for agents that learn from real experience. Now I'm bringing those threads together in foundation models for dexterous manipulation.

Work

MinT project page showing Mind Lab Toolkit

MinT

At Mind Lab, I lead infrastructure for MinT: the RL system behind agents that learn from real experience. It takes care of distributed rollout, training orchestration, and evaluation so models can improve inside real tasks, not just static benchmarks.

Macaron-v1-preview visual header

Macaron-v1-preview

We used MinT to post-train Macaron-v1-preview, a 749B Mixture-of-LoRA agent model. The goal is personal-agent behavior that survives the messiness of real life: long conversations, tool use, generative UI, and tasks that change while the user and world change.

Eko logo

Eko

I helped build Eko, a framework for AI agents that work across browser and desktop. Most frameworks target one or the other; Eko handles both. Designed for real deployment: you can pause, inspect, and resume agents mid-task.

DexCanvas dataset showing human hands, simulated hands, and robot hands manipulating objects

DexCanvas

At DexRobot, my team built DexCanvas: 70 hours of human hand demonstrations expanded to 7,000 hours through physics-validated simulation. Unlike vision-only datasets, we capture contact forces, which is essential for learning manipulation that actually works.

RC car performing drift maneuvers on circular and figure-8 trajectories

Learning-Based Control

My PhD asked: how do you control a system you don't understand, safely, while learning? I developed controllers with a "circuit breaker" that guarantees safety and learns efficiently. Validated on real hardware, not just simulations.

Blog

Four-layer context hierarchy diagram

Is 200K Context Enough for Anybody?

A four-layer context hierarchy for coding agents: durable method, project-local truth, the current task bundle, and unattended execution policy.

Background

DexGEM Lab

Current

Founder

Mind Lab

2025–2026

Infra lead

DexRobot

2024–2025

Embodied AI, team lead

Harvard SEAS

2023

Visiting scholar

Tsinghua

2015–2025

PhD + BE, Control