About me

Building AI agents that reason, act, and improve themselves.

I am a Founding Member of Technical Staff at Eigen AI, where I build systems that make LLM agents better. I lead EigenData, a self-evolving multi-agent platform for synthesizing, auditing, and repairing function-calling data, and EigenLoop, an end-to-end post-training pipeline (SFT, GRPO/RL) for production tool-use agents.

Previously, I was a Research Scientist at Meta (GenAI), where I led agentic post-training that improved Llama 4 Maverick's function-calling BFCL score from ~50 to 72, and an Applied Scientist at Amazon, working on LLM reasoning for e-commerce.

I received my Ph.D. in Computer Science from Georgia Tech / Stanford in 2024, advised by Prof. Diyi Yang. My research spans LLM agents, post-training, and reasoning, with 20+ first-author publications at NeurIPS, ICLR, EMNLP, and ACL.

5,000+
Citations
24
h-index
20+
First-Author Pubs

Research Interests

LLM Agents & Tool Use LLM Post-Training (SFT/RL) LLM Reasoning Data-Efficient NLP Language Generation

News

Mar. 2026 Released EigenData — a self-evolving multi-agent platform for function-calling data synthesis, auditing, and repair.
Jan. 2026 Joined Eigen AI as Founding Member of Technical Staff, building next-gen agentic AI infrastructure.
2025 Led agentic post-training at Meta that improved Llama 4 Maverick's BFCL function-calling score from ~50 to 72.
2024 DARG accepted at NeurIPS 2024; DyVal accepted at ICLR 2024; Skills-in-Context at EMNLP 2024.
May 2024 Defended Ph.D. thesis at Georgia Tech / Stanford. Onward!

Selected Publications

View all →
EigenData: A Self-Evolving Multi-Agent Platform for Function-Calling Data Synthesis, Auditing, and Repair
Jiaao Chen, Jingyuan Qi, Mingye Gao, Wei-Chen Wang, Hanrui Wang, and Di Jin
arXiv, 2026 pdf
DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph
Zhehao Zhang, Jiaao Chen, and Diyi Yang
NeurIPS, 2024 pdf
DyVal: Graph-informed Dynamic Evaluation of Large Language Models
Kaijie Zhu*, Jiaao Chen*, Jindong Wang, Neil Zhenqiang Gong, Diyi Yang, Xing Xie
ICLR, 2024 pdf
Skills-in-context Prompting: Unlocking Compositionality in Large Language Models
Jiaao Chen, Xiaoman Pan, Dian Yu, Kaiqiang Song, Xiaoyang Wang, Dong Yu and Jianshu Chen
EMNLP Findings, 2024 pdf
Eigen AI · 2026– Meta · 2025 Amazon · 2024 Tencent AI · 2023 Amazon AWS · 2022 GT/Stanford · Ph.D. 2024