Yekun Chai
My research focuses on pretraining for foundation models: how data recipes, tokenization, and scale govern the emergence of reasoning, code, and agentic capabilities.
I study which training factors remain predictive as models scale, which capability bottlenecks persist under scaling, and how early training decisions set the ceiling for what later training can amplify.
I have contributed to ERNIE, ERNIE-Code, and StarCoder 2.
News
| Aug 21, 2025 | Four papers accepted to EMNLP 2025. |
|---|---|
| May 16, 2025 | Curiosity-driven RLHF accepted to ACL 2025. [code] |
| Jan 23, 2025 | MA-RLHF accepted to ICLR 2025. [paper] [code] |