publications

* equal contribution · † corresponding author / project lead · ^ mentored intern or student

2026

  1. ERNIE 5.0 Technical Report
    Baidu ERNIE
    arXiv preprint arXiv:2602.04705, 2026

2025

  1. ACL
    Curiosity-Driven Reinforcement Learning from Human Feedback
    Haoran Sun*^Yekun Chai*†Shuohuan WangYu Sun, and 2 more authors
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2025
  2. MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
    Yekun Chai*†Haoran Sun*^Huang FangShuohuan Wang, and 2 more authors
    In The Thirteenth International Conference on Learning Representations, Jul 2025
  3. Aurora-M: Open Source Continual Pre-training for Multilingual Language and Code
    Taishi Nakamura, Mayank Mishra, Simone Tedeschi, Yekun Chai, and 37 more authors
    In Proceedings of the 31st International Conference on Computational Linguistics: Industry Track, Jan 2025

2024

  1. StarCoder 2 and The Stack v2: The Next Generation
    Anton Lozhkov , Raymond Li, Loubna Ben Allal, Federico Cassano, and 62 more authors
    Feb 2024
  2. Autoregressive Pre-Training on Pixels and Texts
    Yekun Chai, Qingyi Liu^, Jingwu Xiao^Shuohuan Wang, and 2 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
  3. EMNLP Oral
    On Training Data Influence of GPT Models
    Yekun Chai, Qingyi Liu^Shuohuan WangYu Sun, and 2 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
  4. Tokenization Falling Short: On Subword Robustness in Large Language Models
    Yekun Chai , Yewei Fang, Qiwei Peng, and Xuhong Li
    In Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
  5. GiLOT: Interpreting Generative Language Models via Optimal Transport
    Xuhong Li*, Jiamin Chen*Yekun Chai*, and Haoyi Xiong
    In Forty-first International Conference on Machine Learning, Nov 2024
  6. HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization
    Qiwei Peng*Yekun Chai*, and Xuhong Li
    In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024
  7. ICLR Spotlight
    Tool-Augmented Reward Modeling
    Lei Li*^Yekun Chai*†Shuohuan WangYu Sun, and 3 more authors
    In The Twelfth International Conference on Learning Representations(top 5%) , May 2024

2023

  1. ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages
    Yekun ChaiShuohuan Wang, Chao Pang, Yu Sun, and 2 more authors
    In Findings of the Association for Computational Linguistics: ACL 2023, Jul 2023
  2. ICASSP Oral
    Improved Training of Mixture-of-Experts Language GANs
    Yekun ChaiQiyue Yin , and Junge Zhang
    In 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jul 2023

2022

  1. Clip-Tuning: Towards Derivative-free Prompt Learning with a Mixture of Rewards
    Yekun ChaiShuohuan WangYu SunHao Tian, and 2 more authors
    In Findings of the Association for Computational Linguistics: EMNLP 2022, Dec 2022

2021

  1. Counter-Contrastive Learning for Language GANs
    Yekun ChaiHaidong ZhangQiyue Yin , and Junge Zhang
    In Findings of the Association for Computational Linguistics: EMNLP 2021, Nov 2021

2020

  1. ACL
    Highway Transformer: Self-Gating Enhanced Self-Attentive Networks
    Yekun Chai, Shuo Jin, and Xinwen Hou
    In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Jul 2020