publications

* indicates equal contribution; ^ denotes the student/researcher/intern I mentored.

2026

  1. ERNIE 5.0 Technical Report
    Baidu ERNIE
    arXiv preprint arXiv:2602.04705, 2026

2025

  1. EMNLP-Findings
    EvolKV: Evolutionary KV Cache Compression for LLM Inference
    Bohan Yu^, and Yekun Chai
    EMNLP Findings, 2025
  2. CodeMixBench: Evaluating Code-Mixing Capabilities of LLMs Across 18 Languages
    Yilun Yang^, and Yekun Chai
    EMNLP, 2025
  3. Understanding Subword Compositionality of Large Language Models
    Qiwei PengYekun Chai, and Anders Søgaard
    EMNLP, 2025
  4. Debiasing Multilingual LLMs in Cross-lingual Latent Space
    Qiwei Peng, Guimin Hu, Yekun Chai, and Anders Søgaard
    EMNLP, 2025
  5. ACL
    Curiosity-Driven Reinforcement Learning from Human Feedback
    Haoran Sun*^Yekun Chai*†Shuohuan WangYu Sun, and 2 more authors
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2025
  6. MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
    Yekun Chai*†Haoran Sun*^Huang FangShuohuan Wang, and 2 more authors
    In The Thirteenth International Conference on Learning Representations, Jul 2025
  7. COLING-Industry
    Aurora-M: Open Source Continual Pre-training for Multilingual Language and Code
    Taishi Nakamura, Mayank Mishra, Simone Tedeschi, Yekun Chai, and 37 more authors
    In Proceedings of the 31st International Conference on Computational Linguistics: Industry Track, Jan 2025
  8. COLING-Industry
    Graph-Augmented Open-Domain Multi-Document Summarization
    Xiaoping Shen, and Yekun Chai
    In Proceedings of the 31st International Conference on Computational Linguistics: Industry Track, Jan 2025

2024

  1. StarCoder 2 and The Stack v2: The Next Generation
    Anton Lozhkov , Raymond Li, Loubna Ben Allal, Federico Cassano, and 62 more authors
    Feb 2024
  2. Autoregressive Pre-Training on Pixels and Texts
    Yekun Chai, Qingyi Liu^, Jingwu Xiao^Shuohuan Wang, and 2 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
  3. EMNLP Oral
    On Training Data Influence of GPT Models
    Yekun Chai, Qingyi Liu^Shuohuan WangYu Sun, and 2 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
  4. EMNLP-Findings
    Tokenization Falling Short: On Subword Robustness in Large Language Models
    Yekun Chai , Yewei Fang, Qiwei Peng, and Xuhong Li
    In Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
  5. GiLOT: Interpreting Generative Language Models via Optimal Transport
    Xuhong Li*, Jiamin Chen*Yekun Chai*, and Haoyi Xiong
    In Forty-first International Conference on Machine Learning, Nov 2024
  6. LREC-COLING
    HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization
    Qiwei Peng*Yekun Chai*, and Xuhong Li
    In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024
  7. ICLR Spotlight
    Tool-Augmented Reward Modeling
    Lei Li*^Yekun Chai*†Shuohuan WangYu Sun, and 3 more authors
    In The Twelfth International Conference on Learning Representations(top 5%) , May 2024

2023

  1. ACL-Findings
    ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages
    Yekun ChaiShuohuan Wang, Chao Pang, Yu Sun, and 2 more authors
    In Findings of the Association for Computational Linguistics: ACL 2023, Jul 2023
  2. NeurIPS Datasets and Benchmarks
    M4: A Unified XAI Benchmark for Faithfulness Evaluation of Feature Attribution Methods across Metrics, Modalities and Models
    Xuhong LiMengnan Du, Jiamin Chen, Yekun Chai, and 2 more authors
    In Thirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track, Jul 2023
  3. IJCNLP-AACLDemos
    ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models
    Pengfei Zhu^, Chao Pang, Yekun Chai , Lei Li^, and 4 more authors
    In Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics: System Demonstrations, Nov 2023
  4. ICASSPOral
    Improved Training of Mixture-of-Experts Language GANs
    Yekun ChaiQiyue Yin , and Junge Zhang
    In 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Nov 2023

2022

  1. EMNLP-Findings
    Clip-Tuning: Towards Derivative-free Prompt Learning with a Mixture of Rewards
    Yekun ChaiShuohuan WangYu SunHao Tian, and 2 more authors
    In Findings of the Association for Computational Linguistics: EMNLP 2022, Dec 2022
  2. ACL Oral
    Predicate-Argument Based Bi-Encoder for Paraphrase Identification
    Qiwei PengDavid Weir, Julie Weeds, and Yekun Chai
    In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), May 2022
  3. SemEval
    X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and Arabic Sarcasm Detection
    Yaqian Han, Yekun ChaiShuohuan WangYu Sun, and 4 more authors
    In Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), Jul 2022

2021

  1. EMNLP-Findings
    Counter-Contrastive Learning for Language GANs
    Yekun ChaiHaidong ZhangQiyue Yin , and Junge Zhang
    In Findings of the Association for Computational Linguistics: EMNLP 2021, Nov 2021
  2. NAACL Workshop
    RefineCap: Concept-Aware Refinement for Image Captioning
    Yekun Chai, Shuo Jin, and Junliang Xing
    NAACL Workshop on Visually Grounded Interaction and Language (ViGiL), Jun 2021
  3. NAACL Workshop
    COIN: Conversational Interactive Networks for Emotion Recognition in Conversation
    Haidong Zhang*, and Yekun Chai*
    In Proceedings of the Third Workshop on Multimodal Artificial Intelligence, Jun 2021
  4. IJCNNoral
    Neural Text Classification by Jointly Learning to Cluster and Align
    Yekun ChaiHaidong ZhangQiyue Yin , and Junge Zhang
    In International Joint Conference on Neural Networks, Jun 2021

2020

  1. ACL
    Highway Transformer: Self-Gating Enhanced Self-Attentive Networks
    Yekun Chai, Shuo Jin, and Xinwen Hou
    In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Jul 2020