publications
* indicates equal contribution; ^ denotes the student/researcher/intern I mentored.
2026
- ERNIE 5.0 Technical ReportarXiv preprint arXiv:2602.04705, 2026
2025
- EMNLP-FindingsEvolKV: Evolutionary KV Cache Compression for LLM InferenceEMNLP Findings, 2025
- CodeMixBench: Evaluating Code-Mixing Capabilities of LLMs Across 18 LanguagesEMNLP, 2025
- Understanding Subword Compositionality of Large Language ModelsEMNLP, 2025
- Debiasing Multilingual LLMs in Cross-lingual Latent SpaceEMNLP, 2025
2024
2023
- IJCNLP-AACLDemosERNIE-Music: Text-to-Waveform Music Generation with Diffusion ModelsIn Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics: System Demonstrations, Nov 2023
- ICASSPOralImproved Training of Mixture-of-Experts Language GANsIn 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Nov 2023
2022
2021
- IJCNNoralNeural Text Classification by Jointly Learning to Cluster and AlignIn International Joint Conference on Neural Networks, Jun 2021