Yekun Chai

Dec 12, 2022	Diffusion Models: A Mathematical Guide from Scratch
Apr 17, 2022	Efficient Distributed Training: From DP to ZeRO and FlashAttention
Jan 10, 2022	Masking Strategies for Pre-trained Language Models: From MLM to T5