Zhou Enyu (周恩宇)
I am a Ph.D student in School of Computer Science at Fudan University, advised by Prof.Xuanjing Huang.
I obtained a bachelor’s degree from Intelligent Science and Technology (Honors Program) at Fudan University.
My research interests mainly lie in NLP, focusing on LLM alignment and reasoning.
📚 Selected Publications
Please visit my Google Scholar page for the full publication list.

(ICLR’2025) RMB: Comprehensively Benchmarking Reward Models in LLM Alignment
Enyu Zhou, Guodong Zheng, Binghai Wang, Zhiheng Xi, Shihan Dou, Rong Bao, Wei Shen, Limao Xiong, Jessica Fan, Yurong Mou, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang
- Proposed a fine-grained benchmarking framework for reward models, covering pairwise and Best-of-N paradigms, and demonstrated the strong correlation between reward model evaluation and downstream alignment tasks.

(AAAI’2025) Alleviating Shifted Distribution in Human Preference Alignment through Meta-Learning
Shihan Dou, Yan Liu, Enyu Zhou, Tianlong Li, Haoxiang Jia, Limao Xiong, Xin Zhao, Junjie Ye, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang
- Proposed a meta-learning method to address distribution shifts in RLHF by alternating optimization of the reward model to adapt to shifted samples and distributions.

(ACL’2024) LoRAMoE: Alleviating world knowledge forgetting in large language models via MoE-style plugin
Shihan Dou*, Enyu Zhou*, Yan Liu, Songyang Gao, Wei Shen, Limao Xiong, Yuhao Zhou, Xiao Wang, Zhiheng Xi, Xiaoran Fan, Shiliang Pu, Jiang Zhu, Rui Zheng, Tao Gui, Qi Zhang, Xuan-Jing Huang
- Explored challenges of world knowledge forgetting during large-scale fine-tuning and proposed LoRAMoE, an architecture combining LoRA and MoE to mitigate knowledge conflicts and enhance multitask capabilities of LLMs.

(EMNLP’2023) RealBehavior: A Framework for Faithfully Characterizing Foundation Models’ Human-like Behavior Mechanisms
Enyu Zhou, Rui Zheng, Zhiheng Xi, Songyang Gao, Xiaoran Fan, Zichu Fei, Jingting Ye, Tao Gui, Qi Zhang, Xuanjing Huang
- Developed a framework for faithfully characterizing LLM human-like behaviors based on psychometric theories, establishing automatic testing processes and investigating the effects of alignment training.
🎖 Honors and Awards
- 2024.12 Fudan University Huatai Securities Technology Scholarship
- 2023.06 Shanghai Outstanding Graduate Award
- 2021.12 National Scholarship for Undergraduates (Selected as an Outstanding Example)
- 2021.12 “Top Ten Students”, School of Information Science and Engineering, Fudan University
📖 Educations
- 2023.06 - Present, School of Computer Science, Fudan University
- 2019.09 - 2023.06, School of Information Science and Technology, Fudan University