๐Ÿ‘‹ About Me

Hi! I am a second-year masterโ€™s student at Tsinghua Shenzhen International Graduate School, Tsinghua University, under the supervision of Prof. Xiu Li. I received my bachelorโ€™s degree with honors from Shandong University in June 2023.

I am fortunate to have collaborated with many great researchers who have generously shared their guidance and insights. Currently, I am a research intern at Large Language Model Center, Shanghai AI Laboratory, advised by Dr. Biqing Qi. Previously, I interned at Intelligent Photonics and Electronics Center (IPEC), Shanghai AI Laboratory, advised by Dr. Chenjia Bai. Before that, I was a research intern at Institute for AI, Peking University, advised by Prof. Yali Du and Prof. Yaodong Yang.

Research Interests: My research focuses on Large Language Models (LLMs) and Reinforcement Learning (RL). I am currently interested in improving the reasoning and generalization capabilities of LLMs and exploring the potential of integrating LLMs to enhance RL algorithms, especially in the context of Reinforcement Learning from Human/AI Feedback (RLHF/RLAIF).

Please feel free to drop me an e-mail if you are interested in collaborating with me.

๐Ÿ”ฅ News

  • [2024.12] ย ๐ŸŽ‰ One paper accepted by AAAI 2025
  • [2024.05] ย ๐ŸŽ‰ One paper accepted by ICML 2024
  • [2024.01] ย ๐ŸŽ‰ One paper accepted by ICLR 2024
  • [2023.10] ย ๐ŸŽ‰ One paper accepted by OTML workshop at NeurIPS 2023
  • [2022.09] ย ๐ŸŽ‰ One paper accepted by NeurIPS 2022

๐Ÿ“ Publications

(* indicates equal contribution)

Preprints

Conference Papers

Workshop Papers

๐ŸŽ“ Education

  • Tsinghua University, 2023.09 - 2026.06
    Masterโ€™s student in Electronic and Information Engineering (AI)
  • Shandong University, 2019.09 - 2023.06
    B.S. in Statistics (Data Science & AI) with honors

๐ŸŽ– Honors and Awards

  • National Scholarship (Top 1%), 2022.12
  • National Scholarship (Top 1%), 2021.12
  • First Prize in China Undergraduate Mathematical Contest in Modeling (CUMCM) (Top 0.65%), 2021.11
  • Outstanding Student of Shandong Province (Top 0.6%), 2022.05
  • Outstanding Graduate of Shandong Province (Top 6%), 2023.04
  • Dishang Scholarship, 2022.10

๐Ÿ’ป Internships

๐Ÿ› ๏ธ Services

  • Conference Reviewer: NeurIPS (2024), ICLR (2025), AAMAS (2024), AISTATS (2025), ECAI (2024)
  • Workshop Reviewer: NeurIPS OTML (2023)