News

May 30, 2025

Our paper Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search 【新智元】 has been accepted to ICML 2025.

May 01, 2025

Will be joining Google DeepMind (Mountain View office) as a Student Researcher!

Apr 15, 2025

I will continue my research journey at Harvard as a PhD student!

Jan 30, 2025

Our papers Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers 【机器之心】, Quantifying Generalization Complexity for Large Language Models, Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems have been accepted to ICLR 2025.