News
May 30, 2025 | Our paper Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search 【新智元】 has been accepted to |
---|---|
May 01, 2025 | Will be joining Google DeepMind (Mountain View office) as a Student Researcher! |
Apr 15, 2025 | I will continue my research journey at Harvard as a PhD student! |
Jan 30, 2025 | Our papers Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers 【机器之心】, Quantifying Generalization Complexity for Large Language Models, Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems have been accepted to |