News | Zhenting Qi

Sep 18, 2025	Our paper EvoLM: In Search of Lost Language Model Training Dynamics has been accepted to `NeurIPS 2025` (oral).
May 30, 2025	Our paper Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search 【新智元】 has been accepted to `ICML 2025`.
May 01, 2025	Will be joining Google DeepMind (Mountain View office) as a Student Researcher, working on language model post-training.
Apr 15, 2025	I will continue my research journey at Harvard as a PhD student!
Jan 30, 2025	Our papers Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers 【机器之心】, Quantifying Generalization Complexity for Large Language Models, Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems have been accepted to `ICLR 2025`.