Publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. satori.png
    Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
    Maohao Shen*, Guangtao Zeng*, Zhenting Qi*, Zhang-Wei Hong, Zhenfang Chen, and 5 more authors
    In the Forty-Second International Conference on Machine Learning (ICML), 2025
  2. rstar.png
    Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solver
    Zhenting Qi, Mingyuan MA, Jiahang Xu, Li Lyna Zhang, Fan Yang, and 1 more author
    In The Thirteenth International Conference on Learning Representations (ICLR), 2025
  3. scylla.png
    Quantifying Generalization Complexity for Large Language Models
    Zhenting Qi, Hongyin Luo, Xuliang Huang, Zhuokai Zhao, Yibo Jiang, and 3 more authors
    In The Thirteenth International Conference on Learning Representations (ICLR), 2025
  4. rag.png
    Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems
    Zhenting Qi, Hanlin Zhang, Eric P. Xing, Sham M. Kakade, and Himabindu Lakkaraju
    In The Thirteenth International Conference on Learning Representations (ICLR), 2025
  5. Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering
    Guangtao Zeng, Maohao Shen, Delin Chen, Zhenting Qi, Subhro Das, and 6 more authors
    arXiv preprint arXiv:2505.23604, 2025
  6. Measuring the Faithfulness of Thinking Drafts in Large Reasoning Models
    Zidi Xiong, Chen Shan, Zhenting Qi, and Himabindu Lakkaraju
    arXiv preprint arXiv:2505.13774, 2025

2024

  1. P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains
    Simeng Han, Aaron Yu, Rui Shen, Zhenting Qi, Martin Riddell, and 6 more authors
    arXiv preprint arXiv:2410.09207, 2024
  2. Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge
    Weihua Du, Qiushi Lyu, Jiaming Shan, Zhenting Qi, Hongxin Zhang, and 6 more authors
    arXiv preprint arXiv:2411.01796, 2024
  3. Generalizing Trust: Weak-to-Strong Trustworthiness in Language Models
    Martin Pawelczyk, Lillian Sun, Zhenting Qi, Aounon Kumar, and Himabindu Lakkaraju
    arXiv preprint arXiv:2501.00418, 2024
  4. folio.png
    FOLIO: Natural Language Reasoning with First-Order Logic
    Simeng Han, Hailey Schoelkopf, Yilun Zhao, Zhenting Qi, Martin Riddell, and 30 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), Nov 2024

2023

  1. Self-Criticism: Aligning Large Language Models with their Understanding of Helpfulness, Honesty, and Harmlessness
    Xiaoyu Tan, Shaojie Shi, Xihe Qiu, Chao Qu, Zhenting Qi, and 2 more authors
    In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Industry Track, Dec 2023
  2. QTSumm: Query-focused summarization over tabular data
    Yilun Zhao, Zhenting Qi, Linyong Nan, Boyu Mi, Yixin Liu, and 6 more authors
    arXiv preprint arXiv:2305.14303, Dec 2023
  3. loft.png
    LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control
    Yilun Zhao*, Zhenting Qi*, Linyong Nan, Lorenzo Jaime Flores, and Dragomir Radev
    In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, May 2023
  4. pillow.png
    PILLOW: Enhancing Efficient Instruction Fine-tuning via Prompt Matching
    Zhenting Qi, Xiaoyu Tan, Shaojie Shi, Chao Qu, Yinghui Xu, and 1 more author
    In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Industry Track, Dec 2023
  5. safer.png
    SaFER: A Robust and Efficient Framework for Fine-tuning BERT-based Classifier with Noisy Labels
    Zhenting Qi, Xiaoyu Tan, Chao Qu, Yinghui Xu, and Yuan Qi
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track), Jul 2023
  6. OpenRT: An Open-source Framework for Reasoning Over Tabular Data
    Yilun Zhao, Boyu Mi, Zhenting Qi, Linyong Nan, Minghao Guo, and 2 more authors
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), Jul 2023
  7. RobuT: A systematic study of table QA robustness against human-annotated adversarial perturbations
    Yilun Zhao, Chen Zhao, Linyong Nan, Zhenting Qi, Wenlin Zhang, and 3 more authors
    arXiv preprint arXiv:2306.14321, Jul 2023

2022

  1. ReasTAP: Injecting table reasoning skills during pre-training via synthetic reasoning examples
    Yilun Zhao, Linyong Nan, Zhenting Qi, Rui Zhang, and Dragomir Radev
    arXiv preprint arXiv:2210.12374, Jul 2022
  2. Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection Model
    Zhenting Qi, Ruike Zhu, Zheyu Fu, Wenhao Chai, and Volodymyr Kindratenko
    In 2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI), Jul 2022