Publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2025
- Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software EngineeringarXiv preprint arXiv:2505.23604, 2025
- Measuring the Faithfulness of Thinking Drafts in Large Reasoning ModelsarXiv preprint arXiv:2505.13774, 2025
2024
- P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning ChainsarXiv preprint arXiv:2410.09207, 2024
- Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence ChallengearXiv preprint arXiv:2411.01796, 2024
- Generalizing Trust: Weak-to-Strong Trustworthiness in Language ModelsarXiv preprint arXiv:2501.00418, 2024
2023
- QTSumm: Query-focused summarization over tabular dataarXiv preprint arXiv:2305.14303, Dec 2023
- RobuT: A systematic study of table QA robustness against human-annotated adversarial perturbationsarXiv preprint arXiv:2306.14321, Jul 2023
2022
- ReasTAP: Injecting table reasoning skills during pre-training via synthetic reasoning examplesarXiv preprint arXiv:2210.12374, Jul 2022
- Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection ModelIn 2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI), Jul 2022