Open-Source Research Projects:
First Author Publications:
LIMOPro: Reasoning Refinement for Efficient and Effective Test-time Scaling
Yang Xiao, J Wang, R Yuan, C Xu, K Xu, W Li, P Liu
Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human States
Yang Xiao, J Wang, Q Xu, C Song, C Xu, Y Cheng, W Li, P Liu
How far are llms from believable ai? a benchmark for evaluating the believability of human behavior simulation
Yang Xiao, Y Cheng, J Fu, J Wang, W Li, P Liu
DataLab: A Platform for Data Analysis and Intervention
Yang Xiao, Jinlan Fu, Weizhe Yuan, Vijay Viswanathan, Zhoumianze Liu, Yixin Liu, Graham Neubig, Pengfei Liu
paper
code
🏆 Outstanding Demo Award
(16 citations, ACL 2022)
Are All the Datasets in Benchmark Necessary? A Pilot Study
Yang Xiao, Jinlan Fu, See-Kiong Ng, Pengfei Liu
paper
(2 citations, NAACL 2022)
Collaborative Publications:
LIMO: Less is more for reasoning
Y Ye, Z Huang, Yang Xiao, E Chern, S Xia, P Liu
Aime-preview: A rigorous and immediate evaluation framework for advanced mathematical reasoning
Y Ye, Yang Xiao, T Mi, P Liu
Enhancing User Engagement in Socially-Driven Dialogue through Interactive LLM Alignments
J Wang, K Song, C Xu, C Song, Yang Xiao, D Li, L Qiu, W Li
Olympicarena: Benchmarking multi-discipline cognitive reasoning for superintelligent ai
Z Huang, Z Wang, S Xia, X Li, H Zhou, R Xu, RZ Fan, L Ye, E Chern, Y Ye, Yang Xiao, et al.
Towards a client-centered assessment of llm therapists by client simulation
J Wang, Yang Xiao, Y Li, C Song, C Xu, C Tan, W Li
On the Robustness of Reading Comprehension Models to Entity Renaming
Jun Yan, Yang Xiao, Sagnik Mukherjee, Bill Yuchen Lin, Robin Jia, Xiang Ren
ExplainaBoard: An Explainable Leaderboard for NLP
Pengfei Liu, Jinlan Fu, Yang Xiao, Weizhe Yuan, Shuaicheng Chang, et al.
paper
code
🏆 Best Demo Award
(75 citations, ACL 2021)