Research

Featured

Selected Publications

LLM Forecasting

Do Language Models Update their Forecasts with New Information?Zhangdie Yuan; Zifeng Ding; Andreas Vlachos(2025)Paper
FOReCAst: The Future Outcome Reasoning and Confidence Assessment BenchmarkZhangdie Yuan; Zifeng Ding; Andreas Vlachos(2025)PaperResource
TCP: A Benchmark for Temporal Constraint-Based PlanningZifeng Ding*; Sikuan Yan*; Zhangdie Yuan*; Xianglong Hu; Fangru Lin; Andreas Vlachos(2025)PaperResource
PRobELM: Plausibility Ranking Evaluation for Language ModelsZhangdie Yuan; Eric Chamoun; Rami Aly; Chenxi Whitehouse; Andreas Vlachos(2024)PaperResource

LLM Reasoning

Capturing Symmetry and Antisymmetry in Language Models through Symmetry-Aware Training ObjectivesZhangdie Yuan; Andreas Vlachos(2025)PaperResource
Can Pretrained Language Models (Yet) Reason Deductively?Zhangdie Yuan*; Songbo Hu*; Ivan Vulić; Anna Korhonen; Zaiqiao Meng(2023)PaperResource

LLM Factuality

Zero-Shot Fact-Checking with Semantic Triples and Knowledge GraphsZhangdie Yuan; Andreas Vlachos(2024)Paper
Varifocal Question Generation for Fact-CheckingNedjma Ousidhoum*; Zhangdie Yuan*; Andreas Vlachos(2022)PaperResource

* equal contribution