Research

Featured

Do Language Models Update their Forecasts with New Information?

Zhangdie Yuan; Zifeng Ding; Andreas Vlachos (2025)

FOReCAst: The Future Outcome Reasoning and Confidence Assessment Benchmark

Zhangdie Yuan; Zifeng Ding; Andreas Vlachos (2025)

TCP: A Benchmark for Temporal Constraint-Based Planning

Zifeng Ding^*; Sikuan Yan^*; Zhangdie Yuan^*; Xianglong Hu; Fangru Lin; Andreas Vlachos (2025)

PRobELM: Plausibility Ranking Evaluation for Language Models

Zhangdie Yuan; Eric Chamoun; Rami Aly; Chenxi Whitehouse; Andreas Vlachos (2024)

Capturing Symmetry and Antisymmetry in Language Models through Symmetry-Aware Training Objectives

Zhangdie Yuan; Andreas Vlachos (2025)

Can Pretrained Language Models (Yet) Reason Deductively?

Zhangdie Yuan^*; Songbo Hu^*; Ivan Vulić; Anna Korhonen; Zaiqiao Meng (2023)

Zero-Shot Fact-Checking with Semantic Triples and Knowledge Graphs

Zhangdie Yuan; Andreas Vlachos (2024)

Varifocal Question Generation for Fact-Checking

Nedjma Ousidhoum^*; Zhangdie Yuan^*; Andreas Vlachos (2022)

* equal contribution