← Back to Computation and Language
cs.CL

Mapping the scientific dependencies behind each major discovery

Eric Chamoun, Yizhou Chi, Yulong Chen, Rui Cao, Zifeng Ding, Michalis Korakakis, Andreas Vlachos

May 14, 2026

SciPaths introduces discovery pathway forecasting, a task that requires identifying enabling contributions necessary to realize a target scientific contribution and grounding them in prior work. The benchmark contains 262 expert-annotated and 2,444 silver pathways from ML and NLP papers, each documenting enabling contributions, their roles, and connections to prior work. Evaluation of frontier models shows current approaches struggle with this task, particularly in recovering methodological dependencies, though performance improves substantially when gold enabling contributions are provided—indicating that decomposition quality is the main bottleneck. The benchmark shifts focus from citation prediction and idea generation toward reasoning about scientific causality and prerequisite knowledge.
Published as SciPaths: Forecasting Pathways to Scientific Discovery arXiv:2605.14600
Read the original paper →