Reproduce the paper numbers from released predictions For each model we release its per-sample prediction dump on the validation split — the exact outputs extract_predicts produced — so you can ...