benchmark evidence
LegalBench
Legal reasoning benchmark covering 162 tasks from law school curricula. Average accuracy across all tasks.
activeupstream source ->
no matched results
No public rows from this source currently map to an active callable model.
what this result means
Legal reasoning benchmark covering 162 tasks from law school curricula. Average accuracy across all tasks.
This benchmark contributes direct public evidence. Read its scope before generalizing the result.
A win here is a win on LegalBench. Broad task pages require independent corroboration before naming a general winner.
source record
category: reasoning
metric: accuracy
matched models: 0
latest source date: date unavailable
direction: higher is better