AI x Bio Benchmark Saturation
Each dot is a biology benchmark that has saturated. The Y-axis shows how many months it took. Benchmarks introduced more recently saturate dramatically faster.
Since 2022, models halve the remaining performance gap on biology benchmarks every
…and the rate is accelerating
Based on 73 benchmarks across 11 domains
Which bio capability domains currently have active, unsaturated benchmarks across different evaluation types.
| Knowledge | Reasoning | Procedural | Agentic | |
|---|---|---|---|---|
| Virology / Biosecurity | ||||
| Genomics | ||||
| Protein | ||||
| Drug Discovery | ||||
| Clinical | ||||
| Bio NLP | ||||
| Medical Imaging | ||||
| Science QA | ||||
| Agentic Bio |