LifeSciBench: AI Fails 64% of Life-Science Tasks 2026
June 24, 2026
OpenAI's LifeSciBench grades AI on 750 expert-written life-science research tasks. Its best model, GPT-Rosalind, passes just 36.1% — here's what that means.
OpenAI's LifeSciBench grades AI on 750 expert-written life-science research tasks. Its best model, GPT-Rosalind, passes just 36.1% — here's what that means.