OpenAI ships a biology frontier model
-
Nº XIV
- Date
- 13 May 2026
- Issue
- Fourteen
- Stories
- Five
- Editor
- Agentic Discovery
Five releases, one theme: agents are being graded on real biology now, not toy tasks.
OpenAI launches GPT-Rosalind
OpenAI shipped GPT-Rosalind, a frontier reasoning model tuned for chemistry, protein engineering, and genomics, with trusted-access deployments at Amgen, Moderna, the Allen Institute, and Thermo Fisher. It beats GPT-5.4 on 6 of 11 LABBench2 tasks — the biggest jump comes on CloningQA — and clears the 95th-percentile human-expert score on Dyno's RNA prediction task. A free Codex "Life Sciences research" plugin ships alongside, exposing 50+ scientific tools. The frontier-lab playbook for biology is now explicit: train a domain reasoning model, wire it to lab-relevant tools, partner with named pharma and research institutes.
ClaroAI-Bench grades reproducibility agents
ClaroAI-Bench scores agents on whether they can actually reproduce results from real biomedical papers — code execution, data wrangling, the full pipeline, not just Q&A. The benchmark joins a thin shelf of evaluations stressing agents on end-to-end science rather than isolated reasoning, and anchors a concrete reference for the reproducibility claims every bio-agent vendor now makes.
Agents extend PROTAC databases
Agentic literature extraction augments targeted protein degradation databases past what manual curation can sustain, pulling PROTAC and molecular glue data straight from papers into structured records. Curation throughput, long the binding constraint on degrader chemistry catalogs, stops being the bottleneck — adjacent to the reproducibility framing in #2 above, this is the curation-side counterpart.
MolDeTox tests fragment-by-fragment edits
MolDeTox benchmarks LLMs on stepwise fragment editing for molecular detoxification — can a model walk a toxic molecule to a safe analog one substructure at a time? Establishes a checkable yardstick for agentic medchem, where "the model suggested a fix" has been hard to grade objectively.
Generative platform targets RNA chemistry
A generative chemistry platform for small molecules targeting RNA gets a case study in chemical optimization, pushing generative design past the protein-target comfort zone. RNA-targeted small molecules — a notoriously hard modality — gain a deployable design loop.
Reply with your discoveries. A human reads them. Forward freely.
|