3 min read

Agents building models, models building drugs

Good Morning!

Six stories today spanning autonomous model construction, foodborne pathogen triage, and a benchmark that may save your team from chasing bigger parameters.


1. Agent builds biomedical ML models autonomously

Agent builds biomedical ML models autonomously

AIDO.Builder automates ML model construction for biomedical tasks end-to-end, according to a new bioRxiv preprint — the system takes a problem specification and autonomously selects architectures, trains, and evaluates models without researcher intervention. AIDO.Builder (AI-Driven Organism modeling, Builder module) sits on top of existing biological foundation models and acts as an orchestrating agent (a system that plans and executes multi-step tasks using tools) over the model-building pipeline. For computational biology groups that spend weeks hand-tuning architectures for each new omics task, this shifts the bottleneck from model construction to problem specification. Read More →

Why it matters: If the claims hold up on independent benchmarks, wet-lab groups without deep ML expertise gain a credible path to custom predictive models — which changes what any core facility or CRO should promise clients by end of year.


2. Bigger models don't win in drug discovery

Bigger models don't win in drug discovery

Scaling up model size does not reliably improve molecular property and activity prediction in drug discovery, according to a new arxiv benchmark across a range of AI-driven cheminformatics tasks. The paper finds that mid-scale models frequently match or beat larger ones on ADMET (absorption, distribution, metabolism, excretion, and toxicity) and bioactivity endpoints — meaning compute budgets for drug-discovery teams may be better spent on data curation than on parameter counts. Read More →


3. Framework for staged clinical AI autonomy

Framework for staged clinical AI autonomy

A staged-autonomy framework for clinical AI proposes moving from black-box model confidence scores to structured evidence requirements and supervision checkpoints before expanding model authority in care pathways. For clinical deployment teams navigating FDA's evolving guidance on AI/ML-based software as a medical device (SaMD), the framework offers a concrete accountability scaffold rather than a vendor trust-us claim. Read More →


4. ESM-2 embeddings triage foodborne pathogens

ESM-2 embeddings triage foodborne pathogens

Whole-proteome ESM-2 embeddings — ESM-2 being Meta's large protein language model — recover bacterial taxonomy and flag genomic outliers in foodborne pathogen surveillance without any task-specific training. For public-health genomics groups running routine sequencing of Salmonella or Listeria isolates, this is a geometry-aware triage layer that could slot in ahead of slower phylogenetic pipelines. Read More →


5. Self-improving agents gain 40% autonomously

Self-improving agents gain 40% autonomously

Open-source self-improving agents reportedly raised performance 40% without human intervention, via automated prompt and tool refinement loops described in an r/LLMDevs post — for bioinformatics pipeline agents where manual prompt tuning is a recurring cost, the pattern is worth pressure-testing on your own eval sets before trusting the headline number. Read More →


6. Security gateway for agent API traffic

Security gateway for agent API traffic

AgentPort open-sources a security gateway that sits between agents and external APIs, handling credential scoping, rate limiting, and audit logging — for any team running agents against clinical data APIs or EHR endpoints where access controls must be demonstrable to compliance reviewers. Read More →


Reply with your discoveries. A human reads them. Forward freely.