Dear OHDSI Community,
“It’s finally arrived. That wonderful month when you can put all your troubles aside, cast off those New Year’s resolutions you’ve already failed at, enjoy the freezing cold Northeastern US weather or the Australian heat, and JUST FOCUS ON PHENOTYPING!”
We are excited to launch Phenotype Phebruary 2026. Another ambitious community experiment designed to advance how we collectively build, evaluate, and refine phenotypes at scale.
28 days. ONE phenotype. An end‑to‑end, iterative development and evaluation challenge.
That’s our target for OHDSI in 2026. Are you ready?
What’s New This Year
Over the last decade, OHDSI has built a tool enabled structured workflow to support phenotype development and evaluation—Atlas, CohortDiagnostics, KEEPER, PheValuators. Then last year in the OHDSI Symposium, we did the Minds Meet Machines* when this workflow was challenged by AI!
And because this is the era of the reasoning models—and LLMs have shown promise to transform phenotype development—this year we’re introducing The Phenotype Challenge, a collaborative experiment to test an iterative, empirically grounded, AI‑assisted workflow.
We’ll explore whether we, as a community, can develop phenotypes through a cycle of:
- Development
- Evaluation
- Error analysis
- Refinement
- Re‑evaluation
All with help from LLM‑enhanced KEEPER, multiple data sources, and robust diagnostics.
Challenge Workflow
1. Vote on the Condition
We will begin with a community vote between two candidate conditions. The winning condition will be announced immediately after the poll closes.
2. Submit Your Initial Phenotype (Feb 1–13)
- All collaborators are invited to submit their best phenotype definition(s), using any method—rule‑based, ML‑based, hybrid—so long as it conforms to the OMOP CDM. You can submit more than one if it’s needed (e.g. specific, sensitive).
- Deadline: Friday, February 13.
3. Evaluation & Diagnostics (Feb 12–24)
We will evaluate all submitted definitions using:
- LLM‑enabled KEEPER to estimate PPV and sensitivity (phevaluator results will also be shared if possible)
- At least three observational data sources (Optum DOD, Optum EHR, JMD)
- A CohortDiagnostics app with the implementations across the same data sources
Collaborators will have access to diagnostics and profiles to analyze performance and identify error sources.
4. Iteration #1 — Second Submission Due Feb 25
Submit your refined definition by Wednesday, February 25.
We will re‑evaluate all updated submissions with the same process.
5. Iteration #2 — Final Submission Due Mar 3
Submit your final definition by Tuesday, March 3.
Final performance results will be announced mid‑March, including:
- Best overall performance
- Most improved
- Most generalizable
Working Group & Community Calls
Phenotype WG (Tuesday 9am EST)
- Feb 3: Support for LLM enabled literature review for existing phenotypes+ phenotype development in ATLAS + GenAI concept set generation (based on the learnings of minds meet machines)
- Feb 10: Discussion of evaluation results, diagnostics, and iteration strategies-*Clinical partners/collaborators’ participation will be very valuable in this session.
- Feb 17 & Feb 24: Optional office hours for iteration support
OHDSI Community Calls (Tuesdays)
- Feb 17: Update on submissions and early insights
- Feb 24: How KEEPER profiles reveal phenotype error sources
- Mar 3: Wrap‑up and reflections
Submission Requirements
Please include:
- An OMOP‑CDM‑compliant phenotype definition (ATLAS JSON preferred)
- High level description of your development method (any approach allowed, including ML)
- Able run on participating data sources platforms
Note: Data partners will run KEEPER locally. No patient profiles will be shared.
Data Partners: We Need You
If you operate an OMOP CDM instance, we encourage you to:
- Run submitted definitions on your data
- Share CohortDiagnostics results
- Execute KEEPER on your own data with manual review (no LLM)
- Execute LLM‑enabled KEEPER locally
Your participation strengthens generalizability and enriches community learning.
Let’s Build the Future of Phenotyping
Phenotype Phebruary 2026 is our opportunity to prototype a scalable, systematic, and AI‑enhanced phenotype development workflow—together.
We invite all collaborators to submit definitions, iterate with us, and learn from this community-wide experiment.
More details—including the condition poll and submission instructions—will be shared shortly on the OHDSI Forums
Let’s make Phenotype Phebruary 2026 our most innovative and impactful yet!
Warm regards,
The OHDSI Phenotype Workgroup
*Minds meet machine drafted manuscript:
For those who participate or contribute to minds meet machine- or those who just want to see the first draft, here is the drafted manuscript and related artifacts. Please feel free to edit and add your author information into the authorship table if you meet author criteria.
Azza Shoaibi PhD
Gowtham Rao MD, PhD