Mind Meets Machines - OHDSI Symposium 2025 - Phenotype Development and Evaluation Work Group

Gowtham_Rao · September 20, 2025, 8:55pm

Open Challenge to AI Innovators: Participate in the OHDSI 2025 “Collaborative Intelligence” Scientific Evaluation

Dear OHDSI Community and Innovators in Clinical Informatics,

Following up on the initial ‘Call for Collaboration’ and the detailed experimental design (the “Run-of-Show”) posted previously, we are now actively recruiting for the “AI Arm” of this experiment.

The proposed OHDSI 2025 Symposium session, "Collaborative Intelligence: Humans and AI in Concept Set Development," is a "Minds Meet Machines" challenge designed as a rigorous, hypothesis-driven scientific study. The scientific rigor of this evaluation depends on comparing expert human curation against the most advanced, distinct (multiple) AI pipelines available.

To ensure this is a world-class, scientific evaluation of Humans versus AI in clinical modeling, we must actively seek out and challenge the leaders in this space.

We have conducted an extensive search spanning academic literature (PubMed, preprints, informatics journals), conference proceedings, open-source repositories (GitHub), and proprietary announcements. Our objective was to identify the academic teams, open-source projects, and industry leaders who are actively developing methodologies for AI-assisted concept set development, value set management, and clinical coding (including OMOP, ICD, HPO, and others).

The organizations listed below are the identifier innovators in our systematic search. If you have not been identified, please reach out to us now and you will be invited.

If your organization is listed, it is because you have publicly claimed innovation in this domain. To contact you, we are posting this here, and we are emailing you directly.

We invite you to bring your solutions and expertise to this open collaboration. We challenge you to validate your methodologies within our standardized, blinded, randomized comparative framework using the ~28 defined phenotype conditions detailed in the experimental protocol (see the first post in this thread).

To the Innovators Listed: Are you up to the challenge?

To the OHDSI Community: The success of this collaboration relies on bringing these innovations together. If you collaborate with these teams or individuals, we encourage you to facilitate an introduction or forward this invitation.

Below is the comprehensive list of identified innovators and their publicly available contact information, compiled entirely through public domain searches.

OHDSI 2025 “AI Arm” Challenge: Outreach Tracker

Name	Key Person(s) / Public Contact	Why You (Focus/Innovation)	Link(s)
Academic & Research Initiatives
Columbia University DBMI	Chunhua Weng, PhD (chunhua@columbia.edu ); George Hripcsak, MD; Anna Ostropolets, MD, PhD anna.ostropolets@columbia.edu	Criteria2Query (C2Q) 3.0 (NLP to OMOP SQL); LLM for concept set curation (refining PHOEBE); Automated taxonomy learning.	Columbia DBMI
Vanderbilt University Medical Center (VUMC) DBMI	Wei-Qi Wei, MD, PhD: wei-qi.wei@vumc.org ; Chao Yan, PhD: chao.yan.1@vumc.org	Evaluation of LLMs (GPT-4, Claude) for generating executable phenotyping algorithms (SQL queries) adhering to a CDM.	Vanderbilt DBMI; Large Language Models Facilitate the Generation of Electronic Health Record Phenotyping Algorithms
Stanford University (AIMI / Shah Lab)	Nigam H. Shah, MBBS, PhD (nigam@stanford.edu )	Extensive research on LLM evaluation in healthcare, including automated billing code assignment and AI fairness frameworks.	The Stanford Center for Artificial Intelligence in Medicine and Imaging (AIMI)
UTHealth Houston & Mayo Clinic (Agentic MCP)	Hua Xu, PhD: Hua.Xu@uth.tmc.edu ; Hongfang Liu, PhD: Hongfang.Liu@uth.tmc.edu	Development of the Agentic Model Context Protocol (MCP) framework for zero-training, hallucination-preventive OMOP mapping using LLMs.	An Agentic Model Context Protocol Framework for Medical Concept Standardization
King’s College London (KCL) / Alan Turing Institute (MedCAT Team)	Prof. Richard Dobson: richard.j.dobson@kcl.ac.uk	Developers of MedCAT/CogStack; NLP for extracting/linking clinical concepts to UMLS/SNOMED CT; Integrating Transformer models.	Medical Concept Annotation Tool
UCSF - Bakar Computational Health Sciences Institute	Vivek Rudrapatna, MD, PhD: Vivek.Rudrapatna@ucsf.edu	Utilizing LLMs and NLP to transform unstructured narratives into structured data, including OMOP mapping and knowledge graphs (SPOKE).	UCSF Bakar Computational Health Sciences Institute
Mass General Brigham (MGB) Research Team	Heekyong Park, PhD (Corresponding Author): hpark25@mgb.org	Utilizing LLMs + RAG to improve phenotyping accuracy from unstructured EHR data, comparing against traditional ICD code methods.	A Comprehensive Evaluation of LLM Phenotyping Using Retrieval-Augmented Generation (RAG): Insights for RAG Optimization
Mount Sinai Health System (MSHS)	Eyal Klang, MD (Corresponding Author): eyal.klang@mountsinai.org	Evaluation of RAG-enhanced LLMs (GPT-4, Llama-3.1) for automating Emergency Department ICD-10-CM coding vs. human coders.	Assessing Retrieval-Augmented Large Language Model Performance in Emergency Department ICD-10-CM Coding Compared to Human Coders
Yale School of Medicine	rohan.khera@yale.edu ; Samah Fodeh, PhD (Corresponding Author): samah.fodeh@yale.edu	Novel Sentence Transformer-based NLP approaches for mapping EHR data (medications) to the OMOP CDM.	A Novel Sentence Transformer-based Natural Language Processing Approach for Schema Mapping of Electronic Health Records to the OMOP Common Data Model
CLH (Code Like Humans)	Univ. of Cambridge/Microsoft Research Authors andreas@motzfeldt.dk ; Yu-Neng Chuang (Corresponding Author): yc577@cam.ac.uk	LLM-based agentic framework designed to automate medical coding (ICD-10) by mirroring human processes.	Code Like Humans: A Multi-Agent Solution for Medical Coding
MedCodER	Univ. of Illinois Urbana-Champaign Authors sanmitra1@gmail.com ; Y. Zhang & J. Gao (Corresponding Authors): {yuz9, jgao8}@illinois.edu	Generative AI framework for automatic medical coding (ICD code prediction) using extraction, retrieval, and re-ranking.	MedCodER: A Generative AI Assistant for Medical Coding
Erasmus MC - Medical Informatics	Peter Rijnbeek, PhD (p.rijnbeek@erasmusmc.nl)	Exploring NLP and LLMs for extracting concepts from clinical text (including non-English languages) for phenotyping.	Erasmus MC - Medical Informatics
The Jackson Laboratory & Charité Berlin	Peter N. Robinson, MD, MSc (Peter.Robinson@jax.org)	Development of Human Phenotype Ontology (HPO); Computational deep phenotyping and ontology-based data models (Phenopackets).	The Jackson Laboratory & Charité Berlin
KOMAP	University of Florida and NVIDIA Authors tcai@hsph.harvard.edu	Knowledge-Driven Online Multimodal Automated Phenotyping; Uses knowledge graph embeddings on EHR concepts to generate feature lists.	Knowledge-Driven Online Multimodal Automated Phenotyping System
Open-Source Tools & Consortia
The Monarch Initiative	Melissa Haendel, PhD; Chris Mungall, PhD info@monarchinitiative.org	International consortium for deep phenotyping; NLP/ML to extract phenotypic information and standardize using ontologies (HPO).	Monarch Initiative
Llettuce	University of Manchester Authors g.figueredo@nottingham.ac.uk R.A.C. and G.D. (Corresponding Authors): {rebecca.croft, goran.davidovic}@manchester.ac.uk	Open-source tool using local LLMs, semantic search, and fuzzy matching for converting medical terms into the OMOP standard vocabulary.	Llettuce: An Open Source Natural Language Processing Tool for the Translation of Medical Terms into Uniform Clinical Encoding
RAG-HPO	Baylor/Texas Children’s Hospital Authors Jennifer.Posey@bcm.edu	Python tool using RAG to enhance LLM accuracy in assigning Human Phenotype Ontology (HPO) terms from medical free-text.	Improving Automated Deep Phenotyping Through Large Language Models Using Retrieval Augmented Generation
GCAF (Generalised Codelist Automation Framework)	A. Aslam (Corresponding Author): a.aslam@sheffield.ac.uk	Framework and Python repository for automating the development of clinical codelists (Readcodes, SNOMED).	An automation framework for clinical codelist development validated with UK data from patients with multiple long-term conditions
CLAMP (Clinical Language Annotation, Modeling, and Processing)	Hua Xu, PhD (Hua.Xu@uth.tmc.edu )	Comprehensive clinical NLP toolkit; Tools for entity recognition and mapping clinical concepts to UMLS/SNOMED CT.	CLAMP Clinical Language Annotation, Modeling, and Processing Toolkit
OHNLP/omop_mcp	jaerongahn@gmail.com	Open Health NLP; Open-source Model Context Protocol (MCP) server for mapping clinical terminology to OMOP concepts using LLMs.	GitHub omop_mcp
pyomop	bell@nuchange.ca	Python library for OMOP CDM; Includes LLM-based natural language queries and LLM-agent assisted FHIR to OMOP mapping.	GitHub pyomop ; Vibe Coding FHIR to OMOP - Bell Eapen MD, PhD.
MedRAG / MIRAGE Benchmark	Univ. at Buffalo, NCBI/NLM/NIH Authors	Toolkit (MedRAG) and benchmark (MIRAGE) for the systematic evaluation of RAG systems in the medical domain.	Benchmarking Retrieval-Augmented Generation for Medicine
OHDSI Community Efforts
OHDSI Generative AI Workgroup	OHDSI Community Members	Official workgroup dedicated to the application of generative AI for RWE, including automated taxonomy learning and concept sets.	Large Language Models Can Enhance OHDSI Evidence Generation Mission
CHIMERA	OHDSI Contributors	Initiative focused on automatic concept set creation and mapping to standard OMOP codes within the OHDSI ATLAS platform.	CHIMERA: Automatic Concept Set Creation and Mapping to Standard OMOP Codes in ATLAS
LLM-Powered OMOP Conversion Tool	Artem Naumenko (OHDSI: ents) daniel@hyperunison.com	Community initiative utilizing an LLM-first approach for automated structural and semantic mapping to accelerate OMOP conversion.	Accelerating OMOP Conversion with free LLM-Powered Tool – Looking for Test Users
Proprietary Software & Industry
RWE & Data Harmonization Platforms
IQVIA	Contact Page	AI solutions to accelerate OMOP conversions and AI-driven analytics for RWE generation.	IQVIA HealthGrade AI
Evidentli	info@evidentli.com	Piano and Auto-Mapper tools; Uses ML/NLP to automate the standardization of medical concepts for OHDSI CDM transformation.	Evidentli
OM1	info@om1.com	PhenOM AI Platform; AI-powered digital phenotyping to create digital phenotypic “fingerprints” for cohort identification.	OM1
Nference	Google Search	nferX AI platform; Advanced NLP to normalize data and map concepts to standardized ontologies for RWE generation.	nference
Truveta	Google Search	“Truveta Language Model” (TLM) to clean, harmonize, and normalize disparate EHR data, including extensive concept mapping.	Truveta
Terminology & NLP Providers
IMO Health	CustomerSupport@imohealth.com	LLMs for automated medical coding (ICD-10, CPT, SNOMED CT) and “smarter value set management,” enhanced with RAG and proprietary terminology.	IMO Health
John Snow Labs	info@johnsnowlabs.com	Healthcare NLP & LLM library; Healthcare-specific LLMs to extract entities and map them to standardized terminologies (RxNorm, ICD, SNOMED, HPO).	John Snow Labs
Autonomous & AI-Assisted Coding
Solventum (Formerly 3M HIS)	Helps.us@solventum.com	360 Encompass™ system; AI-powered autonomous and computer-assisted coding solutions.	Solventum
Fathom Health	hi@fathomhealth.com	Deep learning and LLMs to automate medical coding and auditing (E/M, CPT/HCPCS, ICD).	Fathom Health
BUDDI.AI	Google Search	CODING.AI; Deep learning platform to automate medical coding (CPT, ICD 10).	BUDDI.AI
Nym Health	Contact Nym \| Trusted Medical Coding & Billing Software	Autonomous medical coding using “Clinical Language Understanding (CLU)” to translate notes into billing codes (ICD-10, CPT).	Nym Health
CodaMetrix	hello@codametrix.com	AI-Powered Contextual Coding Automation Platform; Uses NLP and a “knowledge graph” to improve coding quality.	CodaMetrix
Maverick Medical AI	info@maverick-ai.com	Autonomous medical coding platform powered by deep learning, aiming for high “Direct-to-Bill” rate.	Maverick Medical AI
MediCodio	info@medicodio.ai	AI-powered coding automation; Hybrid strategy using LLMs/AI with SNOMED CT for clinical interpretation, then converting to ICD-10.	Medicodio
AGS Health	https://www.agshealth.com/contact-us/	Intelligent Authorization® Autonomous Coding; NLP/ML to automate ICD-10-CM, PCS, CPT, and E&M code assignment.	AGS Health
AutoICD	info@autoicdapi.com	AI-powered Clinical Coding API; NLP to process unstructured texts and generate structured data in SNOMED-CT and ICD10.	AutoICD
MediMobile (Genesis)	support@MediMobile.com	Autonomous medical coding powered by AI (Genesis); Automates CPT & ICD-10 coding in real-time from documentation.	MediMobile
Ambient AI & Documentation
Abridge	Connect with AI Healthcare Experts \| Contact Abridge	Ambient AI for medical conversations; Real-time mapping of spoken clinical concepts to ICD-10, SNOMED, CPT.	Abridge
Suki AI	support@suki.ai	AI-powered voice solutions; Automatically generates suggested ICD-10 and CPT codes based on captured clinical context.	Suki AI
Prometheus	sarah_seager@epam.com	Prometheus is a powerful and versatile platform that can enable collaborative research and data analytics in the healthcare industry. Its comprehensive features, compatibility with industry standards, and focus on security and collaboration make it a valuable tool for researchers and healthcare organizations seeking to leverage the power of data to improve patient outcomes.	https://solutionshub.epam.com/solution/prometheus

Call to Action for the AI Arm

This experiment is intended to generate high-quality, peer-reviewable evidence regarding the integration of AI into the phenotyping lifecycle.

If your organization is interested in submitting your AI pipeline for this evaluation, please REPLY to this forum thread or contact me directly rao@ohdsi.org .

As noted in the experimental stipulations:

“To ensure reliability and fairness, the AI-generated concept sets have to be generated prior to the symposium using the exact same standardized inputs provided to the human teams.”

“AI Methodology Transparency: The AI prompt and system settings (e.g., agent, version, temperature) used for the pre-generated AI concept sets will be documented.” However, you do not have to reveal your proprietary secrets. Also this is not an opportunity to market yourself.

We will coordinate with participating teams on the next steps for this crucial pre-symposium generation phase.

Sincerely,

Gowtham Rao

On behalf of the Phenotype Development and Evaluation Workgroup Organizing Leads & Contributors (Gowtham A Rao MD, PhD; Azza A Shoaibi PhD; Joel Swerdel PhD; Jack Murphy MPH, PhD)