| Academic & Research Initiatives |
|
|
|
|
| Columbia University DBMI |
Chunhua Weng, PhD (chunhua@columbia.edu ); George Hripcsak, MD; Anna Ostropolets, MD, PhD anna.ostropolets@columbia.edu |
Criteria2Query (C2Q) 3.0 (NLP to OMOP SQL); LLM for concept set curation (refining PHOEBE); Automated taxonomy learning. |
Columbia DBMI |
|
| Vanderbilt University Medical Center (VUMC) DBMI |
Wei-Qi Wei, MD, PhD: wei-qi.wei@vumc.org ; Chao Yan, PhD: chao.yan.1@vumc.org |
Evaluation of LLMs (GPT-4, Claude) for generating executable phenotyping algorithms (SQL queries) adhering to a CDM. |
Vanderbilt DBMI; Large Language Models Facilitate the Generation of Electronic Health Record Phenotyping Algorithms |
|
| Stanford University (AIMI / Shah Lab) |
Nigam H. Shah, MBBS, PhD (nigam@stanford.edu ) |
Extensive research on LLM evaluation in healthcare, including automated billing code assignment and AI fairness frameworks. |
The Stanford Center for Artificial Intelligence in Medicine and Imaging (AIMI) |
|
| UTHealth Houston & Mayo Clinic (Agentic MCP) |
Hua Xu, PhD: Hua.Xu@uth.tmc.edu ; Hongfang Liu, PhD: Hongfang.Liu@uth.tmc.edu |
Development of the Agentic Model Context Protocol (MCP) framework for zero-training, hallucination-preventive OMOP mapping using LLMs. |
An Agentic Model Context Protocol Framework for Medical Concept Standardization |
|
| Kingâs College London (KCL) / Alan Turing Institute (MedCAT Team) |
Prof. Richard Dobson: richard.j.dobson@kcl.ac.uk |
Developers of MedCAT/CogStack; NLP for extracting/linking clinical concepts to UMLS/SNOMED CT; Integrating Transformer models. |
Medical Concept Annotation Tool |
|
| UCSF - Bakar Computational Health Sciences Institute |
Vivek Rudrapatna, MD, PhD: Vivek.Rudrapatna@ucsf.edu |
Utilizing LLMs and NLP to transform unstructured narratives into structured data, including OMOP mapping and knowledge graphs (SPOKE). |
UCSF Bakar Computational Health Sciences Institute |
|
| Mass General Brigham (MGB) Research Team |
Heekyong Park, PhD (Corresponding Author): hpark25@mgb.org |
Utilizing LLMs + RAG to improve phenotyping accuracy from unstructured EHR data, comparing against traditional ICD code methods. |
A Comprehensive Evaluation of LLM Phenotyping Using Retrieval-Augmented Generation (RAG): Insights for RAG Optimization |
|
| Mount Sinai Health System (MSHS) |
Eyal Klang, MD (Corresponding Author): eyal.klang@mountsinai.org |
Evaluation of RAG-enhanced LLMs (GPT-4, Llama-3.1) for automating Emergency Department ICD-10-CM coding vs. human coders. |
Assessing Retrieval-Augmented Large Language Model Performance in Emergency Department ICD-10-CM Coding Compared to Human Coders |
|
| Yale School of Medicine |
rohan.khera@yale.edu ; Samah Fodeh, PhD (Corresponding Author): samah.fodeh@yale.edu |
Novel Sentence Transformer-based NLP approaches for mapping EHR data (medications) to the OMOP CDM. |
A Novel Sentence Transformer-based Natural Language Processing Approach for Schema Mapping of Electronic Health Records to the OMOP Common Data Model |
|
| CLH (Code Like Humans) |
Univ. of Cambridge/Microsoft Research Authors andreas@motzfeldt.dk ; Yu-Neng Chuang (Corresponding Author): yc577@cam.ac.uk |
LLM-based agentic framework designed to automate medical coding (ICD-10) by mirroring human processes. |
Code Like Humans: A Multi-Agent Solution for Medical Coding |
|
| MedCodER |
Univ. of Illinois Urbana-Champaign Authors sanmitra1@gmail.com ; Y. Zhang & J. Gao (Corresponding Authors): {yuz9, jgao8}@illinois.edu |
Generative AI framework for automatic medical coding (ICD code prediction) using extraction, retrieval, and re-ranking. |
MedCodER: A Generative AI Assistant for Medical Coding |
|
| Erasmus MC - Medical Informatics |
Peter Rijnbeek, PhD (p.rijnbeek@erasmusmc.nl) |
Exploring NLP and LLMs for extracting concepts from clinical text (including non-English languages) for phenotyping. |
Erasmus MC - Medical Informatics |
|
| The Jackson Laboratory & Charité Berlin |
Peter N. Robinson, MD, MSc (Peter.Robinson@jax.org) |
Development of Human Phenotype Ontology (HPO); Computational deep phenotyping and ontology-based data models (Phenopackets). |
The Jackson Laboratory & Charité Berlin |
|
| KOMAP |
University of Florida and NVIDIA Authors tcai@hsph.harvard.edu |
Knowledge-Driven Online Multimodal Automated Phenotyping; Uses knowledge graph embeddings on EHR concepts to generate feature lists. |
Knowledge-Driven Online Multimodal Automated Phenotyping System |
|
| Open-Source Tools & Consortia |
|
|
|
|
| The Monarch Initiative |
Melissa Haendel, PhD; Chris Mungall, PhD info@monarchinitiative.org |
International consortium for deep phenotyping; NLP/ML to extract phenotypic information and standardize using ontologies (HPO). |
Monarch Initiative |
|
| Llettuce |
University of Manchester Authors g.figueredo@nottingham.ac.uk R.A.C. and G.D. (Corresponding Authors): {rebecca.croft, goran.davidovic}@manchester.ac.uk |
Open-source tool using local LLMs, semantic search, and fuzzy matching for converting medical terms into the OMOP standard vocabulary. |
Llettuce: An Open Source Natural Language Processing Tool for the Translation of Medical Terms into Uniform Clinical Encoding |
|
| RAG-HPO |
Baylor/Texas Childrenâs Hospital Authors Jennifer.Posey@bcm.edu |
Python tool using RAG to enhance LLM accuracy in assigning Human Phenotype Ontology (HPO) terms from medical free-text. |
Improving Automated Deep Phenotyping Through Large Language Models Using Retrieval Augmented Generation |
|
| GCAF (Generalised Codelist Automation Framework) |
A. Aslam (Corresponding Author): a.aslam@sheffield.ac.uk |
Framework and Python repository for automating the development of clinical codelists (Readcodes, SNOMED). |
An automation framework for clinical codelist development validated with UK data from patients with multiple long-term conditions |
|
| CLAMP (Clinical Language Annotation, Modeling, and Processing) |
Hua Xu, PhD (Hua.Xu@uth.tmc.edu ) |
Comprehensive clinical NLP toolkit; Tools for entity recognition and mapping clinical concepts to UMLS/SNOMED CT. |
CLAMP Clinical Language Annotation, Modeling, and Processing Toolkit |
|
| OHNLP/omop_mcp |
jaerongahn@gmail.com |
Open Health NLP; Open-source Model Context Protocol (MCP) server for mapping clinical terminology to OMOP concepts using LLMs. |
GitHub omop_mcp |
|
| pyomop |
bell@nuchange.ca |
Python library for OMOP CDM; Includes LLM-based natural language queries and LLM-agent assisted FHIR to OMOP mapping. |
GitHub pyomop ; Vibe Coding FHIR to OMOP - Bell Eapen MD, PhD. |
|
| MedRAG / MIRAGE Benchmark |
Univ. at Buffalo, NCBI/NLM/NIH Authors |
Toolkit (MedRAG) and benchmark (MIRAGE) for the systematic evaluation of RAG systems in the medical domain. |
Benchmarking Retrieval-Augmented Generation for Medicine |
|
| OHDSI Community Efforts |
|
|
|
|
| OHDSI Generative AI Workgroup |
OHDSI Community Members |
Official workgroup dedicated to the application of generative AI for RWE, including automated taxonomy learning and concept sets. |
Large Language Models Can Enhance OHDSI Evidence Generation Mission |
|
| CHIMERA |
OHDSI Contributors |
Initiative focused on automatic concept set creation and mapping to standard OMOP codes within the OHDSI ATLAS platform. |
CHIMERA: Automatic Concept Set Creation and Mapping to Standard OMOP Codes in ATLAS |
|
| LLM-Powered OMOP Conversion Tool |
Artem Naumenko (OHDSI: ents) daniel@hyperunison.com |
Community initiative utilizing an LLM-first approach for automated structural and semantic mapping to accelerate OMOP conversion. |
Accelerating OMOP Conversion with free LLM-Powered Tool â Looking for Test Users |
|
| Proprietary Software & Industry |
|
|
|
|
| RWE & Data Harmonization Platforms |
|
|
|
|
| IQVIA |
Contact Page |
AI solutions to accelerate OMOP conversions and AI-driven analytics for RWE generation. |
IQVIA HealthGrade AI |
|
| Evidentli |
info@evidentli.com |
Piano and Auto-Mapper tools; Uses ML/NLP to automate the standardization of medical concepts for OHDSI CDM transformation. |
Evidentli |
|
| OM1 |
info@om1.com |
PhenOM AI Platform; AI-powered digital phenotyping to create digital phenotypic âfingerprintsâ for cohort identification. |
OM1 |
|
| Nference |
Google Search |
nferX AI platform; Advanced NLP to normalize data and map concepts to standardized ontologies for RWE generation. |
nference |
|
| Truveta |
Google Search |
âTruveta Language Modelâ (TLM) to clean, harmonize, and normalize disparate EHR data, including extensive concept mapping. |
Truveta |
|
| Terminology & NLP Providers |
|
|
|
|
| IMO Health |
CustomerSupport@imohealth.com |
LLMs for automated medical coding (ICD-10, CPT, SNOMED CT) and âsmarter value set management,â enhanced with RAG and proprietary terminology. |
IMO Health |
|
| John Snow Labs |
info@johnsnowlabs.com |
Healthcare NLP & LLM library; Healthcare-specific LLMs to extract entities and map them to standardized terminologies (RxNorm, ICD, SNOMED, HPO). |
John Snow Labs |
|
| Autonomous & AI-Assisted Coding |
|
|
|
|
| Solventum (Formerly 3M HIS) |
Helps.us@solventum.com |
360 Encompassâą system; AI-powered autonomous and computer-assisted coding solutions. |
Solventum |
|
| Fathom Health |
hi@fathomhealth.com |
Deep learning and LLMs to automate medical coding and auditing (E/M, CPT/HCPCS, ICD). |
Fathom Health |
|
| BUDDI.AI |
Google Search |
CODING.AI; Deep learning platform to automate medical coding (CPT, ICD 10). |
BUDDI.AI |
|
| Nym Health |
Contact Nym | Trusted Medical Coding & Billing Software |
Autonomous medical coding using âClinical Language Understanding (CLU)â to translate notes into billing codes (ICD-10, CPT). |
Nym Health |
|
| CodaMetrix |
hello@codametrix.com |
AI-Powered Contextual Coding Automation Platform; Uses NLP and a âknowledge graphâ to improve coding quality. |
CodaMetrix |
|
| Maverick Medical AI |
info@maverick-ai.com |
Autonomous medical coding platform powered by deep learning, aiming for high âDirect-to-Billâ rate. |
Maverick Medical AI |
|
| MediCodio |
info@medicodio.ai |
AI-powered coding automation; Hybrid strategy using LLMs/AI with SNOMED CT for clinical interpretation, then converting to ICD-10. |
Medicodio |
|
| AGS Health |
https://www.agshealth.com/contact-us/ |
Intelligent AuthorizationÂź Autonomous Coding; NLP/ML to automate ICD-10-CM, PCS, CPT, and E&M code assignment. |
AGS Health |
|
| AutoICD |
info@autoicdapi.com |
AI-powered Clinical Coding API; NLP to process unstructured texts and generate structured data in SNOMED-CT and ICD10. |
AutoICD |
|
| MediMobile (Genesis) |
support@MediMobile.com |
Autonomous medical coding powered by AI (Genesis); Automates CPT & ICD-10 coding in real-time from documentation. |
MediMobile |
|
| Ambient AI & Documentation |
|
|
|
|
| Abridge |
Connect with AI Healthcare Experts | Contact Abridge |
Ambient AI for medical conversations; Real-time mapping of spoken clinical concepts to ICD-10, SNOMED, CPT. |
Abridge |
|
| Suki AI |
support@suki.ai |
AI-powered voice solutions; Automatically generates suggested ICD-10 and CPT codes based on captured clinical context. |
Suki AI |
|
| Prometheus |
sarah_seager@epam.com |
Prometheus is a powerful and versatile platform that can enable collaborative research and data analytics in the healthcare industry. Its comprehensive features, compatibility with industry standards, and focus on security and collaboration make it a valuable tool for researchers and healthcare organizations seeking to leverage the power of data to improve patient outcomes. |
https://solutionshub.epam.com/solution/prometheus |
|