Hello OHDSI community,
I’m interested in connecting with others who have mapped — or are currently mapping — clinical trial datasets and/or public research datasets to the OMOP CDM.
We are currently working on mapping study-specific data from the Anti-Amyloid in Asymptomatic Alzheimer’s Disease (A4) secondary prevention trial to OMOP and would welcome insight from anyone who has tackled similar work.
Example: A4 Study Dataset
The A4 study includes 1,147 cognitively unimpaired individuals (ages 65–85) randomized to placebo (n=583) or solanezumab (n=564), followed for 240 weeks across 67 international sites. All participants entered with a Clinical Dementia Rating – Global (CDR-G) of 0.
Questions for the Community
- CDR and other cognitive scales
- Have you mapped CDR (including box-level granularity) to standard vocabularies?
- Did you use LOINC concepts for total scores and custom concepts for box-level components?
- How did you handle derived composites (e.g., cognition vs function)?
- Clinical trial–specific variables
- How are others representing:
- Randomization arm
- Study visit structure (e.g., week 0–240 schedule)
- Analysis-derived variables (e.g., progression defined as change at two consecutive visits)?
- Are you using OBSERVATION, MEASUREMENT, FACT_RELATIONSHIP, or custom extensions?
- Public datasets
- Has anyone mapped datasets like ADNI or other longitudinal research cohorts to OMOP?
- Are there shared mapping specifications we should align with?
Motivation
Our goal is to:
- Enable standardized analytics across trial and observational datasets
- Explore reproducible phenotyping of progression in pre-symptomatic Alzheimer’s disease
- Evaluate whether cognitive and functional subcomponents (e.g., CDR memory box vs community affairs) behave differently across amyloid strata
If anyone has experience with mapping these types of datasets, or prior forum threads to share, we would greatly appreciate it.
Happy to share our mapping decisions and lessons learned as this progresses.
Thanks in advance,
Ben