OHDSI Home | Forums | Wiki | Github

OMOP CDM and Clinical Trials

I was asked to organize a group around this past proposal in the CDM workgroup

fixed link:
http://www.ohdsi.org/web/wiki/doku.php?id=documentation:next_cdm:clinical_trial

In addition to CDM, we also have “conventions” (see here Per CDM tutorial slides 5 visit types?).

If you are interested in creating conventions (or CDM changes) for clinical trials data, please reply to this thread.

Make sure to delete the edit from the link so you can see it properly:

http://www.ohdsi.org/web/wiki/doku.php?id=documentation:next_cdm:clinical_trial

1 Like

@Vojtech_Huser:

Conventions? You don’t think we need some data structures? What about the arms?

This is an area that I and a couple of my colleagues would like to get involved with. Has the WG met yet?

@christian - I agree that trial arms are also in scope.

I also want to document that there is communication with Columbia U on how to unify some clinical trial data conventions. (possibly to be used by the phenotypeDataExtraction package) (find similar patients given an a trial NCT)

For representing enrollment and withdrawing for a trial identified by an NCT, possible parameters would be SNOMED codes from here

OMOP vocabulary seems to have also some SNOMED national extension (for example, scottish study is represented under this concept)

to keep the conversation going:

Here is possible representation of trial enrollment using OBSERVATION table.

entering a trial - Enrollement (start)
Observation_concept_id: http://www.ohdsi.org/web/atlas/#/concept/4163733 (Patient consented to clinical trial)
value_as_string: NCT000035135

leaving the trial early (due to a decision of a patient (no longer interested) or researcher (non compliance) (end)

*Observation_concept_id:*http://www.ohdsi.org/web/atlas/#/concept/4087907 (Patient withdrawn from trial)
value_as_string: NCT000035135

fully completing the trial (end)
Observation_concept_id: http://www.ohdsi.org/web/atlas/#/concept/40482840 Completion of clinical trial
value_as_string: NCT000035135

Please, sign me up

I was discussing study data transformation with someone (think SDTM data transformed to ADAM data).

Here is another issue for the group to discuss:

To capture raw data as well as “for analysis” transformed data - I propose to use the same table (e.g., measurement) and use type columns to indicate raw vs. transformed. (just to put a proposal to the discussion)

Not capturing raw data is probably not an option. This would make OMOP inferior to current standards.

To capture transformation steps (captured in Define.XML), OMOP METADATA table could be used for that purpose.

I would start with the analysis. Because that is what the OMOP CDM is. And that’s were the use cases live. Raw data are pre-ETL in this world.

I am interested.

@Vojtech_Huser - Since there were a few posters at the Symposium on using OMOP and RWE for Clinical Trial analysis, thought it might make sense to try and revive this thread.

I see a few levels of items to consider for being able to fully leverage the power of OMOP CDM with RCTs. I don’t think this is exhaustive per se, but thought it might help spur the discussion.

A: Data Model Interoperability:

  1. CDISC SDTM conversion to OMOP CDM
  2. CDISC ADaM conversion to OMOP CDM
  3. OMOP conversion back to CDISC

B: RCT Administrative Records:

  1. Patient Screening (not the recorded determination of eligibility, but that the screening took place)
  2. Patient Randomization

C: RCT Visit Type (e.g., was the visit for screening, baseline, follow-up, etc.)

D: Non-standard Interventions and Outcomes (e.g., how to record and deal with investigational therapies, unvalidated PROs, etc.)

@Vojtech_Huser - I’m wondering where you’d recommend is the best place to capture the issues and lessons learned from our respective CDISC conversion efforts?

2 Likes

I like your proposed topics. I focused mostly on how trial data can co-exist next to EHR data when both are ETLed into OMOP.

Some of my observations are summarized here:project/sdtm-omop at master · vojtechhuser/project · GitHub

One community effort could be to produce “SOURCE_TO_CONCEPT” map for the lab test below: (that we saw in the sample data)

(Also tagging @Craig_Mayer in our clinical research informatics team)

(observations also pasted for those who do not click the link)

Sample data conversion results

  • sample CDISC data differes in format from trial RWD (3 trials)
    • person identifier used (SUBJID, RSUBJID,RUSUBJID))
      • large sample needed but not available; must also be recent (trial started and finished in the last 5 years)
  • SDTM manual is very long, in PDF format (this should change with CDISC SHARE initaitive)
  • Demographic (DM) domain does not contain year of birth. It is not clear at what time age is calculated. (at first visit of the subject?; day 1 of the trial) (convention)
  • trial data may be organized by visit number (not absolute date)
  • Demographic (DM) domain data on screen failure or arm: missing suitable concept_id (issue) (tentatively using LOINC 68839-0| Research Note) (todo new concept)
  • LOINC codes will be used in SDTM (in 2018, FDA clarified a LOINC strategy) so detailed mapping is not pursued.
  • Assignment into arms can be in COHORT table or as OBSERVATION or other table (convention).
  • Vital Signs (VS) domain is very similar to Laboratory (LB) domain. Both have standard units and original value and numeric value. LB has exact datetime, whereas VS has only date (issue).
  • _source_value columns were sometimes overloaded with multiple elements separated by delimiters (issue)
  • Medical History (MH) domain captures a condition at a given visit (date1) but
    records onset date (date2). Date2 can be null (not known) (convention) (issue)

update: the trial arm term will be in SNOMED CT in Jan 2019 :slight_smile:!
(I submitted it)

Please vote/comment on this proposed high priority convention for study data Convention for representiong assignment of person to a clinical trial · Issue #37 · OHDSI/Themis · GitHub

Also, we would need row in METADATA for date.

@Vojtech_Huser What is the status of this proposal? Is there an agreement on the convention within Themis?

A next step would be to also capture the details on the study in OMOP, like name, inclusion criteria and arms. A new Study domain/table might be necessary. Or has the assumption been so far that one dataset represents the data from one clinical trial?

To complement today’s discussion in the trial WG. Here are the links I pasted during the chat

https://github.com/vojtechhuser/project/tree/master/sdtm-omop/omop-output-csv
https://github.com/vojtechhuser/project/tree/master/sdtm-omop/sdtm-input-csv

Summary of the meeting also coming soon.

One question was trial vocabulary domain or whether trial visit type should live in the existing visit type vocabulary domain.

I have just written a small overview in this thread: New working group: Clinical Trials

1 Like

OMOP currently captures visits. Where a patient AND provider interact.

If a patient logs into PHR or provider mails a letter to the patient, it is a one sided event. Just patient does something or just provider does something.

From patient care and research point of view (e.g., on PHR side, ability to recruit into a applicable research study via PHR), such events matter and can be important for research.

For the clinical study WG - that could be an interesting topic to discuss. You can also post your ideas here: https://github.com/OHDSI/CommonDataModel/issues/287

Not just for clinical research–I would imagine for some treatment programs of the public health variety, this would be important to capture (I’m thinking of things like TB and HIV, where outreach is a routine part of clinical care). I’m working with a team to develop a EMR in Eswatini that’s pretty HIV-focused, and we are looking to build out an “outreach” module because that’s a critical part of our care program that we get evaluated on.

t