First the questions, then the background:
QUESTIONS: Is anybody using OMOP CDM as their data delivery format to give data extracts to investigators? If so, how do you do this, what data elements/fields to you provide, and how is it working for your investigators? What training do you need to give investigators when you hand over a data set in OMOP CDM format?
BACKGROUND: The University of Colorado uses OMOP for its integrated research data warehouse. We want to begin delivering data sets to our investigators in OMOP format for a couple of reasons: (1) it ensures direct provenance between what is delivered and what is in the warehouse, and (2) it introduces the OMOP CDM to our investigator community.
That said, I am considering violating what I just wrote by creating a version of the CDM that pre-joins all of the concept_ids and pulls in some of the key fields in the CONCEPT table such as concept_code and concept_name (and maybe other fields). Some of the reasons include (1) easier for recipients not comfortable with multi-table joins to work with the data or at least reduce the number of joins in a query and (2) eliminates the need to either ship or always require access to some version of the CONCEPT table. For sophisticated users, we could provide access to CONCEPT_ANCESTOR and/or CONCEPT_SYNONYMS.
At some point I hope to introduce ATLAS as an alternative/integrated data delivery + analytics environment but we are probably a long way off from being able to do this. Even so, I can’t envision ATLAS becoming the dominant method used by most of our investigators for their work (just the really smart ones……! ← that’s for Patrick…… ).
Any insights on using OMOP as a data extraction format would be appreciated.
Thanks,
Michael