OHDSI Home | Forums | Wiki | Github

Network study: Concept Prevalence

(Anna Ostropolets) #1

We want to announce a new network study: https://github.com/OHDSI/StudyProtocolSandbox/tree/master/ConceptPrevalence

The full protocol can be found here: https://github.com/OHDSI/StudyProtocolSandbox/blob/master/ConceptPrevalence/extras/ConceptPrevalenceStudyProtocol_v0.1.docx

We want to study the usage patterns of Concepts across different OMOP CDM instances. This in itself could be useful information to answer many questions, but we have a concrete reason: For any one medical entity, the granularity of codes captured in a data source can vary greatly. For example, Chronic Kidney Disorder stage II can be coded as ICD9 code 585.2 Chronic kidney disease, Stage II (mild); 585.9 Chronic kidney disease, unspecified or even as 586 Renal failure, unspecified. However, this information is key for any cohort definition. Currently, researchers have no way of knowing whether a certain concept with high granularity is even available for selection, or whether they have to use a generic concept in combination with some auxiliary information to define the cohort correctly. Each data source instance is a black box and knowledge about the distribution of the concepts is limited to the very instance researchers have access to. But OHDSI Network Studies are dependent on cohort definitions that work across the network.

In an ideal world, a cohort definition tool like ATLAS would have access to the distribution of all concepts in the community. We would like to make that a reality and collect counts for all:

  • Unique values in the *_concept_id fields

  • Unique values in the *_source concept_id fields

  • Mappings between them

As a side effect, we would also get a better understanding of the dynamics of that distribution over time, and we could draw conclusions about the impact of erroneous mappings.

I’d welcome everybody to participate; tagging some of the people that might be interested: @SCYou @Christian_Reich @Patrick_Ryan @jduke @Rijnbeek @rkboyce @Daniella_Meeker

(Melanie Philofsky) #2

Hi @aostropolets!

Colorado would like to participate! What’s the timeline?

(Jeon Ho Kyun) #3

Hi, I’m Ho-kyun working with SCYOU.
We would like to participate in the study too. :grinning:

(Rkboyce) #4

Hi, I would be delighted to try and participate! Great idea.

(Andrew Williams) #5

Tufts would like to participate.

(Kristin Kostka, MPH) #6

Hi @aostropolets! The IQVIA team would be happy to participate too. Is there a study package we could review?

(Karthik) #7

Count us in at Columbia! Then again my post is redundant :slight_smile:

(Anna Ostropolets) #8

It’s such wonderful news that you all decided to participate! I’ve almost lost hope :slight_smile:
Timeline: the study itself doesn’t require a lot of preliminary work, so I’d say we expect to get some results in a couple of months.
R package can be found here; happy to answer your questions if any.

(Ajit Londhe) #9

Hi @aostropolets, it looks like the SQL overlaps a lot with classic Achilles analyses and some new ones added by @AnthonyMolinaro. Specifically, characterizing concept ids (classic) and characterizing source concept ids (new ones).

Not suggesting you should change anything here, but there could be an opportunity to leverage Achilles results for the deliverable.


Hey @aostropolets, @Frank and I will work on this as well. :slight_smile:

(Anna Ostropolets) #11

Thanks, Ajit! I know that Achilles produces a lot of useful results, but figured out that it may be more convenient just to run the package that readily spits out the tables :slight_smile: Do you feel Achilles is more convenient?

(Anna Ostropolets) #12

Great to hear! I added my email to the readme file; will also duplicate it here: ao2671@cumc.columbia.edu
Please feel free to send the results to this email once they are ready.
@MPhilofsky, @Hokyun, @rkboyce, @Andrew, @krfeeney, @cukarthik, @AnthonyMolinaro thanks a lot for your interest!

EHR data to OMOP CDM Work Group
(Anna Ostropolets) #13

I’m happy to see that we’ve been getting the first results! Thanks to @Hokyun and @mattspotnitz for pioneering :slight_smile:
As you all know, the deadline for the OHDSI Symposium submission is in three weeks, so we are planning to submit some preliminary results. So, it would be great if we can get more data next week to prepare the abstract and finalize the project over the summer :slight_smile: