We’re developing a general-purpose causal inference platform to estimate drug effects on a large scale, primarily using a new user cohort study design. As a first step, we’d like to assess its performance using an established ground truth set, and consider using the OMOP drug safety reference set and replicating the results in Ryan at al’s paper (link). We’d appreciate any help and thoughts re the following:
For a given target drug, the control cohort (or comparator group) is defined using an active comparator drug (or set of such drugs) which “shares the same indication as the target drug but falls within a different therapeutic class”. What vocabularies were used to determine drugs’ indication and therapeutic class?
AFAIK indication and therapeutic class are characteristics of drugs, but the reference set’s concepts are ingredients. So, what drugs are considered for a given active ingredient, all or some subset (e.g., having an ingredient as the single active one)?
Also, the mapping of drugs, and even more so ingredients, to indication/therapeutic class is a one-to-many mapping; does “same indication” requires that the target and comparator drugs have identical sets of indications or just that they share an indication? Similarly, does “different therapeutic class” mean disjoint sets or non-identical?
Finally, is the specific list of the drugs used in each comparison available and can be shared?
In addition, we’d be happy to learn of any additional reference sets that can be used, or studies we can try to replicate, to evaluate our platform performance.
None. Was done manually. I know what you want: You want to automate this selection. This was discussed a few times before, like here.
The problem is it doesn’t work very well. Because these indications are not at the same level that usual
diagnostic codes have. I give you an example. There is an antibiotic amikacin. One of the indications is “Pediatric fever without a source”. There is no such a SNOMED concept, because the Condition is “fever”, and the fact that is pediatric is part of the demographic information (year of birth less than 18 years away from current year). So, it gets mapped to “fever”. There are tons of drugs to treat fever, including aspirin, diphenhydramine, caffeine. Hardly comparators for an i.v. antibiotic.
So, nice idea, but won’t work.
Exactly. Ingredients can have very different indications depending in which combination they are.
Another good idea, but this stuff is not clean enough for any such heuristics to work.
We’ve done a lot of work establishing the OHDSI Methods Benchmark. Currently the only description is these two posters, presented at the 2017 OHDSI US Symposium:
We are working on a full paper, but it taking some time to get it accepted. The Methods Benchmark is implemented in the MethodEvaluation package, as described in this vignette.
Thanks @Christian_Reich and @schuemie for the links and helpful insights. Re the 2013 drug safety work, can you share the drug(s) used as comparator(s) for each tested ingredient?