Since our goal is to create a process that can update the data on a continuous basis, I would recommend we re-engineer Olivier’s process so we can automatically execute it at every refresh rather than add a dependency on Olivier to run it every time.
We can use the actual data you provided to test our implementation.
I agree, let’s try to recreate the process into a format that works with our OHDSI framework. I’m having it loaded internally now so that I can see what is going on. Already see a few things that I didn’t take away from the article so I’m looking into that more. For example, why are tags associated to “precoordinated” pulled and what is being done with indications?
Sounds good. Let’s do it! Will help and we can always ask questions of
folks at NLM.
Christian - this is data that NLM created that improves upon the
Avillach method for identifying PubMed titles and abstracts about
drug-HOI associations using MeSH indexing.
@rkboyce - Do you know of any other documentation other than the README? I’m asking because it looks like they did things outside of the paper as well as I don’t know what some of their columns mean (DRUGS.DRUG_ROLE what is i and c).