Running OHDSI tools on CPRD

Hi Everyone, I have been following the OHDSI from afar and I am testing the water before jumping in the journey (so, this is my first post in the forum and I haven’t clicked all the links on the website yet). I am exploring using OHDSI tools on the CPRD database we have in house but first I have to download them and get them working. Is there a step by step guide (OHDSI for dummies? readme files?) somewhere on the website on how I go about dowloading the tools and converting CPRD into CDM? I noted that the converstion to CDM V4 was already developped, is there one available for CPRD and CDM V5? How much time would you recommend I budget for this? (ballpark estimate of how long it takes to complete the steps from downloading to having the set up ready to run a query…for someone who understands the instructions, don’t worry about guessing whether or not I will understand the instructions :))

@rizem:

Welcome to the family. Yes, you should first click on every button before you dare posting anything here! :smile:

BTW: Talk to CPRD. They are planning on building and distributing an OMOPed version of their data. If you can wait you may save yourself a lot of work.

Since CPRD is already converted and the scripts are available you don’t have to invest in the work of developing all that from scratch. However, if you need a V5 converter and it’s not there the Open Source rule comes into place: If something isn’t there it’s your job.

Step by step guide: Try this. When you run into trouble post here and somebody will help you.

General experience with OMOPing data: 8-12 weeks for soup to nuts. In this case, you are new, but you already have something to start with.

Good luck.

Hi @rizem

@amatcho has made an open-source ETL available for CPRD to convert to CDM. It’s posted here: https://github.com/OHDSI/ETL-CDMBuilder/blob/master/man/CPRD_ETL_CDM_V4.doc . At Janssen, we have run the CDM v4->v5 conversion SQL to make our initial v5 instance (https://github.com/OHDSI/CommonDataModel/tree/master/Version4%20To%20Version5%20Conversion), and Amy is polishing up her v5 ETL script as we speak, and will post all the document and code on github for everyone to see.

I agree with @Christian_Reich, if you can get the data directly from CPRD already in OMOP CDM v5 format, that’ll save you a bunch of work, and let you focus on the more fun stuff: generating evidence from standardized, large-scale analysis tools!

Welcome to the journey!

Wow, really impressive. Are there some key studies from CPRD we could try to replicate just to test the process and data- that would kind of fun.

Shout out to @amatcho for her paper where she replicated a CPRD study done
on the raw source using OMOP CDM v4:

Hi @rizem,
Thanks Patrick, I agree with @Christian_Reich also, easier to get the data already transformed by CPRD. However, if their conversion dates are too far in the future for you, I should have the v5 ETL/code on github soon. Though you won’t want to do a v4 conversion of your data, the v4 ETL and the replication paper Patrick referred to above can help you get started with understanding mappings etc. Let me know if you have any questions!