Please take a look at the new vocabulary we built from scratch. The building process has a new name: Athena, which according to Patrick stands for Automated Terminology Harmonization, Extraction and Normalization for Analytics. The vocabulary was built in Version 5.0 from the sources and back-converted to V4.5.
You can now download the vocabulary both in Version 5.0 and Version 4.5 from the new downlaod page at http://ohdsi.org/web/athena.
Please note: The rules of building the vocabulary used to be hard-wired and not documented very explicelty in V4 (nobody to blame but ourselves). We now rebuilt everything right in V5. We might have missed something. Therefore, I am only sending the announcement to this group. After a little bit of further scrubbing and maybe some āuser testingā by you we will alert everybody. So, donāt be surprised if we announce a few updates in the next couple of days. I will keep you up to date.
We also did a whole bunch of fixes and changes. Here is a list of the most relevant ones:
Refactoring of all the Drug vocabularies and relationships between them: RxNorm, NDC, SPL, VA Product and Class, NDF-RT and ATC
Addition of PCORNet concepts and mappings
Addition of LOINC, HCPCS and CPT4 relatinoships
For Observations and Measurement, we now map from Source Conceps to Standard Concepts (Maps to) and to Value Concepts (Maps to value). For example, ICD9CM codes for personal history are now mapped to the Observation for personal history, and the actual condition goes into the field value_as_concept. Note that this wonāt work in V4.
Revised mapping from ICD9CM and ICD10.
Revised mapping from Read according to the NHS
Revised domain assignment rules for LOINC, Read, ICD9CM, ICD10, SNOMED, HCPCS and CPT4.
Tons of little additions to Type Concepts
We havenāt quite finished all the homework for CPRD, which is mostly Gemscript. It is happening now. We also want to work with Parsa on a proper information model, which will help quality, and to compare and potentially incorporate CIEL (OpenMRS) and Med (Georgeās shop).
Let me know what you think.
The release that Mark Khayter announced yesterday is not related to this. It is a October V5 on the IMEDS website. All vocabulary releases will happen from here going foward. I apologize for that confusion.
@Christian_Reich - Is this a replacement or an update to V5? Was the āOctober V5ā the first official V5, or a draft? I just want to keep my datasets in order.
I recommend re-downloading. I found a couple of additional issues with the files. The latest files on the vocabulary download website have now been corrected.
VOCABULARY and CONCEPT are refusing to load successfully using the provided load file. I tested on Postgres, but assume that these issues should be affecting other DBs.
VOCABULARY.csv
Extra column on several entries (SNOMED, LOINCā¦), ālatest_updateā, that is not present in the OMOP v5 DDL. Causes COPY command to fail.
Loading version 4.5 into an Oracle instance and found two things:
I didnāt find a DDL file for V4 in the VocabImport sub-folder that came with the zip file so I used the DDL from http://omop.org/Vocabularies. However, that DDL has the Drug Approval table which seems to not be provided. Is this one no longer be maintained?
I believe @rkboyce was referring to the Oracle DDL to create the V4.5 tables. The vocabimport directory contains the Oracle CTL statements to load the vocabulary tables.