Our group receives data from several collaborators, all of whom have done their own, independent OMOP conversion of their EHR records. Our goal is to map all this data into our own internal database. We run into problems because different institutions use different concept ids for the same measurements.
For example:
BMI can be represented as 4245997 or 3038553
Body weight can be represented as 3025315 or 3013762
Diastolic BP can be represented by 3012888 or 4154790
PSA can be represented by 42529229 or 3013603 (not exactly the same concept but are used interchangeably in EHRs)
And anti-Smith Ab can be represented by 3016921 or 3001263 (and 4 others depending on who did the conversion).
This problem was detailed in this recent publication:
(We have a poster at the OHDSI symposium on this topic too.)
While some of the OMOP concepts that map to common measurements have slight differences, in a real world implementation of an EHR dataset, they dont make any material difference.
Is there a “canonical” set of concept ids that could be (or has been) implemented uniformly to avoid these issues?
Thanks