I have an Rx claims dataset where I cannot match 75% of the 11-digit NDCs to the OMOP CDM when I try to use a simple join in SQL.
However, sometimes I search in Athena and find the NDC.
Here is one example where I searched ‘83730000626’ in Athena and found a concept with code ‘08373000626’. I’m confused by this example, because it’s not as simple as matching the first 9 digits in my search to the CDM concept.
Is this a matter of taking 0’s in selective places and placing them in other locations to improve the match? Is there an algorithm for this that Athena uses?