We are often faced with situations where we need to show the probability of an event. This is most relevant for covering clinical trials datasets. 95% of these data require to reflect the probability of facts of adverse events with any effect - side effects, efficacy and safety of treatment, probabilistic assessment in relation to the prescription of the drug.
Another group that needs to reflect the probability is etiology and poisoning, when a probabilistic cause of the disease is required.
Third group: complications and consequences. For example, necrosis of the skin of the forearm with a probable cause: insertion of a catheter.
There are already concepts in the real world that describe probabilities.
For example, The WHO-UMC causality assessment system (https://www.who.int/medicines/areas/quality_safety/safety_efficacy/WHOcausality_assessment.pdf), Adverse Drug Reaction Probability Scale (Naranjo) (Adverse Drug Reaction Probability Scale (Naranjo) in Drug Induced Liver Injury - LiverTox - NCBI Bookshelf),
Maria & Victorino (M & V) System of Causality Assessment (Maria and Victorino (M & V) System of Causality Assessment in Drug Induced Liver Injury - LiverTox - NCBI Bookshelf), Roussel Uclaf Causality Assessment (Roussel Uclaf Causality Assessment Method (RUCAM) in Drug Induced Liver Injury - LiverTox - NCBI Bookshelf).
When we encounter this kind of data, we are using custom concepts, not standardized ones. Scales are often not specified and they are source-specific. And OMOP needs to describe this, link and use precordinated concepts in the measurement domain, where the reasons will be placed in value. So my suggestion is to create such concepts:
Event Probability
Subsumes
1 Almost certain probability of an event caused by
2 Likely probability of an event caused by
3 Unlikely probability of an event caused by
4 Very likely probability of an event caused by
5 Very unlikely probability of an event caused by
In older versions (up to 5.3 inclusive), these precoordinated concepts can easily be connected with an event through the fact_relationship table, and in the higher versions, direct links can be used. It also makes it possible not to create a huge number of unnecessary links.
Example of use (tuberculosis associated with taking infliximab):
MEASUREMENT table
concept name: Almost certain probability of an event caused by
| measurement_id | xxx |
| value_as_concept_id | 937368 | (infliximab, RxNorm)
CONDITION_OCCURRENCE table
| condition_occurrence_id | 434557 | (Tuberculosis, SNOMED)
FACT_RELATIONSHIP table
| domain_concept_id_1 | 434557 |
| domain_concept_id_2 | xxx |
| relationship_concept_id | 4165382 | (Associated with)
Another option for using these concepts may also be proposed. The new concepts will serve to indicate the connection between the two events. In this case, they will belong to the observation domain. The disadvantage of this idea is impossibility of using it in the latest versions. And it will also require building reverse links (Almost certain cause of, Likely cause of, Unlikely cause of, Very likely cause of, Very unlikely cause of).
CONDITION_OCCURRENCE table
| condition_occurrence_id | 434557 | (Tuberculosis, SNOMED)
DRUG_EXPOSURE table
| drug_exposure_id | 937368 | (infliximab, RxNorm)
FACT_RELATIONSHIP table
| domain_concept_id_1 | 434557 |
| domain_concept_id_2 | 937368 |
| relationship_concept_id | xxx | (Almost certain probability of event caused by)
@Vlad_Korsik @Alexdavv @Dymshyts @Philip_Solovyev @ElenaTrach @MPhilofsky