Please use this spot to document:
- Describe the issue/topic?
- What do we know about this topic? What has been discussed?
- What are recommendations for how to handle this issue/topic?
- What next steps should be taken?
Related posts:
OHDSI Home | Forums | Wiki | Github |
Please use this spot to document:
Related posts:
@Christian_Reich has a very good proposal linked above. It covers smokeless tobacco consumption and nicotine smoking, not marijuana or other substances. Since we don’t have a lot of time during our Themis workshop, we need to be focused on a solution for nicotine.
The proposal:
Creates a hierarchy with smoking at the top
Covers e-cigarettes/vape pens, pipe, passive/2nd hand smoke, smokeless, etc.
Records intensity of smoking into trivial to very heavy categories.
Previous or ex-smoker gets relegated to “History of” category like all history of concepts
One item not covered is the “pack years” concept. There is not a standard concept_id for this idea. UK Biobank has a non-standard concept_id = 35811050. Do we make this concept standard since there isn’t a standard concept_id for this idea? Then have the observation.value_as_number contain the # of pack years? We will need to keep it outside the hierarchy because pack years can’t be determined to be trivial versus heavy because the age of a person matters. Example: 20 packyear record is a very different category for a 30yo person versus an 80 yo person. And as Christian and Oleg point out, pack years is a measure of cumulative damage versus cigarettes per day which is current usage.
This proposal also does not cover:
Survey data. Survey data has a working group and should continue to be modeled and debated there.
Tobacco and nicotine data are not easy for a number of reasons. It’s not always recorded, it’s patient reported, it’s not always reported accurately, it can begin years or decades before our Observation Period starts, there’s lots of free text out there, etc. Our goal is solid conventions covering the majority of use cases. And as our data, vocabularies, recording methods, and use cases evolve, so will the conventions and the CDM.
Thoughts?
On Sunday, October 16th at the CDM workshop we discussed the above issue. We came to the following conclusions:
1. Describe the issue/topic?
Currently, the conventions lack guidance on how tobacco usage should be stored in the CDM. There are multiple standard concept_ids with the same/similar meaning in different domains.
2. What do we know about this topic? What has been discussed?
This has been discussed on the forums for many years. And was also discussed at the original Themis meeting in 2018. See the forum post above for the most recent and comprehensive discussion.
3. What are recommendations for how to handle this issue/topic?
Proposed Solution: Tobacco and nicotine data are not easy for a number of reasons. It’s not always recorded, they are patient reported data, it’s not always reported accurately, it can begin years or decades before our Observation Period starts, there’s lots of free text out there, etc. Our goal is solid conventions covering the majority of use cases. And as our data, vocabularies, recording methods, and use cases evolve, so will the conventions and the CDM.
*We don’t need the exact number of cigarettes (it is probably false precision anyway)
*Cigarette frequency is for current usage
You map your data in at the granularity you have. So, if the record is Heavy Cigarette smoker, you pick the concept_id representing this idea. If you have a flag for Cigarette smoker: yes, then you pick the concept_id representing this idea.
4. What next steps should be taken?
The Vocabulary team will notify us which concept_ids will be standard for each idea above. And we can put this information into the CDM conventions
The Vocab team will make some concepts non-standard and map them to standards and create new hierarchy
Do we have a concept for each of these, or will we need to create some de-novo?
We’ll recreate the whole story de novo in OMOP Extension to make it all clean and not confusing by old hierarchy, other relationships and different flavors of meaning.
Like we did in Specialty and Visit?
There we mostly preserved the source semantics, structure, and entire donor vocabularies.
Rather Cancer modifier…
Do we have a convention agreed already?
If yes, when we can expect the proper concepts?
We’re planning to release them in December
@Alexdavv what are your thoughts about working with the SDOs to get the content added to the appropriate vocabulary (LOINC or SNOMED) rather than creating the concepts in the OMOP Extension?
We - and probably other organizations - currently store concept codes for tobacco smoking information natively in our EHR (SNOMED concept ids when captured in social history, LOINC codes when captured in flowsheets)
Only concepts from industry standard vocabularies - not OMOP Extension concepts of course - can be mapped natively in the EHR.
Best,
Piper
Because it would take an enormous amount of time. Even within OMOP, it took us years to agree, author the content, and release (hopefully, this week)?! Besides that, in most cases it’s another way around - they have much more content than we need. And the content is modeled and organized in the hierarchy in another way that we can easily adopt in OMOP.