We have data taken through yearly questionnaires. Specifically, part of the questions are asked remotely, while some are answered by a nurse performing evaluations on the patient. For each questionnaire, I have the year and month it was done, and inside it, there are several questions such as:
- When was the patient diagnosed with a condition
- Is the patient currently diagnosed with a condition
- Observations/findings/measurements on the patients
I have several questions about the conventions for mapping some of these aspects to OMOP CDM:
- Regarding episodes, visits, and observation periods, should all questionnaires be grouped as one episode, each questionnaire as a visit_occurrence, and the observation period just an episode, and observation_period that lasts only for the day of the questionnaire?
- How should we annotate in the CDM the transformations done on the date data? For example, how can we encode dates that were assumed to occur on the first day of the month in cases where the day was missing?
- What duration should I put for conditions when I have only for questionnaires if currently have it in every questionnaire? Should I assume it lasts between questionnaires? For example, if the patient says they had hypertension in June 2019, did not answer in September 2020, reports having it in August 2021, and reports they do not have it in December 2021, what should we put as condition_occurrence.condition_start_date and condition_occurrence.condition_end_date? Several instances where they last that day of the questionnaire only or one that lasts throughout?
Looking forward to your thoughts and suggestions!