Hello everyone,
I am currently working with the Visit Occurrence table using EHR inpatient data (NHS-APC). In this context, the visit_start_date
corresponds to the admission date, and the visit_end_date
reflects the discharge date. However, I’ve encountered a significant issue where around 11% of the data has a discharge date earlier than the admission date.
I’m seeking advice on how to handle these inconsistencies. Would it be best to remove the affected rows entirely, given the data integrity concerns? Or would it be more appropriate to replace the incorrect discharge dates with the corresponding admission dates?
Any guidance or suggestions would be greatly appreciated.
Thank you!