The National Library of Medicine (NLM) has created an ETL using Spark SQL to transform T-MSIS Analysis Files (TAF) Research Identifiable Files (RIF) into OMOP CDM. The goal of the ETL was to ease integration of TAF data into network studies. For OMOP practitioners who have access to a 100% sample of TAF RIF via the Chronic Conditions Warehouse, consider using our ETL as a starting point on your OMOP conversion, if desired. We do not anticipate major updates or ongoing development of this code. The NLM is not providing any long-term support for OMOP ETL implementation at this time.
Documentation to support deployment is forthcoming.
The documentation is being edited; but we have a 20+ page word file explaining how to use this. Its not quite ready but almost. I have no objection to posting this on OHDSI Github, but I do have a question of how I can update the code on your Github?
Our project wraps up in late October of this year. That might be a good time because the documentation would be ready. We are also doing a reconciliation process with SOC Medicaid ETL Team and its possible there will be small changes to the code depending on what we find.