During our first study (TxPathways), we needed a short way to communicate how large is a given CDM dataset and how large the input or output cohort is.
I understand that there is Achilles to providing a graphical view of many parameters (e.g., age, lenght of observation periods), but it is not possible to create a quick extract from Achiles to email a collaborator.
For example as IMEDs user, I can not see the Achilles JSON data (either as tables in database or as .JSON files)
I would like to propose an idea of a set of simple SQL code file that would output just a few selected simple parameters (either just plain output when running the query or output into a temp table in a database and listing that table)
for example: (this is real data for CCAE dataset in IMEDS)
MEASURE RESULT EXPLANATION
G1 141,805,491 count of patients
G2 20,328,289,601 count of events
D2 90,024,522 count of patients with at least 1 Dx and 1 Rx
D3 112,148,500 count of patients with at least 1 Dx and 1 Proc
D4 5,939,621 count of patients with at least 1 Obs, 1 Dx and 1 Rx
D5 277,975 count of deceased patients
My feeling is that perhaps not all CDM adopters/users made Achilles work (in an afternoon) and have a spare web server for viewing the results. They may be situations where Achilles may be too complex to use.
I would be interested in what people think of a quick few-numbers-snapshot of the data. (I have a tentative name (IRIS) for this approach.
We would have a first non-J&J [non-Regenstrief] software tool… (excluding terminology products here)