Hi everyone! I’ve also posted on the Working Group teams page, so sorry for the spam. I’m new to the community (Yale just released the internal beta of our OMOP analytics setup). My lab is developing a Python-based dataset pipeline similar to HADES FeatureExtraction (with a few aspects of ACHILLES), specific to the emergency dept. We would like to release the package as a public tool and we would love some feedback!
Mainly, we use PySpark Dataframes as the underlying data structures to operate over. Is this common or do other teams use other structures? I’m guessing since HADES is in R, R dataframes are used, but how about any python shops? (e.g. pandas dataframes)? Thank you in advance and sorry if I’m not making sense, still new to this!