Hello again! Sorry for the late response, I had to talk with my professor to decide on some details of the project.
What we are trying to do is to use the OSIM2 data for online learning using several multi-armed bandit algorithms. This unfortunately requires case-by-case data, where each data sample has a context (age and gender), condition, drug and an outcome. For OSIM2 data, a row of the desired dataset would look like this:
PERSON_ID, CONDITION_CONCEPT_ID, DRUG_CONCEPT_ID, OUTCOME
The drug in DRUG_CONCEPT_ID is the drug that was given for the condition in CONDITION_CONCEPT_ID of the person identified by PERSON_ID, and OUTCOME is whether a side effect or health benefit (or possibly nothing) occured as a result.
I am well aware that OSIM2 might not have been designed with such a use case in mind. It seems it would be quite difficult to generate new data in this manner, especially considering how the risk signals are normally infused after the data is generated. Nonetheless, I found the OSIM2 transition probability matrices from someone else in my research group and I am currently trying to generate new data in the way my project requires. If you have any recommendations about how I could progress or if you know of a more suitable dataset I could use, I would be happy to listen!
By the way, let me know if you would like to upload the transition probability matrices to the FTP server for possible future use, I can send them if it would be helpful for anyone.