With five CDM database and any results of each database, how can I calculate the results of a big database consist of five database?

(panpan) #1

dear all,
I am an OHDSI CDM user and a researcher, but now I have a problem. For example, I have five CDM database, each has 10000 people. But because of some reason, I can only calculate the corresponding parameters of each database separately, I have no right to put 50000 people of these five databases together to calculate the parameters of the total. So my problem is in this situation if I want the statistics results of the 50000 people when I can only calculate the separate results of five databases, can I do that? such as Analysis of Variance (ANOVA)

Best regards
thank you very much!

(Christian Reich) #2

Why can’t you pool the 50,000 into one database, @pandamiao?

(Seng Chan You) #3

@pandamiao Though I’m not sure I fully understand what you want, we’re developing something similar to that. Because the protocol has not been developed, I cannot tell you the details. But it will be announced at the OHDSI Korea symposium, and then I can tell you more details. Still, it’s not relevant with ANOVA.

(Martijn Schuemie) #4

For something as simple as an ANOVA, couldn’t you just compute the 2x2 table (aggregate statistics, so shareable) per database, and then sum them across databases, computing the ANOVA on the combined 2x2 table?

(David Madigan) #5

or, more generally, just do a simple inverse variance-weighted meta-analysis

(Asiyah Lin) #6

@David_Madigan For “inverse variance-weighted meta-analysis”, you are assuming that there is no heterogeneous among all databases. @pandamiao, is that the case?