Most of the OHDSI tools work intensively with the database server, using SQL statements that are often more complicated than those used in other software, with multiple joins and large intermediary results. A nice example is this query in Achilles.
My question: do people have tips on how to optimize performance of database platforms for the OHDSI tools?
I heard @Christophe_Lambert and @msuchard complain about slow performance of PostgreSQL. @Rijnbeek, didn’t I hear you have managed to boost performance of PostgreSQL significantly with some minor tweaks?
I have indeed been playing around with this. It seems that out of the box PostgreSQL is setup with a very basic configuration tuned for wide compatibility rather than performance. There are many tuning variables (for example memory settings) that have effect on the performance:
Now we have our new powerful server with 700+ Gb Ram and many core we will be playing around a bit more with these settings to see their effect on performance, I will let you know the results.
Was there a tuning variable that you found to be especially important to change? Could you post the configuration file you used before you switched to your monster server?