We are in the process of ramping up our Atlas usage and wanting to increase the amount of people running jobs on it.
However, we have run into a substantial bottleneck in the time it takes to run these jobs, generally cohort creation, they take upwards of 10-20 hours and some even longer.
We have the CDM data on a remote postgres database, we have about 9 million patients in the database where the cohort creation is happening.
Was looking for a place to start investigating what is causing the large runtimes, would having the WebAPI on the same server as the database decrease time, generally increasing the resources on the server the WebAPI is currently deployed on?
Any help would be greatly appreciated and happy to share specs and any other relevant information.
Best,
Alex