I came upon this the other day and believe we may be able to leverage it in the near future.
Docker on Yarn Slide Presentation
Presented by Daniel Templeton (Cloudera)
They speak of Docker being a first class citizen of Apache Yarn with the integration of Apache Slider.
The Apache Yarn latest documentation already speaks to this topic
Imagine if you will, being able to host some of OHDSI’s tools within the Hadoop cluster, leverage the clusters scalability, central data source (HDFS/Hive/Impala) from an OHDSI Docker defined container.
Now, I’ve only begun to look into this.
Yet, I already see many benefits…
Thoughts/Suggestions/Ideas ???