OHDSI Home | Forums | Wiki | Github

OHDSI (Docker) + Hadoop Yarn (Docker on YARN)

I came upon this the other day and believe we may be able to leverage it in the near future.

Docker on Yarn Slide Presentation

Presented by Daniel Templeton (Cloudera)

They speak of Docker being a first class citizen of Apache Yarn with the integration of Apache Slider.

The Apache Yarn latest documentation already speaks to this topic

Imagine if you will, being able to host some of OHDSI’s tools within the Hadoop cluster, leverage the clusters scalability, central data source (HDFS/Hive/Impala) from an OHDSI Docker defined container.

Now, I’ve only begun to look into this.
Yet, I already see many benefits…

Thoughts/Suggestions/Ideas ???

thx very much

t