I’ve been working on getting Achilles working against a Hadoop datastore, and I wanted to give an update as I’ve got a couple of queries working now.
This has involved changes to SqlRender (to support Impala as a query language), DatabaseConnector (to connect over JDBC), and Achilles (to add some explicit casts).
I’ve written up some instructions describing how to run against Hadoop at https://github.com/tomwhite/Achilles/blob/impala/README-impala.md. You should be able to try it out if you have a Hadoop cluster with Impala running on it.
The remaining work includes:
- Get all of the Achilles 5 queries working
- Figure out a better way of loading JARs for the Impala JDBC driver
- Add unit tests for SqlRender and DatabaseConnector
I’d love to get some feedback to see if I’m going in the right direction. I’d also welcome collaboration on any of the remaining work.
Thanks,
Tom