Need your help and advise on setting up the DQD (Data Quality Dashboard) with Impala or Spark. I referred to the link “Connecting to Various Database Platforms • DatabaseConnector” but could not get much information from the setup perspective for Impala or Spark. My preference would be first on Impala and second on Spark. Would be great if anyone can please share their views on this.
Please note that our cluster is Kerberized hence need to ensure that the setup would be working accordingly. We are using Cloudera CDP product.
Any help in regards is much appreciated.