Data360 Analyze

 View Only
  • 1.  HIVE Drivers - Which Post to use?

    Posted 04-28-2020 13:41

    We want to configure Data3Sixty to connect to HIVE. There are two posts referencing different drivers and configurations. What's the difference between these two posts and how do we determine which one to use. Thank you in advance!

    Connecting to HIVE as a data source
    Setting up Analyze to connect to HIVE with Kerberos



  • 2.  RE: HIVE Drivers - Which Post to use?

    Employee
    Posted 04-29-2020 02:11

    The Setting up Analyze to connect to HIVE with Kerberos article relates to the Data360 Analyze product.

    The Connecting to HIVE as a data source article relates to the use of the Data360 DQ+ product. 

    Data360 DQ+ runs within a Hadoop cluster and can be configured to connect to Hive data sources.  In contrast, Data360 Analyze does not have Hadoop as a prerequisite for its base operation (i.e. it can be run on a standalone server or desktop machine) but it can be configured to connect to a Hive data source in a remote Hadoop cluster.

    As a consequence of the reliance on Hadoop for the base operation of Data360 DQ+ it has been possible to certify specific versions of the drivers for use with Data360 DQ+.

    Typically customers are looking for Data360 Analyze to connect to the particular distribution and version of Hadoop cluster that has been deployed by their organization (e.g. for use as an enterprise-wide data lake). The article for Data360 Analyze refers to examples of the drivers that could be used. However, it cautions that you may need to use other drivers, depending on the particular Hadoop cluster you want to connect to (if you are operating in a multi-cluster environment you may need different drivers for each cluster). We recommended that you confer with the administrators of the Hadoop cluster to determine which specific drivers are required to access the cluster. The Hadoop distribution vendors (e.g. Cloudera or Hortonworks) typically provide access to JDBC drivers on their websites but they may recommend/require you to obtain the driver from an edge node within the cluster.

     



  • 3.  RE: HIVE Drivers - Which Post to use?

    Posted 04-29-2020 07:11

    Thanks for the speedy response and the help I was hoping to get. Perfect reminder about the drivers, I think we'll need to confirm the drivers with our cluster administrator. Thanks Adrian!