Hive Crawl

Hive Crawling helps to store existing hive schema's metadata in mongoDB so that Data Transformation can use it to build pipelines on top of that data.

Currently, Hive crawling is supported only if the Hive schema is on the same cluster where Infoworks is deployed.

Creating a Hive Source

To create a Hive source, follow these steps:

  • Click Admin and click Sources icon in the sidebar.
  • Click New Source.
  • Enter the Source Name and select Hive under the Source Type drop-down.
  • Click Save Settings.

Configuring the Hive Source

To configure the Hive source that you just created, follow these steps:

  • Click Sources on the main menu on the top of the page.
  • Click the Hive source you just created.
  • Click Settings icon. The following page opens.
  • Enter values for all the required fields.

NOTE: Ensure that the source schema that you provide is already existing in Hive.

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard