Infoworks Installation on EMR

Install Infoworks in the EdgeNode manually using the following steps:

  • Download tar ball using the following command: wget <link-to-download>; reach out to your Infoworks support representative to get the link to download and replace with the link.
  • Extract the tar to the /opt folder using the following command: sudo tar xzf infoworks--emr.tar.gz -C /opt
  • Change the permissions to the respective user using the following command: sudo chown -R {{USER}}:{{USER}} /opt/infoworks
  • Change the directory to Infoworks bin using the following command: cd /opt/infoworks/bin
  • Start services using the following command: ./start.sh all

Provide the following details (mentioned in bold):

  • Enter distribution home: /usr/lib
  • Enter the NameNode hostname: <masternode_private_DNS>
  • Enter the path for Infoworks HDFS home default: /user/{{USER}}; click Enter to use default value.
  • Enter the HiveServer2 thrift server hostname: <masternode_private_DNS>
  • Enter the username for hive default: {{USER}}; click Enter to use default value.
  • Enter the password for the username: default: {{USER}}; click Enter to use default value or enter the password of the user.
  • Enter the hive schema for Data Foundry default: iw_df_ workspace:; click Enter to use default value.
  • Enter the Spark master hostname: <masternode_private_DNS>
  • Enter the Spark home directory: /usr/lib/spark
  • Enter the hostname for accessing Infoworks UI; click Enter to use default value.

Once all the services are started, run orchestrator services using the following command: ./start.sh orchestrator to .

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard