Infoworks Installation on EMR
Install Infoworks in the EdgeNode manually using the following steps:
- Download tar ball using the following command: wget <link-to-download>; reach out to your Infoworks support representative to get the link to download and replace with the link.
- Extract the tar to the /opt folder using the following command: sudo tar xzf infoworks--emr.tar.gz -C /opt
- Change the permissions to the respective user using the following command: sudo chown -R {{USER}}:{{USER}} /opt/infoworks
- Change the directory to Infoworks bin using the following command: cd /opt/infoworks/bin
- Start services using the following command: ./start.sh all
Provide the following details (mentioned in bold):
- Enter distribution home: /usr/lib
- Enter the NameNode hostname: <masternode_private_DNS>
- Enter the path for Infoworks HDFS home default: /user/{{USER}}; click Enter to use default value.
- Enter the HiveServer2 thrift server hostname: <masternode_private_DNS>
- Enter the username for hive default: {{USER}}; click Enter to use default value.
- Enter the password for the username: default: {{USER}}; click Enter to use default value or enter the password of the user.
- Enter the hive schema for Data Foundry default: iw_df_ workspace:; click Enter to use default value.
- Enter the Spark master hostname: <masternode_private_DNS>
- Enter the Spark home directory: /usr/lib/spark
- Enter the hostname for accessing Infoworks UI; click Enter to use default value.
Once all the services are started, run orchestrator services using the following command: ./start.sh orchestrator to .
