DataFoundry 2.9.1
2.9.1
Core Functionalities
Title
Message
Create new category
What is the title of your new category?
Edit page index title
What is the title of the page index?
Edit category
What is the new title of your category?
Edit link
What is the new title and URL of your link?
Installing HDI with ESP
Copy Markdown
Open in ChatGPT
Open in Claude
Prerequisites
Following are the prerequisites to install HDInsight cluster with ESP:
- Install and configure Azure Active Directory.
- Create a group for Cluster Administration in Azure Active Directory.
- Create a Cluster Admin User. The user does not require special rights or groups.
- Set up different groups for the following roles in Azure Active Directory for Infoworks Authentication: System Admin, Database Admin, Data Analyst, Data Modeller.
- Install and configure Azure Active Directory Domain Services (AADDS) from the Azure Portal.
- Enable Secure LDAP for AADDS in Azure Portal. The default is false.
- Create an SSL certificate. The certificate can be self-signed but a certificate from a trusted issuer is recommended. The certificate must be a wildcard for the AADDS domain. For example, *.myaadds.org. For details on the set up, see Configure secure LDAP (LDAPS) for an Azure AD Domain Services managed domain.
- Go to security and ensure that only inbound access to specific IP address is allowed, and not to the internet. Else, LDAP will be exposed to the public internet.
- To set up HDInsight cluster in a different resource group, ensure a VNet is created in a new resource group and peered with Azure AD VNet and vice-versa.
- Set DNS provided by this VNet as the IPs of DNS services provided by AADDS, or a DNS forwarder that can resolve DNS queries to the AADDS domain.
- Create a managed identity. Assign the HDInsight Domain Service Contributor role to the managed identity within the AADDS Azure resource.
For more details on ESP, see Configure a HDInsight cluster with Enterprise Security Package by using Azure Active Directory Domain Services.
HDInsight HBase Cluster Installation
Create an HDInsight HBase Cluster version 3.6. with ESP option selected. Ensure that an associated VNet and Subnet is available. Note the following when creating the cluster:
- Cluster Name
- Cluster Ambari Login User Name
- Cluster Ambari Login Password
- LDAP Admin User Password (Admin user when creating HDInsight cluster)
Following are the steps to setup the HBase Clusters (choose Custom):
Basics
- Enter the following details:
| Field | Description |
|---|---|
| Cluster Name | Name of the cluster |
| Subscription | Subscription ID |
| Cluster type | HBase O/S: Linux Version: HBase 1.1.2 (HDI 3.6) |
| Cluster login username | Ambari login username |
| Cluster login password | Ambari login password |
| Secure Shell (SSH) username | SSH username |
| Use cluster login password for SSH (Checkbox) | Check this box if the SSH and cluster login password must be the same |
| Resource group | Select the existing resource group |
| Location | Resource group location |
- Click Select and click Next.

Security and Networking
- Enter the following details:
| Field | Description |
|---|---|
| Enterprise Security Package | |
| Enterprise Security Package | Select the Enabled option. |
| AADDS DNS Domain Name | Domain name |
| Cluster Admin User | LDAP admin user for HDInsight cluster |
| Cluster access group | Groups to access the HDInsight cluster |
| LDAPS URL | Retain the default url, ldaps://*.com:636 |
| Virtual Network | |
| Virtual network | The VNet created as mentioned in the prerequisite |
| Subnet | Subnet associated with the VNet |
| Identity | |
| Subscription | Subscription ID |
| User-assigned managed identity | Managed identity created as mentioned in the prerequisite |
- Click Next.

Storage
- Enter the following details:
| Field | Description |
|---|---|
| Storage Account Settings | |
| Primary storage type | Select Azure Storage |
| Storage account (new) | Create a new storage account |
| Container | (auto-populated) |
| Additional storage account | Leave the field blank |
| Data Lake Storage Gen1 access | Leave the field blank |
| Metastore Settings | |
| Select a SQL database for Hive | Leave the field blank |
| Select a SQL database for Oozie | Leave the field blank |
| Select a SQL database for Ranger | Leave the field blank |
- Click Next.

Applications[Optional]
Cluster Size
- Enter the following details:
| Description | Value |
|---|---|
| Cluster Size | |
| Number of Region nodes | 10 |
| Region node size (10 nodes) | D13 v2 |
| Head node size (2 nodes) | D13 v2 |
| Zookeeper node sizes | A4 v2 |
- Click Select and click Next.

Script Actions[Optional]
Summary
- Verify that the information is correct and create the clusters accordingly.

- Click Create.
Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard
Last updated on
Next to read:
Infoworks Installation on Existing HDI with ESPFor more details, refer to our Knowledge Base and Best Practices!
For help, contact our support team!
(C) 2015-2022 Infoworks.io, Inc. and Confidential
Discard Changes
Do you want to discard your current changes and overwrite with the template?
Archive Synced Block
Message
Create new Template
What is this template's title?
Delete Template
Message