Installing HDI with ESP

Prerequisites

Following are the prerequisites to install HDInsight cluster with ESP:

  • Install and configure Azure Active Directory.
  • Create a group for Cluster Administration in Azure Active Directory.
  • Create a Cluster Admin User. The user does not require special rights or groups.
  • Set up different groups for the following roles in Azure Active Directory for Infoworks Authentication: System Admin, Database Admin, Data Analyst, Data Modeller.
  • Install and configure Azure Active Directory Domain Services (AADDS) from the Azure Portal.
  • Enable Secure LDAP for AADDS in Azure Portal. The default is false.
  • Create an SSL certificate. The certificate can be self-signed but a certificate from a trusted issuer is recommended. The certificate must be a wildcard for the AADDS domain. For example, *.myaadds.org. For details on the set up, see Configure secure LDAP (LDAPS) for an Azure AD Domain Services managed domain.
  • Go to security and ensure that only inbound access to specific IP address is allowed, and not to the internet. Else, LDAP will be exposed to the public internet.
  • To set up HDInsight cluster in a different resource group, ensure a VNet is created in a new resource group and peered with Azure AD VNet and vice-versa.
  • Set DNS provided by this VNet as the IPs of DNS services provided by AADDS, or a DNS forwarder that can resolve DNS queries to the AADDS domain.
  • Create a managed identity. Assign the HDInsight Domain Service Contributor role to the managed identity within the AADDS Azure resource.

For more details on ESP, see Configure a HDInsight cluster with Enterprise Security Package by using Azure Active Directory Domain Services.

HDInsight HBase Cluster Installation

Create an HDInsight HBase Cluster version 3.6. with ESP option selected. Ensure that an associated VNet and Subnet is available. Note the following when creating the cluster:

  • Cluster Name
  • Cluster Ambari Login User Name
  • Cluster Ambari Login Password
  • LDAP Admin User Password (Admin user when creating HDInsight cluster)

Following are the steps to setup the HBase Clusters (choose Custom):

Basics

  • Enter the following details:
FieldDescription
Cluster NameName of the cluster
SubscriptionSubscription ID
Cluster typeHBase, O/S: Linux, Version: HBase 1.1.2 (HDI 3.6)
Cluster login usernameAmbari login username
Cluster login passwordAmbari login password
Secure Shell (SSH) usernameSSH username
Use cluster login password for SSH (Checkbox)Check this box if the SSH
and cluster login password
must be the same
Resource groupSelect the existing resource group
LocationResource group location
  • Click Select and click Next.

Security and Networking

  • Enter the following details:
FieldDescription
Enterprise Security Package

Enterprise Security PackageSelect the Enabled option.
AADDS DNS Domain NameDomain name
Cluster Admin UserLDAP admin user
for HDInsight cluster
Cluster access groupGroups to access the HDInsight cluster
LDAPS URLRetain the default url, ldaps://*.com:636
Virtual Network
Virtual networkThe VNet created as mentioned
in the prerequisite
SubnetSubnet associated with the VNet
Identity
SubscriptionSubscription ID
User-assigned managed identityManaged identity created as
mentioned in the prerequisite
  • Click Next.

Storage

  • Enter the following details:
FieldDescription
Storage Account Settings
Primary storage typeSelect Azure Storage
Storage account (new)Create a new storage account
Container(auto-populated)
Additional storage accountLeave the field blank
Data Lake Storage Gen1 accessLeave the field blank
Metastore Settings
Select a SQL database for HiveLeave the field blank
Select a SQL database for OozieLeave the field blank
Select a SQL database for RangerLeave the field blank
  • Click Next.

Applications[Optional]

Cluster Size

  • Enter the following details:
DescriptionValue
Cluster Size
Number of Region nodes10
Region node size (10 nodes)D13 v2
Head node size (2 nodes)D13 v2
Zookeeper node sizesA4 v2
  • Click Select and click Next.

Script Actions[Optional]

Summary

  • Verify that the information is correct and create the clusters accordingly.
  • Click Create.
Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard