Configuring External Kerberos Server for HA EMR

EMR with multiple master nodes requires external Kerberos.

Following is the procedure to configure the Kerberos server:

Launching EC2 Server

  • Login to AWS console and navigate to EC2.
  • Click the Launch Instance button.
  • In the Choose an Amazon Machine Image (AMI) page, click the Select button for the Amazon Linux 2 AMI.
  • In the next steps, select the VPC, security group, and key for authentication and launch an instance. This is the External Kerberos server.

NOTE Ensure that the 749 and 88 ports are open to the EMR cluster and Edge node.

Configuring Network and Keys to SSH

  • SSH into the Kerberos server.
  • Install the Kerberos server packages using the following command:

sudo yum install krb5-libs krb5-server krb5-workstation

  • Define the Kerberos realm for the cluster by modifying the /etc/krb5.conf configuration file. The following is an example to configure a Kerberos server with a realm, REALM.DOMAIN (INFOWORKS.IO), on a host, ip-10-00-03-218.ec2.internal.
Copy
  • Define the Kerberos realm for the cluster by modifying the /var/kerberos/krb5kdc/kdc.conf configuration file. The following is an example:
Copy
  • If required, modify the /var/kerberos/krb5kdc/kadm5.acl file to grant the appropriate permissions to the admin.
  • Create the KDC database using the following command:

kdb5_util create -s

  • Enter the appropriate password for the KDC database master key.
  • Start the Kerberos daemons using the following commands:

root@kdc-server$ /sbin/service krb5kdc start

root@kdc-server$ /sbin/service kadmin start

  • To start Kerberos automatically on system restart, run the following commands:

root@kdc-server$ /sbin/chkconfig krb5kdc on

root@kdc-server$ /sbin/chkconfig kadmin on

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard