What is Elastic MapReduce(EMR) Encryption?

  • You can encrypt at rest or in transit or both
  • They exist as a separate entity within EMR
  • By default, the instances within a cluster don’t encrypt data at rest
  • The instances within EMR are created from pre-configured AMIs (Amazon Machine Images)
  • You must use EMR version 5.7.0 or later to use custom AMIs and encrypt the root device volume for specific compliance reasons.

EMR encryption with EBS

  • Linux Unified Key Setup — You can specify AWS KMS to be used as your key management provider or use a Custom Key provider
  • Open-Source HDFS Encryption — Secure Hadoop RPC use SASL, Data encryption of HDFS Block transfer use the AES 256

EMR encryption with S3

  • EMR supports SSE-S3 or SSE-KMS for server-side encryption
  • You can also use CSE-KMS or CSE-C for encryption before storage
  • PEM — you need to create PEM certificates and reference its zip file in S3
  • Custom — you need a custom certificate provider as a Java class
  • Hadoop MapReduce Encrypted Shuffle uses TLS
  • Secure Hadoop PRC uses SASL
  • Data encryption of HDFS Block Transfer uses AES-256
  • When using EMR version 5.6.0 and later, any internal communication between Presto nodes use SSL/TLS
  • Tez Shuffle Handler uses TLS
  • Akka protocol uses TLS
  • Block transfer service uses SASL and 3DES
  • Ensure that the role assigned to your EC2 instances within the cluster has the relevant permissions to enable access to the CMK
  • Add the relevant role to the Key users for the CMK
  • Data is encrypted and decrypted transparently without requiring changes to the application code
  • Each HDFS encryption zone has its own KMS Key; by default, EMR uses the Hadoop KMS, but you can also select an alternative
  • Each file is encrypted by a different data key, which is encrypted with the HDFS encryption zone key; it’s not possible to move files between encryption zones!
Nadtakan Futhoem — Sr. Software Engineer




Founder of Nadtakan.com & Serverless Cloud developer. Follow me on Twitter https://twitter.com/nadtakanfuthoem

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

{UPDATE} Crazy Cars Race Hack Free Resources Generator

The Snapshot for MILKY airdrop

Drip.Community doing an airdrop

Wiretapping and Visualising the Malware Traffic

10 Security Mistakes You Might Be Making On Or Offline

Is my computer hacked?

What is a Penetration Tester?

Unable to connect to the Synchronization Service Manager

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Nadtakan Futhoem

Nadtakan Futhoem

Founder of Nadtakan.com & Serverless Cloud developer. Follow me on Twitter https://twitter.com/nadtakanfuthoem

More from Medium

Dealing with terabytes to exabytes in AWS Cloud Migrations..!

Connecting Robo 3T with DocumentDB outside AWS VPC.

File Viewer for the Cloud

How to Create an encrypted AWS RDS Database from an unencrypted Database