What the module creates
Depending on which cluster type you configure, the module can provision:- EMR clusters on EC2 using instance fleets or instance groups (public or private subnets)
- EMR Serverless applications
- EMR Virtual Clusters running on Amazon EKS
- EMR Studios
- Security groups for
master,core, andtasknodes, plus aservicesecurity group for private clusters - IAM roles for autoscaling, the EMR service, and EC2 instance profiles
Cluster types
EC2 clusters
Deploy Spark, Hive, and other frameworks on EC2 using instance fleets or instance groups in public or private subnets.
Serverless
Run Spark and Hive workloads without managing cluster infrastructure using EMR Serverless.
Virtual clusters (EKS)
Run EMR workloads on your existing Amazon EKS clusters using EMR on EKS virtual clusters.
Studios
Create collaborative development environments for data engineers and scientists using EMR Studio.
Requirements
| Tool | Minimum version |
|---|---|
| Terraform | >= 1.5.7 |
AWS provider (hashicorp/aws) | >= 6.35 |
VPC and subnet tagging
Module sources
The module is published to the Terraform Registry as four sub-modules, one per cluster type. Pin to a specific version to avoid unexpected changes.If you are new to AWS EMR, start with the Quickstart to deploy a private EC2 cluster with instance fleet in a few minutes.