Instance groups only support a single subnet and Availability Zone. Use
ec2_attributes.subnet_id (not subnet_ids) when configuring an instance-group cluster.Group types
The module exposes three group variables, one per node role:| Variable | Node role | Resource |
|---|---|---|
master_instance_group | Master | Inline in aws_emr_cluster |
core_instance_group | Core | Inline in aws_emr_cluster |
task_instance_group | Task | Separate aws_emr_instance_group resource |
master_instance_group
master_instance_group
The master group controls the primary node that coordinates the cluster.
EC2 instance type for the master node, for example
"m5.xlarge".Number of master instances to launch. Use
3 for high-availability master configurations.Display name for the group.
Maximum Spot price in USD per instance-hour. When set, EMR launches master nodes as Spot instances.
EBS volumes to attach. See EBS config for nested attributes.
core_instance_group
core_instance_group
The core group stores data in HDFS and runs compute tasks.
EC2 instance type for core nodes.
Number of core instances to launch.
Display name for the group.
Maximum Spot price in USD per instance-hour.
JSON string containing an EMR autoscaling policy document. When provided, EMR uses this policy to automatically scale the core group.
EBS volumes to attach. See EBS config for nested attributes.
task_instance_group
task_instance_group
The task group adds compute-only capacity. Task nodes do not store HDFS data.
EC2 instance type for task nodes.
Number of task instances to launch.
Display name for the group.
Maximum Spot price in USD per instance-hour. Task nodes are good candidates for Spot because they hold no HDFS data.
JSON string containing an EMR autoscaling policy document.
JSON string for per-group application configuration overrides.
EBS volumes to attach. See EBS config for nested attributes.
Whether EBS optimization is enabled for the instance type. Defaults to
true.EBS config
All three group types accept anebs_config list to attach additional EBS volumes to each instance.
Volume size in GiB.
EBS volume type, for example
"gp3" or "io2".Provisioned IOPS. Only valid for
io1 and io2 volume types.Throughput in MiB/s. Only valid for
gp3 volumes.Number of volumes of this configuration to attach per instance.