Skip to content

Terraform module for terraform-gcp-tamr-wrapper

License

Notifications You must be signed in to change notification settings

Datatamer/terraform-gcp-tamr-wrapper

Repository files navigation

Requirements

Name Version
terraform >= 1.0.0
google >= 4.6.0

Providers

No providers.

Modules

Name Source Version
bigtable git::https://github.com/Datatamer/terraform-gcp-bigtable.git v3.0.0
cloud_sql git::https://github.com/Datatamer/terraform-gcp-tamr-cloud-sql.git v6.1.0
config git::https://github.com/Datatamer/terraform-gcp-tamr-config.git v3.0.1
gcs_buckets git::https://github.com/Datatamer/terraform-gcp-tamr-buckets.git v3.0.0
iam git::https://github.com/Datatamer/terraform-gcp-tamr-iam v2.0.1
tamr_vm git::https://github.com/Datatamer/terraform-gcp-tamr-vm.git v2.2.1

Resources

No resources.

Inputs

Name Description Type Default Required
deployment_name name to use as the base for all resources created. NOTE: if you change this it will recreate all resources string n/a yes
project_id project_id for resources to be deployed into string n/a yes
region GCP region to deploy resources into string n/a yes
subnet_self_link subnetwork self_link to deploy resources onto string n/a yes
tamr_bigtable_max_nodes Max number of nodes to scale up to number n/a yes
tamr_instance_image Image to use for Tamr VM boot disk string n/a yes
tamr_instance_service_account email of service account to attach to the tamr instance. If not given will create a new service account for tamr to use. string n/a yes
tamr_zip_uri gcs location to download tamr zip from string n/a yes
zone GCP zone to deploy resources into string n/a yes
additional_admin_users list of additional entities to give admin permissions to provisioned resources list(string) [] no
additional_read_users list of additional entities to give read only permissions to provisioned resources list(string) [] no
bucket_locations Location for the gcs buckets, default is US string "US" no
dataproc_cluster_master_disk_size Size of disk to use on dataproc master disk number 1000 no
dataproc_cluster_master_instance_type Instance type to use as dataproc master string "n1-highmem-4" no
dataproc_cluster_worker_machine_type machine type of default worker pool string "n1-standard-16" no
dataproc_cluster_worker_num_instances Number of default workers to use number 4 no
dataproc_cluster_worker_num_local_ssds Number of localssds to attach to each worker node number 2 no
dataproc_image_version Dataproc image version string "2.0" no
dataproc_network_tags list of network tags to attach to each dataproc node list(string) [] no
enable_cloud_sql_private_ip Whether the Cloud SQL instance should use a public or private ip address bool false no
force_destroy force destroy potentially persistent resources, like bigtable/gcs bool false no
labels Labels to attach to created resources map(string) {} no
pre_install_bash Bash to be run before Tamr is installed.
Likely to be used to meet Tamr's prerequisites, if not already met by the image. (https://docs.tamr.com/new/docs/requirements )
This will only be run once before Tamr is installed, unless Tamr fails to install. This bash will also be run on subsequent attempts to install Tamr, so it is recommended that this bash is idempotent.
string "" no
private_network_id The VPC network from which the Cloud SQL instance is accessible for private IP string "" no
require_ssl Set if SSL is required for the Cloud SQL instance. bool false no
spark_driver_memory Amount of memory spark should allocate to spark driver string "12G" no
spark_executor_cores Amount of cores spark should allocate to each spark executor number 5 no
spark_executor_instances number of spark executor instances number 12 no
spark_executor_memory Amount of memory spark should allocate to each spark executor string "13G" no
spark_properties_override json blob of spark properties to override, if not set will use a default set of properties that should work for most use cases string "" no
sql_disk_size size of the disk to use on the tamr sql instance number 10 no
sql_disk_type The disk type to use on the cloud SQL instance. should be either PD_SSD or PD_STANDARD string "PD_SSD" no
sql_pg_version version of postgres to use on CloudSQL instance string "POSTGRES_12" no
sql_tier the machine type to use for the sql instance string "db-custom-2-4096" no
ssl_mode Set up the SSL mode. If SSL is required, this value is TRUSTED_CLIENT_CERTIFICATE_REQUIRED. string "ALLOW_UNENCRYPTED_AND_ENCRYPTED" no
tamr_bigtable_min_nodes Min number of nodes to scale down to number 1 no
tamr_cloud_sql_sslmode if ssl is disabled or not.Possible values include disable , allow , prefer , require , verify-ca and verify-full string "disable" no
tamr_instance_disk_size size of the boot disk number 100 no
tamr_instance_disk_type boot disk type string "pd-ssd" no
tamr_instance_metadata custom metadata to attach to created VM map(string) {} no
tamr_instance_tags list of network tags to attach to instance list(string) [] no
tamr_license_key Set a tamr license key string "" no

Outputs

Name Description
instance_ip An arbitrary value that changes each time the resource is replaced.
tamr_config_file full tamr config file
tamr_instance_self_link full self link of created tamr vm
tamr_service_account service account tamr is using
tmpl_dataproc_config dataproc config
tmpl_startup_script rendered metadata startup script