Project files for the post: Running PySpark Applications on Amazon EMR: Methods for Interacting with PySpark on Amazon Elastic MapReduce.
-
Updated
Sep 1, 2022 - Python
Project files for the post: Running PySpark Applications on Amazon EMR: Methods for Interacting with PySpark on Amazon Elastic MapReduce.
Apache Hudi examples designed to be run on AWS Elastic Map Reduce (EMR) via. EMR Studio or EMR Notebooks
Apache Icebery examples designed to be run on AWS Elastic Map Reduce (EMR) via. EMR Studio or EMR Notebooks
Deltalake examples designed to be run on AWS Elastic Map Reduce (EMR) via. EMR Studio or EMR Notebooks
Cluster Creation using Terraform.
Hudi Workshop using Terraform.
Pig Workshop using CloudFormation.
PennBook is a highly scalable implementation of the core functionalities of facebook.com. It uses a Node.js server, React.js for the frontend, and Hadoop libraries such Apache Spark along with AWS Elastic MapReduce for the Big Data functionalities.
Hive Workshop using CloudFormation.
EMR Notebooks and SageMaker using Terraform.
Orchestrating Amazon EMR with AWS StepFunctions using Terraform.
Spark-based ETL using Terraform.
Pig Workshop using Terraform.
EMR Managed Scaling using Terraform.
Presto Workshop using Terraform.
EMR Notebooks and SageMaker using CloudFormation.
Hudi Workshop using CloudFormation.
Orchestrating Amazon EMR with AWS StepFunctions using CloudFormation.
EMR Managed Scaling using CloudFormation.
Spark-based ETL using CloudFormation.
Add a description, image, and links to the elastic-map-reduce topic page so that developers can more easily learn about it.
To associate your repository with the elastic-map-reduce topic, visit your repo's landing page and select "manage topics."