#
aws-emr
Here are 96 public repositories matching this topic...
AWS Auto Terminate Idle AWS EMR Clusters Framework is an AWS based solution using AWS CloudWatch and AWS Lambda using a Python script that is using Boto3 to terminate AWS EMR clusters that have been idle for a specified period of time.
python
emr
aws
automation
cloudformation
aws-lambda
serverless
etl
bigdata
cloudwatch
aws-emr
idle
amazon-web-services
boto3
aws-cloudformation
aws-cloudwatch
datalake
cft
terminate
python-3-7
-
Updated
Sep 13, 2021 - Python
Bits of code I use during live demos
amazon-emr
aws-emr
aws-cloudformation
aws-athena
amazon-athena
emr-cluster
aws-cloudformation-templates
emr-notebooks
live-demos
-
Updated
Mar 12, 2022 - Jupyter Notebook
A collection of airflow sample workflows for data processing on aws
-
Updated
Dec 1, 2017 - Python
The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
python
aws
big-data
spark
aws-emr
pyspark
dataengineering
big-data-analytics
ec2-spot
emr-cluster
wordcloud-generator
ec2-spot-instances
-
Updated
Sep 24, 2021 - Python
Create Data Lake on AWS S3 to store dimensional tables after processing data using Spark on AWS EMR cluster
apache-spark
aws-s3
aws-emr
pyspark
data-engineering
data-lake
json-format
udacity-nanodegree
spark-dataframes
dimensional-model
star-schema
etl-pipeline
-
Updated
Oct 10, 2019 - Python
A cookiecutter template for working with PySpark on AWS EMR
python
aws
data-science
spark
cookiecutter
aws-emr
pyspark
jupyterhub
cookiecutter-python
cloudformation-template
cookiecutter-template
cookiecutter-datascience
-
Updated
Aug 30, 2020 - Python
A large-scale data framework that will enable us to store and analyze financial market data and drive future predictions for investment.
aws
twitter
big-data
hive
hadoop
tweets
python3
data-warehouse
aws-emr
stock-prices
nasdaq
nyse
emr-cluster
snowflake-schema
star-schemas
warehousing-stock-data
-
Updated
Mar 7, 2020 - TSQL
Cloud-based AI / ML workflow and data application development framework
flow
aws
data-science
machine-learning
cloud
ai
spark
aws-lambda
serverless
aws-emr
pyspark
feature-engineering
scala-spark
event-based
aws-glue
sagemaker-notebook
low-code-framework
sagemaker-notebook-instance
bring-your-own-account
-
Updated
Mar 11, 2022 - Python
Lambda to start EMR and run a map reduce job
-
Updated
Aug 16, 2019 - Python
Generic python library that enables to provision emr clusters with yaml config files (Configuration as Code)
-
Updated
Apr 30, 2021 - Python
MapReduce Analysis on Amazon Food Review Dataset (Big-Data)
-
Updated
Aug 6, 2017
My AWS Playground
aws-lambda
aws-s3
aws-apigateway
aws-emr
aws-cognito
aws-serverless
aws-vpc
aws-cloudfront
aws-codecommit
aws-acm
aws-amplify
aws-appsync
aws-cdk
aws-msk
aws-copilot
-
Updated
Mar 27, 2022 - Python
Code and documentation for the demonstration example of the real-time bushfire alerting with the Complex Event Processing (CEP) in Apache Flink on Amazon EMR and a simulated IoT sensor network as described on the AWS Big Data Blog: Real-time bushfire alerting with Complex Event Processing in Apache Flink on Amazon EMR and IoT sensor network.
-
Updated
Sep 14, 2018 - Java
This repository will be used to understand data science and data engineering concepts
-
Updated
Oct 14, 2020 - Scala
Build modern workflows with AWS MWAA, AWS Step Functions, AWS Glue, and AWS EMR
-
Updated
May 18, 2021 - Python
Data Lake hosted on the AWS EMR cluster with S3 buckets used as source and output storages. The analysis was done using AWS Athena.
-
Updated
Jun 27, 2020 - Python
Machine Learning on a Large 12 GB dataset with Pyspark on AWS EMR
-
Updated
May 20, 2021 - Jupyter Notebook
AWS 및 AWS를 이용한 Data Lake 구성 이해
-
Updated
Oct 25, 2021
Data Analysis Exercise over Walmart Stock
python
linux
scala
spark
hadoop
aws-emr
aws-ec2
walmart-data-analysis
data-analysis-exercise
walmart-stock
-
Updated
Jun 26, 2019 - Jupyter Notebook
Improve this page
Add a description, image, and links to the aws-emr topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the aws-emr topic, visit your repo's landing page and select "manage topics."