TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
-
Updated
Feb 24, 2022 - Scala
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
This code is used to build & run a Docker container for performing predictions against a Spark ML Pipeline.
JPMML-SparkML plugin for converting LightGBM-Spark models to PMML
Recommendation engine in Java. Based on an ALS algorithm (Apache Spark). Train a new model after N seconds.
A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.
Free High-Quality Financial Data in Azure
Transformation of Akamai Logs with Spark ETL and discover of Values and similarities in logs used SparkML and H2O ML
BigData Engineering Capstone Project with Tech-stack : Linux, MySQL, sqoop, HDFS, Hive, Impala, SparkSQL, SparkML, git
Twitter Sentiment Analysis using Spark, MongoDB, and Google Cloud
Predicting the arrival delay time of commercial flights
Repo for using scala in a kaggle house price prediction.
Using SparkML to build different machine learning models for simulating a small scale of big data management
Online latent state estimation with Spark
Topic modeling from Facebook news pages
A machine learning at scale demo on flight delay prediction. The project includes an exploration of a series of data transformation and ML pipelines in Apache Spark (via Databricks).
Add a description, image, and links to the sparkml topic page so that developers can more easily learn about it.
To associate your repository with the sparkml topic, visit your repo's landing page and select "manage topics."