Here are
46 public repositories
matching this topic...
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Updated
Aug 9, 2020
Scala
Apache Spark examples exclusively in Java
IBIS is a workflow creation-engine that abstracts the Hadoop internals of ingesting RDBMS data.
Updated
Oct 21, 2018
Python
💰 A bot for maximizing the borrow subreddit
Updated
Feb 13, 2017
JavaScript
Media Management System: ingestion, processing, encoding, delivery, ...
Extensible streaming ingestion pipeline on top of Apache Spark
Updated
Aug 6, 2020
Scala
Parallel Streaming Transformation Loader
Updated
Apr 23, 2019
Java
👥 [WIP] An experimental High Available Reverse Proxy for Massive Asynchronous Message Consumption
Biocaddie Data Processing Pipeline. A data ingestion pipeline that collects and transforms original metadata information to a unified metadata model, called DatA Tag Suite (DATS).
Receiving end of new worker to push data across DC boundaries
Updated
Mar 10, 2020
Java
Fast and sustainable Elasticsearch ingestion, migration, and cloning
Tagbase is a Flask application which provides OpenAPI REST endpoints for ingestion of various files into the Tagbase SQL database
Updated
Feb 7, 2020
PLSQL
Updated
Dec 4, 2018
Scala
Spark with Java - chapter 9
Updated
Jun 21, 2020
Java
Microservice to ingest data from Replicate and push it into DAF. Warning: this repo is deprecated.
Updated
Jan 14, 2018
Java
Generic data ingestion for Elasticsearch to be visualized by Kibana.
Updated
Jul 9, 2020
JavaScript
Broadway is a distributed actor-based processing server optimized for high-speed data/file ingestion
Updated
Mar 29, 2016
Scala
A data pipeline management platform
Updated
Jul 30, 2020
JavaScript
Updated
Feb 24, 2017
JavaScript
Ingest a file in GEDCOM format into MongoDB
Updated
Jul 11, 2019
Python
Data ingestor that reads and parses executive orders from wikisource
Updated
Apr 2, 2017
Python
sqoop import scripts for oracle,mysql,db2 and sql server
Updated
Feb 18, 2019
Shell
Periodically ingest incremental updates (inserts / deletes) into BigQuery using Cloud Composer / Airflow orchestration workflow
Updated
Dec 12, 2019
Python
Ingestion REST api that writes to a Kafka topic
Updated
Sep 28, 2017
JavaScript
Python based ingestion, SQL, Hadoop, Bash scripting
Updated
Feb 21, 2018
Python
DAILP Ingest (of Cherokee language data from Google Sheets)
Updated
Mar 23, 2020
Clojure
API Gateway -> Kinesis Data Streams -> Kinesis Data Firehose -> S3
Updated
Feb 14, 2019
Shell
A distributed processing/orchestration server and ETL for NodeJS
Updated
Apr 22, 2017
Scala
Ingestion Server for Magen Data Leak Prevention Software
Updated
Mar 16, 2018
Python
"Disco" is the name of the automated media ingestion system developed at RXMusic (PCMusic). This repository contains the ansible scripts for the deployment, administration, and management of the Disco cluster.
Improve this page
Add a description, image, and links to the
ingestion
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
ingestion
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.