Compare tables within or across databases
-
Updated
Apr 13, 2023 - Python
Compare tables within or across databases
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
This repository provides various demos/examples of using Snowpark for Python.
An open source development framework to help you build data workflows and modern data architecture on AWS.
A Data Platform built for AWS, powered by Kubernetes.
Code and data for the Modern Polars book
Recohut - Learn data engineering, data science
Index for online reading materials in order to learn Python and backend development/engineering concepts from scratch and develop a mastery sufficient for Senior/Principal Backend Engineers and Data Engineers
Build, test, deploy, iterate - Dev and prod tool for data science pipelines
Data engineering interviews Q&A for data community by data community
Predict stock price based on financial news feeds
Apply for a job at Olist's Data Team: https://olist.gupy.io/
Found a data engineering challenge or participated in a selection process ? Share with us!
Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.
Dockerizing an Apache Spark Standalone Cluster
Instant search for and access to many datasets in Pyspark.
Forecasting Solar Power: Analysis of using a LSTM Neural Network
Add a description, image, and links to the dataengineering topic page so that developers can more easily learn about it.
To associate your repository with the dataengineering topic, visit your repo's landing page and select "manage topics."