Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
-
Updated
Mar 5, 2023 - Python
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Simple and Distributed Machine Learning
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
A native Rust library for Delta Lake, with bindings into Python
DataOps for the Modern Data Warehouse on Microsoft Azure. https://aka.ms/mdw-dataops.
Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...
Databricks Terraform Provider
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Apache Spark Connector for Azure Cosmos DB
Manage your Databricks deployments and CI with code.
Testing framework for Databricks notebooks
Scalable Data Science, course sets in big data Using Apache Spark over databricks and their mathematical, statistical and computational foundations using SageMath.
A set of UDFs and Procedures to extend BigQuery, Snowflake, Redshift, Postgres and Databricks with Spatial Analytics capabilities
Capture deep metrics on one or all assets within a Databricks workspace
machine learning for genomic variants
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
Bloat-free, no BS cloud storage SDK.
Tools for Deploying Databricks Solutions in Azure
Add a description, image, and links to the databricks topic page so that developers can more easily learn about it.
To associate your repository with the databricks topic, visit your repo's landing page and select "manage topics."