Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
-
Updated
Mar 5, 2023 - Python
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
A Scala kernel for Jupyter
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Qubole Sparklens tool for performance tuning Apache Spark
The Internals of Spark SQL
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Use SQL to build ELT pipelines on a data lakehouse.
Apache Spark™ and Scala Workshops
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
MCW Big data analytics and visualization
Spark Structured Streaming / Kafka / Cassandra / Elastic
A prototype project of big data platform, the source codes of the book Big Data Platform Architecture and Prototype
SQL Parsers for BigData, built with antlr4.
Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!
Add a description, image, and links to the spark-sql topic page so that developers can more easily learn about it.
To associate your repository with the spark-sql topic, visit your repo's landing page and select "manage topics."