Apache Spark
Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
Here are 6,618 public repositories matching this topic...
-
Updated
May 13, 2022 - Python
-
Updated
May 12, 2022 - Go
At this moment relu_layer op doesn't allow threshold configuration, and legacy RELU op allows that.
We should add configuration option to relu_layer.
-
Updated
May 16, 2022 - Python
-
Updated
May 8, 2022 - Java
-
Updated
Mar 23, 2022
-
Updated
Feb 9, 2022 - Java
-
Updated
May 17, 2022 - Jupyter Notebook
-
Updated
May 17, 2022 - Java
Feature request
Overview
SBT tests currently run sequentially. It would be good to reduce the total test runtime by parallelizing the SBT tests.
Motivation
SBT tests are taking longer and longer. This is not scalable. While we have already split various version of Scala tests into two CI builds in the repo, each build takes a long time. This is a burden for local testing as
-
Updated
Apr 24, 2020 - Jsonnet
-
Updated
May 17, 2022 - Jupyter Notebook
-
Updated
Apr 21, 2022 - Python
I have a simple regression task (using a LightGBMRegressor) where I want to penalize negative predictions more than positive ones. Is there a way to achieve this with the default regression LightGBM objectives (see https://lightgbm.readthedocs.io/en/latest/Parameters.html)? If not, is it somehow possible to define (many example for default LightGBM model) and pass a custom regression objective?
-
Updated
May 12, 2021 - Jupyter Notebook
-
Updated
Oct 19, 2021 - JavaScript
Used Spark version
Spark Version: 2.4.4
Used Spark Job Server version
SJS version: v0.11.1
Deployed mode
client on Spark Standalone
Actual (wrong) behavior
I can't get config, when post a job with 'sync=true'. I got it:
http://localhost:8090/jobs/ff99479b-e59c-4215-b17d-4058f8d97d25/config
{"status":"ERROR","result":"No such job ID ff99479b-e59c-4215-b17d-4058f8d97d25"
-
Updated
May 17, 2022 - Scala
-
Updated
May 12, 2022 - Java
-
Updated
May 17, 2022 - Java
-
Updated
Apr 20, 2021 - Rust
Created by Matei Zaharia
Released May 26, 2014
- Repository
- apache/spark
- Website
- spark.apache.org
- Wikipedia
- Wikipedia
Describe the bug
Using a time dimension on a runningTotal measure on Snowflake mixes quoted and unquoted columns in the query. This fails the query, because Snowflake has specific rules about quoted columns. Specifically:
So "date_from" <> date_from
To Reproduce
Steps to reproduce