Apache Spark

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Describe the bug
Using a time dimension on a runningTotal measure on Snowflake mixes quoted and unquoted columns in the query. This fails the query, because Snowflake has specific rules about quoted columns. Specifically:

All unquoted column names are treated as upper case
Quoted column names are case sensitive.

So "date_from" <> date_from

To Reproduce
Steps to reproduce

At this moment relu_layer op doesn't allow threshold configuration, and legacy RELU op allows that.
We should add configuration option to relu_layer.

Feature request

Overview

SBT tests currently run sequentially. It would be good to reduce the total test runtime by parallelizing the SBT tests.

Motivation

SBT tests are taking longer and longer. This is not scalable. While we have already split various version of Scala tests into two CI builds in the repo, each build takes a long time. This is a burden for local testing as

I have a simple regression task (using a LightGBMRegressor) where I want to penalize negative predictions more than positive ones. Is there a way to achieve this with the default regression LightGBM objectives (see https://lightgbm.readthedocs.io/en/latest/Parameters.html)? If not, is it somehow possible to define (many example for default LightGBM model) and pass a custom regression objective?

Used Spark version
Spark Version: 2.4.4
Used Spark Job Server version
SJS version: v0.11.1

Deployed mode
client on Spark Standalone

Actual (wrong) behavior
I can't get config, when post a job with 'sync=true'. I got it：
http://localhost:8090/jobs/ff99479b-e59c-4215-b17d-4058f8d97d25/config
{"status":"ERROR","result":"No such job ID ff99479b-e59c-4215-b17d-4058f8d97d25"

Apache Spark

Here are 6,618 public repositories matching this topic...

apache / spark

getredash / redash

yeasy / docker_practice

cube-js / cube.js

eclipse / deeplearning4j

aalansehaiyang / technology-talk

horovod / horovod

zhisheng17 / flink-learning

heibaiying / BigData-Notes

FavioVazquez / ds-cheatsheets

wangzhiwubigdata / God-Of-BigData

Angel-ML / angel

h2oai / h2o-3

apache / zeppelin

Alluxio / alluxio

delta-io / delta

Feature request

Overview

Motivation

PipelineAI / pipeline

DataTalksClub / data-engineering-zoomcamp

intel-analytics / BigDL

yahoo / TensorFlowOnSpark

microsoft / SynapseML

Cyb3rWard0g / HELK

spark-notebook / spark-notebook

databricks / koalas

spark-jobserver / spark-jobserver

JohnSnowLabs / spark-nlp

douban / dpark

RoaringBitmap / RoaringBitmap

apache / incubator-linkis

ballista-compute / ballista

Related Topics