big-data
Here are 2,195 public repositories matching this topic...
-
Updated
Sep 21, 2020
-
Updated
Jul 24, 2020 - Python
-
Updated
Sep 28, 2020 - Python
-
Updated
Sep 13, 2020 - Scala
-
Updated
Sep 28, 2020 - JavaScript
-
Updated
Sep 9, 2020 - Scala
Something that could help the documentation would be a glossary that certain terms could link to. A pointer would be a good candidate, or C data structures (struct/union) in general. Also extension type, terms like extern or inline, etc. It wouldn't have to replicate a complete specification o
Problem:
catboost version: 0.23.2
Operating System: all
Tutorial: https://github.com/catboost/tutorials/blob/master/custom_loss/custom_metric_tutorial.md
Impossible to use custom metric (С++).
Code example
from catboost import CatBoost
train_data = [[1, 4, 5, 6],
-
Updated
Sep 28, 2020 - Jupyter Notebook
-
Updated
Sep 24, 2020 - Erlang
-
Updated
Sep 28, 2020 - Go
Hello. I would like to ask for suggestions for additional features for development.
On the Sessions tab,
Please develop a function that can addon to frequently used search queries.
-
Updated
Sep 1, 2020 - Python
In some cases, the default implementations are very inefficient (e.g. Map.replaceAll and forEach fetching all entries and iterating over them locally). This was improved on member-side as the cluster version is available and in some cases we opted for using entry processors instead.
On the client-side, the cluster version is not available which meant it ends up still using the default versi
-
Updated
Sep 28, 2020 - Scala
-
Updated
Sep 28, 2020 - Java
PrestoDB https://prestodb.io .. is widely used as SQL frontend for many different data-sources, including ElasticSearch, and even files in S3 .. would be very nice if there would be a Connector available for Vespa.
Hi, if my spark app is using 2 storage type, both S3 and Azure Data Lake Store Gen2, could I put spark.delta.logStore.class=org.apache.spark.sql.delta.storage.AzureLogStore, org.apache.spark.sql.delta.storage.S3SingleDriverLogStore
Thanks in advance
-
Updated
Jun 16, 2020 - TSQL
-
Updated
Mar 14, 2017 - Python
Improve this page
Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."

Now insert and query share the resource ( Max Process Count control) 。 When the query with high TPS,the insert will get error (“error: too many process”). I think separator the resource for Insert and Query will makes sense. Ensure enough resource for insert。It looks like Use Yarn, Insert and Query use the different resource quota。
Or the simple way , Can we set Ratio for Insert and