big-data
Here are 2,284 public repositories matching this topic...
-
Updated
Dec 19, 2020
-
Updated
Dec 21, 2020 - Python
-
Updated
Dec 24, 2020 - Python
-
Updated
Oct 14, 2020 - Scala
-
Updated
Dec 23, 2020 - JavaScript
-
Updated
Dec 16, 2020 - Scala
The latest copy of the CPython grammar tests in test_grammar.py has several @skips and FIXMEs. Some of them seem easy to fix, e.g. some parser bugs or missing warnings that would be helpful, others are entire features. We should fix the easy ones and make sure there are tickets for the rest.
Problem:
catboost version: 0.23.2
Operating System: all
Tutorial: https://github.com/catboost/tutorials/blob/master/custom_loss/custom_metric_tutorial.md
Impossible to use custom metric (С++).
Code example
from catboost import CatBoost
train_data = [[1, 4, 5, 6],
-
Updated
Dec 24, 2020 - Jupyter Notebook
-
Updated
Dec 24, 2020 - Go
-
Updated
Dec 21, 2020 - Erlang
- Lots of white space on either side, can that be reduced
- Make it so all count columns, except the far right maybe could be hidden OR just allow any column to be hidden will menu per column
- Would be nice to be able to resize columns
- Support adding 1 or 2 more sub fields (once i had the columns there will be room! :) )
-
Updated
Sep 1, 2020 - Python
We have added default serializers to the 4.2 series here:
hazelcast/hazelcast#17934
There is a problem with backward compatibility. If a user had CustomSerializer for optional in 4.1, in 4.2 there is no way to use their serializers, and Hazelcast will throw java.lang.IllegalArgumentException: [class java.util.Optional] serializer cannot be overridden
Users can basi
-
Updated
Dec 23, 2020 - Scala
-
Updated
Dec 24, 2020 - Java
PrestoDB https://prestodb.io .. is widely used as SQL frontend for many different data-sources, including ElasticSearch, and even files in S3 .. would be very nice if there would be a Connector available for Vespa.
Hi, if my spark app is using 2 storage type, both S3 and Azure Data Lake Store Gen2, could I put spark.delta.logStore.class=org.apache.spark.sql.delta.storage.AzureLogStore, org.apache.spark.sql.delta.storage.S3SingleDriverLogStore
Thanks in advance
-
Updated
Dec 9, 2020 - JavaScript
-
Updated
Dec 22, 2020 - TypeScript
Improve this page
Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."

Now insert and query share the resource ( Max Process Count control) 。 When the query with high TPS,the insert will get error (“error: too many process”). I think separator the resource for Insert and Query will makes sense. Ensure enough resource for insert。It looks like Use Yarn, Insert and Query use the different resource quota。
Or the simple way , Can we set Ratio for Insert and