big-data
Here are 2,155 public repositories matching this topic...
-
Updated
Aug 20, 2020
-
Updated
Jul 24, 2020 - Python
-
Updated
Aug 27, 2020 - Python
-
Updated
May 7, 2020 - Scala
-
Updated
Aug 26, 2020 - JavaScript
-
Updated
Aug 24, 2020 - Scala
The latest copy of the CPython grammar tests in test_grammar.py has several @skips and FIXMEs. Some of them seem easy to fix, e.g. some parser bugs or missing warnings that would be helpful, others are entire features. We should fix the easy ones and make sure there are tickets for the rest.
Problem:
catboost version: 0.23.2
Operating System: all
Tutorial: https://github.com/catboost/tutorials/blob/master/custom_loss/custom_metric_tutorial.md
Impossible to use custom metric (С++).
Code example
from catboost import CatBoost
train_data = [[1, 4, 5, 6],
-
Updated
Aug 27, 2020 - Jupyter Notebook
-
Updated
Aug 26, 2020 - Erlang
-
Updated
Aug 26, 2020 - Go
-
Updated
Aug 15, 2020 - Python
Would you like to add more error handling for return values from functions like the following?
- malloc ⇒ moloch_trie_init
- [MOLOCH_LOCK_INIT](https://github.com/aol/moloch/blob/664ffe25810380f12823941c210
Today IMap.values() and IMap.values(Predicate) calls are blocking.
I would like to use IMap.values(Predicate) in a Jet Pipeline, which is possible, but I need to declare it as nonCooperative, and will have an impact on the pipeline scalability.
Would it be possible to have an async (non-blocking) version for these calls ?
Thank you very much for all the hard work done !
-
Updated
Aug 21, 2020 - Scala
-
Updated
Aug 27, 2020 - Java
PrestoDB https://prestodb.io .. is widely used as SQL frontend for many different data-sources, including ElasticSearch, and even files in S3 .. would be very nice if there would be a Connector available for Vespa.
-
Updated
Jun 16, 2020 - TSQL
-
Updated
Mar 14, 2017 - Python
Series.reindex
Implement Series.reindex.
https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.Series.reindex.html
Improve this page
Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."

Now insert and query share the resource ( Max Process Count control) 。 When the query with high TPS,the insert will get error (“error: too many process”). I think separator the resource for Insert and Query will makes sense. Ensure enough resource for insert。It looks like Use Yarn, Insert and Query use the different resource quota。
Or the simple way , Can we set Ratio for Insert and