-
Updated
Aug 29, 2020 - Go
hdfs
Here are 559 public repositories matching this topic...
Currently, when calling pytest, we use -m "not udf" to skip backends with missing or broken udf support. For all other features, we've been using a decorator @pytest.mark.skip_backends([SQLite], reason='not supported')` instead.
It would be good to skip udf tests for backends that don't implement them with the decorator. So, calling the tests for a backend skips them automatically if the back
-
Updated
Aug 28, 2020 - C++
Similar to how unix ls works, param could be -t
-
Updated
Aug 18, 2020 - Java
-
Updated
Feb 18, 2020 - JavaScript
-
Updated
Aug 25, 2020 - Python
-
Updated
Aug 19, 2020 - Java
Migrate to goavro v2
-
Updated
Aug 24, 2020 - Java
-
Updated
Apr 24, 2018 - Scala
-
Updated
Jul 7, 2020 - Jupyter Notebook
-
Updated
Aug 6, 2019 - Java
-
Updated
Aug 30, 2019 - Java
Given the new key-value store event stream, it'd be nice to have something like:
$ skein kv events <application id> [options...]
where the process blocks, and logs the event stream to the console until interrupted. This would be useful for debugging, as well as demos.
Assuming a valid hive-site.xml, it will be possible to determine the active hive warehouse HDFS directory and HiveServer2 and Metastore URIs.
From there we should be able to perform a directory analysis on the hive warehouse parent directory and then all HDFS locations that represent tables / partitions.
-
Updated
Aug 27, 2020 - Java
-
Updated
Jun 18, 2020 - Java
Improve this page
Add a description, image, and links to the hdfs topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hdfs topic, visit your repo's landing page and select "manage topics."
Another feature that I would like to have in smart_open is support for direct reading/writing of the data to/from ftp.
Anyone else interested in this? Does this make sense?