reinforcement-learning
Here are 5,396 public repositories matching this topic...
-
Updated
Sep 29, 2020 - Python
-
Updated
Oct 20, 2020 - C#
-
Updated
Jul 5, 2020 - Python
-
Updated
Oct 2, 2020 - Python
-
Updated
Aug 10, 2020 - Jupyter Notebook
-
Updated
Oct 2, 2020
Vcpkg is a C++ dependency management system that makes installation and consumption as a dependency very easy. We should support this for VW to allow consuming the lib as easy as possible.
Instructions for creating a new package can be found here: https://github.com/microsoft/vcpkg/blob/master/docs/examples/packaging-github-repos.md
-
Updated
Jul 29, 2020 - Python
-
Updated
Oct 19, 2020 - C++
-
Updated
Oct 10, 2020 - Python
-
Updated
Oct 13, 2020 - Python
-
Updated
Oct 9, 2020
-
Updated
Aug 5, 2020 - Python
-
Updated
Oct 14, 2020 - Python
-
Updated
Oct 15, 2020 - Jupyter Notebook
Bidirectional RNN
Is there a way to train a bidirectional RNN (like LSTM or GRU) on trax nowadays?
-
Updated
Sep 14, 2020 - Python
-
Updated
Dec 14, 2019 - Jupyter Notebook
-
Updated
Mar 18, 2020 - JavaScript
-
Updated
Oct 18, 2020 - Jupyter Notebook
-
Updated
Oct 17, 2020
-
Updated
Aug 22, 2020 - Jupyter Notebook
-
Updated
Oct 16, 2020
-
Updated
Jun 21, 2019 - C++
-
Updated
Sep 10, 2020 - Python
-
Updated
Jun 30, 2020 - Jupyter Notebook
How to use Watcher / WatcherClient over tcp/ip network?
Watcher seems to ZMQ server, and WatcherClient is ZMQ Client, but there is no API/Interface to config server IP address.
Do I need to implement a class that inherits from WatcherClient?
-
Updated
Oct 9, 2020
Improve this page
Add a description, image, and links to the reinforcement-learning topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the reinforcement-learning topic, visit your repo's landing page and select "manage topics."
What is the problem?
It seems that
ray.util.multiprocessing.Pool.starmapdoes not work with iterable created from zip, unless it's explicitly converted to a list etc. For example, this one works: