gym

The following applies to DDPG and TD3, and possibly other models. The following libraries were installed in a virtual environment:

numpy==1.16.4
stable-baselines==2.10.0
gym==0.14.0
tensorflow==1.14.0

Episode rewards do not seem to be updated in model.learn() before callback.on_step(). Depending on which callback.locals variable is used, this means that:

episode rewards may n

🐛 Bug

The documentation of DQN agent (https://stable-baselines3.readthedocs.io/en/master/modules/dqn.html) specifies that log_interval parameter is "The number of timesteps before logging". However, when set to 1 (or any other value) the logging is not made at that pace but is instead made every log_interval episode (and not timesteps). In the example below this is made every 200 timesteps.

Use case

Get better results for exercises and specially ingredients with full text search

Proposal

If using postgres, we should use its full text search capabilities so that we get better results and smooth out typos (search in exercises.api.views and nutrition.api.views). A short check of the connection engine should make easy to use the current filter if that's not the case. W

Per this comment in #12

There seem to be some vulnerabilities in our code that might fail easily. I suggest adding more unit tests for the following:

Custom agents (there's only VPG and PPO on CartPole-v0 as of now. We should preferably add more to cover discrete-offpolicy, continuous-offpolicy and continuous-onpolicy)
Evaluation for the Bandits and Classical agents
Testing of convergence of agents as proposed i

gym

Here are 698 public repositories matching this topic...

hill-a / stable-baselines

DLR-RM / stable-baselines3

🐛 Bug

wger-project / wger

Use case

Proposal

werner-duvaud / muzero-general

araffin / rl-baselines-zoo

uvipen / Super-mario-bros-A3C-pytorch

uvipen / Super-mario-bros-PPO-pytorch

deepdrive / deepdrive

vwxyzjn / cleanrl

ZhiqingXiao / rl-book

DLR-RM / rl-baselines3-zoo

araffin / robotics-rl-srl

ritchieng / deep-learning-wizard

germain-hug / Deep-RL-Keras

MorvanZhou / pytorch-A3C

medipixel / rl_algorithms

navneet-nmk / pytorch-rl

SforAiDl / genrl

uvipen / AirGesture

StepNeverStop / RLs

denisyarats / drq

lubusIN / laravel-gymie

sail-sg / envpool

AcutronicRobotics / gym-gazebo2

koulanurag / ma-gym

ikostrikov / jaxrl

LucasAlegre / sumo-rl

denisyarats / pytorch_sac

zfw1226 / gym-unrealcv

mpSchrader / gym-sokoban

Improve this page

Add this topic to your repo