Skip to content
#

deep-q-network

Here are 238 public repositories matching this topic...

fredcallaway
fredcallaway commented Jun 29, 2017

I was surprised to see this loss function because it is generally used when the target is a distribution (i.e. sums to 1). This is not the case for the advantage estimate. However, I worked out the math and it does appear to be doing the right thing which is neat!

I think this trick should be mentioned in the code.

The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics are Auto-Encoders in relation to the representation learning, the statistical machine learning for energy-based models, adversarial generation networks(GANs), Deep Reinforcement Learning such as Deep Q-Networks, semi-supervised learning, and neural network language model for natural language processing.

  • Updated Jul 11, 2020
  • Python

Improve this page

Add a description, image, and links to the deep-q-network topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the deep-q-network topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.