Skip to content
#

deep-q-network

Here are 230 public repositories matching this topic...

fredcallaway
fredcallaway commented Jun 29, 2017

I was surprised to see this loss function because it is generally used when the target is a distribution (i.e. sums to 1). This is not the case for the advantage estimate. However, I worked out the math and it does appear to be doing the right thing which is neat!

I think this trick should be mentioned in the code.

The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) that I have written in my website. Especially, Natural Language Processing, Statistical Machine Learning, and Deep Reinforcement Learning are main topics.

  • Updated Apr 6, 2020
  • Python

Improve this page

Add a description, image, and links to the deep-q-network topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the deep-q-network topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.