awjuliani/rl-tutorial-2.ipynb

Last active May 2, 2018 18:55

Star (12) You must be signed in to star a gist
Fork (2) You must be signed in to fork a gist

Select an option

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/awjuliani/86ae316a231bceb96a3e2ab3ac8e646a.js"></script>
Save awjuliani/86ae316a231bceb96a3e2ab3ac8e646a to your computer and use it in GitHub Desktop.

Download ZIP

Reinforcement Learning Tutorial 2 (Cart Pole problem)

Raw

rl-tutorial-2.ipynb

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.

Heeseok commented Nov 17, 2016 •

edited

Loading

tGrad,bggg = sess.run([newGrads,begining],feed_dict={observations: epx, input_y: epy, advantages: discounted_epr})

The variable begining did not defined.

I change the code like this.

tGrad = sess.run(newGrads, feed_dict={observations: epx, input_y: epy, advantages: discounted_epr})

And it works well.

LoveDLWujing commented Nov 3, 2017

log(P(y|x)) = (1-input_y)log(probability) + input_ylog(1-probability) , and the loss function in above code happened to the same result as this maximum likelihood.

awjuliani/rl-tutorial-2.ipynb

Select an option

No results found

Select an option

No results found

Heeseok commented Nov 17, 2016 •

edited

Loading

Uh oh!

LoveDLWujing commented Nov 3, 2017

Uh oh!

awjuliani/rl-tutorial-2.ipynb

Heeseok commented Nov 17, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LoveDLWujing commented Nov 3, 2017

Uh oh!

Heeseok commented Nov 17, 2016 •

edited

Loading