Add Ex 7.2 #77

JChunX · 2021-01-10T23:22:10Z

This is my attempt at ex 7.2. For the experiment, I used the Markov reward process found in example 7.1 to compare RMS errors for the original n-step method and the sum of TD errors method.

Add Ex 7.2

d477e56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Ex 7.2 #77

Add Ex 7.2 #77

Uh oh!

JChunX commented Jan 10, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add Ex 7.2 #77

Are you sure you want to change the base?

Add Ex 7.2 #77

Uh oh!

Conversation

JChunX commented Jan 10, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant