Skip to content

Conversation

@JChunX
Copy link

@JChunX JChunX commented Jan 10, 2021

This is my attempt at ex 7.2. For the experiment, I used the Markov reward process found in example 7.1 to compare RMS errors for the original n-step method and the sum of TD errors method.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant