Convergence of Model-Based Temporal Difference Learning for Control

H.P. van Hasselt, M.A. Wiering

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Original languageUndefined/Unknown
Title of host publicationProceedings of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL)
Pages60-67
Number of pages8
Publication statusPublished - 2007

Cite this