Two Novel On-policy Reinforcement Learning Algorithms based on TD(lambda)-methods

M.A. Wiering, H.P. van Hasselt

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Original languageUndefined/Unknown
Title of host publicationProceedings of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL)
Pages280-287
Number of pages8
Publication statusPublished - 2007

Cite this