Computing Optimal Stationary Policies for Multi-objective Markov Decision Processes

M.A. Wiering, E.D. de Jong

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Original languageUndefined/Unknown
Title of host publicationProceedings of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL)
Pages158-165
Number of pages8
Publication statusPublished - 2007

Cite this