Learning optimal policies in MDPs with value function discovery

M. Onderwater, S. Bhulai, R.D. van der Mei

Research output: Contribution to JournalArticleAcademicpeer-review

Original languageEnglish
Pages (from-to)7-9
JournalACM Sigmetrics Performance Evaluation Review
Volume43
Issue number2
DOIs
Publication statusPublished - 2015

Cite this