Learning optimal policies in MDPs with value function discovery

Research output: Contribution to JournalArticleAcademicpeer-review

Original languageEnglish
Pages (from-to)7-9
JournalACM Sigmetrics Performance Evaluation Review
Volume43
Issue number2
DOIs
Publication statusPublished - 2015

Cite this