Original language | English |
---|---|
Pages (from-to) | 7-9 |
Journal | ACM Sigmetrics Performance Evaluation Review |
Volume | 43 |
Issue number | 2 |
DOIs | |
Publication status | Published - 2015 |
Learning optimal policies in MDPs with value function discovery
M. Onderwater, S. Bhulai, R.D. van der Mei
Research output: Contribution to Journal › Article › Academic › peer-review