Abstract
We show how single-run-based measure-valued differentiation gradient estimators can be obtained. The key idea is to apply a change of measure a posterior to the mathematical analysis of the derivative. From the point of view of the likelihood ratio method, we show that likelihood ratio type gradient estimators can be applied in situations where the mathematical conditions needed for applying a likelihood ratio analysis are not met. © 2004 IEEE.
Original language | English |
---|---|
Pages (from-to) | 1843-1846 |
Number of pages | 4 |
Journal | IEEE Transactions on Automatic Control |
Volume | 49 |
Issue number | 10 |
DOIs | |
Publication status | Published - 2004 |