julbern

julbern OP t1_ivyy0g1 wrote

The result from stochastic optimal control that we use in the paper ("verification theorem") originates mainly from the work of M. Pavon (1989) and P. Dai Pra (1991).

Perhaps it is best to start with the lecture notes of R. Van Handel (2007).

For books on the topic, I can further suggest:

  1. W. H. Fleming and R. W. Rishel (1975)
  2. W. H. Fleming and H. M. Soner (2006)
  3. H. Pham (2009)

Some more recent works in this direction are the following:

  1. B. Tzen and M. Raginsky (2019)
  2. N. Nüsken and L. Richter (2021)
  3. M. Pavon (2022)
3

julbern OP t1_ivy9l7n wrote

The generative process in this paper is given by an ODE and the diffusivity coefficient in the corresponding Fokker-Planck equation is thus zero. In this case, the verification theorem basically reduces to the instantaneous change of variables formula (Chen et al., 2018).

On the other hand, the solution to the Poisson equation (with homogeneous Dirichlet boundary condition) considered in the paper also has a stochastic representation based on an SDE with a corresponding stopping time (leading to "walk-on-spheres" methods). It would be quite interesting to merge these viewpoints.

1