Giung Nam, 2026-03-30
Controlled diffusion process
CDP
A controlled diffusion process is defined as the solution to the SDE:
with the reference drift , the diffusion coefficient , and the control drift . The corresponding reverse-time process satisfies:
where is the marginal density of at time , i.e., . FPE provides a forward-time evolution of :
KLD
Let and be path measures on the space of continuous paths , induced by SDEs with the same initial law, the same reference drift , and the same diffusion coefficient , but governed by different controls and . Assuming absolute continuity , the KLD of with respect to is given by:
Stochastic Optimal Control
SOC
The SOC formulation aims to determine the optimal control that minimally perturbs the reference dynamics starting from while minimizing the cost:
where is the running cost, and is the terminal cost. We define the value function as the optimal cost-to-go from any fixed point :
which solves the HJB equation and satisfies the backward-time evolution in the SDE:
Here, the optimal control is written in terms of the value function:
Feynman-Kac-formulated SOC
Introducing the desirability function converts the non-linear HJB equation into the linear PDE for :
which, via FKF, yields
Here, the optimal control is written in terms of the desirability function:
Lagrangian mechanics
The Lagrangian, action, generalized momentum, and generalized force are:
The Hamiltonian and canonical equation are:
The HJB equation can be rewritten in terms of the Hamiltonian:
Option pricing
Substituting , , , , yields:
which is formally equivalent to the Black-Scholes equation for an underlying asset price , a call option price , a volatility , and a risk-free interest rate .
Dynamic Schrodinger Bridge
DSB
The DSB formulation aims to determine the optimal pair of control and density that minimally perturbs the reference dynamics subject to initial and terminal marginal constraints:
We reframe this as an unconstrained optimization problem by introducing a Lagrangian with a Lagrange multiplier :
which yields the minimizer as the solution to the HJB-FP system:
Here, the optimal control is written in terms of the Lagrange multiplier:
Hopf-Cole-transformed DSB
Applying the change of variables , defined as and , transforms the non-linear HJB-FP system into the linear system for a Schrodinger potential :
where the solution is given by:
Here, the optimal control is written in the Schrodinger potential:
SOC-formulated DSB
The DSB formulation can be interpreted as a SOC formulation with the following costs:
Substituting these yields the SOC formulation equivalent to the DSB formulation:
DOT-formulated DSB
The DSB formulation can be interpreted as a DOT formulation with the following velocity field:
Reparameterizing the objective yields the DOT formulation equivalent to the DSB formulation:
DDPM as DSB
The standard continuous diffusion involving a Gaussian forward transition kernel:
can be described by defining the reference dynamics as:
By designing such that approximately reaches starting from , the system can be interpreted as a DSB problem where the reference process is pre-aligned with the target boundary conditions. Thus, the optimal controls vanishes:
FM as DSB
The standard flow matching involving a Dirac delta transition kernel:
can be described by defining the reference dynamics as:
By taking , the DSB problem converges to a classical DOT problem, where the underlying stochastic particle dynamics collapse into purely deterministic trajectories. For a specific pair , the conditional velocity field is simply the straight-line displacement: . With an independent coupling, i.e., , the PF-ODE is given by:
Note that we have the following forward transition when :
Controlled diffusion process
CDP
A controlled diffusion process $(\bm{X}{t}^{\bm{u}} \in \mathbb{R}^{d}){t \in [0,T]}$ is defined as the solution to the SDE: $$ \begin{align} \mathrm{d}\bm{X}{t}^{\bm{u}} &= \left[ \bm{f}(\bm{X}{t}^{\bm{u}}, t) + \sigma_{t}\bm{u}(\bm{X}{t}^{\bm{u}}, t) \right] \mathrm{d}t + \sigma{t}\mathrm{d}\bm{B}{t}, \end{align} $$ with the reference drift $\bm{f} : \mathbb{R}^{d} \times [0, T] \rightarrow \mathbb{R}^{d}$, the diffusion coefficient $\sigma{t} : [0, T] \rightarrow \mathbb{R}{\geq 0}$, and the control drift $\bm{u} : \mathbb{R}^{d} \times [0, T] \rightarrow \mathbb{R}^{d}$. The corresponding reverse-time process $( \bar{\bm{X}}{\bar{t}}^{\bm{u}} = \bm{X}{T - \bar{t}}^{\bm{u}} ){\bar{t} \in [0,T]}$ satisfies: $$ \begin{align}\textstyle \mathrm{d}\bar{\bm{X}}{\bar{t}}^{\bm{u}} &= \left[ - \bm{f}(\bar{\bm{X}}{\bar{t}}^{\bm{u}}, T - \bar{t}) - \sigma_{T - \bar{t}}\bm{u}(\bar{\bm{X}}{\bar{t}}^{\bm{u}}, T - \bar{t}) + \sigma{T - \bar{t}}^{2}\nabla\log{p_{T - \bar{t}}^{\bm{u}}(\bar{\bm{X}}{\bar{t}}^{\bm{u}})} \right] \mathrm{d}\bar{t} + \sigma{T - \bar{t}}\mathrm{d}\bar{\bm{B}}{\bar{t}}, \end{align} $$ where $p{t}^{\bm{u}}$ is the marginal density of $\bm{X}{t}^{\bm{u}}$ at time $t$, i.e., $\bm{X}{t}^{\bm{u}} \sim p_{t}^{\bm{u}}$. FPE provides a forward-time evolution of $p_{t}^{\bm{u}}$: $$ \begin{align}\textstyle \partial_{t} p_{t}^{\bm{u}}(\bm{x}) = -\nabla \cdot \left{ \left[ \bm{f}(\bm{x}, t) + \sigma_{t} \bm{u}(\bm{x}, t) \right] p_{t}^{\bm{u}}(\bm{x}) \right} + \frac{\sigma_{t}^{2}}{2} \Delta p_{t}^{\bm{u}}(\bm{x}). \end{align} $$
KLD
Let $\mathbb{P}^{\bm{u}}$ and $\mathbb{P}^{\tilde{\bm{u}}}$ be path measures on the space of continuous paths $C([0,T];\mathbb{R}^{d})$, induced by SDEs with the same initial law, the same reference drift $\bm{f}$, and the same diffusion coefficient $\sigma_{t}$, but governed by different controls $\bm{u}$ and $\tilde{\bm{u}}$. Assuming absolute continuity $\mathbb{P}^{\bm{u}} \ll \mathbb{P}^{\tilde{\bm{u}}}$, the KLD of $\mathbb{P}^{\bm{u}}$ with respect to $\mathbb{P}^{\tilde{\bm{u}}}$ is given by: $$ \begin{align} D_{\text{KL}} \left( \mathbb{P}^{\bm{u}} \parallel \mathbb{P}^{\tilde{\bm{u}}} \right) &\textstyle = \mathbb{E}{\bm{X}{0:T}^{\bm{u}} \sim \mathbb{P}^{\bm{u}}} \left[ \frac{1}{2} \int_{0}^{T} \left\lVert \bm{u}(\bm{X}{t}^{\bm{u}}, t) - \tilde{\bm{u}}(\bm{X}{t}^{\bm{u}}, t) \right\rVert^{2} \mathrm{d}t \right], \ &\textstyle = \int_{0}^{T} \int_{\mathbb{R}^{d}} \left[ \frac{1}{2} \left\lVert \bm{u}(\bm{x}, t) - \tilde{\bm{u}}(\bm{x}, t) \right\rVert^{2} p_{t}^{\bm{u}}(\bm{x}) \right] \mathrm{d}\bm{x} \mathrm{d}t. \end{align} $$
Stochastic Optimal Control
SOC
The SOC formulation aims to determine the optimal control $\bm{u}^{\ast}$ that minimally perturbs the reference dynamics starting from $\pi_{0}$ while minimizing the cost: $$ \begin{align} \inf_{\bm{u}} & \textstyle \quad D_{\text{KL}}\left( \mathbb{P}^{\bm{u}} \parallel \mathbb{P}^{\bm{0}} \right) + \mathbb{E}{\bm{X}{0:T}^{\bm{u}} \sim \mathbb{P}^{\bm{u}}} \left[ \int_{0}^{T} c(\bm{X}{t}^{\bm{u}}, t) \mathrm{d}t + \Phi(\bm{X}{T}^{\bm{u}}) \right], \ \text{s.t.} & \textstyle \quad \mathrm{d}\bm{X}{t}^{\bm{u}} = \left[ \bm{f}(\bm{X}{t}^{\bm{u}}, t) + \sigma_{t} \bm{u}(\bm{X}{t}^{\bm{u}}, t) \right] \mathrm{d}t + \sigma{t} \mathrm{d} \bm{B}{t}, \bm{X}{0}^{\bm{u}} \sim \pi_{0}, \end{align} $$ where $c : \mathbb{R}^{d} \times [0,T] \rightarrow \mathbb{R}$ is the running cost, and $\Phi : \mathbb{R}^{d} \rightarrow \mathbb{R}$ is the terminal cost. We define the value function $V_{t} : \mathbb{R}^{d} \rightarrow \mathbb{R}$ as the optimal cost-to-go from any fixed point $(\bm{x}, t)$: $$ \begin{align} V_{t}(\bm{x}) & \textstyle = \inf_{\bm{u}} \mathbb{E}{\bm{X}{t:T}^{\bm{u}} \sim \mathbb{P}^{\bm{u}}} \left[ \int_{t}^{T} \left( \frac{1}{2}\left\lVert \bm{u}(\bm{X}{s}^{\bm{u}}, s) \right\rVert^{2} + c(\bm{X}{s}^{\bm{u}}, s) \right) \mathrm{d}s + \Phi(\bm{X}{T}^{\bm{u}}) \mid \bm{X}{t}^{\bm{u}} = \bm{x} \right] \ & \textstyle = \inf_{\bm{u}} \mathbb{E}{\bm{X}{t:T}^{\bm{u}} \sim \mathbb{P}^{\bm{u}}} \left[ \int_{t}^{\tau} \left( \frac{1}{2}\left\lVert \bm{u}(\bm{X}{s}^{\bm{u}}, s) \right\rVert^{2} + c(\bm{X}{s}^{\bm{u}}, s) \right) \mathrm{d}s + V_{\tau}(\bm{X}{\tau}^{\bm{u}}) \mid \bm{X}{t}^{\bm{u}} = \bm{x} \right], \end{align} $$ which solves the HJB equation and satisfies the backward-time evolution in the SDE: $$ \begin{align}\textstyle \partial_{t} V_{t} & \textstyle = - \left[ \langle \bm{f}, \nabla V_{t} \rangle + \frac{\sigma_{t}^{2}}{2} \Delta V_{t} \right] + \frac{\sigma_{t}^{2}}{2} \left\lVert \nabla V_{t} \right\rVert^{2} - c, \quad\text{s.t.}\quad V_{T} = \Phi, \ \mathrm{d}V_{t} & \textstyle = \left( \frac{\sigma_{t}^{2}}{2} \left\lVert \nabla V_{t} \right\rVert^{2} - c + \langle \sigma_{t} \bm{u}, \nabla V_{t} \rangle \right) \mathrm{d}t + \nabla V_{t}^\top \sigma_{t} \mathrm{d}\bm{B}{t}. \end{align} $$ Here, the optimal control $\bm{u}^{\ast}$ is written in terms of the value function: $$ \begin{align} \bm{u}^{\ast}(\bm{x}, t) = - \sigma{t} \nabla V_{t}(\bm{x}). \end{align} $$
Feynman-Kac-formulated SOC
Introducing the desirability function $\phi_{t} = e^{-V_{t}}$ converts the non-linear HJB equation into the linear PDE for $\phi_{t}$: $$ \begin{align}\textstyle \partial_{t} \phi_{t} = - \left[ \langle \bm{f}, \nabla \phi_{t} \rangle + \frac{\sigma_{t}^{2}}{2} \Delta \phi_{t} \right] + c \phi_{t}, \quad\text{s.t.}\quad \phi_{T} = \exp{\left(-\Phi\right)}, \end{align} $$ which, via FKF, yields $$ \begin{align}\textstyle \phi_{t}(\bm{x}) = \mathbb{E}{\bm{X}{0:T} \sim \mathbb{P}^{\bm{0}}} \left[ \exp{\left( -\int_{t}^{T} c(\bm{X}{s}, s) \mathrm{d}s - \Phi(\bm{X}{T}) \right)} \mid \bm{X}{t} = \bm{x} \right]. \end{align} $$ Here, the optimal control $\bm{u}^{\ast}$ is written in terms of the desirability function: $$ \begin{align} \bm{u}^{\ast}(\bm{x}, t) = \sigma{t} \nabla \log{\phi_{t}(\bm{x})}. \end{align} $$
Lagrangian mechanics
The Lagrangian, action, generalized momentum, and generalized force are: $$ \begin{align} L(\bm{x}, \dot{\bm{x}}, t) & \textstyle = \frac{1}{2} \left\lVert \frac{\dot{\bm{x}} - \bm{f}(\bm{x}, t)}{\sigma_{t}} \right\rVert^{2} + c(\bm{x}, t), \ S[\bm{x}] & \textstyle = \int_{0}^{T} L(\bm{x}, \dot{\bm{x}}, t) \mathrm{d}t + \Phi(\bm{x}{T}), \ \bm{p} & \textstyle = \frac{\partial L}{\partial\dot{\bm{x}}} = \frac{\bm{u}}{\sigma{t}}, \ \dot{\bm{p}} & \textstyle = \frac{\partial L}{\partial\bm{x}} = -(\nabla \bm{f})^{\top} \bm{p} + \nabla c. \end{align} $$ The Hamiltonian and canonical equation are: $$ \begin{align} H(\bm{x}, \bm{p}, t) & \textstyle = \langle \bm{p}, \dot{\bm{x}} \rangle - L(\bm{x}, \dot{\bm{x}}, t) \ & \textstyle = \langle \bm{p}, \bm{f}(\bm{x}, t) \rangle + \frac{\sigma_{t}^{2}}{2} \left\lVert \bm{p} \right\rVert^{2} - c(\bm{x}, t), \ \dot{\bm{x}} & \textstyle = \frac{\partial H}{\partial \bm{p}} = \bm{f}(\bm{x}, t) + \sigma_{t}^{2} \bm{p}, \ \dot{\bm{p}} & \textstyle = -\frac{\partial H}{\partial \bm{x}} = -(\nabla \bm{f})^{\top} \bm{p} + \nabla c. \end{align} $$ The HJB equation can be rewritten in terms of the Hamiltonian: $$ \begin{align}\textstyle \partial_{t} V_{t} + \frac{\sigma_{t}^{2}}{2} \Delta V_{t} = H(\bm{x}, -\nabla V_{t}, t). \end{align} $$
Option pricing
Substituting $x = \log{S}$, $\phi_{t}(x) = C(e^{x}, t)$, $\sigma_{t} = \sigma$, $c(x, t) = r$, $\bm{f}(x, t) = r - \frac{1}{2}\sigma^{2}$ yields: $$ \begin{align}\textstyle \partial_{t} C + r S \partial_{S} C + \frac{1}{2} \sigma^{2} S^{2} \partial_{SS} C - rC = 0, \end{align} $$ which is formally equivalent to the Black-Scholes equation for an underlying asset price $S(t)$, a call option price $C(S, t)$, a volatility $\sigma$, and a risk-free interest rate $r$.
Dynamic Schrodinger Bridge
DSB
The DSB formulation aims to determine the optimal pair of control and density that minimally perturbs the reference dynamics subject to initial and terminal marginal constraints: $$ \begin{align} \inf_{\bm{u}, p_{t}^{\bm{u}}} & \textstyle \quad D_{\text{KL}}\left( \mathbb{P}^{\bm{u}} \parallel \mathbb{P}^{\bm{0}} \right), \ \text{s.t.} & \textstyle \quad \mathrm{d}\bm{X}{t}^{\bm{u}} = \left[ \bm{f}(\bm{X}{t}^{\bm{u}}, t) + \sigma_{t} \bm{u}(\bm{X}{t}^{\bm{u}}, t) \right] \mathrm{d}t + \sigma{t} \mathrm{d} \bm{B}{t}, \bm{X}{0}^{\bm{u}} \sim \pi_{0}, \bm{X}{T}^{\bm{u}} \sim \pi{T}. \end{align} $$ We reframe this as an unconstrained optimization problem by introducing a Lagrangian with a Lagrange multiplier $\psi_{t} : \mathbb{R}^{d} \rightarrow \mathbb{R}$: $$ \begin{align}\textstyle \mathcal{L}(p_{t}^{\bm{u}}, \bm{u}, \psi_{t}) = \int_{0}^{T} \int_{\mathbb{R}^{d}} \left{ \frac{1}{2} \left\lVert \bm{u} \right\rVert^{2} p_{t}^{\bm{u}} + \psi_{t} \cdot \left[ \partial_{t}p_{t}^{\bm{u}} + \nabla \cdot \left[ \left( \bm{f} + \sigma_{t}\bm{u} \right) p_{t}^{\bm{u}} \right] - \frac{\sigma_{t}^{2}}{2} \Delta p_{t}^{\bm{u}} \right] \right} \mathrm{d}\bm{x} \mathrm{d}t, \end{align} $$ which yields the minimizer $(\bm{u}^{\ast}, p_{t}^{\ast})$ as the solution to the HJB-FP system: $$ \begin{align} \begin{cases} \partial_{t}\psi_{t} = -\frac{\sigma_{t}^{2}}{2} \left\lVert \nabla \psi_{t} \right\rVert^{2} - \langle \nabla \psi_{t}, \bm{f} \rangle - \frac{\sigma_{t}^{2}}{2} \Delta \psi_{t}, \ \partial_{t}p_{t}^{\ast} = - \nabla \cdot \left[ \left( \bm{f} + \sigma_{t}^{2}\nabla\psi_{t} \right) p_{t}^{\ast} \right] + \frac{\sigma_{t}^{2}}{2} \Delta p_{t}^{\ast}, \end{cases} \quad \text{s.t.} \quad \begin{cases} p_{0}^{\ast} = \pi_{0}, \ p_{T}^{\ast} = \pi_{T}. \end{cases} \end{align} $$ Here, the optimal control $\bm{u}^{\ast}$ is written in terms of the Lagrange multiplier: $$ \begin{align} \bm{u}^{\ast}(\bm{x}, t) = \sigma_{t} \nabla \psi_{t}(\bm{x}). \end{align} $$
Hopf-Cole-transformed DSB
Applying the change of variables $(\psi_{t}, p_{t}^{\ast}) \mapsto (\phi_{t}, \hat{\phi}{t})$, defined as $\psi{t} = \log{\phi_{t}}$ and $p_{t}^{\ast} = \phi_{t}\hat{\phi}{t}$, transforms the non-linear HJB-FP system into the linear system for a Schrodinger potential $(\phi{t}, \hat{\phi}{t})$: $$ \begin{align} \begin{cases} \partial{t}\phi_{t} = -\langle \nabla \phi_{t}, \bm{f} \rangle - \frac{\sigma_{t}^{2}}{2} \Delta \phi_{t}, \ \partial_{t}\hat{\phi}{t} = - \nabla \cdot (\hat{\phi}{t} \bm{f}) + \frac{\sigma_{t}^{2}}{2} \Delta \hat{\phi}{t}, \end{cases} \quad \text{s.t.} \quad \begin{cases} p{0}^{\ast} = \phi_{0}\hat{\phi}{0}, \ p{T}^{\ast} = \phi_{T} \hat{\phi}{T}, \end{cases} \end{align} $$ where the solution is given by: $$ \begin{align} \begin{cases} \phi{t}(\bm{x}) = \int_{\mathbb{R}^{d}} \mathbb{P}{T \mid t}^{\bm{0}}(\bm{y} \mid \bm{x}) \phi{T}(\bm{y}) \mathrm{d}\bm{y}, \ \hat{\phi}{t}(\bm{x}) = \int{\mathbb{R}^{d}} \mathbb{P}{t \mid 0}^{\bm{0}}(\bm{x} \mid \bm{y}) \hat{\phi}{0}(\bm{y}) \mathrm{d}\bm{y}, \end{cases} \quad \text{s.t.} \quad \begin{cases} \pi_{0}(\bm{x}) = \phi_{0}(\bm{x}) \hat{\phi}{0}(\bm{x}), \ \pi{T}(\bm{x}) = \phi_{T}(\bm{x}) \hat{\phi}{T}(\bm{x}). \end{cases} \end{align} $$ Here, the optimal control $\bm{u}^{\ast}$ is written in the Schrodinger potential: $$ \begin{align} \bm{u}^{\ast}(\bm{x}, t) = \sigma{t} \nabla \log{\phi_{t}(\bm{x})}. \end{align} $$
SOC-formulated DSB
The DSB formulation can be interpreted as a SOC formulation with the following costs: $$ \begin{align}\textstyle c(\bm{x},t)=0, \quad \Phi(\bm{x}) = \log{\frac{\hat{\phi}{T}(\bm{x})}{\pi{T}(\bm{x})}}. \end{align} $$ Substituting these yields the SOC formulation equivalent to the DSB formulation: $$ \begin{align} \inf_{\bm{u}} & \textstyle \quad D_{\text{KL}}\left( \mathbb{P}^{\bm{u}} \parallel \mathbb{P}^{\bm{0}} \right) + \mathbb{E}{\bm{X}{0:T}^{\bm{u}} \sim \mathbb{P}^{\bm{u}}} \left[ \log{\frac{\hat{\phi}{T}(\bm{X}{T}^{\bm{u}})}{\pi_{T}(\bm{X}{T}^{\bm{u}})}} \right], \ \text{s.t.} & \textstyle \quad \mathrm{d}\bm{X}{t}^{\bm{u}} = \left[ \bm{f}(\bm{X}{t}^{\bm{u}}, t) + \sigma{t} \bm{u}(\bm{X}{t}^{\bm{u}}, t) \right] \mathrm{d}t + \sigma{t} \mathrm{d} \bm{B}{t}, \bm{X}{0}^{\bm{u}} \sim \pi_{0}. \end{align} $$
DOT-formulated DSB
The DSB formulation can be interpreted as a DOT formulation with the following velocity field: $$ \begin{align}\textstyle \bm{v}(\bm{x}, t) = \bm{u}(\bm{x}, t) + \frac{\sigma_{t}}{2} \nabla\log{p_{t}(\bm{x})}. \end{align} $$ Reparameterizing the objective yields the DOT formulation equivalent to the DSB formulation: $$ \begin{align} \inf_{\bm{v}} & \textstyle \quad \int_{0}^{T} \int_{\mathbb{R}^{d}} \left[ \left( \frac{1}{2} \left\lVert \bm{v} \right\rVert^{2} + \frac{\sigma_{t}^{2}}{8} \left\lVert \nabla \log{p_{t}} \right\rVert^{2} - \frac{1}{2} \langle \nabla \log{p_{t}}, \bm{f} \rangle \right) \bm{p}{t}^{\bm{u}} \right] \mathrm{d}\bm{x} \mathrm{d}t, \ \text{s.t.} & \textstyle \quad \partial{t} p_{t}^{\bm{u}} = -\nabla \cdot \left{ \left[ \bm{f} + \sigma_{t} \bm{v} \right] p_{t}^{\bm{u}} \right}, p_{0}^{\bm{u}} = \pi_{0}, p_{T}^{\bm{u}} = \pi_{T}. \end{align} $$
DDPM as DSB
The standard continuous diffusion involving a Gaussian forward transition kernel: $$ \begin{align}\textstyle q(\bm{x}{t} \mid \bm{x}{0}) = \mathcal{N}\left( \bm{x}{t} ; \sqrt{\alpha{t}}\bm{x}{0}, (1 - \alpha{t}) \bm{I} \right), \quad\text{where } \alpha_{t} = \exp{\left( -\int_{0}^{t} \beta_{s} \mathrm{d}s \right)}, \end{align} $$ can be described by defining the reference dynamics $\mathbb{P}^{\bm{0}}$ as: $$ \begin{align}\textstyle \mathrm{d}\bm{X}{t} = \bm{f}(\bm{X}{t}, t)\mathrm{d}t + \sigma_{t}\mathrm{d}\bm{B}{t}, \quad\text{ where } \bm{f}(\bm{x}, t) = -\frac{1}{2}\beta{t}\bm{x}, ; \sigma_{t} = \sqrt{\beta_{t}}. \end{align} $$ By designing $\beta_{t} : [0,T] \rightarrow \mathbb{R}{\geq 0}$ such that $\mathbb{P}^{\bm{0}}$ approximately reaches $\pi{T} = \mathcal{N}(\bm{0},\bm{I})$ starting from $\pi_{0} = p_{\text{data}}$, the system can be interpreted as a DSB problem where the reference process is pre-aligned with the target boundary conditions. Thus, the optimal controls vanishes: $$ \begin{align} \bm{u}^{\ast}(\bm{x}, t) = \sigma_{t} \nabla \log{\phi_{t}(\bm{x})} = \sqrt{\beta_{t}} \left( \nabla\log{p_{t}^{\ast}}(\bm{x}) - \nabla\log{\hat{\phi}_{t}}(\bm{x}) \right) = \bm{0}. \end{align} $$
FM as DSB
The standard flow matching involving a Dirac delta transition kernel: $$ \begin{align}\textstyle q(\bm{x}{t} \mid \bm{x}{0}, \bm{x}{T}) = \lim{\sigma \to 0} \mathcal{N}\left( \bm{x}{t} ; \frac{(T-t)\bm{x}{0} + t\bm{x}{T}}{T}, \frac{t(T-t)\sigma^{2}}{T}\bm{I} \right), \end{align} $$ can be described by defining the reference dynamics $\mathbb{P}^{\bm{0}}$ as: $$ \begin{align}\textstyle \mathrm{d}\bm{X}{t} = \bm{f}(\bm{X}{t}, t)\mathrm{d}t + \sigma\mathrm{d}\bm{B}{t}, \quad\text{ where } \bm{f}(\bm{x}, t) = 0, ; \sigma \to 0. \end{align} $$ By taking $\sigma \to 0$, the DSB problem converges to a classical DOT problem, where the underlying stochastic particle dynamics collapse into purely deterministic trajectories. For a specific pair $(\bm{x}{0}, \bm{x}{T}) \sim \pi_{0,T}$, the conditional velocity field is simply the straight-line displacement: $\bm{u}^{\ast}(\bm{x}, t) = \frac{1}{T}(\bm{x}{T} - \bm{x}{0})$. With an independent coupling, i.e., $\pi_{0,T}(\bm{x}{0}, \bm{x}{T}) = \pi_{0}(\bm{x}{0}) \pi{T}(\bm{x}{T})$, the PF-ODE is given by: $$ \begin{align}\textstyle \mathrm{d}\bm{X}{t}^{\ast} = \bm{v}^{\ast}(\bm{X}{t}^{\ast}, t) \mathrm{d}t, \quad \bm{v}^{\ast}(\bm{x}, t) = \mathbb{E}{\pi_{0,T}} \left[ \frac{1}{T} (\bm{x}{T} - \bm{x}{0}) \mid \bm{X}{t}^{\ast}=\bm{x} \right]. \end{align} $$ Note that we have the following forward transition when $\pi{T} = \mathcal{N}(\bm{0}, \bm{I})$: $$ \begin{align}\textstyle q(\bm{x}{t} \mid \bm{x}{0}) = \mathcal{N}\left( \bm{x}{t} ; (1-\frac{t}{T})\bm{x}{0}, (\frac{t}{T})^{2}\bm{I} \right). \end{align} $$