Notes on Schrodinger bridges
Notes & Scribbles

Giung Nam, 2026-03-30

Controlled diffusion process

CDP

A controlled diffusion process (XtuRd)t[0,T](\bm{X}_{t}^{\bm{u}} \in \mathbb{R}^{d})_{t \in [0,T]} is defined as the solution to the SDE:

dXtu=[f(Xtu,t)+σtu(Xtu,t)]dt+σtdBt,\begin{align} \mathrm{d}\bm{X}_{t}^{\bm{u}} &= \left[ \bm{f}(\bm{X}_{t}^{\bm{u}}, t) + \sigma_{t}\bm{u}(\bm{X}_{t}^{\bm{u}}, t) \right] \mathrm{d}t + \sigma_{t}\mathrm{d}\bm{B}_{t}, \end{align}

with the reference drift f:Rd×[0,T]Rd\bm{f} : \mathbb{R}^{d} \times [0, T] \rightarrow \mathbb{R}^{d}, the diffusion coefficient σt:[0,T]R0\sigma_{t} : [0, T] \rightarrow \mathbb{R}_{\geq 0}, and the control drift u:Rd×[0,T]Rd\bm{u} : \mathbb{R}^{d} \times [0, T] \rightarrow \mathbb{R}^{d}. The corresponding reverse-time process (Xˉtˉu=XTtˉu)tˉ[0,T]( \bar{\bm{X}}_{\bar{t}}^{\bm{u}} = \bm{X}_{T - \bar{t}}^{\bm{u}} )_{\bar{t} \in [0,T]} satisfies:

dXˉtˉu=[f(Xˉtˉu,Ttˉ)σTtˉu(Xˉtˉu,Ttˉ)+σTtˉ2logpTtˉu(Xˉtˉu)]dtˉ+σTtˉdBˉtˉ,\begin{align}\textstyle \mathrm{d}\bar{\bm{X}}_{\bar{t}}^{\bm{u}} &= \left[ - \bm{f}(\bar{\bm{X}}_{\bar{t}}^{\bm{u}}, T - \bar{t}) - \sigma_{T - \bar{t}}\bm{u}(\bar{\bm{X}}_{\bar{t}}^{\bm{u}}, T - \bar{t}) + \sigma_{T - \bar{t}}^{2}\nabla\log{p_{T - \bar{t}}^{\bm{u}}(\bar{\bm{X}}_{\bar{t}}^{\bm{u}})} \right] \mathrm{d}\bar{t} + \sigma_{T - \bar{t}}\mathrm{d}\bar{\bm{B}}_{\bar{t}}, \end{align}

where ptup_{t}^{\bm{u}} is the marginal density of Xtu\bm{X}_{t}^{\bm{u}} at time tt, i.e., Xtuptu\bm{X}_{t}^{\bm{u}} \sim p_{t}^{\bm{u}}. FPE provides a forward-time evolution of ptup_{t}^{\bm{u}}:

tptu(x)={[f(x,t)+σtu(x,t)]ptu(x)}+σt22Δptu(x).\begin{align}\textstyle \partial_{t} p_{t}^{\bm{u}}(\bm{x}) = -\nabla \cdot \left\{ \left[ \bm{f}(\bm{x}, t) + \sigma_{t} \bm{u}(\bm{x}, t) \right] p_{t}^{\bm{u}}(\bm{x}) \right\} + \frac{\sigma_{t}^{2}}{2} \Delta p_{t}^{\bm{u}}(\bm{x}). \end{align}

KLD

Let Pu\mathbb{P}^{\bm{u}} and Pu~\mathbb{P}^{\tilde{\bm{u}}} be path measures on the space of continuous paths C([0,T];Rd)C([0,T];\mathbb{R}^{d}), induced by SDEs with the same initial law, the same reference drift f\bm{f}, and the same diffusion coefficient σt\sigma_{t}, but governed by different controls u\bm{u} and u~\tilde{\bm{u}}. Assuming absolute continuity PuPu~\mathbb{P}^{\bm{u}} \ll \mathbb{P}^{\tilde{\bm{u}}}, the KLD of Pu\mathbb{P}^{\bm{u}} with respect to Pu~\mathbb{P}^{\tilde{\bm{u}}} is given by:

DKL(PuPu~)=EX0:TuPu[120Tu(Xtu,t)u~(Xtu,t)2dt],=0TRd[12u(x,t)u~(x,t)2ptu(x)]dxdt.\begin{align} D_{\text{KL}} \left( \mathbb{P}^{\bm{u}} \parallel \mathbb{P}^{\tilde{\bm{u}}} \right) &\textstyle = \mathbb{E}_{\bm{X}_{0:T}^{\bm{u}} \sim \mathbb{P}^{\bm{u}}} \left[ \frac{1}{2} \int_{0}^{T} \left\lVert \bm{u}(\bm{X}_{t}^{\bm{u}}, t) - \tilde{\bm{u}}(\bm{X}_{t}^{\bm{u}}, t) \right\rVert^{2} \mathrm{d}t \right], \\ &\textstyle = \int_{0}^{T} \int_{\mathbb{R}^{d}} \left[ \frac{1}{2} \left\lVert \bm{u}(\bm{x}, t) - \tilde{\bm{u}}(\bm{x}, t) \right\rVert^{2} p_{t}^{\bm{u}}(\bm{x}) \right] \mathrm{d}\bm{x} \mathrm{d}t. \end{align}

Stochastic Optimal Control

SOC

The SOC formulation aims to determine the optimal control u\bm{u}^{\ast} that minimally perturbs the reference dynamics starting from π0\pi_{0} while minimizing the cost:

infuDKL(PuP0)+EX0:TuPu[0Tc(Xtu,t)dt+Φ(XTu)],s.t.dXtu=[f(Xtu,t)+σtu(Xtu,t)]dt+σtdBt,X0uπ0,\begin{align} \inf_{\bm{u}} & \textstyle \quad D_{\text{KL}}\left( \mathbb{P}^{\bm{u}} \parallel \mathbb{P}^{\bm{0}} \right) + \mathbb{E}_{\bm{X}_{0:T}^{\bm{u}} \sim \mathbb{P}^{\bm{u}}} \left[ \int_{0}^{T} c(\bm{X}_{t}^{\bm{u}}, t) \mathrm{d}t + \Phi(\bm{X}_{T}^{\bm{u}}) \right], \\ \text{s.t.} & \textstyle \quad \mathrm{d}\bm{X}_{t}^{\bm{u}} = \left[ \bm{f}(\bm{X}_{t}^{\bm{u}}, t) + \sigma_{t} \bm{u}(\bm{X}_{t}^{\bm{u}}, t) \right] \mathrm{d}t + \sigma_{t} \mathrm{d} \bm{B}_{t}, \bm{X}_{0}^{\bm{u}} \sim \pi_{0}, \end{align}

where c:Rd×[0,T]Rc : \mathbb{R}^{d} \times [0,T] \rightarrow \mathbb{R} is the running cost, and Φ:RdR\Phi : \mathbb{R}^{d} \rightarrow \mathbb{R} is the terminal cost. We define the value function Vt:RdRV_{t} : \mathbb{R}^{d} \rightarrow \mathbb{R} as the optimal cost-to-go from any fixed point (x,t)(\bm{x}, t):

Vt(x)=infuEXt:TuPu[tT(12u(Xsu,s)2+c(Xsu,s))ds+Φ(XTu)Xtu=x]=infuEXt:TuPu[tτ(12u(Xsu,s)2+c(Xsu,s))ds+Vτ(Xτu)Xtu=x],\begin{align} V_{t}(\bm{x}) & \textstyle = \inf_{\bm{u}} \mathbb{E}_{\bm{X}_{t:T}^{\bm{u}} \sim \mathbb{P}^{\bm{u}}} \left[ \int_{t}^{T} \left( \frac{1}{2}\left\lVert \bm{u}(\bm{X}_{s}^{\bm{u}}, s) \right\rVert^{2} + c(\bm{X}_{s}^{\bm{u}}, s) \right) \mathrm{d}s + \Phi(\bm{X}_{T}^{\bm{u}}) \mid \bm{X}_{t}^{\bm{u}} = \bm{x} \right] \\ & \textstyle = \inf_{\bm{u}} \mathbb{E}_{\bm{X}_{t:T}^{\bm{u}} \sim \mathbb{P}^{\bm{u}}} \left[ \int_{t}^{\tau} \left( \frac{1}{2}\left\lVert \bm{u}(\bm{X}_{s}^{\bm{u}}, s) \right\rVert^{2} + c(\bm{X}_{s}^{\bm{u}}, s) \right) \mathrm{d}s + V_{\tau}(\bm{X}_{\tau}^{\bm{u}}) \mid \bm{X}_{t}^{\bm{u}} = \bm{x} \right], \end{align}

which solves the HJB equation and satisfies the backward-time evolution in the SDE:

tVt=[f,Vt+σt22ΔVt]+σt22Vt2c,s.t.VT=Φ,dVt=(σt22Vt2c+σtu,Vt)dt+VtσtdBt.\begin{align}\textstyle \partial_{t} V_{t} & \textstyle = - \left[ \langle \bm{f}, \nabla V_{t} \rangle + \frac{\sigma_{t}^{2}}{2} \Delta V_{t} \right] + \frac{\sigma_{t}^{2}}{2} \left\lVert \nabla V_{t} \right\rVert^{2} - c, \quad\text{s.t.}\quad V_{T} = \Phi, \\ \mathrm{d}V_{t} & \textstyle = \left( \frac{\sigma_{t}^{2}}{2} \left\lVert \nabla V_{t} \right\rVert^{2} - c + \langle \sigma_{t} \bm{u}, \nabla V_{t} \rangle \right) \mathrm{d}t + \nabla V_{t}^\top \sigma_{t} \mathrm{d}\bm{B}_{t}. \end{align}

Here, the optimal control u\bm{u}^{\ast} is written in terms of the value function:

u(x,t)=σtVt(x).\begin{align} \bm{u}^{\ast}(\bm{x}, t) = - \sigma_{t} \nabla V_{t}(\bm{x}). \end{align}

Feynman-Kac-formulated SOC

Introducing the desirability function ϕt=eVt\phi_{t} = e^{-V_{t}} converts the non-linear HJB equation into the linear PDE for ϕt\phi_{t}:

tϕt=[f,ϕt+σt22Δϕt]+cϕt,s.t.ϕT=exp(Φ),\begin{align}\textstyle \partial_{t} \phi_{t} = - \left[ \langle \bm{f}, \nabla \phi_{t} \rangle + \frac{\sigma_{t}^{2}}{2} \Delta \phi_{t} \right] + c \phi_{t}, \quad\text{s.t.}\quad \phi_{T} = \exp{\left(-\Phi\right)}, \end{align}

which, via FKF, yields

ϕt(x)=EX0:TP0[exp(tTc(Xs,s)dsΦ(XT))Xt=x].\begin{align}\textstyle \phi_{t}(\bm{x}) = \mathbb{E}_{\bm{X}_{0:T} \sim \mathbb{P}^{\bm{0}}} \left[ \exp{\left( -\int_{t}^{T} c(\bm{X}_{s}, s) \mathrm{d}s - \Phi(\bm{X}_{T}) \right)} \mid \bm{X}_{t} = \bm{x} \right]. \end{align}

Here, the optimal control u\bm{u}^{\ast} is written in terms of the desirability function:

u(x,t)=σtlogϕt(x).\begin{align} \bm{u}^{\ast}(\bm{x}, t) = \sigma_{t} \nabla \log{\phi_{t}(\bm{x})}. \end{align}

Lagrangian mechanics

The Lagrangian, action, generalized momentum, and generalized force are:

L(x,x˙,t)=12x˙f(x,t)σt2+c(x,t),S[x]=0TL(x,x˙,t)dt+Φ(xT),p=Lx˙=uσt,p˙=Lx=(f)p+c.\begin{align} L(\bm{x}, \dot{\bm{x}}, t) & \textstyle = \frac{1}{2} \left\lVert \frac{\dot{\bm{x}} - \bm{f}(\bm{x}, t)}{\sigma_{t}} \right\rVert^{2} + c(\bm{x}, t), \\ S[\bm{x}] & \textstyle = \int_{0}^{T} L(\bm{x}, \dot{\bm{x}}, t) \mathrm{d}t + \Phi(\bm{x}_{T}), \\ \bm{p} & \textstyle = \frac{\partial L}{\partial\dot{\bm{x}}} = \frac{\bm{u}}{\sigma_{t}}, \\ \dot{\bm{p}} & \textstyle = \frac{\partial L}{\partial\bm{x}} = -(\nabla \bm{f})^{\top} \bm{p} + \nabla c. \end{align}

The Hamiltonian and canonical equation are:

H(x,p,t)=p,x˙L(x,x˙,t)=p,f(x,t)+σt22p2c(x,t),x˙=Hp=f(x,t)+σt2p,p˙=Hx=(f)p+c.\begin{align} H(\bm{x}, \bm{p}, t) & \textstyle = \langle \bm{p}, \dot{\bm{x}} \rangle - L(\bm{x}, \dot{\bm{x}}, t) \\ & \textstyle = \langle \bm{p}, \bm{f}(\bm{x}, t) \rangle + \frac{\sigma_{t}^{2}}{2} \left\lVert \bm{p} \right\rVert^{2} - c(\bm{x}, t), \\ \dot{\bm{x}} & \textstyle = \frac{\partial H}{\partial \bm{p}} = \bm{f}(\bm{x}, t) + \sigma_{t}^{2} \bm{p}, \\ \dot{\bm{p}} & \textstyle = -\frac{\partial H}{\partial \bm{x}} = -(\nabla \bm{f})^{\top} \bm{p} + \nabla c. \end{align}

The HJB equation can be rewritten in terms of the Hamiltonian:

tVt+σt22ΔVt=H(x,Vt,t).\begin{align}\textstyle \partial_{t} V_{t} + \frac{\sigma_{t}^{2}}{2} \Delta V_{t} = H(\bm{x}, -\nabla V_{t}, t). \end{align}

Option pricing

Substituting x=logSx = \log{S}, ϕt(x)=C(ex,t)\phi_{t}(x) = C(e^{x}, t), σt=σ\sigma_{t} = \sigma, c(x,t)=rc(x, t) = r, f(x,t)=r12σ2\bm{f}(x, t) = r - \frac{1}{2}\sigma^{2} yields:

tC+rSSC+12σ2S2SSCrC=0,\begin{align}\textstyle \partial_{t} C + r S \partial_{S} C + \frac{1}{2} \sigma^{2} S^{2} \partial_{SS} C - rC = 0, \end{align}

which is formally equivalent to the Black-Scholes equation for an underlying asset price S(t)S(t), a call option price C(S,t)C(S, t), a volatility σ\sigma, and a risk-free interest rate rr.

Dynamic Schrodinger Bridge

DSB

The DSB formulation aims to determine the optimal pair of control and density that minimally perturbs the reference dynamics subject to initial and terminal marginal constraints:

infu,ptuDKL(PuP0),s.t.dXtu=[f(Xtu,t)+σtu(Xtu,t)]dt+σtdBt,X0uπ0,XTuπT.\begin{align} \inf_{\bm{u}, p_{t}^{\bm{u}}} & \textstyle \quad D_{\text{KL}}\left( \mathbb{P}^{\bm{u}} \parallel \mathbb{P}^{\bm{0}} \right), \\ \text{s.t.} & \textstyle \quad \mathrm{d}\bm{X}_{t}^{\bm{u}} = \left[ \bm{f}(\bm{X}_{t}^{\bm{u}}, t) + \sigma_{t} \bm{u}(\bm{X}_{t}^{\bm{u}}, t) \right] \mathrm{d}t + \sigma_{t} \mathrm{d} \bm{B}_{t}, \bm{X}_{0}^{\bm{u}} \sim \pi_{0}, \bm{X}_{T}^{\bm{u}} \sim \pi_{T}. \end{align}

We reframe this as an unconstrained optimization problem by introducing a Lagrangian with a Lagrange multiplier ψt:RdR\psi_{t} : \mathbb{R}^{d} \rightarrow \mathbb{R}:

L(ptu,u,ψt)=0TRd{12u2ptu+ψt[tptu+[(f+σtu)ptu]σt22Δptu]}dxdt,\begin{align}\textstyle \mathcal{L}(p_{t}^{\bm{u}}, \bm{u}, \psi_{t}) = \int_{0}^{T} \int_{\mathbb{R}^{d}} \left\{ \frac{1}{2} \left\lVert \bm{u} \right\rVert^{2} p_{t}^{\bm{u}} + \psi_{t} \cdot \left[ \partial_{t}p_{t}^{\bm{u}} + \nabla \cdot \left[ \left( \bm{f} + \sigma_{t}\bm{u} \right) p_{t}^{\bm{u}} \right] - \frac{\sigma_{t}^{2}}{2} \Delta p_{t}^{\bm{u}} \right] \right\} \mathrm{d}\bm{x} \mathrm{d}t, \end{align}

which yields the minimizer (u,pt)(\bm{u}^{\ast}, p_{t}^{\ast}) as the solution to the HJB-FP system:

{tψt=σt22ψt2ψt,fσt22Δψt,tpt=[(f+σt2ψt)pt]+σt22Δpt,s.t.{p0=π0,pT=πT.\begin{align} \begin{cases} \partial_{t}\psi_{t} = -\frac{\sigma_{t}^{2}}{2} \left\lVert \nabla \psi_{t} \right\rVert^{2} - \langle \nabla \psi_{t}, \bm{f} \rangle - \frac{\sigma_{t}^{2}}{2} \Delta \psi_{t}, \\ \partial_{t}p_{t}^{\ast} = - \nabla \cdot \left[ \left( \bm{f} + \sigma_{t}^{2}\nabla\psi_{t} \right) p_{t}^{\ast} \right] + \frac{\sigma_{t}^{2}}{2} \Delta p_{t}^{\ast}, \end{cases} \quad \text{s.t.} \quad \begin{cases} p_{0}^{\ast} = \pi_{0}, \\ p_{T}^{\ast} = \pi_{T}. \end{cases} \end{align}

Here, the optimal control u\bm{u}^{\ast} is written in terms of the Lagrange multiplier:

u(x,t)=σtψt(x).\begin{align} \bm{u}^{\ast}(\bm{x}, t) = \sigma_{t} \nabla \psi_{t}(\bm{x}). \end{align}

Hopf-Cole-transformed DSB

Applying the change of variables (ψt,pt)(ϕt,ϕ^t)(\psi_{t}, p_{t}^{\ast}) \mapsto (\phi_{t}, \hat{\phi}_{t}), defined as ψt=logϕt\psi_{t} = \log{\phi_{t}} and pt=ϕtϕ^tp_{t}^{\ast} = \phi_{t}\hat{\phi}_{t}, transforms the non-linear HJB-FP system into the linear system for a Schrodinger potential (ϕt,ϕ^t)(\phi_{t}, \hat{\phi}_{t}):

{tϕt=ϕt,fσt22Δϕt,tϕ^t=(ϕ^tf)+σt22Δϕ^t,s.t.{p0=ϕ0ϕ^0,pT=ϕTϕ^T,\begin{align} \begin{cases} \partial_{t}\phi_{t} = -\langle \nabla \phi_{t}, \bm{f} \rangle - \frac{\sigma_{t}^{2}}{2} \Delta \phi_{t}, \\ \partial_{t}\hat{\phi}_{t} = - \nabla \cdot (\hat{\phi}_{t} \bm{f}) + \frac{\sigma_{t}^{2}}{2} \Delta \hat{\phi}_{t}, \end{cases} \quad \text{s.t.} \quad \begin{cases} p_{0}^{\ast} = \phi_{0}\hat{\phi}_{0}, \\ p_{T}^{\ast} = \phi_{T} \hat{\phi}_{T}, \end{cases} \end{align}

where the solution is given by:

{ϕt(x)=RdPTt0(yx)ϕT(y)dy,ϕ^t(x)=RdPt00(xy)ϕ^0(y)dy,s.t.{π0(x)=ϕ0(x)ϕ^0(x),πT(x)=ϕT(x)ϕ^T(x).\begin{align} \begin{cases} \phi_{t}(\bm{x}) = \int_{\mathbb{R}^{d}} \mathbb{P}_{T \mid t}^{\bm{0}}(\bm{y} \mid \bm{x}) \phi_{T}(\bm{y}) \mathrm{d}\bm{y}, \\ \hat{\phi}_{t}(\bm{x}) = \int_{\mathbb{R}^{d}} \mathbb{P}_{t \mid 0}^{\bm{0}}(\bm{x} \mid \bm{y}) \hat{\phi}_{0}(\bm{y}) \mathrm{d}\bm{y}, \end{cases} \quad \text{s.t.} \quad \begin{cases} \pi_{0}(\bm{x}) = \phi_{0}(\bm{x}) \hat{\phi}_{0}(\bm{x}), \\ \pi_{T}(\bm{x}) = \phi_{T}(\bm{x}) \hat{\phi}_{T}(\bm{x}). \end{cases} \end{align}

Here, the optimal control u\bm{u}^{\ast} is written in the Schrodinger potential:

u(x,t)=σtlogϕt(x).\begin{align} \bm{u}^{\ast}(\bm{x}, t) = \sigma_{t} \nabla \log{\phi_{t}(\bm{x})}. \end{align}

SOC-formulated DSB

The DSB formulation can be interpreted as a SOC formulation with the following costs:

c(x,t)=0,Φ(x)=logϕ^T(x)πT(x).\begin{align}\textstyle c(\bm{x},t)=0, \quad \Phi(\bm{x}) = \log{\frac{\hat{\phi}_{T}(\bm{x})}{\pi_{T}(\bm{x})}}. \end{align}

Substituting these yields the SOC formulation equivalent to the DSB formulation:

infuDKL(PuP0)+EX0:TuPu[logϕ^T(XTu)πT(XTu)],s.t.dXtu=[f(Xtu,t)+σtu(Xtu,t)]dt+σtdBt,X0uπ0.\begin{align} \inf_{\bm{u}} & \textstyle \quad D_{\text{KL}}\left( \mathbb{P}^{\bm{u}} \parallel \mathbb{P}^{\bm{0}} \right) + \mathbb{E}_{\bm{X}_{0:T}^{\bm{u}} \sim \mathbb{P}^{\bm{u}}} \left[ \log{\frac{\hat{\phi}_{T}(\bm{X}_{T}^{\bm{u}})}{\pi_{T}(\bm{X}_{T}^{\bm{u}})}} \right], \\ \text{s.t.} & \textstyle \quad \mathrm{d}\bm{X}_{t}^{\bm{u}} = \left[ \bm{f}(\bm{X}_{t}^{\bm{u}}, t) + \sigma_{t} \bm{u}(\bm{X}_{t}^{\bm{u}}, t) \right] \mathrm{d}t + \sigma_{t} \mathrm{d} \bm{B}_{t}, \bm{X}_{0}^{\bm{u}} \sim \pi_{0}. \end{align}

DOT-formulated DSB

The DSB formulation can be interpreted as a DOT formulation with the following velocity field:

v(x,t)=u(x,t)+σt2logpt(x).\begin{align}\textstyle \bm{v}(\bm{x}, t) = \bm{u}(\bm{x}, t) + \frac{\sigma_{t}}{2} \nabla\log{p_{t}(\bm{x})}. \end{align}

Reparameterizing the objective yields the DOT formulation equivalent to the DSB formulation:

infv0TRd[(12v2+σt28logpt212logpt,f)ptu]dxdt,s.t.tptu={[f+σtv]ptu},p0u=π0,pTu=πT.\begin{align} \inf_{\bm{v}} & \textstyle \quad \int_{0}^{T} \int_{\mathbb{R}^{d}} \left[ \left( \frac{1}{2} \left\lVert \bm{v} \right\rVert^{2} + \frac{\sigma_{t}^{2}}{8} \left\lVert \nabla \log{p_{t}} \right\rVert^{2} - \frac{1}{2} \langle \nabla \log{p_{t}}, \bm{f} \rangle \right) \bm{p}_{t}^{\bm{u}} \right] \mathrm{d}\bm{x} \mathrm{d}t, \\ \text{s.t.} & \textstyle \quad \partial_{t} p_{t}^{\bm{u}} = -\nabla \cdot \left\{ \left[ \bm{f} + \sigma_{t} \bm{v} \right] p_{t}^{\bm{u}} \right\}, p_{0}^{\bm{u}} = \pi_{0}, p_{T}^{\bm{u}} = \pi_{T}. \end{align}

DDPM as DSB

The standard continuous diffusion involving a Gaussian forward transition kernel:

q(xtx0)=N(xt;αtx0,(1αt)I),where αt=exp(0tβsds),\begin{align}\textstyle q(\bm{x}_{t} \mid \bm{x}_{0}) = \mathcal{N}\left( \bm{x}_{t} ; \sqrt{\alpha_{t}}\bm{x}_{0}, (1 - \alpha_{t}) \bm{I} \right), \quad\text{where } \alpha_{t} = \exp{\left( -\int_{0}^{t} \beta_{s} \mathrm{d}s \right)}, \end{align}

can be described by defining the reference dynamics P0\mathbb{P}^{\bm{0}} as:

dXt=f(Xt,t)dt+σtdBt, where f(x,t)=12βtx,  σt=βt.\begin{align}\textstyle \mathrm{d}\bm{X}_{t} = \bm{f}(\bm{X}_{t}, t)\mathrm{d}t + \sigma_{t}\mathrm{d}\bm{B}_{t}, \quad\text{ where } \bm{f}(\bm{x}, t) = -\frac{1}{2}\beta_{t}\bm{x}, \; \sigma_{t} = \sqrt{\beta_{t}}. \end{align}

By designing βt:[0,T]R0\beta_{t} : [0,T] \rightarrow \mathbb{R}_{\geq 0} such that P0\mathbb{P}^{\bm{0}} approximately reaches πT=N(0,I)\pi_{T} = \mathcal{N}(\bm{0},\bm{I}) starting from π0=pdata\pi_{0} = p_{\text{data}}, the system can be interpreted as a DSB problem where the reference process is pre-aligned with the target boundary conditions. Thus, the optimal controls vanishes:

u(x,t)=σtlogϕt(x)=βt(logpt(x)logϕ^t(x))=0.\begin{align} \bm{u}^{\ast}(\bm{x}, t) = \sigma_{t} \nabla \log{\phi_{t}(\bm{x})} = \sqrt{\beta_{t}} \left( \nabla\log{p_{t}^{\ast}}(\bm{x}) - \nabla\log{\hat{\phi}_{t}}(\bm{x}) \right) = \bm{0}. \end{align}

FM as DSB

The standard flow matching involving a Dirac delta transition kernel:

q(xtx0,xT)=limσ0N(xt;(Tt)x0+txTT,t(Tt)σ2TI),\begin{align}\textstyle q(\bm{x}_{t} \mid \bm{x}_{0}, \bm{x}_{T}) = \lim_{\sigma \to 0} \mathcal{N}\left( \bm{x}_{t} ; \frac{(T-t)\bm{x}_{0} + t\bm{x}_{T}}{T}, \frac{t(T-t)\sigma^{2}}{T}\bm{I} \right), \end{align}

can be described by defining the reference dynamics P0\mathbb{P}^{\bm{0}} as:

dXt=f(Xt,t)dt+σdBt, where f(x,t)=0,  σ0.\begin{align}\textstyle \mathrm{d}\bm{X}_{t} = \bm{f}(\bm{X}_{t}, t)\mathrm{d}t + \sigma\mathrm{d}\bm{B}_{t}, \quad\text{ where } \bm{f}(\bm{x}, t) = 0, \; \sigma \to 0. \end{align}

By taking σ0\sigma \to 0, the DSB problem converges to a classical DOT problem, where the underlying stochastic particle dynamics collapse into purely deterministic trajectories. For a specific pair (x0,xT)π0,T(\bm{x}_{0}, \bm{x}_{T}) \sim \pi_{0,T}, the conditional velocity field is simply the straight-line displacement: u(x,t)=1T(xTx0)\bm{u}^{\ast}(\bm{x}, t) = \frac{1}{T}(\bm{x}_{T} - \bm{x}_{0}). With an independent coupling, i.e., π0,T(x0,xT)=π0(x0)πT(xT)\pi_{0,T}(\bm{x}_{0}, \bm{x}_{T}) = \pi_{0}(\bm{x}_{0}) \pi_{T}(\bm{x}_{T}), the PF-ODE is given by:

dXt=v(Xt,t)dt,v(x,t)=Eπ0,T[1T(xTx0)Xt=x].\begin{align}\textstyle \mathrm{d}\bm{X}_{t}^{\ast} = \bm{v}^{\ast}(\bm{X}_{t}^{\ast}, t) \mathrm{d}t, \quad \bm{v}^{\ast}(\bm{x}, t) = \mathbb{E}_{\pi_{0,T}} \left[ \frac{1}{T} (\bm{x}_{T} - \bm{x}_{0}) \mid \bm{X}_{t}^{\ast}=\bm{x} \right]. \end{align}

Note that we have the following forward transition when πT=N(0,I)\pi_{T} = \mathcal{N}(\bm{0}, \bm{I}):

q(xtx0)=N(xt;(1tT)x0,(tT)2I).\begin{align}\textstyle q(\bm{x}_{t} \mid \bm{x}_{0}) = \mathcal{N}\left( \bm{x}_{t} ; (1-\frac{t}{T})\bm{x}_{0}, (\frac{t}{T})^{2}\bm{I} \right). \end{align}

Controlled diffusion process

CDP

A controlled diffusion process $(\bm{X}{t}^{\bm{u}} \in \mathbb{R}^{d}){t \in [0,T]}$ is defined as the solution to the SDE: $$ \begin{align} \mathrm{d}\bm{X}{t}^{\bm{u}} &= \left[ \bm{f}(\bm{X}{t}^{\bm{u}}, t) + \sigma_{t}\bm{u}(\bm{X}{t}^{\bm{u}}, t) \right] \mathrm{d}t + \sigma{t}\mathrm{d}\bm{B}{t}, \end{align} $$ with the reference drift $\bm{f} : \mathbb{R}^{d} \times [0, T] \rightarrow \mathbb{R}^{d}$, the diffusion coefficient $\sigma{t} : [0, T] \rightarrow \mathbb{R}{\geq 0}$, and the control drift $\bm{u} : \mathbb{R}^{d} \times [0, T] \rightarrow \mathbb{R}^{d}$. The corresponding reverse-time process $( \bar{\bm{X}}{\bar{t}}^{\bm{u}} = \bm{X}{T - \bar{t}}^{\bm{u}} ){\bar{t} \in [0,T]}$ satisfies: $$ \begin{align}\textstyle \mathrm{d}\bar{\bm{X}}{\bar{t}}^{\bm{u}} &= \left[ - \bm{f}(\bar{\bm{X}}{\bar{t}}^{\bm{u}}, T - \bar{t}) - \sigma_{T - \bar{t}}\bm{u}(\bar{\bm{X}}{\bar{t}}^{\bm{u}}, T - \bar{t}) + \sigma{T - \bar{t}}^{2}\nabla\log{p_{T - \bar{t}}^{\bm{u}}(\bar{\bm{X}}{\bar{t}}^{\bm{u}})} \right] \mathrm{d}\bar{t} + \sigma{T - \bar{t}}\mathrm{d}\bar{\bm{B}}{\bar{t}}, \end{align} $$ where $p{t}^{\bm{u}}$ is the marginal density of $\bm{X}{t}^{\bm{u}}$ at time $t$, i.e., $\bm{X}{t}^{\bm{u}} \sim p_{t}^{\bm{u}}$. FPE provides a forward-time evolution of $p_{t}^{\bm{u}}$: $$ \begin{align}\textstyle \partial_{t} p_{t}^{\bm{u}}(\bm{x}) = -\nabla \cdot \left{ \left[ \bm{f}(\bm{x}, t) + \sigma_{t} \bm{u}(\bm{x}, t) \right] p_{t}^{\bm{u}}(\bm{x}) \right} + \frac{\sigma_{t}^{2}}{2} \Delta p_{t}^{\bm{u}}(\bm{x}). \end{align} $$

KLD

Let $\mathbb{P}^{\bm{u}}$ and $\mathbb{P}^{\tilde{\bm{u}}}$ be path measures on the space of continuous paths $C([0,T];\mathbb{R}^{d})$, induced by SDEs with the same initial law, the same reference drift $\bm{f}$, and the same diffusion coefficient $\sigma_{t}$, but governed by different controls $\bm{u}$ and $\tilde{\bm{u}}$. Assuming absolute continuity $\mathbb{P}^{\bm{u}} \ll \mathbb{P}^{\tilde{\bm{u}}}$, the KLD of $\mathbb{P}^{\bm{u}}$ with respect to $\mathbb{P}^{\tilde{\bm{u}}}$ is given by: $$ \begin{align} D_{\text{KL}} \left( \mathbb{P}^{\bm{u}} \parallel \mathbb{P}^{\tilde{\bm{u}}} \right) &\textstyle = \mathbb{E}{\bm{X}{0:T}^{\bm{u}} \sim \mathbb{P}^{\bm{u}}} \left[ \frac{1}{2} \int_{0}^{T} \left\lVert \bm{u}(\bm{X}{t}^{\bm{u}}, t) - \tilde{\bm{u}}(\bm{X}{t}^{\bm{u}}, t) \right\rVert^{2} \mathrm{d}t \right], \ &\textstyle = \int_{0}^{T} \int_{\mathbb{R}^{d}} \left[ \frac{1}{2} \left\lVert \bm{u}(\bm{x}, t) - \tilde{\bm{u}}(\bm{x}, t) \right\rVert^{2} p_{t}^{\bm{u}}(\bm{x}) \right] \mathrm{d}\bm{x} \mathrm{d}t. \end{align} $$

Stochastic Optimal Control

SOC

The SOC formulation aims to determine the optimal control $\bm{u}^{\ast}$ that minimally perturbs the reference dynamics starting from $\pi_{0}$ while minimizing the cost: $$ \begin{align} \inf_{\bm{u}} & \textstyle \quad D_{\text{KL}}\left( \mathbb{P}^{\bm{u}} \parallel \mathbb{P}^{\bm{0}} \right) + \mathbb{E}{\bm{X}{0:T}^{\bm{u}} \sim \mathbb{P}^{\bm{u}}} \left[ \int_{0}^{T} c(\bm{X}{t}^{\bm{u}}, t) \mathrm{d}t + \Phi(\bm{X}{T}^{\bm{u}}) \right], \ \text{s.t.} & \textstyle \quad \mathrm{d}\bm{X}{t}^{\bm{u}} = \left[ \bm{f}(\bm{X}{t}^{\bm{u}}, t) + \sigma_{t} \bm{u}(\bm{X}{t}^{\bm{u}}, t) \right] \mathrm{d}t + \sigma{t} \mathrm{d} \bm{B}{t}, \bm{X}{0}^{\bm{u}} \sim \pi_{0}, \end{align} $$ where $c : \mathbb{R}^{d} \times [0,T] \rightarrow \mathbb{R}$ is the running cost, and $\Phi : \mathbb{R}^{d} \rightarrow \mathbb{R}$ is the terminal cost. We define the value function $V_{t} : \mathbb{R}^{d} \rightarrow \mathbb{R}$ as the optimal cost-to-go from any fixed point $(\bm{x}, t)$: $$ \begin{align} V_{t}(\bm{x}) & \textstyle = \inf_{\bm{u}} \mathbb{E}{\bm{X}{t:T}^{\bm{u}} \sim \mathbb{P}^{\bm{u}}} \left[ \int_{t}^{T} \left( \frac{1}{2}\left\lVert \bm{u}(\bm{X}{s}^{\bm{u}}, s) \right\rVert^{2} + c(\bm{X}{s}^{\bm{u}}, s) \right) \mathrm{d}s + \Phi(\bm{X}{T}^{\bm{u}}) \mid \bm{X}{t}^{\bm{u}} = \bm{x} \right] \ & \textstyle = \inf_{\bm{u}} \mathbb{E}{\bm{X}{t:T}^{\bm{u}} \sim \mathbb{P}^{\bm{u}}} \left[ \int_{t}^{\tau} \left( \frac{1}{2}\left\lVert \bm{u}(\bm{X}{s}^{\bm{u}}, s) \right\rVert^{2} + c(\bm{X}{s}^{\bm{u}}, s) \right) \mathrm{d}s + V_{\tau}(\bm{X}{\tau}^{\bm{u}}) \mid \bm{X}{t}^{\bm{u}} = \bm{x} \right], \end{align} $$ which solves the HJB equation and satisfies the backward-time evolution in the SDE: $$ \begin{align}\textstyle \partial_{t} V_{t} & \textstyle = - \left[ \langle \bm{f}, \nabla V_{t} \rangle + \frac{\sigma_{t}^{2}}{2} \Delta V_{t} \right] + \frac{\sigma_{t}^{2}}{2} \left\lVert \nabla V_{t} \right\rVert^{2} - c, \quad\text{s.t.}\quad V_{T} = \Phi, \ \mathrm{d}V_{t} & \textstyle = \left( \frac{\sigma_{t}^{2}}{2} \left\lVert \nabla V_{t} \right\rVert^{2} - c + \langle \sigma_{t} \bm{u}, \nabla V_{t} \rangle \right) \mathrm{d}t + \nabla V_{t}^\top \sigma_{t} \mathrm{d}\bm{B}{t}. \end{align} $$ Here, the optimal control $\bm{u}^{\ast}$ is written in terms of the value function: $$ \begin{align} \bm{u}^{\ast}(\bm{x}, t) = - \sigma{t} \nabla V_{t}(\bm{x}). \end{align} $$

Feynman-Kac-formulated SOC

Introducing the desirability function $\phi_{t} = e^{-V_{t}}$ converts the non-linear HJB equation into the linear PDE for $\phi_{t}$: $$ \begin{align}\textstyle \partial_{t} \phi_{t} = - \left[ \langle \bm{f}, \nabla \phi_{t} \rangle + \frac{\sigma_{t}^{2}}{2} \Delta \phi_{t} \right] + c \phi_{t}, \quad\text{s.t.}\quad \phi_{T} = \exp{\left(-\Phi\right)}, \end{align} $$ which, via FKF, yields $$ \begin{align}\textstyle \phi_{t}(\bm{x}) = \mathbb{E}{\bm{X}{0:T} \sim \mathbb{P}^{\bm{0}}} \left[ \exp{\left( -\int_{t}^{T} c(\bm{X}{s}, s) \mathrm{d}s - \Phi(\bm{X}{T}) \right)} \mid \bm{X}{t} = \bm{x} \right]. \end{align} $$ Here, the optimal control $\bm{u}^{\ast}$ is written in terms of the desirability function: $$ \begin{align} \bm{u}^{\ast}(\bm{x}, t) = \sigma{t} \nabla \log{\phi_{t}(\bm{x})}. \end{align} $$

Lagrangian mechanics

The Lagrangian, action, generalized momentum, and generalized force are: $$ \begin{align} L(\bm{x}, \dot{\bm{x}}, t) & \textstyle = \frac{1}{2} \left\lVert \frac{\dot{\bm{x}} - \bm{f}(\bm{x}, t)}{\sigma_{t}} \right\rVert^{2} + c(\bm{x}, t), \ S[\bm{x}] & \textstyle = \int_{0}^{T} L(\bm{x}, \dot{\bm{x}}, t) \mathrm{d}t + \Phi(\bm{x}{T}), \ \bm{p} & \textstyle = \frac{\partial L}{\partial\dot{\bm{x}}} = \frac{\bm{u}}{\sigma{t}}, \ \dot{\bm{p}} & \textstyle = \frac{\partial L}{\partial\bm{x}} = -(\nabla \bm{f})^{\top} \bm{p} + \nabla c. \end{align} $$ The Hamiltonian and canonical equation are: $$ \begin{align} H(\bm{x}, \bm{p}, t) & \textstyle = \langle \bm{p}, \dot{\bm{x}} \rangle - L(\bm{x}, \dot{\bm{x}}, t) \ & \textstyle = \langle \bm{p}, \bm{f}(\bm{x}, t) \rangle + \frac{\sigma_{t}^{2}}{2} \left\lVert \bm{p} \right\rVert^{2} - c(\bm{x}, t), \ \dot{\bm{x}} & \textstyle = \frac{\partial H}{\partial \bm{p}} = \bm{f}(\bm{x}, t) + \sigma_{t}^{2} \bm{p}, \ \dot{\bm{p}} & \textstyle = -\frac{\partial H}{\partial \bm{x}} = -(\nabla \bm{f})^{\top} \bm{p} + \nabla c. \end{align} $$ The HJB equation can be rewritten in terms of the Hamiltonian: $$ \begin{align}\textstyle \partial_{t} V_{t} + \frac{\sigma_{t}^{2}}{2} \Delta V_{t} = H(\bm{x}, -\nabla V_{t}, t). \end{align} $$

Option pricing

Substituting $x = \log{S}$, $\phi_{t}(x) = C(e^{x}, t)$, $\sigma_{t} = \sigma$, $c(x, t) = r$, $\bm{f}(x, t) = r - \frac{1}{2}\sigma^{2}$ yields: $$ \begin{align}\textstyle \partial_{t} C + r S \partial_{S} C + \frac{1}{2} \sigma^{2} S^{2} \partial_{SS} C - rC = 0, \end{align} $$ which is formally equivalent to the Black-Scholes equation for an underlying asset price $S(t)$, a call option price $C(S, t)$, a volatility $\sigma$, and a risk-free interest rate $r$.

Dynamic Schrodinger Bridge

DSB

The DSB formulation aims to determine the optimal pair of control and density that minimally perturbs the reference dynamics subject to initial and terminal marginal constraints: $$ \begin{align} \inf_{\bm{u}, p_{t}^{\bm{u}}} & \textstyle \quad D_{\text{KL}}\left( \mathbb{P}^{\bm{u}} \parallel \mathbb{P}^{\bm{0}} \right), \ \text{s.t.} & \textstyle \quad \mathrm{d}\bm{X}{t}^{\bm{u}} = \left[ \bm{f}(\bm{X}{t}^{\bm{u}}, t) + \sigma_{t} \bm{u}(\bm{X}{t}^{\bm{u}}, t) \right] \mathrm{d}t + \sigma{t} \mathrm{d} \bm{B}{t}, \bm{X}{0}^{\bm{u}} \sim \pi_{0}, \bm{X}{T}^{\bm{u}} \sim \pi{T}. \end{align} $$ We reframe this as an unconstrained optimization problem by introducing a Lagrangian with a Lagrange multiplier $\psi_{t} : \mathbb{R}^{d} \rightarrow \mathbb{R}$: $$ \begin{align}\textstyle \mathcal{L}(p_{t}^{\bm{u}}, \bm{u}, \psi_{t}) = \int_{0}^{T} \int_{\mathbb{R}^{d}} \left{ \frac{1}{2} \left\lVert \bm{u} \right\rVert^{2} p_{t}^{\bm{u}} + \psi_{t} \cdot \left[ \partial_{t}p_{t}^{\bm{u}} + \nabla \cdot \left[ \left( \bm{f} + \sigma_{t}\bm{u} \right) p_{t}^{\bm{u}} \right] - \frac{\sigma_{t}^{2}}{2} \Delta p_{t}^{\bm{u}} \right] \right} \mathrm{d}\bm{x} \mathrm{d}t, \end{align} $$ which yields the minimizer $(\bm{u}^{\ast}, p_{t}^{\ast})$ as the solution to the HJB-FP system: $$ \begin{align} \begin{cases} \partial_{t}\psi_{t} = -\frac{\sigma_{t}^{2}}{2} \left\lVert \nabla \psi_{t} \right\rVert^{2} - \langle \nabla \psi_{t}, \bm{f} \rangle - \frac{\sigma_{t}^{2}}{2} \Delta \psi_{t}, \ \partial_{t}p_{t}^{\ast} = - \nabla \cdot \left[ \left( \bm{f} + \sigma_{t}^{2}\nabla\psi_{t} \right) p_{t}^{\ast} \right] + \frac{\sigma_{t}^{2}}{2} \Delta p_{t}^{\ast}, \end{cases} \quad \text{s.t.} \quad \begin{cases} p_{0}^{\ast} = \pi_{0}, \ p_{T}^{\ast} = \pi_{T}. \end{cases} \end{align} $$ Here, the optimal control $\bm{u}^{\ast}$ is written in terms of the Lagrange multiplier: $$ \begin{align} \bm{u}^{\ast}(\bm{x}, t) = \sigma_{t} \nabla \psi_{t}(\bm{x}). \end{align} $$

Hopf-Cole-transformed DSB

Applying the change of variables $(\psi_{t}, p_{t}^{\ast}) \mapsto (\phi_{t}, \hat{\phi}{t})$, defined as $\psi{t} = \log{\phi_{t}}$ and $p_{t}^{\ast} = \phi_{t}\hat{\phi}{t}$, transforms the non-linear HJB-FP system into the linear system for a Schrodinger potential $(\phi{t}, \hat{\phi}{t})$: $$ \begin{align} \begin{cases} \partial{t}\phi_{t} = -\langle \nabla \phi_{t}, \bm{f} \rangle - \frac{\sigma_{t}^{2}}{2} \Delta \phi_{t}, \ \partial_{t}\hat{\phi}{t} = - \nabla \cdot (\hat{\phi}{t} \bm{f}) + \frac{\sigma_{t}^{2}}{2} \Delta \hat{\phi}{t}, \end{cases} \quad \text{s.t.} \quad \begin{cases} p{0}^{\ast} = \phi_{0}\hat{\phi}{0}, \ p{T}^{\ast} = \phi_{T} \hat{\phi}{T}, \end{cases} \end{align} $$ where the solution is given by: $$ \begin{align} \begin{cases} \phi{t}(\bm{x}) = \int_{\mathbb{R}^{d}} \mathbb{P}{T \mid t}^{\bm{0}}(\bm{y} \mid \bm{x}) \phi{T}(\bm{y}) \mathrm{d}\bm{y}, \ \hat{\phi}{t}(\bm{x}) = \int{\mathbb{R}^{d}} \mathbb{P}{t \mid 0}^{\bm{0}}(\bm{x} \mid \bm{y}) \hat{\phi}{0}(\bm{y}) \mathrm{d}\bm{y}, \end{cases} \quad \text{s.t.} \quad \begin{cases} \pi_{0}(\bm{x}) = \phi_{0}(\bm{x}) \hat{\phi}{0}(\bm{x}), \ \pi{T}(\bm{x}) = \phi_{T}(\bm{x}) \hat{\phi}{T}(\bm{x}). \end{cases} \end{align} $$ Here, the optimal control $\bm{u}^{\ast}$ is written in the Schrodinger potential: $$ \begin{align} \bm{u}^{\ast}(\bm{x}, t) = \sigma{t} \nabla \log{\phi_{t}(\bm{x})}. \end{align} $$

SOC-formulated DSB

The DSB formulation can be interpreted as a SOC formulation with the following costs: $$ \begin{align}\textstyle c(\bm{x},t)=0, \quad \Phi(\bm{x}) = \log{\frac{\hat{\phi}{T}(\bm{x})}{\pi{T}(\bm{x})}}. \end{align} $$ Substituting these yields the SOC formulation equivalent to the DSB formulation: $$ \begin{align} \inf_{\bm{u}} & \textstyle \quad D_{\text{KL}}\left( \mathbb{P}^{\bm{u}} \parallel \mathbb{P}^{\bm{0}} \right) + \mathbb{E}{\bm{X}{0:T}^{\bm{u}} \sim \mathbb{P}^{\bm{u}}} \left[ \log{\frac{\hat{\phi}{T}(\bm{X}{T}^{\bm{u}})}{\pi_{T}(\bm{X}{T}^{\bm{u}})}} \right], \ \text{s.t.} & \textstyle \quad \mathrm{d}\bm{X}{t}^{\bm{u}} = \left[ \bm{f}(\bm{X}{t}^{\bm{u}}, t) + \sigma{t} \bm{u}(\bm{X}{t}^{\bm{u}}, t) \right] \mathrm{d}t + \sigma{t} \mathrm{d} \bm{B}{t}, \bm{X}{0}^{\bm{u}} \sim \pi_{0}. \end{align} $$

DOT-formulated DSB

The DSB formulation can be interpreted as a DOT formulation with the following velocity field: $$ \begin{align}\textstyle \bm{v}(\bm{x}, t) = \bm{u}(\bm{x}, t) + \frac{\sigma_{t}}{2} \nabla\log{p_{t}(\bm{x})}. \end{align} $$ Reparameterizing the objective yields the DOT formulation equivalent to the DSB formulation: $$ \begin{align} \inf_{\bm{v}} & \textstyle \quad \int_{0}^{T} \int_{\mathbb{R}^{d}} \left[ \left( \frac{1}{2} \left\lVert \bm{v} \right\rVert^{2} + \frac{\sigma_{t}^{2}}{8} \left\lVert \nabla \log{p_{t}} \right\rVert^{2} - \frac{1}{2} \langle \nabla \log{p_{t}}, \bm{f} \rangle \right) \bm{p}{t}^{\bm{u}} \right] \mathrm{d}\bm{x} \mathrm{d}t, \ \text{s.t.} & \textstyle \quad \partial{t} p_{t}^{\bm{u}} = -\nabla \cdot \left{ \left[ \bm{f} + \sigma_{t} \bm{v} \right] p_{t}^{\bm{u}} \right}, p_{0}^{\bm{u}} = \pi_{0}, p_{T}^{\bm{u}} = \pi_{T}. \end{align} $$

DDPM as DSB

The standard continuous diffusion involving a Gaussian forward transition kernel: $$ \begin{align}\textstyle q(\bm{x}{t} \mid \bm{x}{0}) = \mathcal{N}\left( \bm{x}{t} ; \sqrt{\alpha{t}}\bm{x}{0}, (1 - \alpha{t}) \bm{I} \right), \quad\text{where } \alpha_{t} = \exp{\left( -\int_{0}^{t} \beta_{s} \mathrm{d}s \right)}, \end{align} $$ can be described by defining the reference dynamics $\mathbb{P}^{\bm{0}}$ as: $$ \begin{align}\textstyle \mathrm{d}\bm{X}{t} = \bm{f}(\bm{X}{t}, t)\mathrm{d}t + \sigma_{t}\mathrm{d}\bm{B}{t}, \quad\text{ where } \bm{f}(\bm{x}, t) = -\frac{1}{2}\beta{t}\bm{x}, ; \sigma_{t} = \sqrt{\beta_{t}}. \end{align} $$ By designing $\beta_{t} : [0,T] \rightarrow \mathbb{R}{\geq 0}$ such that $\mathbb{P}^{\bm{0}}$ approximately reaches $\pi{T} = \mathcal{N}(\bm{0},\bm{I})$ starting from $\pi_{0} = p_{\text{data}}$, the system can be interpreted as a DSB problem where the reference process is pre-aligned with the target boundary conditions. Thus, the optimal controls vanishes: $$ \begin{align} \bm{u}^{\ast}(\bm{x}, t) = \sigma_{t} \nabla \log{\phi_{t}(\bm{x})} = \sqrt{\beta_{t}} \left( \nabla\log{p_{t}^{\ast}}(\bm{x}) - \nabla\log{\hat{\phi}_{t}}(\bm{x}) \right) = \bm{0}. \end{align} $$

FM as DSB

The standard flow matching involving a Dirac delta transition kernel: $$ \begin{align}\textstyle q(\bm{x}{t} \mid \bm{x}{0}, \bm{x}{T}) = \lim{\sigma \to 0} \mathcal{N}\left( \bm{x}{t} ; \frac{(T-t)\bm{x}{0} + t\bm{x}{T}}{T}, \frac{t(T-t)\sigma^{2}}{T}\bm{I} \right), \end{align} $$ can be described by defining the reference dynamics $\mathbb{P}^{\bm{0}}$ as: $$ \begin{align}\textstyle \mathrm{d}\bm{X}{t} = \bm{f}(\bm{X}{t}, t)\mathrm{d}t + \sigma\mathrm{d}\bm{B}{t}, \quad\text{ where } \bm{f}(\bm{x}, t) = 0, ; \sigma \to 0. \end{align} $$ By taking $\sigma \to 0$, the DSB problem converges to a classical DOT problem, where the underlying stochastic particle dynamics collapse into purely deterministic trajectories. For a specific pair $(\bm{x}{0}, \bm{x}{T}) \sim \pi_{0,T}$, the conditional velocity field is simply the straight-line displacement: $\bm{u}^{\ast}(\bm{x}, t) = \frac{1}{T}(\bm{x}{T} - \bm{x}{0})$. With an independent coupling, i.e., $\pi_{0,T}(\bm{x}{0}, \bm{x}{T}) = \pi_{0}(\bm{x}{0}) \pi{T}(\bm{x}{T})$, the PF-ODE is given by: $$ \begin{align}\textstyle \mathrm{d}\bm{X}{t}^{\ast} = \bm{v}^{\ast}(\bm{X}{t}^{\ast}, t) \mathrm{d}t, \quad \bm{v}^{\ast}(\bm{x}, t) = \mathbb{E}{\pi_{0,T}} \left[ \frac{1}{T} (\bm{x}{T} - \bm{x}{0}) \mid \bm{X}{t}^{\ast}=\bm{x} \right]. \end{align} $$ Note that we have the following forward transition when $\pi{T} = \mathcal{N}(\bm{0}, \bm{I})$: $$ \begin{align}\textstyle q(\bm{x}{t} \mid \bm{x}{0}) = \mathcal{N}\left( \bm{x}{t} ; (1-\frac{t}{T})\bm{x}{0}, (\frac{t}{T})^{2}\bm{I} \right). \end{align} $$