Files
myWiki/concepts/stochastic-differential-equation.md

56 lines
1.7 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
title: "随机微分方程 (Stochastic Differential Equation)"
created: 2026-06-17
updated: 2026-06-17
type: concept
tags: [mathematics, stochastic-processes, theory, probability]
sources: [raw/papers/tiwari-ticks-to-flows-2026.md]
confidence: high
---
# 随机微分方程 (Stochastic Differential Equation)
SDE 是描述**受随机噪声驱动的连续时间动态系统**的数学框架,是 [[ticks-to-flows|Tiwari et al. (2026)]] 论文的核心数学工具。
## 标准形式
```
dX_t = b(X_t) dt + σ(X_t) dW_t
```
- `b(X_t) dt`**漂移项**drift确定性的变化方向
- `σ(X_t) dW_t`**扩散项**diffusion随机波动
- `W_t`[[wiener-process|Wiener 过程]]Brownian motion
## Itô 积分
SDE 的解通过 [[ito-calculus|Itô 积分]] 定义:
```
X_t = X_0 + ∫_0^t b(X_l) dl + ∫_0^t σ(X_l) dW_l
```
在适当的条件Lipschitz 连续 + 线性增长)下,解在概率意义下存在且唯一。
## 在强化学习中的应用
在 [[continuous-time-rl|连续时间 RL]] 中SDE 用于建模:
1. **环境转移**`ds_t = (g(s_t) + h(s_t)a_t)dt + σ(s_t)dW_t`
2. **探索动力学**:同时包含策略随机性和环境随机性
3. **梯度时间动态**:描述参数更新如何改变状态分布
## 关键性质
- **鞅性质**:扩散项形成一个[[martingale-clt|鞅]],可用于 CLT 分析
- **Markov 性**:未来仅依赖当前状态
- **无穷小生成元**infinitesimal generatorL^π 刻画函数沿轨道的瞬时变化
## 参考
- [[wiener-process|维纳过程]]
- [[ito-calculus|Itô 微积分]]
- [[continuous-time-rl|连续时间 RL]]
- [[exploratory-dynamics|探索动力学]]
- [[ticks-to-flows|Ticks to Flows]]