34 lines
1006 B
Markdown
34 lines
1006 B
Markdown
---
|
||
title: "Cache Hit Ratio (CHR)"
|
||
created: 2026-05-11
|
||
updated: 2026-05-11
|
||
type: concept
|
||
tags: [cache, metrics, observability, system-design]
|
||
sources: [[prompt-caching-architecture]]
|
||
---
|
||
|
||
# Cache Hit Ratio (CHR,缓存命中率)
|
||
|
||
## 定义
|
||
|
||
Cache Hit Ratio 是衡量 [[prompt-caching|Prompt Caching]] 效率的核心指标:
|
||
|
||
CHR = (缓存命中的请求数) / (总请求数)
|
||
|
||
## 工程标准
|
||
|
||
- **CHR ≥ 95%**:健康状态,[[prompt-layering|提示分层]] 设计有效
|
||
- **CHR < 95%**:触发告警,需立即排查 [[cache-invalidation|失效原因]]
|
||
- 在 [[meta-jctrader|Meta-JCTrader]] 等高频系统中,低 CHR 直接意味着高额账单损失和不可接受的延迟
|
||
|
||
## 辅助指标
|
||
|
||
- **Invalidation Point Identification**:失效点的字节偏移追踪
|
||
- **Cost Efficiency Score**:Cache-Off vs Cache-On 的 Token 消耗对比
|
||
|
||
## 相关概念
|
||
|
||
- [[prompt-caching|Prompt Caching]]
|
||
- [[cache-invalidation|缓存失效]]
|
||
- [[cache-health-observability|缓存健康度可观测性]]
|