20260514:增加新内容
This commit is contained in:
33
concepts/cache-hit-ratio.md
Normal file
33
concepts/cache-hit-ratio.md
Normal file
@@ -0,0 +1,33 @@
|
||||
---
|
||||
title: "Cache Hit Ratio (CHR)"
|
||||
created: 2026-05-11
|
||||
updated: 2026-05-11
|
||||
type: concept
|
||||
tags: [cache, metrics, observability, system-design]
|
||||
sources: [[prompt-caching-architecture]]
|
||||
---
|
||||
|
||||
# Cache Hit Ratio (CHR,缓存命中率)
|
||||
|
||||
## 定义
|
||||
|
||||
Cache Hit Ratio 是衡量 [[prompt-caching|Prompt Caching]] 效率的核心指标:
|
||||
|
||||
CHR = (缓存命中的请求数) / (总请求数)
|
||||
|
||||
## 工程标准
|
||||
|
||||
- **CHR ≥ 95%**:健康状态,[[prompt-layering|提示分层]] 设计有效
|
||||
- **CHR < 95%**:触发告警,需立即排查 [[cache-invalidation|失效原因]]
|
||||
- 在 [[meta-jctrader|Meta-JCTrader]] 等高频系统中,低 CHR 直接意味着高额账单损失和不可接受的延迟
|
||||
|
||||
## 辅助指标
|
||||
|
||||
- **Invalidation Point Identification**:失效点的字节偏移追踪
|
||||
- **Cost Efficiency Score**:Cache-Off vs Cache-On 的 Token 消耗对比
|
||||
|
||||
## 相关概念
|
||||
|
||||
- [[prompt-caching|Prompt Caching]]
|
||||
- [[cache-invalidation|缓存失效]]
|
||||
- [[cache-health-observability|缓存健康度可观测性]]
|
||||
Reference in New Issue
Block a user