20260514:增加新内容

This commit is contained in:
2026-05-14 13:54:52 +08:00
parent 56c4d3ef7c
commit b116710e4c
294 changed files with 10682 additions and 255 deletions

View File

@@ -0,0 +1,33 @@
---
title: "Cache Hit Ratio (CHR)"
created: 2026-05-11
updated: 2026-05-11
type: concept
tags: [cache, metrics, observability, system-design]
sources: [[prompt-caching-architecture]]
---
# Cache Hit Ratio (CHR缓存命中率)
## 定义
Cache Hit Ratio 是衡量 [[prompt-caching|Prompt Caching]] 效率的核心指标:
CHR = (缓存命中的请求数) / (总请求数)
## 工程标准
- **CHR ≥ 95%**:健康状态,[[prompt-layering|提示分层]] 设计有效
- **CHR < 95%**触发告警需立即排查 [[cache-invalidation|失效原因]]
- [[meta-jctrader|Meta-JCTrader]] 等高频系统中 CHR 直接意味着高额账单损失和不可接受的延迟
## 辅助指标
- **Invalidation Point Identification**失效点的字节偏移追踪
- **Cost Efficiency Score**Cache-Off vs Cache-On Token 消耗对比
## 相关概念
- [[prompt-caching|Prompt Caching]]
- [[cache-invalidation|缓存失效]]
- [[cache-health-observability|缓存健康度可观测性]]