Files
myWiki/concepts/cache-hit-ratio.md

34 lines
1006 B
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
title: "Cache Hit Ratio (CHR)"
created: 2026-05-11
updated: 2026-05-11
type: concept
tags: [cache, metrics, observability, system-design]
sources: [[prompt-caching-architecture]]
---
# Cache Hit Ratio (CHR缓存命中率)
## 定义
Cache Hit Ratio 是衡量 [[prompt-caching|Prompt Caching]] 效率的核心指标:
CHR = (缓存命中的请求数) / (总请求数)
## 工程标准
- **CHR ≥ 95%**:健康状态,[[prompt-layering|提示分层]] 设计有效
- **CHR < 95%**触发告警需立即排查 [[cache-invalidation|失效原因]]
- [[meta-jctrader|Meta-JCTrader]] 等高频系统中 CHR 直接意味着高额账单损失和不可接受的延迟
## 辅助指标
- **Invalidation Point Identification**失效点的字节偏移追踪
- **Cost Efficiency Score**Cache-Off vs Cache-On Token 消耗对比
## 相关概念
- [[prompt-caching|Prompt Caching]]
- [[cache-invalidation|缓存失效]]
- [[cache-health-observability|缓存健康度可观测性]]