Files
myWiki/concepts/cache-hit-ratio.md

1006 B
Raw Permalink Blame History

title, created, updated, type, tags, sources
title created updated type tags sources
Cache Hit Ratio (CHR) 2026-05-11 2026-05-11 concept
cache
metrics
observability
system-design
prompt-caching-architecture

Cache Hit Ratio (CHR缓存命中率)

定义

Cache Hit Ratio 是衡量 prompt-caching 效率的核心指标:

CHR = (缓存命中的请求数) / (总请求数)

工程标准

辅助指标

  • Invalidation Point Identification:失效点的字节偏移追踪
  • Cost Efficiency ScoreCache-Off vs Cache-On 的 Token 消耗对比

相关概念