Files
myWiki/concepts/qlora.md

29 lines
780 B
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
title: "QLoRA (量化低秩适配)"
created: 2025-06-02
updated: 2025-06-02
type: concept
tags: [qlora, fine-tuning, quantization, placeholder]
sources: []
---
# QLoRA
> Quantized Low-Rank AdaptationDettmers et al., NeurIPS 2023将 [[lora|LoRA]] 与 4-bit 量化结合,大幅降低 LLM 微调的内存需求。
## 核心机制
- **4-bit NormalFloat (NF4)** 量化:专为正态分布权重设计
- **双重量化**:进一步压缩量化常数
- **分页优化器**:处理内存峰值
## 在 One-Pass to Reason 中的应用
[[goru-one-pass-to-reason-2025]] 在 Qwen-3 系列4B/8B/32B上使用 QLoRA 进行实验rank=32α=644-bit NF4 量化。
## 相关
- [[lora]]
- [[goru-one-pass-to-reason-2025|One-Pass to Reason]]
- [[llama-factory]]