mirror of
https://github.com/huggingface/transformers.git
synced 2025-07-31 02:02:21 +06:00
![]() * remove to restiction for 4-bit model * Update src/transformers/modeling_utils.py Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com> * bitsandbytes: prevent dtype casting while allowing device movement with .to or .cuda * quality fix * Improve warning message for .to() and .cuda() on bnb quantized models --------- Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com> |
||
---|---|---|
.. | ||
aqlm_integration | ||
autoawq | ||
bnb | ||
eetq_integration | ||
fbgemm_fp8 | ||
ggml | ||
gptq | ||
hqq | ||
quanto_integration | ||
torchao_integration |