transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-04 05:10:06 +06:00

History

jiqing-feng 11c27dd331 Enable BNB multi-backend support (#31098 ) * enable cpu bnb path * fix style * fix code style * fix 4 bit path * Update src/transformers/utils/import_utils.py Co-authored-by: Aarni Koskela <akx@iki.fi> * add multi backend refactor tests * fix style * tweak 4bit quantizer + fix corresponding tests * tweak 8bit quantizer + try fixing corresponding tests * fix dequant bnb 8bit * account for Intel CPU in variability of expected outputs * enable cpu and xpu device map * further tweaks to account for Intel CPU * fix autocast to work with both cpu + cuda * fix comments * fix comments * switch to testing_utils.torch_device * allow for xpu in multi-gpu tests * fix tests 4bit for CPU NF4 * fix bug with is_torch_xpu_available needing to be called as func * avoid issue where test reports attr err due to other failure * fix formatting * fix typo from resolving of merge conflict * polish based on last PR review Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * fix CI * Update src/transformers/integrations/integration_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/integrations/integration_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fix error log * fix error msg * add \n in error log * make quality * rm bnb cuda restriction in doc * cpu model don't need dispatch * fix doc * fix style * check cuda avaliable in testing * fix tests * Update docs/source/en/model_doc/chameleon.md Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update docs/source/en/model_doc/llava_next.md Co-authored-by: Aarni Koskela <akx@iki.fi> * Update tests/quantization/bnb/test_4bit.py Co-authored-by: Aarni Koskela <akx@iki.fi> * Update tests/quantization/bnb/test_4bit.py Co-authored-by: Aarni Koskela <akx@iki.fi> * fix doc * fix check multibackends * fix import sort * remove check torch in bnb * docs: update bitsandbytes references with multi-backend info * docs: fix small mistakes in bnb paragraph * run formatting * reveret bnb check * move bnb multi-backend check to import_utils * Update src/transformers/utils/import_utils.py Co-authored-by: Aarni Koskela <akx@iki.fi> * fix bnb check * minor fix for bnb * check lib first * fix code style * Revert "run formatting" This reverts commit `ac108c6d6b`. * fix format * give warning when bnb version is low and no cuda found] * fix device assignment check to be multi-device capable * address akx feedback on get_avlbl_dev fn * revert partially, as we don't want the function that public, as docs would be too much (enforced) --------- Co-authored-by: Aarni Koskela <akx@iki.fi> Co-authored-by: Titus von Koeller <9048635+Titus-von-Koeller@users.noreply.github.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>		2024-09-24 03:40:56 -06:00
..
internal	Add a static cache that offloads to the CPU or other device (#32161 )	2024-08-29 11:51:09 +02:00
main_classes	Decorator for easier tool building (#33439 )	2024-09-18 11:07:51 +02:00
model_doc	Enable BNB multi-backend support (#31098 )	2024-09-24 03:40:56 -06:00
quantization	Enable BNB multi-backend support (#31098 )	2024-09-24 03:40:56 -06:00
tasks	Add keypoint-detection task guide (#33274 )	2024-09-16 13:08:31 +02:00
_config.py	[#29174 ] ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1 Fix (#29888 )	2024-04-08 14:21:16 +01:00
_redirects.yml	Docs / Quantization: Redirect deleted page (#31063 )	2024-05-28 18:29:22 +02:00
_toctree.yml	Granitemoe (#33207 )	2024-09-21 01:43:50 +02:00
accelerate.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
add_new_model.md	Remove add-new-model in favor of add-new-model-like (#30424 )	2024-04-24 09:38:18 +02:00
add_new_pipeline.md	add `push_to_hub` to pipeline (#29172 )	2024-04-16 15:34:04 +01:00
agents_advanced.md	Decorator for easier tool building (#33439 )	2024-09-18 11:07:51 +02:00
agents.md	Decorator for easier tool building (#33439 )	2024-09-18 11:07:51 +02:00
attention.md	[Docs] Fix broken links and syntax issues (#28918 )	2024-02-08 14:13:35 -08:00
autoclass_tutorial.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
benchmarks.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
bertology.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
big_models.md	[docs] Big model loading (#29920 )	2024-04-01 18:47:32 -07:00
chat_templating.md	Add explicit example for RAG chat templating (#33503 )	2024-09-17 16:08:05 +01:00
community.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
contributing.md	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
conversations.md	[docs] change temperature to a positive value (#32077 )	2024-07-23 17:47:51 +01:00
create_a_model.md	Enable HF pretrained backbones (#31145 )	2024-06-06 22:02:38 +01:00
custom_models.md	Updated the custom_models.md changed cross_entropy code (#33118 )	2024-08-26 13:15:43 +02:00
debugging.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
deepspeed.md	Fix typos (#31819 )	2024-07-08 11:52:47 +01:00
fast_tokenizers.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
fsdp.md	[docs] Trainer docs (#28145 )	2023-12-20 10:37:23 -08:00
generation_strategies.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
gguf.md	Add support for GGUF Phi-3 (#31844 )	2024-09-10 13:32:38 +02:00
glossary.md	Fix typos (#31819 )	2024-07-08 11:52:47 +01:00
hpo_train.md	Remove-auth-token (#27060 )	2023-11-13 14:20:54 +01:00
index.md	Granitemoe (#33207 )	2024-09-21 01:43:50 +02:00
installation.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
kv_cache.md	Cache: don't show warning in forward passes when `past_key_values` is None (#33541 )	2024-09-19 12:02:46 +01:00
llm_optims.md	Docs: add more cross-references to the KV cache docs (#33323 )	2024-09-06 10:22:00 +01:00
llm_tutorial_optimization.md	Enable BNB multi-backend support (#31098 )	2024-09-24 03:40:56 -06:00
llm_tutorial.md	Add SynCode to llm_tutorial (#32884 )	2024-08-22 15:30:22 +02:00
model_memory_anatomy.md	Enable BNB multi-backend support (#31098 )	2024-09-24 03:40:56 -06:00
model_sharing.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
model_summary.md	model_summary.md - Restore link to Harvard's Annotated Transformer. (#29702 )	2024-03-23 18:29:39 -07:00
multilingual.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
notebooks.md	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
pad_truncation.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
peft.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
perf_hardware.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
perf_infer_cpu.md	[Docs] Fix spelling and grammar mistakes (#28825 )	2024-02-02 08:45:00 +01:00
perf_infer_gpu_one.md	Sdpa dino v2 (#33403 )	2024-09-21 01:58:00 +01:00
perf_torch_compile.md	fix(docs): Fixed a link in docs (#32274 )	2024-07-29 10:50:43 +01:00
perf_train_cpu_many.md	Update the distributed CPU training on Kubernetes documentation (#32669 )	2024-08-14 09:36:43 -07:00
perf_train_cpu.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
perf_train_gpu_many.md	Update perf_train_gpu_many.md (#31451 )	2024-06-18 11:00:26 -07:00
perf_train_gpu_one.md	Enable BNB multi-backend support (#31098 )	2024-09-24 03:40:56 -06:00
perf_train_special.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
perf_train_tpu_tf.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
performance.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
perplexity.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
philosophy.md	[docs] fixed links with 404 (#27327 )	2023-11-06 19:45:03 +00:00
pipeline_tutorial.md	Docs: Fixed `whisper-large-v2` model link in docs (#32871 )	2024-08-19 09:50:35 -07:00
pipeline_webserver.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
pr_checks.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
preprocessing.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
quicktour.md	docs: fix broken link (#31370 )	2024-06-12 11:33:00 +01:00
run_scripts.md	[docs] refine the doc for `train with a script` (#33423 )	2024-09-12 10:16:12 -07:00
sagemaker.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
serialization.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
task_summary.md	More fixes for doctest (#30265 )	2024-04-16 11:58:55 +02:00
tasks_explained.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
testing.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
tf_xla.md	fix(docs): Fixed a link in docs (#32274 )	2024-07-29 10:50:43 +01:00
tflite.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
tiktoken.md	Support reading tiktoken tokenizer.model file (#31656 )	2024-09-06 14:24:02 +02:00
tokenizer_summary.md	[docs] Spanish translation of tokenizer_summary.md (#31154 )	2024-06-03 16:52:23 -07:00
torchscript.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
trainer.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
training.md	Added the necessay import of module (#30804 )	2024-05-14 18:45:06 +01:00
troubleshooting.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00