transformers/docs/source
JB (Don) dfa7b580e9
[BERT] Add support for sdpa (#28802)
* Adding SDPA support for BERT

* Using the proper input name for testing model input in inference()

* Adding documentation for SDPA in BERT model page

* Use the stable link for the documentation

* Adding a gate to only call .contiguous() for torch < 2.2.0

* Additions and fixes to the documentation

* Minor updates to documentation

* Adding extra requirements needed for the contiguous() bug

* Adding "Adapted from" in plcae of the "Copied from"

* Add benchmark speedup tables to the documentation

* Minor fixes to the documentation

* Use ClapText as a replacemenet for Bert in the Copied-From

* Some more fixes for the fix-copies references

* Overriding the test_eager_matches_sdpa_generate in bert tests to not load with low_cpu_mem_usage

[test all]

* Undo changes to separate test

* Refactored SDPA self attention code for KV projections

* Change use_sdpa to attn_implementation

* Fix test_sdpa_can_dispatch_on_flash by preparing input (required for MultipleChoice models)
2024-04-26 16:23:44 +01:00
..
de Remove add-new-model in favor of add-new-model-like (#30424) 2024-04-24 09:38:18 +02:00
en [BERT] Add support for sdpa (#28802) 2024-04-26 16:23:44 +01:00
es [docs] Spanish translation of pipeline_tutorial.md (#30252) 2024-04-25 12:18:06 -07:00
fr [#29174] ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1 Fix (#29888) 2024-04-08 14:21:16 +01:00
hi More fixes for doctest (#30265) 2024-04-16 11:58:55 +02:00
it Remove add-new-model in favor of add-new-model-like (#30424) 2024-04-24 09:38:18 +02:00
ja Remove add-new-model in favor of add-new-model-like (#30424) 2024-04-24 09:38:18 +02:00
ko Remove add-new-model in favor of add-new-model-like (#30424) 2024-04-24 09:38:18 +02:00
ms Remove old TF port docs (#30426) 2024-04-23 16:06:20 +01:00
pt 🚨🚨🚨Deprecate evaluation_strategy to eval_strategy🚨🚨🚨 (#30190) 2024-04-18 12:49:43 -04:00
te [#29174] ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1 Fix (#29888) 2024-04-08 14:21:16 +01:00
tr Translate index.md to Turkish (#27093) 2023-11-08 08:35:20 -05:00
zh Remove add-new-model in favor of add-new-model-like (#30424) 2024-04-24 09:38:18 +02:00
_config.py [#29174] ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1 Fix (#29888) 2024-04-08 14:21:16 +01:00