transformers/docs/source
Younes Belkada ae9a344cce
[Mistral] Add Flash Attention-2 support for mistral (#26464)
* add FA-2 support for mistral

* fixup

* add sliding windows

* fixing few nits

* v1 slicing cache - logits do not match

* add comment

* fix bugs

* more mem efficient

* add warning once

* add warning once

* oops

* fixup

* more comments

* copy

* add safety checker

* fixup

* Update src/transformers/models/mistral/modeling_mistral.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* copied from

* up

* raise when padding side is right

* fixup

* add doc + few minor changes

* fixup

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2023-10-03 13:44:46 +02:00
..
de [i18n-DE] contribute chapter (#26481) 2023-10-02 09:56:40 -07:00
en [Mistral] Add Flash Attention-2 support for mistral (#26464) 2023-10-03 13:44:46 +02:00
es docs: update link huggingface map (#26077) 2023-09-11 12:57:04 +01:00
fr Fix typos (#25936) 2023-09-04 11:15:12 +01:00
it [Docs] Fix un-rendered images (#25561) 2023-08-17 12:08:11 +02:00
ja Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
ko 🌐 [i18n-KO] Translated tokenizer_summary.md to Korean (#26243) 2023-10-02 09:55:33 -07:00
ms Add BROS (#23190) 2023-09-14 18:02:37 +01:00
pt docs: update link huggingface map (#26077) 2023-09-11 12:57:04 +01:00
zh Fix small typo README.md (#25934) 2023-09-06 14:07:29 +01:00
_config.py Adding evaluate to the list of libraries required in generated notebooks (#20850) 2022-12-21 14:04:08 +01:00