transformers/tests/models/gemma2
Raushan Turganbay 7f552e28e0
Gemma2 and flash-attention (#32188)
* enable flash-attn & static cache

* this works, not the prev

* fix for sliding window layers

* not needed anymore
2024-07-31 10:33:38 +05:00
..
__init__.py Add gemma 2 (#31659) 2024-06-27 17:36:19 +02:00
test_modeling_gemma2.py Gemma2 and flash-attention (#32188) 2024-07-31 10:33:38 +05:00