transformers/tests/models/bamba
Garrett Goon 390f153469
Add padding-free to bamba (#35861)
* add seq_idx and fa kwargs

* update tests

* docs and grad ckpt support

* fmt

* better names

* test_raise_missing_padding_free_kwarg_errs

* + seq_idx in doc strings

* padding free training docs

* add link to pr plots

* raise err on attn_mask with padding free

* rm raising missing padding free err test

* BambaFlashAttentionKwargs

* run modular util for modular_granitemoehybrid.py
2025-05-20 17:13:59 +02:00
..
__init__.py Add the Bamba Model (#34982) 2024-12-18 20:18:17 +01:00
test_modeling_bamba.py Add padding-free to bamba (#35861) 2025-05-20 17:13:59 +02:00