transformers/tests/models/mega
Tanay Mehta b8def68934
Fix Mega chunking error when using decoder-only model (#25765)
* add: potential fix to mega chunking in decoder only model bug

* add: decoder with chunking test

* add: input_mask passed with input_ids
2023-09-05 21:50:14 +02:00
..
__init__.py Add Mega: Moving Average Equipped Gated Attention (#21766) 2023-03-24 08:17:27 -04:00
test_modeling_mega.py Fix Mega chunking error when using decoder-only model (#25765) 2023-09-05 21:50:14 +02:00