transformers/templates
Yih-Dar d3cb28886a
Not use -1e4 as attn mask (#17306)
* Use torch.finfo(self.dtype).min

* for GPTNeoX

* for Albert

* For Splinter

* Update src/transformers/models/data2vec/modeling_data2vec_audio.py

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fix -inf used in Bart-like models

* Fix a few remaining -inf

* more fix

* clean up

* For CLIP

* For FSMT

* clean up

* fix test

* Add dtype argument and use it for LayoutLMv3

* update FlaxLongT5Attention

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2022-06-20 16:16:16 +02:00
..
adding_a_missing_tokenization_test Move test model folders (#17034) 2022-05-03 14:42:02 +02:00
adding_a_new_example_script Add examples telemetry (#17552) 2022-06-07 11:57:52 -04:00
adding_a_new_model Not use -1e4 as attn mask (#17306) 2022-06-20 16:16:16 +02:00