transformers/templates/adding_a_new_model/cookiecutter-template-{{cookiecutter.modelname}}
Yih-Dar d3cb28886a
Not use -1e4 as attn mask (#17306)
* Use torch.finfo(self.dtype).min

* for GPTNeoX

* for Albert

* For Splinter

* Update src/transformers/models/data2vec/modeling_data2vec_audio.py

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fix -inf used in Bart-like models

* Fix a few remaining -inf

* more fix

* clean up

* For CLIP

* For FSMT

* clean up

* fix test

* Add dtype argument and use it for LayoutLMv3

* update FlaxLongT5Attention

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2022-06-20 16:16:16 +02:00
..
__init__.py Fix template init (#17163) 2022-05-10 15:24:23 -04:00
{{cookiecutter.lowercase_modelname}}.mdx Check the repo consistency in model templates test (#15141) 2022-01-14 04:52:38 -05:00
configuration_{{cookiecutter.lowercase_modelname}}.py Happy New Year! (#15094) 2022-01-10 12:05:57 -05:00
configuration.json Add template for adding flax models (#12441) 2021-09-01 09:49:03 +02:00
modeling_{{cookiecutter.lowercase_modelname}}.py Not use -1e4 as attn mask (#17306) 2022-06-20 16:16:16 +02:00
modeling_flax_{{cookiecutter.lowercase_modelname}}.py Fix obvious typos in flax decoder impl (#17279) 2022-05-16 13:08:04 +02:00
modeling_tf_{{cookiecutter.lowercase_modelname}}.py TF: BART compatible with XLA generation (#17479) 2022-06-20 11:07:46 +01:00
test_modeling_{{cookiecutter.lowercase_modelname}}.py Move test model folders (#17034) 2022-05-03 14:42:02 +02:00
test_modeling_flax_{{cookiecutter.lowercase_modelname}}.py Move test model folders (#17034) 2022-05-03 14:42:02 +02:00
test_modeling_tf_{{cookiecutter.lowercase_modelname}}.py Move test model folders (#17034) 2022-05-03 14:42:02 +02:00
to_replace_{{cookiecutter.lowercase_modelname}}.py [WIP] Fix Pyright static type checking by replacing if-else imports with try-except (#16578) 2022-05-09 11:28:53 -04:00
tokenization_{{cookiecutter.lowercase_modelname}}.py fix the tokenizer_config.json file for the slow tokenizer when a fast version is available (#15319) 2022-02-01 16:48:25 +01:00
tokenization_fast_{{cookiecutter.lowercase_modelname}}.py Happy New Year! (#15094) 2022-01-10 12:05:57 -05:00