transformers/templates/adding_a_new_model/cookiecutter-template-{{cookiecutter.modelname}}
Younes Belkada 285a48011d
Fix gradient checkpointing + fp16 autocast for most models (#24247)
* fix gc bug

* continue PoC on OPT

* fixes

* 🤯

* fix tests

* remove pytest.mark

* fixup

* forward contrib credits from discussions

* forward contrib credits from discussions

* reverting changes on untouched files.

---------

Co-authored-by: zhaoqf123 <zhaoqf123@users.noreply.github.com>
Co-authored-by: 7eu7d7 <7eu7d7@users.noreply.github.com>
2023-06-21 17:04:59 +02:00
..
__init__.py Cleanup quality (#21493) 2023-02-07 12:27:31 -05:00
{{cookiecutter.lowercase_modelname}}.mdx Check the repo consistency in model templates test (#15141) 2022-01-14 04:52:38 -05:00
configuration_{{cookiecutter.lowercase_modelname}}.py Fix some docs what layerdrop does (#23691) 2023-05-23 14:50:40 -04:00
configuration.json Add template for adding flax models (#12441) 2021-09-01 09:49:03 +02:00
modeling_{{cookiecutter.lowercase_modelname}}.py Fix gradient checkpointing + fp16 autocast for most models (#24247) 2023-06-21 17:04:59 +02:00
modeling_flax_{{cookiecutter.lowercase_modelname}}.py Flax dtype-dependent numerical masking (#21197) 2023-01-19 16:43:42 +00:00
modeling_tf_{{cookiecutter.lowercase_modelname}}.py Big TF test cleanup (#24282) 2023-06-16 15:40:49 +01:00
test_modeling_{{cookiecutter.lowercase_modelname}}.py Generate: move generation_*.py src files into generation/*.py (#20096) 2022-11-09 15:34:08 +00:00
test_modeling_flax_{{cookiecutter.lowercase_modelname}}.py Move test model folders (#17034) 2022-05-03 14:42:02 +02:00
test_modeling_tf_{{cookiecutter.lowercase_modelname}}.py TF: standardize test_model_common_attributes for language models (#23457) 2023-06-13 17:51:37 +01:00
to_replace_{{cookiecutter.lowercase_modelname}}.py [WIP] Fix Pyright static type checking by replacing if-else imports with try-except (#16578) 2022-05-09 11:28:53 -04:00
tokenization_{{cookiecutter.lowercase_modelname}}.py Add missing comment quotes (#17379) 2022-06-29 06:16:36 -04:00
tokenization_fast_{{cookiecutter.lowercase_modelname}}.py Happy New Year! (#15094) 2022-01-10 12:05:57 -05:00