transformers/templates/adding_a_new_model/cookiecutter-template-{{cookiecutter.modelname}}
Matt 814de8fac7
Overhaul TF serving signatures + dummy inputs (#23234)
* Let's try autodetecting serving sigs

* Don't clobber existing sigs

* Change shapes for multiplechoice models

* Make default dummy inputs smarter too

* Fix missing f-string

* Let's YOLO a serving output too

* Read __class__.__name__ properly

* Don't just pass naked lists in there and expect it to be okay

* Code cleanup

* Update default serving sig

* Clearer error messages

* Further updates to the default serving output

* make fixup

* Update the serving output a bit more

* Cleanups and renames, raise errors appropriately when we can't infer inputs

* More renames

* we're building in a functional context again, yolo

* import DUMMY_INPUTS from the right place

* import DUMMY_INPUTS from the right place

* Support cross-attention in the dummies

* Support cross-attention in the dummies

* Complete removal of dummy/serving overrides in BERT

* Complete removal of dummy/serving overrides in RoBERTa

* Obliterate lots and lots of serving sig and dummy overrides

* merge type hint changes

* Fix for token_type_ids with vocab_size 1

* Add missing property decorator

* Fix T5 and hopefully some models that take conv inputs

* More signature pruning

* Fix T5's signature

* Fix Wav2Vec2 signature

* Fix LongformerForMultipleChoice input signature

* Fix BLIP and LED

* Better default serving output error handling

* Fix BART dummies

* Fix dummies for cross-attention, esp encoder-decoder models

* Fix visionencoderdecoder signature

* Fix BLIP serving output

* Small tweak to BART dummies

* Cleanup the ugly parameter inspection line that I used in a few places

* committed a breakpoint again

* Move the text_dims check

* Remove blip_text serving_output

* Add decoder_input_ids to the default input sig

* Remove all the manual overrides for encoder-decoder model signatures

* Tweak longformer/led input sigs

* Tweak default serving output

* output.keys() -> output

* make fixup
2023-05-24 17:03:24 +01:00
..
__init__.py Cleanup quality (#21493) 2023-02-07 12:27:31 -05:00
{{cookiecutter.lowercase_modelname}}.mdx Check the repo consistency in model templates test (#15141) 2022-01-14 04:52:38 -05:00
configuration_{{cookiecutter.lowercase_modelname}}.py Fix some docs what layerdrop does (#23691) 2023-05-23 14:50:40 -04:00
configuration.json Add template for adding flax models (#12441) 2021-09-01 09:49:03 +02:00
modeling_{{cookiecutter.lowercase_modelname}}.py Generate - update cookie cutters to not initialize cache with training and gradient checkpointing (#21759) 2023-02-24 11:21:00 +00:00
modeling_flax_{{cookiecutter.lowercase_modelname}}.py Flax dtype-dependent numerical masking (#21197) 2023-01-19 16:43:42 +00:00
modeling_tf_{{cookiecutter.lowercase_modelname}}.py Overhaul TF serving signatures + dummy inputs (#23234) 2023-05-24 17:03:24 +01:00
test_modeling_{{cookiecutter.lowercase_modelname}}.py Generate: move generation_*.py src files into generation/*.py (#20096) 2022-11-09 15:34:08 +00:00
test_modeling_flax_{{cookiecutter.lowercase_modelname}}.py Move test model folders (#17034) 2022-05-03 14:42:02 +02:00
test_modeling_tf_{{cookiecutter.lowercase_modelname}}.py 🚨🚨🚨 TF: Remove TFWrappedEmbeddings (breaking: TF embedding initialization updated for encoder-decoder models) (#19263) 2022-10-11 16:48:03 +01:00
to_replace_{{cookiecutter.lowercase_modelname}}.py [WIP] Fix Pyright static type checking by replacing if-else imports with try-except (#16578) 2022-05-09 11:28:53 -04:00
tokenization_{{cookiecutter.lowercase_modelname}}.py Add missing comment quotes (#17379) 2022-06-29 06:16:36 -04:00
tokenization_fast_{{cookiecutter.lowercase_modelname}}.py Happy New Year! (#15094) 2022-01-10 12:05:57 -05:00