Lysandre Debut
6587cf9f84
Patch *ForCausalLM model ( #9092 )
2020-12-14 00:39:55 -05:00
Julien Plu
51d9c569fa
Fix embeddings resizing in TF models ( #8657 )
...
* Resize the biases in same time than the embeddings
* Trigger CI
* Biases are not reset anymore
* Remove get_output_embeddings + better LM model detection in generation utils
* Apply style
* First test on BERT
* Update docstring + new name
* Apply the new resizing logic to all the models
* fix tests
* Apply style
* Update the template
* Fix naming
* Fix naming
* Apply style
* Apply style
* Remove unused import
* Revert get_output_embeddings
* Trigger CI
* Update num parameters
* Restore get_output_embeddings in TFPretrainedModel and add comments
* Style
* Add decoder resizing
* Style
* Fix tests
* Separate bias and decoder resize
* Fix tests
* Fix tests
* Apply style
* Add bias resizing in MPNet
* Trigger CI
* Apply style
2020-12-13 23:05:24 -05:00
Lysandre Debut
67ff1c314a
Templates overhaul 1 ( #8993 )
2020-12-08 18:00:07 -05:00
Sylvain Gugger
00aa9dbca2
Copyright ( #8970 )
...
* Add copyright everywhere missing
* Style
2020-12-07 18:36:34 -05:00
Julien Plu
dcd3046f98
Better booleans handling in the TF models ( #8777 )
...
* Apply on BERT and ALBERT
* Update TF Bart
* Add input processing to TF BART
* Add input processing for TF CTRL
* Add input processing to TF Distilbert
* Add input processing to TF DPR
* Add input processing to TF Electra
* Add deprecated arguments
* Add input processing to TF XLM
* Add input processing to TF Funnel
* Add input processing to TF GPT2
* Add input processing to TF Longformer
* Add input processing to TF Lxmert
* Apply style
* Add input processing to TF Mobilebert
* Add input processing to TF GPT
* Add input processing to TF Roberta
* Add input processing to TF T5
* Add input processing to TF TransfoXL
* Apply style
* Rebase on master
* Bug fix
* Retry to bugfix
* Retry bug fix
* Fix wrong model name
* Try another fix
* Fix BART
* Fix input precessing
* Apply style
* Put the deprecated warnings in the input processing function
* Remove the unused imports
* Raise an error when len(kwargs)>0
* test ModelOutput instead of TFBaseModelOutput
* Bug fix
* Address Patrick's comments
* Address Patrick's comments
* Address Sylvain's comments
* Add boolean processing for the inputs
* Apply style
* Missing optional
* Fix missing some input proc
* Update the template
* Fix missing inputs
* Missing input
* Fix args parameter
* Trigger CI
* Trigger CI
* Trigger CI
* Address Patrick's and Sylvain's comments
* Replace warn by warning
* Trigger CI
* Fix XLNET
* Fix detection
2020-12-04 09:08:29 -05:00
Patrick von Platen
443f67e887
[PyTorch] Refactor Resize Token Embeddings ( #8880 )
...
* fix resize tokens
* correct mobile_bert
* move embedding fix into modeling_utils.py
* refactor
* fix lm head resize
* refactor
* break lines to make sylvain happy
* add news tests
* fix typo
* improve test
* skip bart-like for now
* check if base_model = get(...) is necessary
* clean files
* improve test
* fix tests
* revert style templates
* Update templates/adding_a_new_model/cookiecutter-template-{{cookiecutter.modelname}}/modeling_{{cookiecutter.lowercase_modelname}}.py
2020-12-02 19:19:50 +01:00
Julien Plu
29d4992453
New TF model inputs ( #8602 )
...
* Apply on BERT and ALBERT
* Update TF Bart
* Add input processing to TF BART
* Add input processing for TF CTRL
* Add input processing to TF Distilbert
* Add input processing to TF DPR
* Add input processing to TF Electra
* Add input processing for TF Flaubert
* Add deprecated arguments
* Add input processing to TF XLM
* remove unused imports
* Add input processing to TF Funnel
* Add input processing to TF GPT2
* Add input processing to TF Longformer
* Add input processing to TF Lxmert
* Apply style
* Add input processing to TF Mobilebert
* Add input processing to TF GPT
* Add input processing to TF Roberta
* Add input processing to TF T5
* Add input processing to TF TransfoXL
* Apply style
* Rebase on master
* Bug fix
* Retry to bugfix
* Retry bug fix
* Fix wrong model name
* Try another fix
* Fix BART
* Fix input precessing
* Apply style
* Put the deprecated warnings in the input processing function
* Remove the unused imports
* Raise an error when len(kwargs)>0
* test ModelOutput instead of TFBaseModelOutput
* Bug fix
* Address Patrick's comments
* Address Patrick's comments
* Address Sylvain's comments
* Add the new inputs in new Longformer models
* Update the template with the new input processing
* Remove useless assert
* Apply style
* Trigger CI
2020-11-24 13:55:00 -05:00
Stas Bekman
e84786aaa6
consistent ignore keys + make private ( #8737 )
...
* consistent ignore keys + make private
* style
* - authorized_missing_keys => _keys_to_ignore_on_load_missing
- authorized_unexpected_keys => _keys_to_ignore_on_load_unexpected
* move public doc of private attributes to private comment
2020-11-23 12:33:13 -08:00
Sylvain Gugger
dd52804f5f
Remove deprecated ( #8604 )
...
* Remove old deprecated arguments
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>
* Remove needless imports
* Fix tests
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>
2020-11-17 15:11:29 -05:00
Sylvain Gugger
36a19915ea
Fix model templates ( #8595 )
...
* First fixes
* Fix imports and add init
* Fix typo
* Move init to final dest
* Fix tokenization import
* More fixes
* Styling
2020-11-17 10:35:38 -05:00
Sylvain Gugger
c89bdfbe72
Reorganize repo ( #8580 )
...
* Put models in subfolders
* Styling
* Fix imports in tests
* More fixes in test imports
* Sneaky hidden imports
* Fix imports in doc files
* More sneaky imports
* Finish fixing tests
* Fix examples
* Fix path for copies
* More fixes for examples
* Fix dummy files
* More fixes for example
* More model import fixes
* Is this why you're unhappy GitHub?
* Fix imports in conver command
2020-11-16 21:43:42 -05:00
Sylvain Gugger
1073a2bde5
Switch return_dict
to True
by default. ( #8530 )
...
* Use the CI to identify failing tests
* Remove from all examples and tests
* More default switch
* Fixes
* More test fixes
* More fixes
* Last fixes hopefully
* Use the CI to identify failing tests
* Remove from all examples and tests
* More default switch
* Fixes
* More test fixes
* More fixes
* Last fixes hopefully
* Run on the real suite
* Fix slow tests
2020-11-16 11:43:00 -05:00
Lysandre Debut
826f04576f
Model templates encoder only ( #8509 )
...
* Model templates
* TensorFlow
* Remove pooler
* CI
* Tokenizer + Refactoring
* Encoder-Decoder
* Let's go testing
* Encoder-Decoder in TF
* Let's go testing in TF
* Documentation
* README
* Fixes
* Better names
* Style
* Update docs
* Choose to skip either TF or PT
* Code quality fixes
* Add to testing suite
* Update file path
* Cookiecutter path
* Update `transformers` path
* Handle rebasing
* Remove seq2seq from model templates
* Remove s2s config
* Apply Sylvain and Patrick comments
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Last fixes from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2020-11-13 11:59:30 -05:00