Julien Plu
1243ee7d0c
Full rework of the TF input/output embeddings and bias resizing ( #9193 )
...
* Start rework resizing
* Rework bias/decoder resizing
* Full resizing rework
* Full resizing rework
* Start to update the models with the new approach
* Finish to update the models
* Update all the tests
* Update the template
* Fix tests
* Fix tests
* Test a new approach
* Refactoring
* Refactoring
* Refactoring
* New rework
* Rework BART
* Rework bert+blenderbot
* Rework CTRL
* Rework Distilbert
* Rework DPR
* Rework Electra
* Rework Flaubert
* Rework Funnel
* Rework GPT2
* Rework Longformer
* Rework Lxmert
* Rework marian+mbart
* Rework mobilebert
* Rework mpnet
* Rework openai
* Rework pegasus
* Rework Roberta
* Rework T5
* Rework xlm+xlnet
* Rework template
* Fix TFT5EncoderOnly + DPRs
* Restore previous methods
* Fix Funnel
* Fix CTRL and TransforXL
* Apply style
* Apply Sylvain's comments
* Restore a test in DPR
* Address the comments
* Fix bug
* Apply style
* remove unused import
* Fix test
* Forgot a method
* missing test
* Trigger CI
* naming update
* Rebase
* Trigger CI
2021-01-11 06:27:28 -05:00
Julien Plu
812045adcc
New serving ( #9419 )
...
* Add a serving method
* Add albert
* Add serving for BERT and BART
* Add more models
* Finish the serving addition
* Temp fix
* Restore DPR
* Fix funnel attribute
* Fix attributes GPT2
* Fix OpenAIGPT attribute
* Fix T5 attributes
* Fix Bart attributes
* Fix TransfoXL attributes
* Add versioning
* better test
* Update template
* Fix Flaubert
* Fix T5
* Apply style
* Remove unused imports
* Deactivate extra parameters
* Remove too long test + saved_model default to False
* Ignore the saved model test for some models
* Fix some inputs
* Fix mpnet serving
* Trigger CI
* Address all comments
2021-01-07 11:48:49 +01:00
Patrick von Platen
eef66035a2
[PyTorch Bart] Split Bart into different models ( #9343 )
...
* first try
* remove old template
* finish bart
* finish mbart
* delete unnecessary line
* init pegasus
* save intermediate
* correct pegasus
* finish pegasus
* remove cookie cutter leftover
* add marian
* finish blenderbot
* replace in file
* correctly split blenderbot
* delete "old" folder
* correct "add statement"
* adapt config for tf comp
* correct configs for tf
* remove ipdb
* fix more stuff
* fix mbart
* push pegasus fix
* fix mbart
* more fixes
* fix research projects code
* finish docs for bart, mbart, and marian
* delete unnecessary file
* correct attn typo
* correct configs
* remove pegasus for seq class
* correct peg docs
* correct peg docs
* finish configs
* further improve docs
* add copied from statements to mbart
* fix copied from in mbart
* add copy statements to marian
* add copied from to marian
* add pegasus copied from
* finish pegasus
* finish copied from
* Apply suggestions from code review
* make style
* backward comp blenderbot
* apply lysandres and sylvains suggestions
* apply suggestions
* push last fixes
* fix docs
* fix tok tests
* fix imports code style
* fix doc
2021-01-05 22:00:05 +01:00
Patrick von Platen
cbe63949d7
Model Templates for Seq2Seq ( #9251 )
...
* adapt cookie cutter
* fix copy past statement
* delete copy statements for now
* remove unused import from template
* make doc rst
* correct config docstring
* correct training
* correct inputs processing tf enc dec
* make style
* adapt templates
* clean tabs
* correct tensor -> Tensor naming
* correct indent
* correct templates
* fix the test
* break lines to avoid > 119
* Apply suggestions from code review
2020-12-22 23:41:20 +01:00
Patrick von Platen
e9d77ccd5a
[EncoderDecoder] Make tests more aggressive ( #9256 )
...
* add tests
* make style and fix bart bug
* fix bart past key value edge case
* correct tf bart test
* fix gpt2 tf
* fix t5 test
2020-12-22 17:00:04 +01:00
Patrick von Platen
abc573f51a
[TF Bart] Refactor TFBart ( #9029 )
...
* reorder file
* delete unnecesarry function
* make style
* save intermediate
* fix attention masks
* correct tf bart past key values
* solve merge conflict bug
* correct tensor dims
* save intermediate tf
* change attn layer
* fix typo re-order past
* inputs_embeds
* make fix copies
* finish tests
* fix graph mode
* appyl lysandres suggestions
2020-12-15 17:31:28 +01:00
Julien Plu
df3f4d2aef
Fix T5 and BART for TF ( #9063 )
...
* Fix T5 for graphe compilation+execution
* Fix BART
* Fix import
* Fix naming
* fix attribute name
* Oops
* fix import
* fix tests
* fix tests
* Update test
* Add mising import
* Address Patrick's comments
* Style
* Address Patrick's comment
2020-12-14 18:47:00 +01:00
Julien Plu
51d9c569fa
Fix embeddings resizing in TF models ( #8657 )
...
* Resize the biases in same time than the embeddings
* Trigger CI
* Biases are not reset anymore
* Remove get_output_embeddings + better LM model detection in generation utils
* Apply style
* First test on BERT
* Update docstring + new name
* Apply the new resizing logic to all the models
* fix tests
* Apply style
* Update the template
* Fix naming
* Fix naming
* Apply style
* Apply style
* Remove unused import
* Revert get_output_embeddings
* Trigger CI
* Update num parameters
* Restore get_output_embeddings in TFPretrainedModel and add comments
* Style
* Add decoder resizing
* Style
* Fix tests
* Separate bias and decoder resize
* Fix tests
* Fix tests
* Apply style
* Add bias resizing in MPNet
* Trigger CI
* Apply style
2020-12-13 23:05:24 -05:00
Julien Plu
29d4992453
New TF model inputs ( #8602 )
...
* Apply on BERT and ALBERT
* Update TF Bart
* Add input processing to TF BART
* Add input processing for TF CTRL
* Add input processing to TF Distilbert
* Add input processing to TF DPR
* Add input processing to TF Electra
* Add input processing for TF Flaubert
* Add deprecated arguments
* Add input processing to TF XLM
* remove unused imports
* Add input processing to TF Funnel
* Add input processing to TF GPT2
* Add input processing to TF Longformer
* Add input processing to TF Lxmert
* Apply style
* Add input processing to TF Mobilebert
* Add input processing to TF GPT
* Add input processing to TF Roberta
* Add input processing to TF T5
* Add input processing to TF TransfoXL
* Apply style
* Rebase on master
* Bug fix
* Retry to bugfix
* Retry bug fix
* Fix wrong model name
* Try another fix
* Fix BART
* Fix input precessing
* Apply style
* Put the deprecated warnings in the input processing function
* Remove the unused imports
* Raise an error when len(kwargs)>0
* test ModelOutput instead of TFBaseModelOutput
* Bug fix
* Address Patrick's comments
* Address Patrick's comments
* Address Sylvain's comments
* Add the new inputs in new Longformer models
* Update the template with the new input processing
* Remove useless assert
* Apply style
* Trigger CI
2020-11-24 13:55:00 -05:00
Sylvain Gugger
c89bdfbe72
Reorganize repo ( #8580 )
...
* Put models in subfolders
* Styling
* Fix imports in tests
* More fixes in test imports
* Sneaky hidden imports
* Fix imports in doc files
* More sneaky imports
* Finish fixing tests
* Fix examples
* Fix path for copies
* More fixes for examples
* Fix dummy files
* More fixes for example
* More model import fixes
* Is this why you're unhappy GitHub?
* Fix imports in conver command
2020-11-16 21:43:42 -05:00
Sylvain Gugger
1073a2bde5
Switch return_dict
to True
by default. ( #8530 )
...
* Use the CI to identify failing tests
* Remove from all examples and tests
* More default switch
* Fixes
* More test fixes
* More fixes
* Last fixes hopefully
* Use the CI to identify failing tests
* Remove from all examples and tests
* More default switch
* Fixes
* More test fixes
* More fixes
* Last fixes hopefully
* Run on the real suite
* Fix slow tests
2020-11-16 11:43:00 -05:00
Sam Shleifer
566b083eb1
TFMarian, TFMbart, TFPegasus, TFBlenderbot ( #7987 )
...
* Start plumbing
* Marian close
* Small stubs for all children
* Fixed bart
* marian working
* pegasus test is good, but failing
* Checkin tests
* More model files
* Subtle marian, pegasus integration test failures
* Works well
* rm print
* boom boom
* Still failing model2doc
* merge master
* Equivalence test failing, all others fixed
* cleanup
* Fix embed_scale
* Cleanup marian pipeline test
* Undo extra changes
* Smaller delta
* Cleanup model testers
* undo delta
* fix tests import structure
* cross test decorator
* Cleaner set_weights
* Respect authorized_unexpected_keys
* No warnings
* No warnings
* style
* Nest tf import
* black
* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* functional dropout
* fixup
* Fixup
* style_doc
* embs
* shape list
* delete slow force_token_id_to_be_generated func
* fixup
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2020-10-30 11:23:16 -04:00
Thomas Wolf
3a40cdf58d
[tests|tokenizers] Refactoring pipelines test backbone - Small tokenizers improvements - General tests speedups ( #7970 )
...
* WIP refactoring pipeline tests - switching to fast tokenizers
* fix dialog pipeline and fill-mask
* refactoring pipeline tests backbone
* make large tests slow
* fix tests (tf Bart inactive for now)
* fix doc...
* clean up for merge
* fixing tests - remove bart from summarization until there is TF
* fix quality and RAG
* Add new translation pipeline tests - fix JAX tests
* only slow for dialog
* Fixing the missing TF-BART imports in modeling_tf_auto
* spin out pipeline tests in separate CI job
* adding pipeline test to CI YAML
* add slow pipeline tests
* speed up tf and pt join test to avoid redoing all the standalone pt and tf tests
* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
* Update src/transformers/pipelines.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/pipelines.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* Update src/transformers/testing_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* add require_torch and require_tf in is_pt_tf_cross_test
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2020-10-23 15:58:19 +02:00
Sam Shleifer
829842159e
Add TFBartForConditionalGeneration ( #5411 )
...
* half done
* doc improvement
* Cp test file
* brokedn
* broken test
* undo some mess
* ckpt
* borked
* Halfway
* 6 passing
* boom boom
* Much progress but still 6
* boom boom
* merged master
* 10 passing
* boom boom
* Style
* no t5 changes
* 13 passing
* Integration test failing, but not gibberish
* Frustrated
* Merged master
* 4 fail
* 4 fail
* fix return_dict
* boom boom
* Still only 4
* prepare method
* prepare method
* before delete classif
* Skip tests to avoid adding boilerplate
* boom boom
* fast tests passing
* style
* boom boom
* Switch to supporting many input types
* remove FIXMENORM
* working
* Fixed past_key_values/decoder_cached_states confusion
* new broken test
* Fix attention mask kwarg name
* undo accidental
* Style and reviewers
* style
* Docs and common tests
* Cleaner assert messages
* copy docs
* style issues
* Sphinx fix
* Simplify caching logic
* test does not require torch
* copy _NoLayerEmbedTokens
* Update src/transformers/modeling_tf_bart.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* Update tests/test_modeling_tf_bart.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* Update src/transformers/modeling_tf_bart.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* Update src/transformers/modeling_tf_bart.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* Update src/transformers/modeling_tf_bart.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* Line length and dont document None
* Add pipeline test coverage
* assert msg
* At parity
* Assert messages
* mark slow
* Update compile test
* back in init
* Merge master
* Fix tests
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2020-10-21 13:10:16 +02:00