Sylvain Gugger
f13f1f8fb8
Test checkpointing ( #11682 )
...
* Add test and see where CI is unhappy
* Load with strict=False
2021-05-11 12:02:48 -04:00
Julien Plu
d9b286272c
Fix TF Roberta for mixed precision training ( #11675 )
2021-05-11 12:01:03 -04:00
Sylvain Gugger
a135f59536
Auto modelcard ( #11599 )
...
* Autogenerate model cards from the Trainer
* ModelCard deprecated
* Fix test
* Style
* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Address review comments
* Quality
* With all metadata
* Metadata
* Post-merge conflict mess
* Data args and all examples
* Default license and languages when possible
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2021-05-11 11:30:34 -04:00
Matt
b3429ab678
Grammar and style edits for the frontpage README ( #11679 )
...
* Grammar and style edits for the frontpage README
* Going all-in on em-dashes because you only live once
* Update README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2021-05-11 15:49:34 +01:00
nxznm
901153c61e
Fix docstring of description about input_ids ( #11672 )
2021-05-11 08:12:02 -04:00
Jonathan Chang
64232bc0df
Add --text_column to run_summarization_no_trainer ( #11673 )
2021-05-11 07:58:38 -04:00
Julien Plu
024cd19bb7
Add MacOS TF version ( #11674 )
...
Co-authored-by: Julien Plu <jplu@argos.local>
2021-05-11 05:42:21 -04:00
Pavel Soriano
9120ae7d66
Fixes NoneType exception when topk is larger than one coupled with a small context in the Question-Answering pipeline ( #11628 )
...
* added fix to decode function. added test to qa pipeline tests
* completed topk docstring
* fixed formatting with black
* applied style_doc to fix line length
2021-05-10 13:28:10 -04:00
Patrick von Platen
dcb0e61430
push ( #11667 )
2021-05-10 17:38:17 +01:00
Sylvain Gugger
05a930671f
Save scaler state dict when checkpointing ( #11663 )
2021-05-10 10:58:30 -04:00
Matt
ef8d32c5ea
Fix suggested by @bhadreshpsavani ( #11660 )
2021-05-10 14:28:04 +01:00
Vasudev Gupta
575c979144
Update community.md ( #11654 )
2021-05-10 09:48:21 +01:00
Tanmay Laud
f7f872955d
Big Bird Fast Tokenizer implementation ( #11075 )
...
* Added Big Bird Fast Tokenizer initial file
* style fixes
* flake fixes
* Added big bird fast tokenizer to init files
* Added big bird fast to Auto tokenization
* fix styles
* minor quality fixes
* Added initial test code
* Fix SpmConverter when precompiled_charsmap doesn't exist
* fixed post processor
* minor style fix
* minor fix input names
* Actually fix identity normalization
* style
* Added token type ids to fast tokenizer
* style
* flake fix
* fix copies
Co-authored-by: Anthony MOI <m.anthony.moi@gmail.com>
2021-05-10 03:01:23 -04:00
Bhavitvya Malik
80da304a0f
updated user permissions based on umask ( #11119 )
...
* updated user permissions based on umask
* updated user permissions based on umask
* changes as per suggestions
* minor changes
2021-05-10 02:45:29 -04:00
Quentin Lhoest
1a0b41781d
Update requirements.txt ( #11634 )
2021-05-10 11:19:52 +05:30
NielsRogge
f785c51692
Update code example ( #11631 )
...
* Update code example
* Code review
2021-05-10 11:18:43 +05:30
Tommy Chiang
7e406f4a65
[Examples] Fix invalid links after reorg ( #11650 )
2021-05-10 11:16:48 +05:30
Tommy Chiang
f2ffcaf49f
[Examples] Check key exists in datasets first ( #11503 )
2021-05-09 15:42:38 -04:00
Stas Bekman
ba0d50f214
[examples] fix sys.path in conftest.py ( #11636 )
...
* restore conftest.py
* fix conftest and make copies
* remove unneeded parts
* remove unwanted files
2021-05-07 14:44:22 -07:00
Stas Bekman
cd9b8d7efe
[self-push CI] sync with self-scheduled ( #11637 )
...
forgot to add the missing `libaio-dev` to this workflow
2021-05-07 14:06:33 -07:00
Lysandre Debut
da37eb8e43
Reduce to 1 worker and set timeout for GPU TF tests ( #11633 )
2021-05-07 11:55:20 -04:00
Lysandre Debut
39084ca663
Add the ImageClassificationPipeline ( #11598 )
...
* Add the ImageClassificationPipeline
* Code review
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>
* Have `load_image` at the module level
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>
2021-05-07 08:08:40 -04:00
Patrick von Platen
e7bff0aabe
make fix copy ( #11627 )
2021-05-07 07:48:51 -04:00
Vasudev Gupta
dc3f6758cf
Add BigBirdPegasus ( #10991 )
...
* init bigbird pegasus
* add debugging nb ; update config
* init conversion
* update conversion script
* complete conversion script
* init forward()
* complete forward()
* add tokenizer
* add some slow tests
* commit current
* fix copies
* add docs
* add conversion script for bigbird-roberta-summarization
* remove TODO
* small fixups
* correct tokenizer
* add bigbird core for now
* fix config
* fix more
* revert pegasus-tokenizer back
* make style
* everything working for pubmed; yayygit status
* complete tests finally
* remove bigbird pegasus tok
* correct tokenizer
* correct tests
* add tokenizer files
* finish make style
* fix test
* update
* make style
* fix tok utils base file
* make fix-copies
* clean a bit
* small update
* fix some suggestions
* add to readme
* fix a bit, clean tests
* fix more tests
* Update src/transformers/__init__.py
* Update src/transformers/__init__.py
* make fix-copies
* complete attn switching, auto-padding left
* make style
* fix auto-padding test
* make style
* fix batched attention tests
* put tolerance at 1e-1 for stand-alone decoder test
* fix docs
* fix tests
* correct slow tokenizer conversion
* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* complete remaining suggestions
* fix test
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2021-05-07 09:27:43 +02:00
Jonathan Chang
6f40e31766
Fix comment in run_clm_no_trainer.py ( #11624 )
2021-05-07 12:32:30 +05:30
Sylvain Gugger
33fd83bc01
Fix RNG saves in distributed mode. ( #11620 )
...
* Fix RNG saves in distributed mode.
* Update src/transformers/trainer.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
2021-05-06 17:14:12 -04:00
Stas Bekman
619200cc42
[cuda ext tests] fixing tests ( #11619 )
...
* fixing tests
* cleanup
2021-05-06 13:35:28 -07:00
Patrick von Platen
44c5621db0
fix tests ( #11615 )
2021-05-06 20:42:51 +02:00
Sylvain Gugger
7eee950ac3
Re-styling in seq2seq attention ( #11613 )
2021-05-06 14:24:19 -04:00
Eldar Kurtic
cf409e5594
Fix docstring typo ( #11611 )
2021-05-06 17:09:28 +05:30
Vipul Raheja
f594090a93
fix typo in command ( #11605 )
2021-05-06 12:32:54 +05:30
Lysandre Debut
079557c1c5
Fix Python version ( #11607 )
2021-05-06 02:50:11 -04:00
baeseongsu
c1780ce7a4
fix head_mask for albert encoder part(AlbertTransformer
) ( #11596 )
...
* fix head mask for albert encoder part
* fix head_mask for albert encoder part
2021-05-06 02:18:02 -04:00
Mats Sjöberg
864c1dfe34
Accept tensorflow-rocm package when checking TF availability ( #11595 )
2021-05-05 14:44:29 -04:00
Patrick von Platen
3e3e41ae20
Pytorch - Lazy initialization of models ( #11471 )
...
* lazy_init_weights
* remove ipdb
* save int
* add necessary code
* remove unnecessary utils
* Update src/transformers/models/t5/modeling_t5.py
* clean
* add tests
* correct
* finish tests
* finish tests
* fix some more tests
* fix xlnet & transfo-xl
* fix more tests
* make sure tests are independent
* fix tests more
* finist tests
* final touches
* Update src/transformers/modeling_utils.py
* Apply suggestions from code review
* Update src/transformers/modeling_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
* Update src/transformers/modeling_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
* clean tests
* give arg positive name
* add more mock weights to xlnet
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
2021-05-05 17:22:20 +02:00
Lysandre
8fa8e19429
Skip Funnel test
2021-05-05 12:38:01 +02:00
Deepali
83e59d8e0b
add importlib_metadata and huggingface_hub as dependency in the conda recipe ( #11591 )
...
* add importlib_metadata as dependency (#11490 )
Co-authored-by: Deepali Chourasia <deepch23@us.ibm.com>
* add huggingface_hub dependency
Co-authored-by: Deepali Chourasia <deepch23@us.ibm.com>
2021-05-05 03:36:18 -04:00
Stas Bekman
bf0dfa98d3
copies need to be fixed too ( #11585 )
2021-05-05 03:35:15 -04:00
Stas Bekman
c065025c47
[trainer] document resume randomness ( #11588 )
...
* document resume randomness
* fix link
* reword
* fix
* reword
* style
2021-05-04 14:17:11 -07:00
Sylvain Gugger
6b241e0e3b
Reproducible checkpoint ( #11582 )
...
* Set generator in dataloader
* Use generator in all random samplers
* Checkpoint all RNG states
* Final version
* Quality
* Test
* Address review comments
* Quality
* Remove debug util
* Add python and numpy RNGs
* Split states in different files in distributed
* Quality
* local_rank for TPUs
* Only use generator when accepted
* Add test
* Set seed to avoid flakiness
* Make test less flaky
* Quality
2021-05-04 16:20:56 -04:00
Patrick Fernandes
0afe4a90f9
[Flax] Add Electra models ( #11426 )
...
* add electra model to flax
* Remove Electra Next Sentence Prediction model added by mistake
* fix parameter sharing and loosen equality threshold
* fix styling issues
* add mistaken removen imports
* fix electra table
* Add FlaxElectra to automodels and fixe docs
* fix issues pointed out the PR
* fix flax electra to comply with latest changes
* remove stale class
* add copied from
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2021-05-04 20:56:09 +02:00
Philipp Schmid
226e74b610
Removes SageMakerTrainer code but keeps class as wrapper ( #11587 )
...
* removed all old code
* make quality
2021-05-04 14:31:18 -04:00
Patrick von Platen
084a187da3
[FlaxRoberta] Add FlaxRobertaModels & adapt run_mlm_flax.py ( #11470 )
...
* add flax roberta
* make style
* correct initialiazation
* modify model to save weights
* fix copied from
* fix copied from
* correct some more code
* add more roberta models
* Apply suggestions from code review
* merge from master
* finish
* finish docs
Co-authored-by: Patrick von Platen <patrick@huggingface.co>
2021-05-04 19:57:59 +02:00
Sylvain Gugger
2ce0fb84cc
Make quality scripts work when one backend is missing. ( #11573 )
...
* Make quality scripts work when one backend is missing.
* Check env variable is properly set
* Add default
* With print statements
* Fix typo
* Set env variable
* Remove debug code
2021-05-04 09:53:44 -04:00
Lysandre Debut
09b0bcfea9
Enable added tokens ( #11325 )
...
* Fix tests
* Reorganize
* Update tests/test_modeling_mobilebert.py
* Remove unnecessary addition
2021-05-04 08:13:57 -04:00
abhishek thakur
c40c7e213b
Add multi-class, multi-label and regression to transformers ( #11012 )
...
* add to bert
* review comments
* Update src/transformers/configuration_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/configuration_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* self.config.problem_type
* fix style
* fix
* fin
* fix
* update doc
* fix
* test
* Test more problem types
* Update src/transformers/configuration_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* fix
* remove
* fix
* quality
* make fix-copies
* remove test
Co-authored-by: abhishek thakur <abhishekkrthakur@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
2021-05-04 02:23:40 -04:00
Stas Bekman
7c622482e8
fix resize_token_embeddings ( #11572 )
2021-05-03 13:12:06 -07:00
Sylvain Gugger
fe82b1bfa0
Update training tutorial ( #11533 )
...
* Update training tutorial
* Apply suggestions from code review
Co-authored-by: Hamel Husain <hamelsmu@github.com>
* Address review comments
* Update docs/source/training.rst
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* More review comments
* Last review comments
Co-authored-by: Hamel Husain <hamelsmu@github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2021-05-03 13:18:46 -04:00
Sylvain Gugger
f4c9a7e62e
Accumulate opt state dict on do_rank 0 ( #11481 )
2021-05-03 13:18:27 -04:00
Nicolas Patry
1e8e06862f
Fixes a useless warning. ( #11566 )
...
Fixes #11525
2021-05-03 18:48:13 +02:00