Sam Shleifer
9bdce3a4f9
[s2s] fix lockfile and peg distillation constants ( #7545 )
2020-10-02 15:58:14 -04:00
Sam Shleifer
de4d7b004a
[s2s] Adafactor support for builtin trainer ( #7522 )
2020-10-01 17:27:45 -04:00
Sam Shleifer
d3a9601a11
[s2s] trainer scripts: Remove --run_name, thanks sylvain! ( #7521 )
2020-10-01 17:18:47 -04:00
Sylvain Gugger
bdcc4b78a2
Fix seq2seq example test ( #7518 )
...
* Fix seq2seq example test
* Fix bad copy-paste
* Also save the state
2020-10-01 14:13:29 -04:00
Sylvain Gugger
29baa8fabe
Clean the Trainer state ( #7490 )
...
* Trainer should not modify its TrainingArguments
* Trainer should not modify its TrainingArguments
* Trainer should not modify its TrainingArguments
* Add test of resumed training
* Fixes
* Non multiGPU test
* Clean Trainer state
* Add more to the state
* Documentation
* One last test
* Make resume training test more complete
* Unwanted changes
2020-10-01 13:07:04 -04:00
Sam Shleifer
2a358f45ef
[s2s] fix nltk pytest race condition with FileLock ( #7515 )
2020-10-01 12:51:09 -04:00
Suraj Patil
72d363d979
[examples/s2s] clean up finetune_trainer ( #7509 )
2020-10-01 12:19:29 -04:00
Patrick von Platen
bd2621583b
fix data type ( #7513 )
2020-10-01 18:15:41 +02:00
Patrick von Platen
62f5ae68ec
[Seq2Seq] Fix a couple of bugs and clean examples ( #7474 )
...
* clean T5
* fix t5 tests
* fix index typo
* fix tf common test
* fix examples
* change positional ordering for Bart and FSTM
* add signature test
* clean docs and add tests
* add docs to encoder decoder
* clean docs
* correct two doc strings
* remove sig test for TF Elektra & Funnel
* fix tf t5 slow tests
* fix input_ids to inputs in tf
* Update src/transformers/modeling_bart.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/modeling_bart.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* implement lysandre results
* make style
* fix encoder decoder typo
* fix tf slow tests
* fix slow tests
* renaming
* remove unused input
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2020-10-01 17:38:50 +02:00
Muhammad Harris
a42f62d34f
Train T5 in Tensoflow 2 Community Notebook ( #7428 )
...
* t5 t5 community notebook added
* author link updated
* t5 t5 community notebook added
* author link updated
* new colab link updated
Co-authored-by: harris <muhammad.harris@visionx.io>
2020-10-01 16:54:29 +02:00
Kai Fricke
5fc3b5cba4
Fix Tune progress_reporter kwarg ( #7508 )
2020-10-01 10:34:31 -04:00
Kai Fricke
dabc85d1ba
Report Tune metrics in final evaluation ( #7507 )
2020-10-01 09:52:36 -04:00
Alexandr
9a92afb6d0
Update LayoutLM doc ( #7388 )
...
Co-authored-by: Alexandr Maslov <avmaslov3@gmail.com>
2020-10-01 09:11:42 -04:00
Julien Chaumond
e32390931d
[model_card] distilbert-base-german-cased
2020-10-01 09:08:49 -04:00
Julien Chaumond
9a4e163b58
[model_card] Fix metadata, adalbertojunior/PTT5-SMALL-SUM
2020-10-01 08:54:06 -04:00
Adalberto
8435e10e24
Create README.md ( #7299 )
...
* Create README.md
* language metadata
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-10-01 08:52:28 -04:00
Martin Müller
d727432072
Update README.md ( #7459 )
2020-10-01 08:51:26 -04:00
allenyummy
664da5b077
Create README.md ( #7468 )
2020-10-01 08:50:26 -04:00
ahotrod
f745f61c99
Update README.md ( #7491 )
...
Model now fine-tuned on Transformers 3.1.0, previous out-of-date model was fine-tuned on Transformers 2.3.0.
2020-10-01 08:50:07 -04:00
Abed khooli
6ef7658c0a
Create README.md ( #7349 )
...
Model card for akhooli/personachat-arabic
2020-10-01 08:48:51 -04:00
Bayartsogt Yadamsuren
15ab3f049b
Creating readme for bert-base-mongolian-cased ( #7439 )
...
* Creating readme for bert-base-mongolian-cased
* Update model_cards/bayartsogt/bert-base-mongolian-cased/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-10-01 08:46:27 -04:00
Bayartsogt Yadamsuren
0c2b9fa831
creating readme for bert-base-mongolian-uncased ( #7440 )
2020-10-01 08:45:22 -04:00
Akshay Gupta
381443c096
Update README.md ( #7498 )
...
Making transformers readme more robust.
2020-10-01 07:42:07 -04:00
Lysandre Debut
85d2d8c920
Fix local_files_only for TF ( #6091 )
2020-10-01 05:06:02 -04:00
Sam Shleifer
9e80f972fb
Enable pegasus fp16 by clamping large activations ( #7243 )
...
* Clean clamp
* boom boom
* Take some other changes
* boom boom
* boom boom
* boom boom
* one chg
* fix test
* Use finfo
* style
2020-10-01 04:48:37 -04:00
Sylvain Gugger
be51c1039d
Add forgotten return_dict argument in the docs ( #7483 )
2020-10-01 04:41:29 -04:00
Sam Shleifer
48f23f92a8
[s2sTrainer] test + code cleanup ( #7467 )
2020-10-01 00:33:01 -04:00
Sam Shleifer
097049b81b
Distributed Trainer: 2 little fixes ( #7461 )
...
* reset model.config
* Update src/transformers/trainer.py
* use lower case tensor
* Just tensor change
2020-09-30 22:14:14 -04:00
Julien Chaumond
0acd1ffa09
[doc] rm Azure buttons as not implemented yet
2020-09-30 17:31:08 -04:00
Sam Shleifer
03e46c1de3
[s2s] fix kwargs style ( #7488 )
2020-09-30 17:00:06 -04:00
Sam Shleifer
6fe8a693eb
[s2s] Fix t5 warning for distributed eval ( #7487 )
2020-09-30 16:58:03 -04:00
Sylvain Gugger
4c6728460a
Bump isort version. ( #7484 )
2020-09-30 13:44:58 -04:00
Amanpreet Singh
c031d01023
Seq2SeqDataset: avoid passing src_lang everywhere ( #7470 )
...
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
2020-09-30 13:27:48 -04:00
Suraj Patil
08939cfdf7
[s2strainer] fix eval dataset loading ( #7477 )
2020-09-30 12:39:13 -04:00
Sylvain Gugger
a97a73e0ee
Small QOL improvements to TrainingArguments ( #7475 )
...
* Small QOL improvements to TrainingArguments
* With the self.
2020-09-30 12:12:03 -04:00
Sylvain Gugger
dc7d2daa4c
Alphabetize model lists ( #7478 )
2020-09-30 10:43:58 -04:00
Sylvain Gugger
fdccf82e28
Remove config assumption in Trainer ( #7464 )
...
* Remove config assumption in Trainer
* Initialize for eval
2020-09-30 09:03:25 -04:00
François REMY
cc4eff8087
Make transformers install check positive ( #7473 )
...
When transformers is correctly installed, you should get a positive message ^_^
2020-09-30 07:44:40 -04:00
Pengcheng He
7a0cf0ec93
Add DeBERTa model ( #5929 )
...
* Add DeBERTa model
* Remove dependency of deberta
* Address comments
* Patch DeBERTa
Documentation
Style
* Add final tests
* Style
* Enable tests + nitpicks
* position IDs
* BERT -> DeBERTa
* Quality
* Style
* Tokenization
* Last updates.
* @patrickvonplaten's comments
* Not everything can be a copy
* Apply most of @sgugger's review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Last reviews
* DeBERTa -> Deberta
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2020-09-30 07:07:30 -04:00
Lysandre Debut
44a93c981f
Number of GPUs for multi-gpu ( #7472 )
2020-09-30 06:53:20 -04:00
Lysandre Debut
886ef35ce6
Fix LXMERT with DataParallel ( #7471 )
2020-09-30 06:41:24 -04:00
Lysandre
35e94c68df
Number of GPUs
2020-09-30 12:29:26 +02:00
Lysandre Debut
056723ad1d
Multi-GPU setup ( #7453 )
2020-09-30 05:53:34 -04:00
Sylvain Gugger
4ba248748f
Get a better error when check_copies fails ( #7457 )
...
* Get a better error when check_copies fails
* Fix tests
2020-09-30 10:05:14 +02:00
Sam Shleifer
bef0175168
remove codecov PR comments ( #7400 )
2020-09-29 15:16:43 -04:00
Sylvain Gugger
a1c2ef7bd0
Add documentation for v3.3.1
2020-09-29 14:31:43 -04:00
Sylvain Gugger
1ba08dc221
Release: v3.3.1
2020-09-29 14:17:34 -04:00
Sylvain Gugger
8546dc55c2
Fix Trainer tests in a multiGPU env ( #7458 )
2020-09-29 14:06:41 -04:00
Sylvain Gugger
d0fd7154c5
Catch import datasets common errors ( #7456 )
2020-09-29 13:42:09 -04:00
Sylvain Gugger
f1220c5fe2
Add a code of conduct ( #7433 )
2020-09-29 13:38:47 -04:00