Patrick von Platen
068e6b5edd
make files independent ( #8267 )
2020-11-03 21:13:33 +01:00
Patrick von Platen
3c682ea15c
[Examples] Allow EncoderDecoderModels to be trained with Seq2Seq ( #7809 )
...
* Make Seq2Seq Trainer more similar to Trainer
* fix typo
* fix seq2seq trainer
* remove from tests
* remove lock
* remove train files
* delete test files
* correct typo
* check at init
* make sure trainer is not slowed down on TPU
* correct isort
* remove use cache
* fix use cache
* add last use chache = false
2020-10-23 23:05:51 +02:00
Stas Bekman
8b38173398
[seq2seq testing] multigpu test run via subprocess ( #7281 )
...
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
2020-10-21 17:20:53 -04:00
Suraj Patil
06a973fd2a
[s2s] configure lr_scheduler from command line ( #7641 )
2020-10-08 13:06:35 -04:00
Suraj Patil
99cb924bfb
[s2s] add config params like Dropout in Seq2SeqTrainingArguments ( #7532 )
2020-10-04 12:42:30 -04:00
Sam Shleifer
de4d7b004a
[s2s] Adafactor support for builtin trainer ( #7522 )
2020-10-01 17:27:45 -04:00
Sylvain Gugger
bdcc4b78a2
Fix seq2seq example test ( #7518 )
...
* Fix seq2seq example test
* Fix bad copy-paste
* Also save the state
2020-10-01 14:13:29 -04:00
Suraj Patil
72d363d979
[examples/s2s] clean up finetune_trainer ( #7509 )
2020-10-01 12:19:29 -04:00
Sam Shleifer
48f23f92a8
[s2sTrainer] test + code cleanup ( #7467 )
2020-10-01 00:33:01 -04:00
Suraj Patil
08939cfdf7
[s2strainer] fix eval dataset loading ( #7477 )
2020-09-30 12:39:13 -04:00
Suraj Patil
9e68d075a4
Seq2SeqTrainer ( #6769 )
...
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
2020-09-24 18:46:58 -04:00