Sylvain Gugger
a0a027c2ed
Add DistributedSamplerWithLoop ( #10746 )
...
* Add DistributedSamplerWithLoop
* Fix typo
* Test and small fix
2021-03-16 11:22:39 -04:00
Sylvain Gugger
3ced9b3eb9
Check layer types for Optimizer construction ( #10598 )
...
* Check layer types for Optimizer construction
* Duplicate class
2021-03-08 16:40:11 -05:00
Sylvain Gugger
821d518e03
Revert "Tests"
...
This reverts commit b35e7b68ca
.
2021-03-08 16:05:55 -05:00
Sylvain Gugger
4196bfeda0
Revert "Style"
...
This reverts commit a8ec52efc2
.
2021-03-08 16:05:52 -05:00
Sylvain Gugger
a8ec52efc2
Style
2021-03-08 16:04:46 -05:00
Sylvain Gugger
b35e7b68ca
Tests
2021-03-08 16:04:30 -05:00
Sylvain Gugger
5f80c15ef5
Fix memory regression in Seq2Seq example ( #9713 )
...
* Fix memory regression in Seq2Seq example
* Fix test and properly deal with -100
* Easier condition with device safety
* Patch for MBartTokenzierFast
2021-01-21 12:05:46 -05:00
Sylvain Gugger
329fe2746a
Upstream (and rename) sortish sampler ( #9574 )
...
* Upstream (and rename) sortish sampler
* Use proper sampler
* Update src/transformers/trainer_pt_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2021-01-14 10:38:14 -05:00
Sylvain Gugger
490b39e614
Seq2seq trainer ( #9241 )
...
* Add label smoothing in Trainer
* Add options for scheduler and Adafactor in Trainer
* Put Seq2SeqTrainer in the main lib
* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Address review comments and adapt scripts
* Documentation
* Move test not using script to tests folder
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2020-12-22 11:33:44 -05:00
Sylvain Gugger
a1d1b332d0
Add predict step accumulation ( #7767 )
...
* Add eval_accumulation_step and clean distributed eval
* Add TPU test
* Add TPU stuff
* Fix arg name
* Fix Seq2SeqTrainer
* Fix total_size
* Update src/transformers/trainer_pt_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* Doc and add test to TPU
* Add unit test
* Adapt name
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2020-10-14 11:41:45 -04:00