transformers/docs/source
Cheng Li c83fbc5f2d
[Deepspeed] Allow HF optimizer and scheduler to be passed to deepspeed (#10464)
* pass hf optimizer and scheduler to deepspeed if not specified in ds config

* pass hf optimizer and scheduler to deepspeed if not specified in ds config

* update

* make init_deepspeed support config dict

* fix docstring formatting

* clean up trainer's comments

* add new tests

* fix type

* composit argparse doesn't work

* style

* add a new test, rename others

* document new functionality

* complete tests, add docs

* style

* correct level

* Apply suggestions from code review

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add new methods to the doc

* must tell DS we are using a non-native optimizer

* add protection against cpu_offload + HF optimizer combo

* fix the cli overrides

* sync docs + tests

* restore AdamW

* better docs

* need new version

* no longer needed

* remove outdate information

* refactor duplicated code

Co-authored-by: Stas Bekman <stas@stason.org>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2021-03-16 15:51:09 -07:00
..
_static Docs for v4.4.1 2021-03-16 15:41:49 -04:00
imgs [Templates] Add template "call-for-model" markdown and "call-for-big-bird" markdown (#9921) 2021-02-05 15:47:54 +03:00
internal Adding new parameter to generate: max_time. (#9846) 2021-03-12 10:11:50 +01:00
main_classes [Deepspeed] Allow HF optimizer and scheduler to be passed to deepspeed (#10464) 2021-03-16 15:51:09 -07:00
model_doc fix M2M100 example (#10745) 2021-03-16 20:20:00 +05:30
add_new_model.rst Add new model docs (#9667) 2021-02-01 17:55:10 +03:00
benchmarks.rst Make doc styler detect lists on rst (#9488) 2021-01-11 08:53:41 -05:00
bertology.rst Fix documentation links always pointing to master. (#9217) 2021-01-05 06:18:48 -05:00
community.md Add new community notebook - Blenderbot (#10126) 2021-02-11 12:53:40 +03:00
conf.py Development on v4.5.0dev0 2021-03-16 11:41:15 -04:00
contributing.md Update installation page and add contributing to the doc (#5084) 2020-06-17 14:01:10 -04:00
converting_tensorflow_models.rst Fix broken links in the converting tf ckpt document (#9791) 2021-01-26 03:37:57 -05:00
custom_datasets.rst Fix None in add_token_positions - issue #10210 (#10374) 2021-02-25 09:18:33 -07:00
examples.md per_device instead of per_gpu/error thrown when argument unknown (#4618) 2020-05-27 11:36:55 -04:00
favicon.ico Adding usage examples for common tasks (#2850) 2020-02-25 13:48:24 -05:00
glossary.rst Adds terms to Glossary (#10443) 2021-02-28 08:27:54 -05:00
index.rst Release v4.4.0 2021-03-16 11:33:35 -04:00
installation.md split seq2seq script into summarization & translation (#10611) 2021-03-15 09:11:42 -04:00
migration.md Copyright (#8970) 2020-12-07 18:36:34 -05:00
model_sharing.rst [doc] nested markup is invalid in rst (#9898) 2021-01-30 09:59:19 -05:00
model_summary.rst ConvBERT Model (#9717) 2021-01-27 03:20:09 -05:00
multilingual.rst Fix documentation links always pointing to master. (#9217) 2021-01-05 06:18:48 -05:00
notebooks.md Update notebooks (#3620) 2020-04-06 14:32:39 -04:00
perplexity.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
philosophy.rst Minor documentation revisions from copyediting (#9266) 2020-12-23 10:15:49 -05:00
preprocessing.rst Minor documentation revisions from copyediting (#9266) 2020-12-23 10:15:49 -05:00
pretrained_models.rst Add m2m100 (#10236) 2021-03-06 22:14:16 +05:30
quicktour.rst Minor documentation revisions from copyediting (#9266) 2020-12-23 10:15:49 -05:00
serialization.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
task_summary.rst split seq2seq script into summarization & translation (#10611) 2021-03-15 09:11:42 -04:00
testing.rst Fix documentation links always pointing to master. (#9217) 2021-01-05 06:18:48 -05:00
tokenizer_summary.rst Minor documentation revisions from copyediting (#9266) 2020-12-23 10:15:49 -05:00
training.rst [trainer] deepspeed integration (#9211) 2021-01-12 19:05:18 -08:00