transformers/docs/source/main_classes
Stas Bekman 580dd87c55
[Deepspeed] add support for bf16 mode (#14569)
* [WIP] add support for bf16 mode

* prep for bf16

* prep for bf16

* fix; zero2/bf16 is ok

* check bf16 is available

* test fixes

* enable zero3_bf16

* config files

* docs

* split stage_dtype; merge back to non-dtype-specific config file

* fix doc

* cleanup

* cleanup

* bfloat16 => bf16 to match the PR changes

* s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/

* test fixes/skipping

* move

* fix

* Update docs/source/main_classes/deepspeed.mdx

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* backticks

* cleanup

* cleanup

* cleanup

* new version

* add note about grad accum in bf16

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-03-11 17:53:53 -08:00
..
callback.mdx 📝 Add codecarbon callback to docs (#15563) 2022-02-08 14:10:53 -05:00
configuration.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
data_collator.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
deepspeed.mdx [Deepspeed] add support for bf16 mode (#14569) 2022-03-11 17:53:53 -08:00
feature_extractor.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
keras_callbacks.mdx Add proper documentation for Keras callbacks (#15374) 2022-01-27 10:51:38 -05:00
logging.mdx Usage examples for logger (#15657) 2022-02-16 10:15:13 +01:00
model.mdx [Docs] Improve PyTorch, Flax generate API (#15988) 2022-03-10 11:54:45 +01:00
onnx.mdx Add ONNX configuration classes to docs (#15121) 2022-01-12 16:33:32 +01:00
optimizer_schedules.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
output.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
pipelines.mdx Adding ZeroShotImageClassificationPipeline (#12119) 2022-02-23 09:41:42 +01:00
processors.mdx TF text classification examples (#15704) 2022-02-21 17:17:59 +00:00
text_generation.mdx [Docs] Improve PyTorch, Flax generate API (#15988) 2022-03-10 11:54:45 +01:00
tokenizer.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
trainer.mdx [trainer docs] document how to select specific gpus (#15551) 2022-02-09 10:12:29 -08:00