transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-15 02:28:24 +06:00

History

Stas Bekman 580dd87c55 [Deepspeed] add support for bf16 mode (#14569 ) * [WIP] add support for bf16 mode * prep for bf16 * prep for bf16 * fix; zero2/bf16 is ok * check bf16 is available * test fixes * enable zero3_bf16 * config files * docs * split stage_dtype; merge back to non-dtype-specific config file * fix doc * cleanup * cleanup * bfloat16 => bf16 to match the PR changes * s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/ * test fixes/skipping * move * fix * Update docs/source/main_classes/deepspeed.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * backticks * cleanup * cleanup * cleanup * new version * add note about grad accum in bf16 Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>		2022-03-11 17:53:53 -08:00
..
callback.mdx	📝 Add codecarbon callback to docs (#15563 )	2022-02-08 14:10:53 -05:00
configuration.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
data_collator.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
deepspeed.mdx	[Deepspeed] add support for bf16 mode (#14569 )	2022-03-11 17:53:53 -08:00
feature_extractor.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
keras_callbacks.mdx	Add proper documentation for Keras callbacks (#15374 )	2022-01-27 10:51:38 -05:00
logging.mdx	Usage examples for logger (#15657 )	2022-02-16 10:15:13 +01:00
model.mdx	[Docs] Improve PyTorch, Flax generate API (#15988 )	2022-03-10 11:54:45 +01:00
onnx.mdx	Add ONNX configuration classes to docs (#15121 )	2022-01-12 16:33:32 +01:00
optimizer_schedules.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
output.mdx	Doc styler examples (#14953 )	2021-12-27 19:07:46 -05:00
pipelines.mdx	Adding ZeroShotImageClassificationPipeline (#12119 )	2022-02-23 09:41:42 +01:00
processors.mdx	TF text classification examples (#15704 )	2022-02-21 17:17:59 +00:00
text_generation.mdx	[Docs] Improve PyTorch, Flax generate API (#15988 )	2022-03-10 11:54:45 +01:00
tokenizer.mdx	Convert rst files (#14888 )	2021-12-22 16:14:35 -05:00
trainer.mdx	[trainer docs] document how to select specific gpus (#15551 )	2022-02-09 10:12:29 -08:00