mirror of
https://github.com/huggingface/transformers.git
synced 2025-07-13 09:40:06 +06:00
![]() * Initial work * More or less finished with first draft * Update src/transformers/modeling_utils.py Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Update src/transformers/modeling_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Fix randomly initialized weights * Update src/transformers/modeling_utils.py Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Address review comments * Rename DeepSpeed folder to temporarily fix the test issue? * Revert to try if Accelerate fix works * Use latest Accelerate release * Quality and fixes * Style * Quality * Add doc * Test + fix * More blocks Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> |
||
---|---|---|
.. | ||
callback.mdx | ||
configuration.mdx | ||
data_collator.mdx | ||
deepspeed.mdx | ||
feature_extractor.mdx | ||
keras_callbacks.mdx | ||
logging.mdx | ||
model.mdx | ||
onnx.mdx | ||
optimizer_schedules.mdx | ||
output.mdx | ||
pipelines.mdx | ||
processors.mdx | ||
text_generation.mdx | ||
tokenizer.mdx | ||
trainer.mdx |