transformers/docs/source/en/main_classes
Marc Sun e03a9cc0cd
Modify device_map behavior when loading a model using from_pretrained (#23922)
* Modify device map behavior for 4/8 bits model

* Remove device_map arg for training 4/8 bit model

* Remove index

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add Exceptions

* Modify comment

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Fix formatting

* Get current device with accelerate

* Revert "Get current device with accelerate"

This reverts commit 46f0079910.

* Fix Exception

* Modify quantization doc

* Fix error

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-06-01 13:21:22 -04:00
..
agent.mdx Add local agent (#23438) 2023-05-18 11:09:55 -04:00
callback.mdx Adds a FlyteCallback (#23759) 2023-05-30 10:08:07 -04:00
configuration.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
data_collator.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
deepspeed.mdx [deepspeed zero3] need generate(synced_gpus=True, ...) (#22242) 2023-03-22 12:18:57 -07:00
feature_extractor.mdx Update feature extractor docs (#18324) 2022-07-27 15:32:57 -05:00
image_processor.mdx AutoImageProcessor (#20111) 2022-11-08 19:54:41 +00:00
keras_callbacks.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
logging.mdx logging documentation update (#17174) 2022-05-16 16:47:28 -04:00
model.mdx Generate: move generation_*.py src files into generation/*.py (#20096) 2022-11-09 15:34:08 +00:00
onnx.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
optimizer_schedules.mdx Add inverse sqrt learning rate scheduler (#21495) 2023-02-07 15:00:50 -05:00
output.mdx Fix typo ; Update output.mdx (#23227) 2023-05-09 09:19:38 -04:00
pipelines.mdx [Pipeline] Add zero shot audio classificatoin pipeline (#21600) 2023-02-27 11:43:44 +01:00
processors.mdx Update doc examples feature extractor -> image processor (#20501) 2022-11-30 14:50:55 +00:00
quantization.mdx Modify device_map behavior when loading a model using from_pretrained (#23922) 2023-06-01 13:21:22 -04:00
text_generation.mdx Generate: basic token streaming (#22449) 2023-03-30 12:00:12 +01:00
tokenizer.mdx documentation: some minor clean up (#16850) 2022-04-26 16:56:08 -04:00
trainer.mdx Update trainer.mdx class_weights example (#23787) 2023-05-26 08:36:33 -04:00