transformers/docs/source/en/main_classes
Zach Mueller e0dfd7bcaf
Speedup model init on CPU (by 10x+ for llama-3-8B as one example) (#31771)
* 1,100%!

* Clean

* Don't touch DS

* Experiment with dtype allocation

* skip test_load_save_without_tied_weights test

* A little faster

* Include proper upscaling?

* Fixup tests

* Potentially skip?

* Let's see if this fixes git history

* Maintain new dtype

* Fin

* Rm hook idea for now

* New approach, see what breaks

* stage

* Clean

* Stash

* Should be fin now, just need to mark failing models

* Clean up

* Simplify

* Deal with weird models

* Enc/Dec

* Skip w/ reason

* Adjust test

* Fix test

* one more test

* Keep experimenting

* Fix ref

* TO REMOVE: testing feedback CI

* Right push

* Update tests/utils/test_modeling_utils.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* disable

* Add new func

* Test nits from Amy

* Update src/transformers/modeling_utils.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Adjust comment

* Adjust comment on skip

* make private

* Fin

* Should be a not flag

* Clarify and rename test

---------

Co-authored-by: Marc Sun <marc@huggingface.co>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2024-07-16 09:32:01 -04:00
..
agent.md Reboot Agents (#30387) 2024-05-07 12:59:49 +02:00
backbones.md [Doc] Fix docbuilder - make BackboneMixin and BackboneConfigMixin importable from utils. (#29002) 2024-02-14 10:29:22 +00:00
callback.md Update CometCallback to allow reusing of the running experiment (#31366) 2024-07-05 08:13:46 +02:00
configuration.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
data_collator.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
deepspeed.md [docs] DeepSpeed (#28542) 2024-01-24 08:31:28 -08:00
feature_extractor.md Fixed typos (#26810) 2023-10-16 09:52:29 +02:00
image_processor.md Fast image processor (#28847) 2024-06-11 15:47:38 +01:00
keras_callbacks.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
logging.md Warnings controlled by logger level (#26527) 2023-10-12 10:48:38 +02:00
model.md Speedup model init on CPU (by 10x+ for llama-3-8B as one example) (#31771) 2024-07-16 09:32:01 -04:00
onnx.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
optimizer_schedules.md Add WSD scheduler (#30231) 2024-04-25 12:07:21 +01:00
output.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
pipelines.md Allow FP16 or other precision inference for Pipelines (#31342) 2024-07-05 17:21:50 +01:00
processors.md [docs] fixed links with 404 (#27327) 2023-11-06 19:45:03 +00:00
quantization.md Add HQQ quantization support (#29637) 2024-05-02 17:51:49 +01:00
text_generation.md Add Watermarking LogitsProcessor and WatermarkDetector (#29676) 2024-05-14 13:31:39 +05:00
tokenizer.md [PretrainedTokenizer] add some of the most important functions to the doc (#27313) 2023-11-06 15:11:00 +01:00
trainer.md [docs] Trainer docs (#28145) 2023-12-20 10:37:23 -08:00