transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-23 06:20:22 +06:00

History

Zach Mueller e0dfd7bcaf Speedup model init on CPU (by 10x+ for llama-3-8B as one example) (#31771 ) * 1,100%! * Clean * Don't touch DS * Experiment with dtype allocation * skip test_load_save_without_tied_weights test * A little faster * Include proper upscaling? * Fixup tests * Potentially skip? * Let's see if this fixes git history * Maintain new dtype * Fin * Rm hook idea for now * New approach, see what breaks * stage * Clean * Stash * Should be fin now, just need to mark failing models * Clean up * Simplify * Deal with weird models * Enc/Dec * Skip w/ reason * Adjust test * Fix test * one more test * Keep experimenting * Fix ref * TO REMOVE: testing feedback CI * Right push * Update tests/utils/test_modeling_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * disable * Add new func * Test nits from Amy * Update src/transformers/modeling_utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Adjust comment * Adjust comment on skip * make private * Fin * Should be a not flag * Clarify and rename test --------- Co-authored-by: Marc Sun <marc@huggingface.co> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>		2024-07-16 09:32:01 -04:00
..
agent.md	Reboot Agents (#30387 )	2024-05-07 12:59:49 +02:00
backbones.md	[`Doc`] Fix docbuilder - make `BackboneMixin` and `BackboneConfigMixin` importable from `utils`. (#29002 )	2024-02-14 10:29:22 +00:00
callback.md	Update CometCallback to allow reusing of the running experiment (#31366 )	2024-07-05 08:13:46 +02:00
configuration.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
data_collator.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
deepspeed.md	[docs] DeepSpeed (#28542 )	2024-01-24 08:31:28 -08:00
feature_extractor.md	Fixed typos (#26810 )	2023-10-16 09:52:29 +02:00
image_processor.md	Fast image processor (#28847 )	2024-06-11 15:47:38 +01:00
keras_callbacks.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
logging.md	Warnings controlled by logger level (#26527 )	2023-10-12 10:48:38 +02:00
model.md	Speedup model init on CPU (by 10x+ for llama-3-8B as one example) (#31771 )	2024-07-16 09:32:01 -04:00
onnx.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
optimizer_schedules.md	Add WSD scheduler (#30231 )	2024-04-25 12:07:21 +01:00
output.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
pipelines.md	Allow FP16 or other precision inference for Pipelines (#31342 )	2024-07-05 17:21:50 +01:00
processors.md	[docs] fixed links with 404 (#27327 )	2023-11-06 19:45:03 +00:00
quantization.md	Add HQQ quantization support (#29637 )	2024-05-02 17:51:49 +01:00
text_generation.md	Add Watermarking LogitsProcessor and WatermarkDetector (#29676 )	2024-05-14 13:31:39 +05:00
tokenizer.md	[`PretrainedTokenizer`] add some of the most important functions to the doc (#27313 )	2023-11-06 15:11:00 +01:00
trainer.md	[docs] Trainer docs (#28145 )	2023-12-20 10:37:23 -08:00