transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-29 01:02:25 +06:00

History

Zach Mueller e0dfd7bcaf Speedup model init on CPU (by 10x+ for llama-3-8B as one example) (#31771 ) * 1,100%! * Clean * Don't touch DS * Experiment with dtype allocation * skip test_load_save_without_tied_weights test * A little faster * Include proper upscaling? * Fixup tests * Potentially skip? * Let's see if this fixes git history * Maintain new dtype * Fin * Rm hook idea for now * New approach, see what breaks * stage * Clean * Stash * Should be fin now, just need to mark failing models * Clean up * Simplify * Deal with weird models * Enc/Dec * Skip w/ reason * Adjust test * Fix test * one more test * Keep experimenting * Fix ref * TO REMOVE: testing feedback CI * Right push * Update tests/utils/test_modeling_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * disable * Add new func * Test nits from Amy * Update src/transformers/modeling_utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Adjust comment * Adjust comment on skip * make private * Fin * Should be a not flag * Clarify and rename test --------- Co-authored-by: Marc Sun <marc@huggingface.co> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>		2024-07-16 09:32:01 -04:00
..
__init__.py	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
test_modeling_bart.py	Speedup model init on CPU (by 10x+ for llama-3-8B as one example) (#31771 )	2024-07-16 09:32:01 -04:00
test_modeling_flax_bart.py	Update quality tooling for formatting (#21480 )	2023-02-06 18:10:56 -05:00
test_modeling_tf_bart.py	Remove ConversationalPipeline and Conversation object (#31165 )	2024-06-07 17:50:18 +01:00
test_tokenization_bart.py	Skip tests properly (#31308 )	2024-06-26 21:59:08 +01:00