transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-12 09:10:05 +06:00

History

Matt acfb714bdf Improve TF weight loading, especially PT crossloading (#21792 ) * First commit for the improved PT-TF weight loading * Remove workarounds from TFEncoderDecoder tests * Allow a custom weight renaming function in from_pretrained and use that to clean up EncoderDecoder * make fixup * First attempt at visionencoderdecoder * Disable tensorfloat32 in tests to get consistent outputs * Quick fix to tf_vision_encoder_decoder tests * make fixup * Update Blenderbot tests * Remove unused arg in modeling_tf_opt * load_tf_sharded_weights had strict=True! This meant transfer learning was impossible, so I'm setting it to False. * Support prefixes when loading sharded TF checkpoints * make fixup * Add test to load sharded models with a weight prefix * Fix sharded weight loading test * Add a test for transfer from a sharded checkpoint * make fixup * Add test to check that crossloading from PT with a prefix works * Refactor from_pretrained in the encoderdecoder classes * Refactor from_pretrained in the encoderdecoder classes * missmatched -> mismatched * Explicitly check for None * No comments showing my very impressive and attractive knowledge of Py3.9+ * Disable TF32 across all TF tests		2023-02-28 18:41:34 +00:00
..
__init__.py	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
test_modeling_flax_vision_encoder_decoder.py	Update quality tooling for formatting (#21480 )	2023-02-06 18:10:56 -05:00
test_modeling_tf_vision_encoder_decoder.py	Improve TF weight loading, especially PT crossloading (#21792 )	2023-02-28 18:41:34 +00:00
test_modeling_vision_encoder_decoder.py	Update quality tooling for formatting (#21480 )	2023-02-06 18:10:56 -05:00