transformers/tests
Sylvain Gugger 56f50590d5
Use Accelerate in from_pretrained for big model inference (#17341)
* Initial work

* More or less finished with first draft

* Update src/transformers/modeling_utils.py

Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update src/transformers/modeling_utils.py

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Fix randomly initialized weights

* Update src/transformers/modeling_utils.py

Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

* Address review comments

* Rename DeepSpeed folder to temporarily fix the test issue?

* Revert to try if Accelerate fix works

* Use latest Accelerate release

* Quality and fixes

* Style

* Quality

* Add doc

* Test + fix

* More blocks

Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
2022-05-23 14:32:21 -04:00
..
benchmark [Test refactor 1/5] Per-folder tests reorganization (#15725) 2022-02-23 15:46:28 -05:00
deepspeed missing file (#17164) 2022-05-10 10:19:50 -07:00
extended Update self-push workflow (#17177) 2022-05-13 16:28:00 +02:00
fixtures add a warning in SpmConverter for sentencepiece's model using the byte fallback feature (#16629) 2022-04-11 11:06:10 +02:00
generation [Generation] Fix Transition probs (#17311) 2022-05-19 22:17:02 +02:00
models Traced models serialization and torchscripting fix (#17206) 2022-05-23 17:50:40 +02:00
onnx Add onnx export cuda support (#17183) 2022-05-18 17:52:13 +02:00
optimization [Test refactor 1/5] Per-folder tests reorganization (#15725) 2022-02-23 15:46:28 -05:00
pipelines Adding batch_size test to QA pipeline. (#17330) 2022-05-19 14:28:12 -04:00
sagemaker Black preview (#17217) 2022-05-12 16:25:55 -04:00
tokenization [Test refactor 1/5] Per-folder tests reorganization (#15725) 2022-02-23 15:46:28 -05:00
trainer [tests] fix copy-n-paste error (#17312) 2022-05-18 16:00:47 -07:00
utils Fix tests of mixed precision now that experimental is deprecated (#17300) 2022-05-17 14:14:17 +01:00
__init__.py GPU text generation: mMoved the encoded_prompt to correct device 2020-01-06 15:11:12 +01:00
test_configuration_common.py Black preview (#17217) 2022-05-12 16:25:55 -04:00
test_feature_extraction_common.py Move test model folders (#17034) 2022-05-03 14:42:02 +02:00
test_modeling_common.py Use Accelerate in from_pretrained for big model inference (#17341) 2022-05-23 14:32:21 -04:00
test_modeling_flax_common.py [Flax] improve large model init and loading (#16148) 2022-04-19 14:19:55 +02:00
test_modeling_tf_common.py Rewrite TensorFlow train_step and test_step (#17057) 2022-05-17 14:36:23 +01:00
test_sequence_feature_extraction_common.py Some tests misusing assertTrue for comparisons fix (#16771) 2022-04-19 14:44:08 +02:00
test_tokenization_common.py Black preview (#17217) 2022-05-12 16:25:55 -04:00