transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

History

Younes Belkada 4a51075a96 `bitsandbytes` - `Linear8bitLt` integration into `transformers` models (#17901 ) * first commit * correct replace function * add final changes - works like charm! - cannot implement tests yet - tested * clean up a bit * add bitsandbytes dependencies * working version - added import function - added bitsandbytes utils file * small fix * small fix - fix import issue * fix import issues * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * refactor a bit - move bitsandbytes utils to utils - change comments on functions * reformat docstring - reformat docstring on init_empty_weights_8bit * Update src/transformers/__init__.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * revert bad formatting * change to bitsandbytes * refactor a bit - remove init8bit since it is useless * more refactoring - fixed init empty weights issue - added threshold param * small hack to make it work * Update src/transformers/modeling_utils.py * Update src/transformers/modeling_utils.py * revmoe the small hack * modify utils file * make style + refactor a bit * create correctly device map * add correct dtype for device map creation * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * apply suggestions - remove with torch.grad - do not rely on Python bool magic! * add docstring - add docstring for new kwargs * add docstring - comment `replace_8bit_linear` function - fix weird formatting * - added more documentation - added new utility function for memory footprint tracking - colab demo to add * few modifs - typo doc - force cast into float16 when load_in_8bit is enabled * added colab link * add test architecture + docstring a bit * refactor a bit testing class * make style + refactor a bit * enhance checks - add more checks - start writing saving test * clean up a bit * male style * add more details on doc * add more tests - still needs to fix 2 tests * replace by "or" - could not fix it from GitHub GUI Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * refactor a bit testing code + add readme * make style * fix import issue * Update src/transformers/modeling_utils.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * add few comments * add more doctring + make style * more docstring * raise error when loaded in 8bit * make style * add warning if loaded on CPU * add small sanity check * fix small comment * add bitsandbytes on dockerfile * Improve documentation - improve documentation from comments * add few comments * slow tests pass on the VM but not on the CI VM * Fix merge conflict * make style * another test should pass on a multi gpu setup * fix bad import in testing file * Fix slow tests - remove dummy batches - no more CUDA illegal memory errors * odify dockerfile * Update docs/source/en/main_classes/model.mdx * Update Dockerfile * Update model.mdx * Update Dockerfile * Apply suggestions from code review * few modifications - lm head can stay on disk/cpu - change model name so that test pass * change test value - change test value to the correct output - torch bmm changed to baddmm in bloom modeling when merging * modify installation guidelines * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * replace `n`by `name` * merge `load_in_8bit` and `low_cpu_mem_usage` * first try - keep the lm head in full precision * better check - check the attribute `base_model_prefix` instead of computing the number of parameters * added more tests * Update src/transformers/utils/bitsandbytes.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Merge branch 'integration-8bit' of https://github.com/younesbelkada/transformers into integration-8bit * improve documentation - fix typos for installation - change title in the documentation Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>		2022-08-10 09:13:36 +02:00
..
test_module	Custom pipeline (#18079 )	2022-07-19 12:02:35 +02:00
tf_ops	Check TF ops for ONNX compliance (#10025 )	2021-02-15 07:55:10 -05:00
check_config_docstrings.py	Add OWL-ViT model for zero-shot object detection (#17938 )	2022-07-22 13:35:32 +03:00
check_copies.py	Fix template for new models in README (#18182 )	2022-07-18 16:01:51 +02:00
check_doc_toc.py	Split model list on modality (#18328 )	2022-08-01 11:10:20 -05:00
check_dummies.py	Add a TF in-graph tokenizer for BERT (#17701 )	2022-06-27 12:06:21 +01:00
check_inits.py	Make check_init script more robust and clean inits (#17408 )	2022-05-25 07:23:56 -04:00
check_repo.py	Clean up hub (#18497 )	2022-08-08 08:48:10 -04:00
check_table.py	Fix some typos. (#17560 )	2022-07-11 05:00:13 -04:00
check_tf_ops.py	Check TF ops for ONNX compliance (#10025 )	2021-02-15 07:55:10 -05:00
custom_init_isort.py	explicitly set utf8 for Windows (#17664 )	2022-06-13 08:05:45 -04:00
documentation_tests.txt	[VideoMAE] Add model to doc tests (#18523 )	2022-08-08 19:28:51 +02:00
download_glue_data.py	Raise exceptions instead of asserts (#13907 )	2021-10-07 12:44:23 +05:30
get_modified_files.py	Updates the default branch from master to main (#16326 )	2022-03-23 03:46:59 -04:00
notification_service_doc_tests.py	explicitly set utf8 for Windows (#17664 )	2022-06-13 08:05:45 -04:00
notification_service.py	Add machine type in the artifact of Examples directory job (#18459 )	2022-08-04 18:52:01 +02:00
past_ci_versions.py	Add PyTorch 1.11 to past CI (#18302 )	2022-07-26 15:47:23 +02:00
prepare_for_doc_test.py	Add a check regarding the number of occurrences of ``` (#18389 )	2022-08-01 14:23:02 +02:00
print_env.py	Print more library versions in CI (#17384 )	2022-06-02 10:24:16 +02:00
release.py	Clean README in post release job as well. (#17519 )	2022-06-02 07:44:03 -04:00
sort_auto_mappings.py	Automatically sort auto mappings (#17250 )	2022-05-16 13:24:20 -04:00
tests_fetcher.py	`bitsandbytes` - `Linear8bitLt` integration into `transformers` models (#17901 )	2022-08-10 09:13:36 +02:00
update_metadata.py	Replace commit sha by commit url for update jobs (#14852 )	2021-12-21 11:17:11 -05:00