mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 10:12:23 +06:00

History

Younes Belkada a5ca56ff15 Supporting seq2seq models for `bitsandbytes` integration (#18579 ) * Supporting seq2seq models for `bitsandbytes` integration - `bitsandbytes` integration supports now seq2seq models - check if a model has tied weights as an additional check * small modification - tie the weights before looking at tied weights!		2022-08-12 16:15:09 +02:00
..
__init__.py	`bitsandbytes` - `Linear8bitLt` integration into `transformers` models (#17901 )	2022-08-10 09:13:36 +02:00
README.md	`bitsandbytes` - `Linear8bitLt` integration into `transformers` models (#17901 )	2022-08-10 09:13:36 +02:00
test_mixed_int8.py	Supporting seq2seq models for `bitsandbytes` integration (#18579 )	2022-08-12 16:15:09 +02:00

README.md

Testing mixed int8 quantization

Hardware requirements

I am using a setup of 2 GPUs that are NVIDIA-Tesla T4 15GB

Virutal envs

conda create --name int8-testing python==3.8 git clone https://github.com/younesbelkada/transformers.git && git checkout integration-8bit pip install -e ".[dev]" pip install -i https://test.pypi.org/simple/ bitsandbytes pip install git+https://github.com/huggingface/accelerate.git@e0212893ea6098cc0a7a3c7a6eb286a9104214c1

Trobleshooting

conda create --name int8-testing python==3.8 pip install -i https://test.pypi.org/simple/ bitsandbytes conda install pytorch torchvision torchaudio -c pytorch git clone https://github.com/younesbelkada/transformers.git && git checkout integration-8bit pip install -e ".[dev]" pip install git+https://github.com/huggingface/accelerate.git@b52b793ea8bac108ba61192eead3cf11ca02433d

Check driver settings:

nvcc --version

ls -l $CONDA_PREFIX/lib/libcudart.so

Recurrent bugs

Sometimes you have to run a "dummy" inference pass when dealing with a multi-GPU setup. Checkout the test_multi_gpu_loading and the test_pipeline functions.