Mohamed Mekkouri
b262680af4
Add Bitnet model ( #37742 )
...
* Adding BitNet b1.58 Model
* Add testing code for BitNet
* Fix format issues
* Fix docstring format issues
* Fix docstring
* Fix docstring
* Fix: weight back to uint8
* Fix
* Fix format issues
* Remove copy comments
* Add model link to the docstring
* Fix: set tie_word_embeddings default to false
* Update
* Generate modeling file
* Change config name for automatically generating modeling file.
* Generate modeling file
* Fix class name
* Change testing branch
* Remove unused param
* Fix config docstring
* Add docstring for BitNetQuantConfig.
* Fix docstring
* Update docs/source/en/model_doc/bitnet.md
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
* Update docs/source/en/model_doc/bitnet.md
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* Update bitnet config
* Update explanation between online and offline mode
* Remove space
* revert changes
* more revert
* spaces
* update
* fix-copies
* doc fix
* fix minor nits
* empty
* small nit
* empty
---------
Co-authored-by: Shuming Ma <shumingma@pku.edu.cn>
Co-authored-by: shumingma <shmingm@gmail.com>
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
2025-04-28 15:08:46 +02:00
cyyever
1e6b546ea6
Use Python 3.9 syntax in tests ( #37343 )
...
Signed-off-by: cyy <cyyever@outlook.com>
2025-04-08 14:12:08 +02:00
Mohamed Mekkouri
4e6b19cd95
Fix : BitNet tests ( #34895 )
...
* fix_tests_bitnet
* fix format
2024-11-25 16:47:14 +01:00
Mohamed Mekkouri
54be2d7ae8
Bitnet test fix to avoid using gated model ( #34863 )
...
small test fix
2024-11-22 17:18:49 +01:00
Mohamed Mekkouri
36d410dab6
FEAT : Adding BitNet quantization method to HFQuantizer ( #33410 )
...
* rebasing changes
* fixing style
* adding some doc to functions
* remove bitblas
* change dtype
* fixing check_code_quality
* fixing import order
* adding doc to tree
* Small update on BitLinear
* adding some tests
* sorting imports
* small update
* reformatting
* reformatting
* reformatting with ruff
* adding assert
* changes after review
* update disk offloading
* adapting after review
* Update after review
* add is_serializable back
* fixing style
* adding serialization test
* make style
* small updates after review
2024-10-09 17:51:41 +02:00