Yao Matrix
|
fb82a98717
|
enable large_gpu and torchao cases on XPU (#38355)
* cohere2 done
Signed-off-by: Matrix Yao <matrix.yao@intel.com>
* enable torchao cases on XPU
Signed-off-by: Matrix YAO <matrix.yao@intel.com>
* fix
Signed-off-by: Matrix YAO <matrix.yao@intel.com>
* fix
Signed-off-by: Matrix YAO <matrix.yao@intel.com>
* fix
Signed-off-by: Matrix YAO <matrix.yao@intel.com>
* rename
Signed-off-by: Matrix YAO <matrix.yao@intel.com>
* fix
Signed-off-by: Matrix YAO <matrix.yao@intel.com>
* fix comments
Signed-off-by: Matrix YAO <matrix.yao@intel.com>
---------
Signed-off-by: Matrix Yao <matrix.yao@intel.com>
Signed-off-by: Matrix YAO <matrix.yao@intel.com>
|
2025-05-28 10:30:16 +02:00 |
|
Yao Matrix
|
34c1e29cdd
|
enable autoround cases on XPU (#38167)
* enable autoround cases on XPU
Signed-off-by: Matrix Yao <matrix.yao@intel.com>
* fix style
Signed-off-by: Matrix Yao <matrix.yao@intel.com>
---------
Signed-off-by: Matrix Yao <matrix.yao@intel.com>
|
2025-05-16 09:08:35 +00:00 |
|
co63oc
|
d5fa7d2d19
|
Fix typos in strings and comments (#37799)
|
2025-04-28 11:39:11 +01:00 |
|
Wenhua Cheng
|
b3492ff9f7
|
Add AutoRound quantization support (#37393)
* add auto-round support
* Update src/transformers/quantizers/auto.py
Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com>
* fix style issue
Signed-off-by: wenhuach <wenhuach87@gmail.com>
* tiny change
* tiny change
* refine ut and doc
* revert unnecessary change
* tiny change
* try to fix style issue
* try to fix style issue
* try to fix style issue
* try to fix style issue
* try to fix style issue
* try to fix style issue
* try to fix style issue
* fix doc issue
* Update tests/quantization/autoround/test_auto_round.py
* fix comments
* Update tests/quantization/autoround/test_auto_round.py
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* Update tests/quantization/autoround/test_auto_round.py
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* update doc
* Update src/transformers/quantizers/quantizer_auto_round.py
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* update
* update
* fix
* try to fix style issue
* Update src/transformers/quantizers/auto.py
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
* Update docs/source/en/quantization/auto_round.md
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
* Update docs/source/en/quantization/auto_round.md
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
* Update docs/source/en/quantization/auto_round.md
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
* update
* fix style issue
* update doc
* update doc
* Refine the doc
* refine doc
* revert one change
* set sym to True by default
* Enhance the unit test's robustness.
* update
* add torch dtype
* tiny change
* add awq convert test
* fix typo
* update
* fix packing format issue
* use one gpu
---------
Signed-off-by: wenhuach <wenhuach87@gmail.com>
Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com>
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
Co-authored-by: Shen, Haihao <haihao.shen@intel.com>
|
2025-04-22 13:56:54 +02:00 |
|