Yao Matrix
89542fb81c
enable more test cases on xpu ( #38572 )
...
* enable glm4 integration cases on XPU, set xpu expectation for blip2
Signed-off-by: Matrix YAO <matrix.yao@intel.com>
* more
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
* fix style
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
* refine wording
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
* refine test case names
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
* run
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
* add gemma2 and chameleon
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
* fix review comments
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
---------
Signed-off-by: Matrix YAO <matrix.yao@intel.com>
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
2025-06-06 09:29:51 +02:00
Mohamed Mekkouri
9a962dd9ed
Add tearDown method to Quark to solve OOM issues ( #38234 )
...
fix
2025-05-21 14:26:44 +02:00
co63oc
d5fa7d2d19
Fix typos in strings and comments ( #37799 )
2025-04-28 11:39:11 +01:00
Mohamed Mekkouri
38c406844e
Fixing quantization tests ( #37650 )
...
* fix
* style
* add capability check
2025-04-22 13:59:57 +02:00
Mohamed Mekkouri
bb2a44ad4b
Fix Quark quantization config ( #37578 )
...
fix
2025-04-18 07:23:39 +02:00
Mohamed Mekkouri
d228f50acc
Fixing gated repo issues ( #37463 )
...
using unsloth model
2025-04-14 17:19:10 +02:00
Bowen Bao
6cef03ba66
[Regression] Fix Quark quantized model loading after refactorization ( #37407 )
2025-04-11 13:43:36 +02:00
Mohamed Mekkouri
9c0c323e12
Fix require_read_token ( #37422 )
...
* nit
* fix
* fix
2025-04-10 17:01:40 +02:00
Mohamed Mekkouri
5ae9b2cac0
Quark Quantization gated repo ( #37412 )
...
* fix
* empty commit
* empty
* nit
* fix maybe ?
2025-04-10 14:57:15 +02:00
cyyever
1e6b546ea6
Use Python 3.9 syntax in tests ( #37343 )
...
Signed-off-by: cyy <cyyever@outlook.com>
2025-04-08 14:12:08 +02:00
fxmarty-amd
1a374799ce
Support loading Quark quantized models in Transformers ( #36372 )
...
* add quark quantizer
* add quark doc
* clean up doc
* fix tests
* make style
* more style fixes
* cleanup imports
* cleaning
* precise install
* Update docs/source/en/quantization/quark.md
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* Update tests/quantization/quark_integration/test_quark.py
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* Update src/transformers/utils/quantization_config.py
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* remove import guard as suggested
* update copyright headers
* add quark to transformers-quantization-latest-gpu Dockerfile
* make tests pass on transformers main + quark==0.7
* add missing F8_E4M3 and F8_E5M2 keys from str_to_torch_dtype
---------
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Bowen Bao <bowenbao@amd.com>
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
2025-03-20 15:40:51 +01:00