Bowen Bao
|
6cef03ba66
|
[Regression] Fix Quark quantized model loading after refactorization (#37407)
|
2025-04-11 13:43:36 +02:00 |
|
Mohamed Mekkouri
|
9c0c323e12
|
Fix require_read_token (#37422)
* nit
* fix
* fix
|
2025-04-10 17:01:40 +02:00 |
|
Mohamed Mekkouri
|
5ae9b2cac0
|
Quark Quantization gated repo (#37412)
* fix
* empty commit
* empty
* nit
* fix maybe ?
|
2025-04-10 14:57:15 +02:00 |
|
cyyever
|
1e6b546ea6
|
Use Python 3.9 syntax in tests (#37343)
Signed-off-by: cyy <cyyever@outlook.com>
|
2025-04-08 14:12:08 +02:00 |
|
fxmarty-amd
|
1a374799ce
|
Support loading Quark quantized models in Transformers (#36372)
* add quark quantizer
* add quark doc
* clean up doc
* fix tests
* make style
* more style fixes
* cleanup imports
* cleaning
* precise install
* Update docs/source/en/quantization/quark.md
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* Update tests/quantization/quark_integration/test_quark.py
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* Update src/transformers/utils/quantization_config.py
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* remove import guard as suggested
* update copyright headers
* add quark to transformers-quantization-latest-gpu Dockerfile
* make tests pass on transformers main + quark==0.7
* add missing F8_E4M3 and F8_E5M2 keys from str_to_torch_dtype
---------
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Bowen Bao <bowenbao@amd.com>
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
|
2025-03-20 15:40:51 +01:00 |
|