transformers/tests/utils
jiqing-feng 387663e571
Enable gptqmodel (#35012)
* gptqmodel

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix format

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* update readme

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* gptqmodel need use checkpoint_format (#1)

* gptqmodel need use checkpoint_format

* fix quantize

* Update quantization_config.py

* Update quantization_config.py

* Update quantization_config.py

---------

Co-authored-by: ZX-ModelCloud <zx@modelcloud.ai>
Co-authored-by: Qubitium-ModelCloud <qubitium@modelcloud.ai>

* Revert quantizer_gptq.py (#2)

* revert quantizer_gptq.py change

* pass **kwargs

* limit gptqmodel and optimum version

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix format

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix warning

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix version check

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* revert unrelated changes

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* enable gptqmodel tests

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix requires gptq

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* Fix Transformer compat (#3)

* revert quantizer_gptq.py change

* pass **kwargs

* add meta info

* cleanup

* cleanup

* Update quantization_config.py

* hf_select_quant_linear pass checkpoint_format and meta

* fix GPTQTestCUDA

* Update test_gptq.py

* gptqmodel.hf_select_quant_linear() now does not select ExllamaV2

* cleanup

* add backend

* cleanup

* cleanup

* no need check exllama version

* Update quantization_config.py

* lower checkpoint_format and backend

* check none

* cleanup

* Update quantization_config.py

* fix self.use_exllama == False

* spell

* fix unittest

* fix unittest

---------

Co-authored-by: LRL <lrl@lbx.dev>
Co-authored-by: Qubitium-ModelCloud <qubitium@modelcloud.ai>

* fix format

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix format again

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* update gptqmodel version (#6)

* update gptqmodel version

* update gptqmodel version

* fix unit test (#5)

* update gptqmodel version

* update gptqmodel version

* "not self.use_exllama" is not equivalent to "self.use_exllama==False"

* fix unittest

* update gptqmodel version

* backend is loading_attibutes (#7)

* fix format and tests

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix memory check

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix device mismatch

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix result check

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* Update src/transformers/quantizers/quantizer_gptq.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/quantizers/quantizer_gptq.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/quantizers/quantizer_gptq.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* update tests

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* review: update docs (#10)

* review: update docs (#12)

* review: update docs

* fix typo

* update tests for gptqmodel

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* update document (#9)

* update overview.md

* cleanup

* Update overview.md

* Update overview.md

* Update overview.md

* update gptq.md

* Update gptq.md

* Update gptq.md

* Update gptq.md

* Update gptq.md

* Update gptq.md

* Update gptq.md

---------

Co-authored-by: Qubitium-ModelCloud <qubitium@modelcloud.ai>

* typo

* doc note for asymmetric quant

* typo with apple silicon(e)

* typo for marlin

* column name revert: review

* doc rocm support

* Update docs/source/en/quantization/gptq.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/quantization/gptq.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/quantization/gptq.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/quantization/gptq.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/quantization/overview.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/quantization/overview.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
Co-authored-by: LRL-ModelCloud <165116337+LRL-ModelCloud@users.noreply.github.com>
Co-authored-by: ZX-ModelCloud <zx@modelcloud.ai>
Co-authored-by: Qubitium-ModelCloud <qubitium@modelcloud.ai>
Co-authored-by: ZX-ModelCloud <165115237+ZX-ModelCloud@users.noreply.github.com>
Co-authored-by: LRL <lrl@lbx.dev>
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
2025-01-15 14:22:49 +01:00
..
import_structures Import structure & first three model refactors (#31329) 2024-09-10 11:10:53 +02:00
__init__.py [Test refactor 1/5] Per-folder tests reorganization (#15725) 2022-02-23 15:46:28 -05:00
test_activations_tf.py TF: Add sigmoid activation function (#16819) 2022-04-19 16:13:08 +01:00
test_activations.py Add the GeLU activation from pytorch with the tanh approximation (#21345) 2023-02-02 09:33:04 -05:00
test_add_new_model_like.py fix: Fixed failing tests in tests/utils/test_add_new_model_like.py (#32678) 2024-08-14 12:06:17 +01:00
test_audio_utils.py Remove trust_remote_code when loading Libri Dummy (#31748) 2024-07-23 14:54:38 +08:00
test_backbone_utils.py 🚨 out_indices always a list (#30941) 2024-05-22 15:23:04 +01:00
test_cache_utils.py Enable gptqmodel (#35012) 2025-01-15 14:22:49 +01:00
test_chat_template_utils.py Make tool JSON schemas consistent (#31756) 2024-07-02 20:00:42 +01:00
test_cli.py Forbid PretrainedConfig from saving generate parameters; Update deprecations in generate-related code 🧹 (#32659) 2024-08-23 11:12:53 +01:00
test_configuration_utils.py Fix flaky Hub CI (test_trainer.py) (#35062) 2024-12-05 17:02:27 +01:00
test_convert_slow_tokenizer.py Revert error back into warning for byte fallback conversion. (#22607) 2023-04-06 14:00:29 +02:00
test_deprecation.py Decorators for deprecation and named arguments validation (#30799) 2024-06-10 12:35:10 +01:00
test_doc_samples.py Skip tests properly (#31308) 2024-06-26 21:59:08 +01:00
test_dynamic_module_utils.py Fix the regex in get_imports to support multiline try blocks and excepts with specific exception types (#23725) 2023-05-24 15:40:19 -04:00
test_feature_extraction_utils.py Fix flaky Hub CI (test_trainer.py) (#35062) 2024-12-05 17:02:27 +01:00
test_file_utils.py Inheritance-based framework detection (#21784) 2023-02-27 15:31:55 +00:00
test_generic.py Decorators for deprecation and named arguments validation (#30799) 2024-06-10 12:35:10 +01:00
test_hf_argparser.py HfArgumentParser: allow for hyhenated field names in long-options (#33990) 2024-10-10 11:58:26 +02:00
test_hub_utils.py Use HF_HUB_OFFLINE + fix has_file in offline mode (#31016) 2024-05-29 11:55:43 +01:00
test_image_processing_utils.py Fix flaky Hub CI (test_trainer.py) (#35062) 2024-12-05 17:02:27 +01:00
test_image_utils.py fix: Replaced deprecated mktemp() function (#32123) 2024-07-22 14:13:39 +01:00
test_import_structure.py Fix some missing tests in circleci (#33559) 2024-09-20 20:58:51 +02:00
test_logging.py Fix flaky test for log level (#21776) 2023-02-28 16:24:14 -05:00
test_model_card.py Automatically add transformers tag to the modelcard (#32623) 2024-08-13 07:59:01 +02:00
test_model_output.py Skip tests properly (#31308) 2024-06-26 21:59:08 +01:00
test_modeling_flax_utils.py Fix flaky Hub CI (test_trainer.py) (#35062) 2024-12-05 17:02:27 +01:00
test_modeling_rope_utils.py More model refactoring! (#35359) 2025-01-09 11:09:09 +01:00
test_modeling_tf_core.py Add tf_keras imports to prepare for Keras 3 (#28588) 2024-01-30 17:26:36 +00:00
test_modeling_tf_utils.py Fix flaky Hub CI (test_trainer.py) (#35062) 2024-12-05 17:02:27 +01:00
test_modeling_utils.py Enable different torch dtype in sub models (#34873) 2025-01-13 13:42:08 +01:00
test_offline.py Use HF_HUB_OFFLINE + fix has_file in offline mode (#31016) 2024-05-29 11:55:43 +01:00
test_processing_utils.py Uniformize kwargs for Pixtral processor (#33521) 2024-09-17 14:44:27 -04:00
test_skip_decorators.py Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
test_tokenization_utils.py Fix flaky Hub CI (test_trainer.py) (#35062) 2024-12-05 17:02:27 +01:00
test_versions_utils.py improve dev setup comments and hints (#28495) 2024-01-15 18:36:40 +00:00
tiny_model_summary.json Add image text to text pipeline (#34170) 2024-10-31 15:48:11 -04:00