transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

History

jiqing-feng 387663e571 Enable gptqmodel (#35012 ) * gptqmodel Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix format Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update readme Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * gptqmodel need use checkpoint_format (#1) * gptqmodel need use checkpoint_format * fix quantize * Update quantization_config.py * Update quantization_config.py * Update quantization_config.py --------- Co-authored-by: ZX-ModelCloud <zx@modelcloud.ai> Co-authored-by: Qubitium-ModelCloud <qubitium@modelcloud.ai> * Revert quantizer_gptq.py (#2) * revert quantizer_gptq.py change * pass *kwargs limit gptqmodel and optimum version Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix format Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix warning Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix version check Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * revert unrelated changes Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * enable gptqmodel tests Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix requires gptq Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * Fix Transformer compat (#3) * revert quantizer_gptq.py change * pass *kwargs add meta info * cleanup * cleanup * Update quantization_config.py * hf_select_quant_linear pass checkpoint_format and meta * fix GPTQTestCUDA * Update test_gptq.py * gptqmodel.hf_select_quant_linear() now does not select ExllamaV2 * cleanup * add backend * cleanup * cleanup * no need check exllama version * Update quantization_config.py * lower checkpoint_format and backend * check none * cleanup * Update quantization_config.py * fix self.use_exllama == False * spell * fix unittest * fix unittest --------- Co-authored-by: LRL <lrl@lbx.dev> Co-authored-by: Qubitium-ModelCloud <qubitium@modelcloud.ai> * fix format Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix format again Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update gptqmodel version (#6) * update gptqmodel version * update gptqmodel version * fix unit test (#5) * update gptqmodel version * update gptqmodel version * "not self.use_exllama" is not equivalent to "self.use_exllama==False" * fix unittest * update gptqmodel version * backend is loading_attibutes (#7) * fix format and tests Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix memory check Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix device mismatch Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix result check Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * update tests Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * review: update docs (#10) * review: update docs (#12) * review: update docs * fix typo * update tests for gptqmodel Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update document (#9) * update overview.md * cleanup * Update overview.md * Update overview.md * Update overview.md * update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md --------- Co-authored-by: Qubitium-ModelCloud <qubitium@modelcloud.ai> * typo * doc note for asymmetric quant * typo with apple silicon(e) * typo for marlin * column name revert: review * doc rocm support * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/quantization/overview.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/quantization/overview.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com> Co-authored-by: LRL-ModelCloud <165116337+LRL-ModelCloud@users.noreply.github.com> Co-authored-by: ZX-ModelCloud <zx@modelcloud.ai> Co-authored-by: Qubitium-ModelCloud <qubitium@modelcloud.ai> Co-authored-by: ZX-ModelCloud <165115237+ZX-ModelCloud@users.noreply.github.com> Co-authored-by: LRL <lrl@lbx.dev> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>		2025-01-15 14:22:49 +01:00
..
import_structures	Import structure & first three model refactors (#31329 )	2024-09-10 11:10:53 +02:00
__init__.py	[Test refactor 1/5] Per-folder tests reorganization (#15725 )	2022-02-23 15:46:28 -05:00
test_activations_tf.py	TF: Add sigmoid activation function (#16819 )	2022-04-19 16:13:08 +01:00
test_activations.py	Add the GeLU activation from pytorch with the tanh approximation (#21345 )	2023-02-02 09:33:04 -05:00
test_add_new_model_like.py	fix: Fixed failing tests in `tests/utils/test_add_new_model_like.py` (#32678 )	2024-08-14 12:06:17 +01:00
test_audio_utils.py	Remove `trust_remote_code` when loading Libri Dummy (#31748 )	2024-07-23 14:54:38 +08:00
test_backbone_utils.py	🚨 out_indices always a list (#30941 )	2024-05-22 15:23:04 +01:00
test_cache_utils.py	Enable gptqmodel (#35012 )	2025-01-15 14:22:49 +01:00
test_chat_template_utils.py	Make tool JSON schemas consistent (#31756 )	2024-07-02 20:00:42 +01:00
test_cli.py	Forbid `PretrainedConfig` from saving `generate` parameters; Update deprecations in `generate`-related code 🧹 (#32659 )	2024-08-23 11:12:53 +01:00
test_configuration_utils.py	Fix flaky Hub CI (`test_trainer.py`) (#35062 )	2024-12-05 17:02:27 +01:00
test_convert_slow_tokenizer.py	Revert error back into warning for byte fallback conversion. (#22607 )	2023-04-06 14:00:29 +02:00
test_deprecation.py	Decorators for deprecation and named arguments validation (#30799 )	2024-06-10 12:35:10 +01:00
test_doc_samples.py	Skip tests properly (#31308 )	2024-06-26 21:59:08 +01:00
test_dynamic_module_utils.py	Fix the regex in `get_imports` to support multiline try blocks and excepts with specific exception types (#23725 )	2023-05-24 15:40:19 -04:00
test_feature_extraction_utils.py	Fix flaky Hub CI (`test_trainer.py`) (#35062 )	2024-12-05 17:02:27 +01:00
test_file_utils.py	Inheritance-based framework detection (#21784 )	2023-02-27 15:31:55 +00:00
test_generic.py	Decorators for deprecation and named arguments validation (#30799 )	2024-06-10 12:35:10 +01:00
test_hf_argparser.py	HfArgumentParser: allow for hyhenated field names in long-options (#33990 )	2024-10-10 11:58:26 +02:00
test_hub_utils.py	Use `HF_HUB_OFFLINE` + fix has_file in offline mode (#31016 )	2024-05-29 11:55:43 +01:00
test_image_processing_utils.py	Fix flaky Hub CI (`test_trainer.py`) (#35062 )	2024-12-05 17:02:27 +01:00
test_image_utils.py	fix: Replaced deprecated `mktemp()` function (#32123 )	2024-07-22 14:13:39 +01:00
test_import_structure.py	Fix some missing tests in circleci (#33559 )	2024-09-20 20:58:51 +02:00
test_logging.py	Fix flaky test for log level (#21776 )	2023-02-28 16:24:14 -05:00
test_model_card.py	Automatically add `transformers` tag to the modelcard (#32623 )	2024-08-13 07:59:01 +02:00
test_model_output.py	Skip tests properly (#31308 )	2024-06-26 21:59:08 +01:00
test_modeling_flax_utils.py	Fix flaky Hub CI (`test_trainer.py`) (#35062 )	2024-12-05 17:02:27 +01:00
test_modeling_rope_utils.py	More model refactoring! (#35359 )	2025-01-09 11:09:09 +01:00
test_modeling_tf_core.py	Add tf_keras imports to prepare for Keras 3 (#28588 )	2024-01-30 17:26:36 +00:00
test_modeling_tf_utils.py	Fix flaky Hub CI (`test_trainer.py`) (#35062 )	2024-12-05 17:02:27 +01:00
test_modeling_utils.py	Enable different torch dtype in sub models (#34873 )	2025-01-13 13:42:08 +01:00
test_offline.py	Use `HF_HUB_OFFLINE` + fix has_file in offline mode (#31016 )	2024-05-29 11:55:43 +01:00
test_processing_utils.py	Uniformize kwargs for Pixtral processor (#33521 )	2024-09-17 14:44:27 -04:00
test_skip_decorators.py	Update quality tooling for formatting (#21480 )	2023-02-06 18:10:56 -05:00
test_tokenization_utils.py	Fix flaky Hub CI (`test_trainer.py`) (#35062 )	2024-12-05 17:02:27 +01:00
test_versions_utils.py	improve dev setup comments and hints (#28495 )	2024-01-15 18:36:40 +00:00
tiny_model_summary.json	Add image text to text pipeline (#34170 )	2024-10-31 15:48:11 -04:00