transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-15 02:28:24 +06:00

History

Marc Sun 55db70c63d GPTQ integration (#25062 ) * GTPQ integration * Add tests for gptq * support for more quantization model * fix style * typo * fix method * Update src/transformers/modeling_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * add dataclass and fix quantization_method * fix doc * Update tests/quantization/gptq/test_gptq.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * modify dataclass * add gtpqconfig import * fix typo * fix tests * remove dataset as req arg * remove tokenizer import * add offload cpu quantization test * fix check dataset * modify dockerfile * protect trainer * style * test for config * add more log * overwrite torch_dtype * draft doc * modify quantization_config docstring * fix class name in docstring * Apply suggestions from code review Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * more warning * fix 8bit kwargs tests * peft compatibility * remove var * fix is_gptq_quantized * remove is_gptq_quantized * fix wrap * Update src/transformers/modeling_utils.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * add exllama * skip test * overwrite float16 * style * fix skip test * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fix docsting formatting * add doc * better test --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>		2023-08-10 16:06:29 -04:00
..
internal	Generate: add SequenceBiasLogitsProcessor (#24334 )	2023-06-21 11:14:41 +01:00
main_classes	GPTQ integration (#25062 )	2023-08-10 16:06:29 -04:00
model_doc	Generate: generation config validation fixes in docs (#25405 )	2023-08-09 13:07:11 +01:00
tasks	VQA task guide (#25244 )	2023-08-09 08:29:06 -04:00
_config.py	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
_toctree.yml	VQA task guide (#25244 )	2023-08-09 08:29:06 -04:00
accelerate.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
add_new_model.md	Document check copies (#25291 )	2023-08-04 14:56:29 +02:00
add_new_pipeline.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
add_tensorflow_model.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
attention.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
autoclass_tutorial.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
benchmarks.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
bertology.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
big_models.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
community.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
contributing.md	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
create_a_model.md	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
custom_models.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
custom_tools.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
debugging.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
fast_tokenizers.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
generation_strategies.md	Generate: `group_beam_search` requires `diversity_penalty>0.0` (#24456 )	2023-06-27 10:46:39 +01:00
glossary.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
hpo_train.md	Update RayTune doc link for Hyperparameter tuning (#24422 )	2023-06-22 10:38:01 -04:00
index.md	Add bloom flax (#25094 )	2023-07-27 18:24:56 +01:00
installation.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
llm_tutorial.md	Docs: introduction to generation with LLMs (#25240 )	2023-08-09 11:09:20 +01:00
model_memory_anatomy.md	[docs] Performance docs tidy up, part 1 (#23963 )	2023-07-24 08:57:24 -04:00
model_sharing.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
model_summary.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
multilingual.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
notebooks.md	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
pad_truncation.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
perf_hardware.md	🌐 [i18n-KO] Translated `perf_hardware.md` to Korean (#24966 )	2023-07-25 07:44:24 -04:00
perf_infer_cpu.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
perf_infer_gpu_many.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
perf_infer_gpu_one.md	fix: add TOC anchor link (#25066 )	2023-07-25 08:02:33 -04:00
perf_infer_special.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
perf_torch_compile.md	Fix rendering for `torch.compile()` docs (#25432 )	2023-08-10 13:25:00 +02:00
perf_train_cpu_many.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
perf_train_cpu.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
perf_train_gpu_many.md	deprecate `sharded_ddp` training argument (#24825 )	2023-07-17 06:57:42 -04:00
perf_train_gpu_one.md	Set `TF32` flag for PyTorch cuDNN backend (#25075 )	2023-07-25 08:04:48 -04:00
perf_train_special.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
perf_train_tpu_tf.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
perf_train_tpu.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
performance.md	[docs] Performance docs tidy up, part 1 (#23963 )	2023-07-24 08:57:24 -04:00
perplexity.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
philosophy.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
pipeline_tutorial.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
pipeline_webserver.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
pr_checks.md	Document check copies (#25291 )	2023-08-04 14:56:29 +02:00
preprocessing.md	Removal of deprecated vision methods and specify deprecation versions (#24570 )	2023-06-29 15:09:51 +01:00
quicktour.md	🌐 [i18n-KO] Fixed Korean and English `quicktour.md` (#24664 )	2023-07-21 08:19:28 -04:00
run_scripts.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
sagemaker.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
serialization.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
task_summary.md	Fix doctest (#25031 )	2023-07-25 22:10:06 +02:00
tasks_explained.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
testing.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
tf_xla.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
tflite.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
tokenizer_summary.md	Fix typo: Roberta -> RoBERTa (#25302 )	2023-08-03 14:17:30 -07:00
torchscript.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
training.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
transformers_agents.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
troubleshooting.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00