transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-15 10:38:23 +06:00

History

Shijie 19e6e80e10 support qwen2-vl (#32318 ) * support-qwen2-vl * tidy * tidy * tidy * tidy * tidy * tidy * tidy * hyphen->underscore * make style * add-flash2-tipd * delete-tokenize=False * remove-image_processor-in-init-file * add-qwen2_vl-in-MODEL_FOR_VISION_2_SEQ_MAPPING_NAMES * format-doct * support-Qwen2VLVisionConfig * remove-standardize_cache_format * fix-letter-varaibles * remove-torch-in-image-processor * remove-useless-docstring * fix-one-letter-varaible-name * change-block-name * default-quick-gelu-in-vision * remove-useless-doc * use-preimplemented-flash-forward * fix-doc * fix-image-processing-doc * fix-apply-rotary-embed * fix-flash-attn-sliding-window * refactor * remove-default_template * remove-reorder_cache * simple-get-rope_deltas * update-prepare_inputs_for_generation * update-attention-mask * update-rotary_seq_len * remove-state * kv_seq_length * remove-warning * _supports_static_cache * remove-legacy-cache * refactor * fix-replace * mrope-section-doc * code-quality * code-quality * polish-doc * fix-image-processing-test * update readme * Update qwen2_vl.md * fix-test * Update qwen2_vl.md * nit * processor-kwargs * hard-code-norm_layer * code-quality * discard-pixel-values-in-gen * fix-inconsistent-error-msg * unify-image-video * hidden_act * add-docstring * vision-encode-as-PreTrainedModel * pixel-to-target-dtype * update doc and low memoryvit * format * format * channel-foramt * fix vit_flashatt * format * inherit-Qwen2VLPreTrainedModel * simplify * format-test * remove-one-line-func-in-image-processing * avoid-one-line-reshape * simplify-rotary_seq_len * avoid-single-letter-variable * no-for-loop-sdpa * avoid-single-letter-variable * remove-one-line-reshape * remove-one-line-reshape * remove-no-rope-in-vit-logic * default-mrope * add-copied-from * more-docs-for-mrope * polish-doc * comment-and-link * polish-doc * single-letter-variables * simplify-image-processing * video->images * kv_seq_len-update * vision-rope-on-the-fly * vision-eager-attention * change-processor-order --------- Co-authored-by: baishuai <baishuai.bs@alibaba-inc.com> Co-authored-by: ShuaiBai623 <43326198+ShuaiBai623@users.noreply.github.com>		2024-08-26 15:16:44 +02:00
..
internal	Forbid `PretrainedConfig` from saving `generate` parameters; Update deprecations in `generate`-related code 🧹 (#32659 )	2024-08-23 11:12:53 +01:00
main_classes	Add TorchAOHfQuantizer (#32306 )	2024-08-14 16:14:24 +02:00
model_doc	support qwen2-vl (#32318 )	2024-08-26 15:16:44 +02:00
quantization	Add TorchAOHfQuantizer (#32306 )	2024-08-14 16:14:24 +02:00
tasks	[docs] Translation guide (#32547 )	2024-08-08 13:43:14 -07:00
_config.py	[#29174 ] ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1 Fix (#29888 )	2024-04-08 14:21:16 +01:00
_redirects.yml	Docs / Quantization: Redirect deleted page (#31063 )	2024-05-28 18:29:22 +02:00
_toctree.yml	support qwen2-vl (#32318 )	2024-08-26 15:16:44 +02:00
accelerate.md	Fix typos (#25936 )	2023-09-04 11:15:12 +01:00
add_new_model.md	Remove add-new-model in favor of add-new-model-like (#30424 )	2024-04-24 09:38:18 +02:00
add_new_pipeline.md	add `push_to_hub` to pipeline (#29172 )	2024-04-16 15:34:04 +01:00
agents.md	Agents use grammar (#31735 )	2024-08-07 11:42:52 +02:00
attention.md	[Docs] Fix broken links and syntax issues (#28918 )	2024-02-08 14:13:35 -08:00
autoclass_tutorial.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
benchmarks.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
bertology.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
big_models.md	[docs] Big model loading (#29920 )	2024-04-01 18:47:32 -07:00
chat_templating.md	Update Jinja docs with new functions and general cleanup (#33097 )	2024-08-23 17:40:06 +01:00
community.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
contributing.md	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
conversations.md	[docs] change temperature to a positive value (#32077 )	2024-07-23 17:47:51 +01:00
create_a_model.md	Enable HF pretrained backbones (#31145 )	2024-06-06 22:02:38 +01:00
custom_models.md	Updated the custom_models.md changed cross_entropy code (#33118 )	2024-08-26 13:15:43 +02:00
debugging.md	[Docs] Fix spelling and grammar mistakes (#28825 )	2024-02-02 08:45:00 +01:00
deepspeed.md	Fix typos (#31819 )	2024-07-08 11:52:47 +01:00
fast_tokenizers.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
fsdp.md	[docs] Trainer docs (#28145 )	2023-12-20 10:37:23 -08:00
generation_strategies.md	Docs: alert for the possibility of manipulating logits (#32467 )	2024-08-07 16:34:46 +01:00
gguf.md	Add Qwen2 GGUF loading support (#31175 )	2024-06-03 14:55:10 +01:00
glossary.md	Fix typos (#31819 )	2024-07-08 11:52:47 +01:00
hpo_train.md	Remove-auth-token (#27060 )	2023-11-13 14:20:54 +01:00
index.md	support qwen2-vl (#32318 )	2024-08-26 15:16:44 +02:00
installation.md	Use `HF_HUB_OFFLINE` + fix has_file in offline mode (#31016 )	2024-05-29 11:55:43 +01:00
kv_cache.md	Cache: create docs (#32150 )	2024-08-06 10:24:19 +05:00
llm_optims.md	Cache: use `batch_size` instead of `max_batch_size` (#32657 )	2024-08-16 11:48:45 +01:00
llm_tutorial_optimization.md	Fix typos (#31819 )	2024-07-08 11:52:47 +01:00
llm_tutorial.md	Add SynCode to llm_tutorial (#32884 )	2024-08-22 15:30:22 +02:00
model_memory_anatomy.md	🚨🚨🚨Deprecate `evaluation_strategy` to `eval_strategy`🚨🚨🚨 (#30190 )	2024-04-18 12:49:43 -04:00
model_sharing.md	Docs: formatting nits (#32247 )	2024-07-30 15:49:14 +01:00
model_summary.md	model_summary.md - Restore link to Harvard's Annotated Transformer. (#29702 )	2024-03-23 18:29:39 -07:00
multilingual.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
notebooks.md	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
pad_truncation.md	[Doc] Spanish translation of pad_truncation.md (#27890 )	2023-12-08 10:32:18 -08:00
peft.md	Docs / Quantization: Replace all occurences of `load_in_8bit` with bnb config (#31136 )	2024-05-30 16:47:35 +02:00
perf_hardware.md	Fix typos (#31819 )	2024-07-08 11:52:47 +01:00
perf_infer_cpu.md	[Docs] Fix spelling and grammar mistakes (#28825 )	2024-02-02 08:45:00 +01:00
perf_infer_gpu_one.md	support qwen2-vl (#32318 )	2024-08-26 15:16:44 +02:00
perf_torch_compile.md	fix(docs): Fixed a link in docs (#32274 )	2024-07-29 10:50:43 +01:00
perf_train_cpu_many.md	Update the distributed CPU training on Kubernetes documentation (#32669 )	2024-08-14 09:36:43 -07:00
perf_train_cpu.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
perf_train_gpu_many.md	Update perf_train_gpu_many.md (#31451 )	2024-06-18 11:00:26 -07:00
perf_train_gpu_one.md	Add torch_empty_cache_steps to TrainingArguments (#31546 )	2024-07-04 13:20:49 -04:00
perf_train_special.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
perf_train_tpu_tf.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
performance.md	[docs] Update CPU/GPU inference docs (#26881 )	2023-10-31 09:44:51 -07:00
perplexity.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
philosophy.md	[docs] fixed links with 404 (#27327 )	2023-11-06 19:45:03 +00:00
pipeline_tutorial.md	Docs: Fixed `whisper-large-v2` model link in docs (#32871 )	2024-08-19 09:50:35 -07:00
pipeline_webserver.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
pr_checks.md	[Docs] Fix spelling and grammar mistakes (#28825 )	2024-02-02 08:45:00 +01:00
preprocessing.md	chore: remove duplicate words (#31853 )	2024-07-09 10:38:29 +01:00
quicktour.md	docs: fix broken link (#31370 )	2024-06-12 11:33:00 +01:00
run_scripts.md	Fix broken link to Transformers notebooks (#30512 )	2024-04-29 10:57:51 +01:00
sagemaker.md	[docs] fixed links with 404 (#27327 )	2023-11-06 19:45:03 +00:00
serialization.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
task_summary.md	More fixes for doctest (#30265 )	2024-04-16 11:58:55 +02:00
tasks_explained.md	[docs] Spanish translation of tasks_explained.md (#29224 )	2024-02-26 08:18:15 -08:00
testing.md	Docs: Fixed WhisperModel.forward’s docstring link (#32498 )	2024-08-07 11:01:33 -07:00
tf_xla.md	fix(docs): Fixed a link in docs (#32274 )	2024-07-29 10:50:43 +01:00
tflite.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
tokenizer_summary.md	[docs] Spanish translation of tokenizer_summary.md (#31154 )	2024-06-03 16:52:23 -07:00
torchscript.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
trainer.md	Integrate Liger (Linkedin GPU Efficient Runtime) Kernel to Trainer (#32860 )	2024-08-23 13:20:49 +02:00
training.md	Added the necessay import of module (#30804 )	2024-05-14 18:45:06 +01:00
troubleshooting.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00