transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-05 22:00:09 +06:00

History

Yoni Gozlan 203e27059b Add image text to text pipeline (#34170 ) * Standardize image-text-to-text-models-output add post_process_image_text_to_text to chameleon and cleanup Fix legacy kwarg behavior and deprecation warning add post_process_image_text_to_text to qwen2_vl and llava_onevision Add post_process_image_text_to_text to idefics3, mllama, pixtral processor * nit var name post_process_image_text_to_text udop * nit fix deprecation warnings * Add image-text-to-text pipeline * add support for image url in chat template for pipeline * Reformat to be fully compatible with chat templates * Add tests chat template * Fix imports and tests * Add pipeline tag * change logic handling of single prompt ans multiple images * add pipeline mapping to models * fix batched inference * fix tests * Add manual batching for preprocessing * Fix outputs with nested images * Add support for all common processing kwargs * Add default padding when multiple text inputs (batch size>1) * nit change version deprecation warning * Add support for text only inference * add chat_template warnings * Add pipeline tests and add copied from post process function * Fix batched pipeline tests * nit * Fix pipeline tests blip2 * remove unnecessary max_new_tokens * revert processing kosmos2 and remove unnecessary max_new_tokens * fix pipeline tests idefics * Force try loading processor if pipeline supports it * revert load_processor change * hardcode loading only processor * remove unnecessary try except * skip imagetexttotext tests for kosmos2 as tiny model causes problems * Make code clearer * Address review comments * remove preprocessing logic from pipeline * fix fuyu * add BC resize fuyu * Move post_process_image_text_to_text to ProcessorMixin * add guard in post_process * fix zero shot object detection pipeline * add support for generator input in pipeline * nit * change default image-text-to-text model to llava onevision * fix owlv2 size dict * Change legacy deprecation warning to only show when True		2024-10-31 15:48:11 -04:00
..
agent.md	Reboot Agents (#30387 )	2024-05-07 12:59:49 +02:00
callback.md	Update CometCallback to allow reusing of the running experiment (#31366 )	2024-07-05 08:13:46 +02:00
configuration.md	translation main-class files to chinese (#27588 )	2023-11-27 12:36:37 -08:00
data_collator.md	translation main-class files to chinese (#27588 )	2023-11-27 12:36:37 -08:00
deepspeed.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
feature_extractor.md	translation main-class files to chinese (#27588 )	2023-11-27 12:36:37 -08:00
image_processor.md	translation main-class files to chinese (#27588 )	2023-11-27 12:36:37 -08:00
keras_callbacks.md	translation main-class files to chinese (#27588 )	2023-11-27 12:36:37 -08:00
logging.md	translation main-class files to chinese (#27588 )	2023-11-27 12:36:37 -08:00
model.md	translation main-class files to chinese (#27588 )	2023-11-27 12:36:37 -08:00
onnx.md	translation main-class files to chinese (#27588 )	2023-11-27 12:36:37 -08:00
optimizer_schedules.md	translation main-class files to chinese (#27588 )	2023-11-27 12:36:37 -08:00
output.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
pipelines.md	Add image text to text pipeline (#34170 )	2024-10-31 15:48:11 -04:00
processors.md	translation main-class files to chinese (#27588 )	2023-11-27 12:36:37 -08:00
quantization.md	Docs / Quantization: Replace all occurences of `load_in_8bit` with bnb config (#31136 )	2024-05-30 16:47:35 +02:00
text_generation.md	Generate: inner decoding methods are no longer public (#29437 )	2024-03-05 10:27:36 +00:00
tokenizer.md	translation main-class files to chinese (#27588 )	2023-11-27 12:36:37 -08:00
trainer.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00