transformers/tests/pipelines
Yoni Gozlan 203e27059b
Add image text to text pipeline (#34170)
* Standardize image-text-to-text-models-output

add post_process_image_text_to_text to chameleon and cleanup

Fix legacy kwarg behavior and deprecation warning

add post_process_image_text_to_text to qwen2_vl and llava_onevision

Add post_process_image_text_to_text to idefics3, mllama, pixtral processor

* nit var name post_process_image_text_to_text udop

* nit fix deprecation warnings

* Add image-text-to-text pipeline

* add support for image url in chat template for pipeline

* Reformat to be fully compatible with chat templates

* Add tests chat template

* Fix imports and tests

* Add pipeline tag

* change logic handling of single prompt ans multiple images

* add pipeline mapping to models

* fix batched inference

* fix tests

* Add manual batching for preprocessing

* Fix outputs with nested images

* Add support for all common processing kwargs

* Add default padding when multiple text inputs (batch size>1)

* nit change version deprecation warning

* Add support for text only inference

* add chat_template warnings

* Add pipeline tests and add copied from post process function

* Fix batched pipeline tests

* nit

* Fix pipeline tests blip2

* remove unnecessary max_new_tokens

* revert processing kosmos2 and remove unnecessary max_new_tokens

* fix pipeline tests idefics

* Force try loading processor if pipeline supports it

* revert load_processor change

* hardcode loading only processor

* remove unnecessary try except

* skip imagetexttotext tests for kosmos2 as tiny model causes problems

* Make code clearer

* Address review comments

* remove preprocessing logic from pipeline

* fix fuyu

* add BC resize fuyu

* Move post_process_image_text_to_text to ProcessorMixin

* add guard in post_process

* fix zero shot object detection pipeline

* add support for generator input in pipeline

* nit

* change default image-text-to-text model to llava onevision

* fix owlv2 size dict

* Change legacy deprecation warning to only show when True
2024-10-31 15:48:11 -04:00
..
__init__.py [Test refactor 1/5] Per-folder tests reorganization (#15725) 2022-02-23 15:46:28 -05:00
test_pipelines_audio_classification.py Make pipeline able to load processor (#32514) 2024-10-09 16:46:11 +01:00
test_pipelines_automatic_speech_recognition.py Add option for running ffmpeg_microphone_live as a background process (#32838) 2024-10-22 15:56:41 +02:00
test_pipelines_common.py Pipeline: no side-effects on model.config and model.generation_config 🔫 (#33480) 2024-09-18 15:43:06 +01:00
test_pipelines_depth_estimation.py Add post_process_depth_estimation to image processors and support ZoeDepth's inference intricacies (#32550) 2024-10-22 15:50:54 +02:00
test_pipelines_document_question_answering.py Make pipeline able to load processor (#32514) 2024-10-09 16:46:11 +01:00
test_pipelines_feature_extraction.py Make pipeline able to load processor (#32514) 2024-10-09 16:46:11 +01:00
test_pipelines_fill_mask.py Make pipeline able to load processor (#32514) 2024-10-09 16:46:11 +01:00
test_pipelines_image_classification.py Make pipeline able to load processor (#32514) 2024-10-09 16:46:11 +01:00
test_pipelines_image_feature_extraction.py Make pipeline able to load processor (#32514) 2024-10-09 16:46:11 +01:00
test_pipelines_image_segmentation.py Make pipeline able to load processor (#32514) 2024-10-09 16:46:11 +01:00
test_pipelines_image_text_to_text.py Add image text to text pipeline (#34170) 2024-10-31 15:48:11 -04:00
test_pipelines_image_to_image.py Allow FP16 or other precision inference for Pipelines (#31342) 2024-07-05 17:21:50 +01:00
test_pipelines_image_to_text.py Make pipeline able to load processor (#32514) 2024-10-09 16:46:11 +01:00
test_pipelines_mask_generation.py Make pipeline able to load processor (#32514) 2024-10-09 16:46:11 +01:00
test_pipelines_object_detection.py Make pipeline able to load processor (#32514) 2024-10-09 16:46:11 +01:00
test_pipelines_question_answering.py enable QA bf16 pipeline (#34483) 2024-10-31 12:55:53 +00:00
test_pipelines_summarization.py Avoid check expected exception when it is on CUDA (#34408) 2024-10-25 17:14:07 +02:00
test_pipelines_table_question_answering.py Allow FP16 or other precision inference for Pipelines (#31342) 2024-07-05 17:21:50 +01:00
test_pipelines_text_classification.py Fix default behaviour in TextClassificationPipeline for regression problem type (#34066) 2024-10-15 13:06:20 +01:00
test_pipelines_text_generation.py Avoid check expected exception when it is on CUDA (#34408) 2024-10-25 17:14:07 +02:00
test_pipelines_text_to_audio.py Make pipeline able to load processor (#32514) 2024-10-09 16:46:11 +01:00
test_pipelines_text2text_generation.py Make pipeline able to load processor (#32514) 2024-10-09 16:46:11 +01:00
test_pipelines_token_classification.py Make pipeline able to load processor (#32514) 2024-10-09 16:46:11 +01:00
test_pipelines_translation.py Make pipeline able to load processor (#32514) 2024-10-09 16:46:11 +01:00
test_pipelines_video_classification.py Sync video classification pipeline with huggingface_hub spec (#34288) 2024-10-22 13:33:49 +01:00
test_pipelines_visual_question_answering.py Make pipeline able to load processor (#32514) 2024-10-09 16:46:11 +01:00
test_pipelines_zero_shot_audio_classification.py Allow FP16 or other precision inference for Pipelines (#31342) 2024-07-05 17:21:50 +01:00
test_pipelines_zero_shot_image_classification.py Image pipelines spec compliance (#33899) 2024-10-08 13:34:28 +01:00
test_pipelines_zero_shot_object_detection.py Add image text to text pipeline (#34170) 2024-10-31 15:48:11 -04:00
test_pipelines_zero_shot.py Make pipeline able to load processor (#32514) 2024-10-09 16:46:11 +01:00