transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-02 19:21:31 +06:00

History

Yoni Gozlan 203e27059b Add image text to text pipeline (#34170 ) * Standardize image-text-to-text-models-output add post_process_image_text_to_text to chameleon and cleanup Fix legacy kwarg behavior and deprecation warning add post_process_image_text_to_text to qwen2_vl and llava_onevision Add post_process_image_text_to_text to idefics3, mllama, pixtral processor * nit var name post_process_image_text_to_text udop * nit fix deprecation warnings * Add image-text-to-text pipeline * add support for image url in chat template for pipeline * Reformat to be fully compatible with chat templates * Add tests chat template * Fix imports and tests * Add pipeline tag * change logic handling of single prompt ans multiple images * add pipeline mapping to models * fix batched inference * fix tests * Add manual batching for preprocessing * Fix outputs with nested images * Add support for all common processing kwargs * Add default padding when multiple text inputs (batch size>1) * nit change version deprecation warning * Add support for text only inference * add chat_template warnings * Add pipeline tests and add copied from post process function * Fix batched pipeline tests * nit * Fix pipeline tests blip2 * remove unnecessary max_new_tokens * revert processing kosmos2 and remove unnecessary max_new_tokens * fix pipeline tests idefics * Force try loading processor if pipeline supports it * revert load_processor change * hardcode loading only processor * remove unnecessary try except * skip imagetexttotext tests for kosmos2 as tiny model causes problems * Make code clearer * Address review comments * remove preprocessing logic from pipeline * fix fuyu * add BC resize fuyu * Move post_process_image_text_to_text to ProcessorMixin * add guard in post_process * fix zero shot object detection pipeline * add support for generator input in pipeline * nit * change default image-text-to-text model to llava onevision * fix owlv2 size dict * Change legacy deprecation warning to only show when True		2024-10-31 15:48:11 -04:00
..
__init__.py	[Test refactor 1/5] Per-folder tests reorganization (#15725 )	2022-02-23 15:46:28 -05:00
test_pipelines_audio_classification.py	Make `pipeline` able to load `processor` (#32514 )	2024-10-09 16:46:11 +01:00
test_pipelines_automatic_speech_recognition.py	Add option for running ffmpeg_microphone_live as a background process (#32838 )	2024-10-22 15:56:41 +02:00
test_pipelines_common.py	Pipeline: no side-effects on `model.config` and `model.generation_config` 🔫 (#33480 )	2024-09-18 15:43:06 +01:00
test_pipelines_depth_estimation.py	Add post_process_depth_estimation to image processors and support ZoeDepth's inference intricacies (#32550 )	2024-10-22 15:50:54 +02:00
test_pipelines_document_question_answering.py	Make `pipeline` able to load `processor` (#32514 )	2024-10-09 16:46:11 +01:00
test_pipelines_feature_extraction.py	Make `pipeline` able to load `processor` (#32514 )	2024-10-09 16:46:11 +01:00
test_pipelines_fill_mask.py	Make `pipeline` able to load `processor` (#32514 )	2024-10-09 16:46:11 +01:00
test_pipelines_image_classification.py	Make `pipeline` able to load `processor` (#32514 )	2024-10-09 16:46:11 +01:00
test_pipelines_image_feature_extraction.py	Make `pipeline` able to load `processor` (#32514 )	2024-10-09 16:46:11 +01:00
test_pipelines_image_segmentation.py	Make `pipeline` able to load `processor` (#32514 )	2024-10-09 16:46:11 +01:00
test_pipelines_image_text_to_text.py	Add image text to text pipeline (#34170 )	2024-10-31 15:48:11 -04:00
test_pipelines_image_to_image.py	Allow FP16 or other precision inference for Pipelines (#31342 )	2024-07-05 17:21:50 +01:00
test_pipelines_image_to_text.py	Make `pipeline` able to load `processor` (#32514 )	2024-10-09 16:46:11 +01:00
test_pipelines_mask_generation.py	Make `pipeline` able to load `processor` (#32514 )	2024-10-09 16:46:11 +01:00
test_pipelines_object_detection.py	Make `pipeline` able to load `processor` (#32514 )	2024-10-09 16:46:11 +01:00
test_pipelines_question_answering.py	enable QA bf16 pipeline (#34483 )	2024-10-31 12:55:53 +00:00
test_pipelines_summarization.py	Avoid check expected exception when it is on CUDA (#34408 )	2024-10-25 17:14:07 +02:00
test_pipelines_table_question_answering.py	Allow FP16 or other precision inference for Pipelines (#31342 )	2024-07-05 17:21:50 +01:00
test_pipelines_text_classification.py	Fix default behaviour in TextClassificationPipeline for regression problem type (#34066 )	2024-10-15 13:06:20 +01:00
test_pipelines_text_generation.py	Avoid check expected exception when it is on CUDA (#34408 )	2024-10-25 17:14:07 +02:00
test_pipelines_text_to_audio.py	Make `pipeline` able to load `processor` (#32514 )	2024-10-09 16:46:11 +01:00
test_pipelines_text2text_generation.py	Make `pipeline` able to load `processor` (#32514 )	2024-10-09 16:46:11 +01:00
test_pipelines_token_classification.py	Make `pipeline` able to load `processor` (#32514 )	2024-10-09 16:46:11 +01:00
test_pipelines_translation.py	Make `pipeline` able to load `processor` (#32514 )	2024-10-09 16:46:11 +01:00
test_pipelines_video_classification.py	Sync video classification pipeline with huggingface_hub spec (#34288 )	2024-10-22 13:33:49 +01:00
test_pipelines_visual_question_answering.py	Make `pipeline` able to load `processor` (#32514 )	2024-10-09 16:46:11 +01:00
test_pipelines_zero_shot_audio_classification.py	Allow FP16 or other precision inference for Pipelines (#31342 )	2024-07-05 17:21:50 +01:00
test_pipelines_zero_shot_image_classification.py	Image pipelines spec compliance (#33899 )	2024-10-08 13:34:28 +01:00
test_pipelines_zero_shot_object_detection.py	Add image text to text pipeline (#34170 )	2024-10-31 15:48:11 -04:00
test_pipelines_zero_shot.py	Make `pipeline` able to load `processor` (#32514 )	2024-10-09 16:46:11 +01:00