transformers/tests/pipelines
NielsRogge 9c6f7485a6
Add GIT (GenerativeImage2Text) (#20295)
* First draft

* Make model instantiation work

* Fix copied from statement

* More fixes

* Add correct output head

* Improve configuration

* Add conversion script

* Improve conversion script

* Remove token_type_ids

* Fix conversion of projection layers

* Convert all weights

* Use cats image

* Make logits match

* Generate caption on cats image

* Add GITProcessor

* Update conversion script

* Add support for more checkpoints

* Fix conversion script

* Add initial tests

* Remove cross-attention

* More improvements

* Remove is_decoder

* Improve model tests

* Improve tests

* Improve model outputs

* Fix model outputs equivalence

* Fix more tests

* Remove unused code

* Use generate to generate text, no use of cache for now

* Use generate more appropriately

* Fix config tests

* Fix style

* Add support for use_cache

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Fix style

* Fix GIT vision encoder

* Update README

* Fix integration test

* Set bos and eos token ids

* Improve docs

* Improve code

* Add support for provided attention_mask

* Add copied from statement

* Fix gradient checkpointing test

* Set model_input_names

* Investigate model_input_names

* Remove script

* Fix model inputs

* Fix docstring

* Rename GIT to Git

* Support more models

* Add support for textvqa model

* Add video support

* Extend conversion script for video

* Add support for large variant

* Add support for more models

* Fix config archive map

* Update integration test

* Fix README

* Fix CLIP mean and std

* Update processor

* Fix use_cache for video, thanks @gante

* Remove print statements

* Remove assertion

* Add processor tests

* Fix model_input_names

* Use Auto API for processor

* Fix processor tests

* Fix integration test

* Fix pipeline test

* Make tests faster

* Update conversion script

* Update conversion script

* Convert more checkpoints

* Update conversion script

* Fix typo

* Update docstrings

* Improve code snippets

* Fix doc tests

* Add more code examplesé

* Fix doc tests

* Add integration tests

* Fix unused variable

* revert

* Add GIT to Japanese README

Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-01-03 14:17:18 +01:00
..
__init__.py [Test refactor 1/5] Per-folder tests reorganization (#15725) 2022-02-23 15:46:28 -05:00
test_pipelines_audio_classification.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00
test_pipelines_automatic_speech_recognition.py Add generate kwargs to AutomaticSpeechRecognitionPipeline (#20952) 2022-12-31 01:13:28 -05:00
test_pipelines_common.py Implement Roberta PreLayerNorm (#20305) 2022-12-19 09:30:17 +01:00
test_pipelines_conversational.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00
test_pipelines_depth_estimation.py Add depth estimation pipeline (#18618) 2022-10-12 08:54:20 -04:00
test_pipelines_document_question_answering.py A few CI fixes for DocumentQuestionAnsweringPipeline (#19584) 2022-10-17 15:35:27 +02:00
test_pipelines_feature_extraction.py [Pipeline] skip feature extraction test if in IMAGE_PROCESSOR_MAPPING (#20790) 2022-12-16 12:46:58 +01:00
test_pipelines_fill_mask.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00
test_pipelines_image_classification.py Specify TF framework explicitly in more pipeline tests (#19748) 2022-10-19 16:24:03 +02:00
test_pipelines_image_segmentation.py Make ImageSegmentationPipelineTests less flaky (#20147) 2022-11-15 09:14:55 +01:00
test_pipelines_image_to_text.py fix image2test args forwarding (#19648) 2022-10-24 09:49:24 -04:00
test_pipelines_object_detection.py Fix object detection2 (#20798) 2022-12-16 13:25:36 +01:00
test_pipelines_question_answering.py Clean up deprecation warnings (#19654) 2022-10-18 13:34:47 -04:00
test_pipelines_summarization.py Change a logic in pipeline test regarding TF (#20710) 2022-12-13 13:42:36 +01:00
test_pipelines_table_question_answering.py Add require_torch to 2 pipeline tests (#20585) 2022-12-05 16:06:39 +01:00
test_pipelines_text_classification.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00
test_pipelines_text_generation.py Add GIT (GenerativeImage2Text) (#20295) 2023-01-03 14:17:18 +01:00
test_pipelines_text2text_generation.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00
test_pipelines_token_classification.py TF port of ESM (#19587) 2022-10-17 14:16:16 +01:00
test_pipelines_translation.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00
test_pipelines_video_classification.py Add video classification pipeline (#20151) 2022-12-08 16:22:43 -05:00
test_pipelines_visual_question_answering.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00
test_pipelines_zero_shot_image_classification.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00
test_pipelines_zero_shot_object_detection.py [Proposal] Breaking change zero-shot-object-detection for improved consistency. (#20280) 2022-11-18 15:57:28 +01:00
test_pipelines_zero_shot.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00