transformers/docs/source/en/tasks
Pablo Montalvo caa0ff0bf1
Add fuyu model (#26911)
* initial commit

* add processor, add fuyu naming

* add draft processor

* fix processor

* remove dropout to fix loading of weights

* add image processing fixes from Pedro

* fix

* fix processor

* add basic processing fuyu test

* add documentation and TODO

* address comments, add tests, add doc

* replace assert with torch asserts

* add Mixins and fix tests

* clean imports

* add model tester, clean imports

* fix embedding test

* add updated tests from pre-release model

* Processor: return input_ids used for inference

* separate processing and model tests

* relax test tolerance for embeddings

* add test for logit comparison

* make sure fuyu image processor is imported in the init

* fix formattingh

* more formatting issues

* and more

* fixups

* remove some stuff

* nits

* update init

* remove the fuyu file

* Update integration test with release model

* Update conversion script.

The projection is not used, as confirmed by the authors.

* improve geenration

* Remove duplicate function

* Trickle down patches to model call

* processing fuyu updates

* remove things

* fix prepare_inputs_for_generation to fix generate()

* remove model_input

* update

* add generation tests

* nits

* draft leverage automodel and autoconfig

* nits

* fix dtype patch

* address comments, update READMEs and doc, include tests

* add working processing test, remove refs to subsequences

* add tests, remove Sequence classification

* processing

* update

* update the conversion script

* more processing cleanup

* safe import

* take out ModelTesterMixin for early release

* more cl;eanup

* more cleanup

* more cleanup

* and more

* register a buffer

* nits

* add postprocessing of generate output

* nits

* updates

* add one working test

* fix test

* make fixup works

* fixup

* Arthur's updates

* nits

* update

* update

* fix processor

* update tests

* passe more fixups

* fix

* nits

* don't import torch

* skip fuyu config for now

* fixup done

* fixup

* update

* oups

* nits

* Use input embeddings

* no buffer

* update

* styling processing fuyu

* fix test

* update licence

* protect torch import

* fixup and update not doctested

* kwargs should be passed

* udpates

* update the impofixuprts in the test

* protect import

* protecting imports

* protect imports in type checking

* add testing decorators

* protect top level import structure

* fix typo

* fix check init

* move requires_backend to functions

* Imports

* Protect types

---------

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: ArthurZucker <arthur.zucker@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Lysandre <lysandre@huggingface.co>
2023-10-18 15:24:11 -07:00
..
asr.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
audio_classification.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
document_question_answering.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
idefics.md fix typos in idefics.md (#26648) 2023-10-09 12:18:02 +02:00
image_captioning.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
image_classification.md Pvt model (#24720) 2023-07-24 15:34:19 +01:00
image_to_image.md Image-to-Image Task Guide (#26595) 2023-10-16 15:12:03 +02:00
knowledge_distillation_for_image_classification.md Knowledge distillation for vision guide (#25619) 2023-10-18 04:42:32 -07:00
language_modeling.md Add fuyu model (#26911) 2023-10-18 15:24:11 -07:00
masked_language_modeling.md Add Multi Resolution Analysis (MRA) (New PR) (#24513) 2023-07-10 10:50:43 +01:00
monocular_depth_estimation.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
multiple_choice.md Add Multi Resolution Analysis (MRA) (New PR) (#24513) 2023-07-10 10:50:43 +01:00
object_detection.md [doc] fixed indices in obj detection example (#26343) 2023-09-22 10:29:27 -04:00
prompting.md [docs] LLM prompting guide (#26274) 2023-10-12 08:48:01 -04:00
question_answering.md [MPT] Add MosaicML's MPT model to transformers (#24629) 2023-07-25 14:32:40 +02:00
semantic_segmentation.md Update semantic_segmentation.md (#26419) 2023-09-27 11:51:44 +02:00
sequence_classification.md [Mistral] Mistral-7B-v0.1 support (#26447) 2023-09-27 18:30:46 +02:00
summarization.md [Umt5] Add google's umt5 to transformers (#24477) 2023-07-03 07:38:21 +02:00
text-to-speech.md [docs] Updates to TTS task guide with regards to the new TTS pipeline (#26095) 2023-09-12 11:29:06 -04:00
token_classification.md Add BROS (#23190) 2023-09-14 18:02:37 +01:00
translation.md [Umt5] Add google's umt5 to transformers (#24477) 2023-07-03 07:38:21 +02:00
video_classification.md Add ViViT (#22518) 2023-07-11 14:04:04 +01:00
visual_question_answering.md VQA task guide (#25244) 2023-08-09 08:29:06 -04:00
zero_shot_image_classification.md [docs] Fix model reference in zero shot image classification example (#26206) 2023-09-19 00:45:12 +02:00
zero_shot_object_detection.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00