transformers/docs/source/ja/model_doc
Yoni Gozlan fa56dcc2ab
Refactoring of ImageProcessorFast (#35069)
* add init and base image processing functions

* add add_fast_image_processor to transformers-cli

* add working fast image processor clip

* add fast image processor to doc, working tests

* remove "to be implemented" SigLip

* fix unprotected import

* fix unprotected vision import

* update ViTImageProcessorFast

* increase threshold slow fast ewuivalence

* add fast img blip

* add fast class in tests with cli

* improve cli

* add fast image processor convnext

* add LlavaPatchingMixin and fast image processor for llava_next and llava_onevision

* add device kwarg to ImagesKwargs for fast processing on cuda

* cleanup

* fix unprotected import

* group images by sizes and add batch processing

* Add batch equivalence tests, skip when center_crop is used

* cleanup

* update init and cli

* fix-copies

* refactor convnext, cleanup base

* fix

* remove patching mixins, add piped torchvision transforms for ViT

* fix unbatched processing

* fix f strings

* protect imports

* change llava onevision to class transforms (test)

* fix convnext

* improve formatting (following Pavel review)

* fix handling device arg

* improve cli

* fix

* fix inits

* Add distinction between preprocess and _preprocess, and support for arbitrary kwargs through valid_extra_kwargs

* uniformize qwen2_vl fast

* fix docstrings

* add add fast image processor llava

* remove min_pixels max_pixels from accepted size

* nit

* nit

* refactor fast image processors docstrings

* cleanup and remove fast class transforms

* update add fast image processor transformers cli

* cleanup docstring

* uniformize pixtral fast and  make _process_image explicit

* fix prepare image structure llava next/onevision

* Use typed kwargs instead of explicit args

* nit fix import Unpack

* clearly separate pops and gets in base preprocess. Use explicit typed kwargs

* make qwen2_vl preprocess arguments hashable
2025-02-04 17:52:31 -05:00
..
albert.md
align.md
altclip.md
audio-spectrogram-transformer.md
auto.md
autoformer.md
bark.md
bart.md
barthez.md
bartpho.md
beit.md
bert-generation.md
bert-japanese.md
bert.md
bertweet.md
big_bird.md
bigbird_pegasus.md
biogpt.md
bit.md
blenderbot-small.md
blenderbot.md
blip-2.md
blip.md Refactoring of ImageProcessorFast (#35069) 2025-02-04 17:52:31 -05:00
bloom.md
bort.md
bridgetower.md
bros.md
byt5.md
camembert.md
canine.md
chinese_clip.md
clap.md
clip.md Refactoring of ImageProcessorFast (#35069) 2025-02-04 17:52:31 -05:00
clipseg.md
clvp.md
code_llama.md
codegen.md
conditional_detr.md
convbert.md
convnext.md Refactoring of ImageProcessorFast (#35069) 2025-02-04 17:52:31 -05:00
convnextv2.md
cpm.md
cpmant.md
ctrl.md
cvt.md
data2vec.md
deberta-v2.md
deberta.md
decision_transformer.md
deformable_detr.md
deit.md Refactoring of ImageProcessorFast (#35069) 2025-02-04 17:52:31 -05:00
deplot.md
deta.md
detr.md
dialogpt.md
dinat.md