Younes Belkada
a1d4563f7a
accelerate
support for OwlViT
(#20411 )
...
* `accelerate` support for `OwlViT`
- added `accelerate` support
- added slow `fp16` tests
* apply suggestions
2022-11-25 11:20:44 +01:00
Alara Dirik
a00b7e85ea
Adds image-guided object detection support to OWL-ViT ( #20136 )
...
Adds image-guided object detection method to OwlViTForObjectDetection class as described in the original paper. One-shot/ image-guided object detection enables users to use a query image to search for similar objects in the input image.
Co-Authored-By: Dhruv Karan k4r4n.dhruv@gmail.com
2022-11-16 09:07:46 +03:00
NielsRogge
cbbeca3d17
[OWL-ViT] Make model consistent with CLIP ( #20144 )
...
* Apply fix
* Fix test
* Remove another argument which is not used
* Fix pipeline test
* Add argument back, add deprecation warning
* Add warning add other location
* Use warnings instead
* Add num_channels to config
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>
2022-11-11 11:36:17 +01:00
Yih-Dar
2d02178e5c
Allow passing arguments to model testers for CLIP-like models ( #20044 )
...
* POC
* For more CLIP-like models
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-11-04 18:01:41 +01:00
Alara Dirik
f28f240828
fix owlvit tests, update docstring examples ( #18586 )
2022-08-11 19:10:25 +03:00
Sylvain Gugger
3f0707b2fe
Skip broken tests
2022-08-11 09:33:41 -04:00
Alara Dirik
9caf68a638
Owlvit test fixes ( #18303 )
...
* fix owlvit test assertion errors
* fix gpu test error
* remove redundant lines
* fix styling
2022-07-27 17:26:27 +03:00
Alara Dirik
12d66b4701
Add OWL-ViT model for zero-shot object detection ( #17938 )
...
* add owlvit model skeleton
* add class and box predictor heads
* convert modified flax clip to pytorch
* fix box and class predictors
* add OwlViTImageTextEmbedder
* convert class and box head checkpoints
* convert image text embedder checkpoints
* add object detection head
* fix bugs
* update conversion script
* update conversion script
* fix q,v,k,out weight conversion conversion
* add owlvit object detection output
* fix bug in image embedder
* fix bugs in text embedder
* fix positional embeddings
* fix bug in inference mode vision pooling
* update docs, init tokenizer and processor files
* support batch processing
* add OwlViTProcessor
* remove merge conflicts
* readd owlvit imports
* fix bug in OwlViTProcessor imports
* fix bugs in processor
* update docs
* fix bugs in processor
* update owlvit docs
* add OwlViTFeatureExtractor
* style changes, add postprocess method to feature extractor
* add feature extractor and processor tests
* add object detection tests
* update conversion script
* update config paths
* update config paths
* fix configuration paths and bugs
* fix bugs in OwlViT tests
* add import checks to processor
* fix docs and minor issues
* fix docs and minor issues
* fix bugs and issues
* fix bugs and issues
* fix bugs and issues
* fix bugs and issues
* update docs and examples
* fix bugs and issues
* update conversion script, fix positional embeddings
* process 2D input ids, update tests
* fix style and quality issues
* update docs
* update docs and imports
* update OWL-ViT index.md
* fix bug in OwlViT feature ext tests
* fix code examples, return_dict by default
* return_dict by default
* minor fixes, add tests to processor
* small fixes
* add output_attentions arg to main model
* fix bugs
* remove output_hidden_states arg from main model
* update self.config variables
* add option to return last_hidden_states
* fix bug in config variables
* fix copied from statements
* fix small issues and bugs
* fix bugs
* fix bugs, support greyscale images
* run fixup
* update repo name
* merge OwlViTImageTextEmbedder with obj detection head
* fix merge conflict
* fix merge conflict
* make fixup
* fix bugs
* fix bugs
* add additional processor test
2022-07-22 13:35:32 +03:00