Raushan Turganbay
1cfcbfcab8
[VLMs] fix flash-attention tests ( #37603 )
...
* fix one test
* fa2 ln test
* remove keys from config recursively
* fix
* fixup
2025-04-24 11:48:11 +02:00
Joao Gante
362fa37da2
[test] update test_past_key_values_format
( #37614 )
...
allow custom shapes
2025-04-22 11:07:34 +01:00
cyyever
1e6b546ea6
Use Python 3.9 syntax in tests ( #37343 )
...
Signed-off-by: cyy <cyyever@outlook.com>
2025-04-08 14:12:08 +02:00
Matt
2d46a08b63
Purge unused ModelTester code ( #37085 )
...
* Purge correctly this time
* Remove more methods from recent PRs
* make fixup
2025-04-03 17:48:35 +01:00
Raushan Turganbay
0c78ef6cd3
🔴 VLM: compile compatibility ( #35724 )
...
* llavas
* add mroe models
* fix `compile_forward` test for all models
* fix copies
* make style
* also doesn't support cache class
* fix some tests
* not copied from
* ci green?
* fix tests
* fix copies
* fix tests
* check with `numel` and remove `item`
* fix copies
* fix copies
* Update src/transformers/models/cohere2/modeling_cohere2.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* opt remove cross attn
* gemma2
* fixup
* fixup
* fix newly added test
* maybe fixed?
* green please?
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2025-02-14 15:23:49 +01:00
Joao Gante
62c7ea0201
CI: avoid human error, automatically infer generative models ( #33212 )
...
* tmp commit
* move tests to the right class
* remove ALL all_generative_model_classes = ...
* skip tf roberta
* skip InstructBlipForConditionalGenerationDecoderOnlyTest
* videollava
* reduce diff
* reduce diff
* remove on vlms
* fix a few more
* manual rebase bits
* more manual rebase
* remove all manual generative model class test entries
* fix up to ernie
* a few more removals
* handle remaining cases
* recurrent gemma
* it's better here
* make fixup
* tf idefics is broken
* tf bert + generate is broken
* don't touch tf :()
* don't touch tf :(
* make fixup
* better comments for test skips
* revert tf changes
* remove empty line removal
* one more
* missing one
2025-02-13 16:27:11 +01:00
Yoni Gozlan
2b46943195
Add GOT-OCR 2.0 to Transformers ( #34721 )
...
* init modular got_ocr2
* Get correct got_ocr architecture
* add processing
* run modular with processing
* add working inference
* apply modular
* Refactor and fix style
* Refactor, cleanup, fix style
* fix init order
* Fix docs
* add base modeling tests
* fix style and consistency
* rename doc file
* fix repo consistency
* fix inference with box
* add image processing and support for crop_to_multi_page
* Fix batch inference
* add tests
* fixup
* fix slow test
* fix docstrings
* Add model doc
* update to new init
* fix input autocast pixel_values dtype
* update doc
* move doc to multimodal
* Reformat crop_image_to_patches and add docstrings
* Fix example in forward docstring
* Address Pablo review
* [run slow] got_ocr2
* remove defaults defined twice
* apply modular
* add torch_device to integration tests
* update modular
* follow-up Pavel review
* add device variable in doc
* fix doc multi-page
* Force eager attention for vision encoder to avoid attn implementation conflict
* revert qwen2vl doc changes
* use Qwen2ForCausalLM instead of Qwen2Model
* make fixup
* refactor gotocr2 to llava style
* uniformize function names and reduce checks
* final nits
* fix pixel_values dtype error
* change checkpoint names
* fix modular
2025-01-31 11:28:13 -05:00