Pablo Montalvo
a5bb528471
Fix signatures for processing kwargs ( #35105 )
...
* add conversion script
* remove pg2 refs
* fixup style
* small update
* get correct scaling
* add back missing bos
* fix missing config keys
* might revert this pos_embeddings
* fixup 9b config
* fix 9b
* fixup 9b conversion for good + add back num_hidden_layers
* add correct query scaling for 2b, 9b, 27b
* fixup 27b conversion
* Additional variant: 27b-896
* Use CPU for conversion to reduce GPU RAM requirements
* fix causal mask generation + formatting
* fix in-training causal mask generation edge case
* trigger CI
* update config
* update config
* update config
* update config
* update config
* update config
* update config
* update config
* update config
* move conversion file to main model dir
* handle multi-images + bos token
* address comments for input ids
* revert ci fixes
* [run-slow] paligemma
* fix
* [run-slow] paligemma
* skip end 2 end
* [run-slow] paligemma
---------
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2024-12-05 18:15:48 +01:00
Yoni Gozlan
62e8c759c3
rename all test_processing_*.py to test_processor_*.py ( #33878 )
...
* rename all test_processing_*.py to test_processor_*.py ans fix duplicate test processor paligemma
* fix copies
* fix broken tests
* fix-copies
* fix test processor bridgetower
2024-10-02 16:43:43 +02:00
Yoni Gozlan
c0c6815dc9
Add support for args to ProcessorMixin for backward compatibility ( #33479 )
...
* add check and prepare args for BC to ProcessorMixin, improve ProcessorTesterMixin
* change size and crop_size in processor kwargs tests to do_rescale and rescale_factor
* remove unnecessary llava processor kwargs test overwrite
* nit
* change data_arg_name to input_name
* Remove unnecessary test override
* Remove unnecessary tests Paligemma
* Move test_prepare_and_validate_optional_call_args to TesterMixin, add docstring
2024-09-20 11:40:59 -04:00
Yoni Gozlan
f111d5b783
Uniformize kwargs for Paligemma processor and update docs ( #33571 )
...
* Uniformize paligemma processor
* nit
2024-09-19 14:14:06 -04:00