BakerBunker
|
4b8c6d4cf8
|
Add Qwen2.5-Omni (#36752)
* Add qwen2.5-omni
* Remove einops dependency
* Add torchdiffeq dependency
* Sort init
* Add torchdiffeq to extras['diffeq']
* Fix repo consistency
* use cached_file
* del odeint
* renew pytest
* format
* Remove torchdiffeq
* format
* fixed batch infer bug
* Change positional_embedding to parameter
* Change default speaker
* Config revision
* Use modular & code clean
* code clean
* decouple padding with model & code cleaning
* sort init
* fix
* fix
* Second code review
* fix
* fix
* rename vars to full name + some comments
* update pytest
* Code clean & fix
* fix
* style
* more clean up
* fixup
* smaller vision model in tests
* fix processor test
* deflake a bit the tests (still flaky though)
* de-flake tests finally + add generation mixin
* final nits i hope
* make sure processor tests are complete
* replace with Qwen2_5OmniForConditionalGeneration
* fix tests after updating ckpt
* fix typos when cleaning, also we can't change ckpt
* fixup
* images and videos kwargs for processor
* thinker and talker loadable from hub ckpt
* address comments and update tests after rebase
* fixup
* skip for now
* fixup
* fixup
* remove torch dependency in processors
---------
Co-authored-by: lvyuanjun.lyj <lvyuanjun.lyj@alibaba-inc.con>
Co-authored-by: feizi.wx <feizi.wx@alibaba-inc.com>
Co-authored-by: raushan <raushan@huggingface.co>
|
2025-04-14 12:36:41 +02:00 |
|