Raushan Turganbay
43df47d8e7
Llava Onevision: add model ( #32673 )
...
* working version
* fix copies
* update
* tests
* update docs
* codestyle
* add more tests
* add returns for docs
* clean up
* Update src/transformers/models/llava_onevision/processing_llava_onevision.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* updates
* codestyle
* style
* shouldn't be reversed
* [run-slow] llava_onevision
* [run-slow] llava_onevision
* add pooling in videos
* [run-slow] llava_onevision
* num-logits-to-keep
* [run-slow] llava_onevision
* [run-slow] llava_onevision
* Update tests/test_modeling_common.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* video matched orig impl
* fix tests
* chat template was modified
* Update docs/source/en/model_doc/llava_onevision.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* add morer info in the doc page
---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2024-09-05 14:43:20 +05:00
Yoni Gozlan
9230d78e76
Add validate images and text inputs order util for processors and test_processing_utils ( #33285 )
...
* Add validate images and test processing utils
* Remove encoded text from possible inputs in tests
* Removed encoded inputs as valid in processing_utils
* change text input check to be recursive
* change text check to all element of lists and not just the first one in recursive checks
2024-09-04 13:50:31 -04:00
Matthew Douglas
b3909989d3
Fix excessive CPU memory usage with FSDP and cpu_ram_efficient_loading ( #33154 )
2024-09-04 18:37:54 +02:00
Yoach Lacombe
a1faf22f2c
[BUG] fix upper nltk version ( #33301 )
...
fix upper nltk version
2024-09-04 18:28:08 +02:00
Aymeric Roucher
cfd92c64f5
Add new documentation page for advanced agent usage ( #33265 )
...
* Add new documentation page for advanced agent usage
2024-09-04 18:19:54 +02:00
Matt
01c8c6c419
Add a warning to the chat template docs about the tool_calls format ( #33277 )
...
* Add a warning to the chat template docs
* Add a warning to the chat template docs
* Add a warning to the chat template docs
2024-09-04 17:13:34 +01:00
Aymeric Roucher
2cb543db77
Multi agents with manager ( #32687 )
...
* Add Multi agents with a hierarchical system
2024-09-04 17:30:54 +02:00
amyeroberts
d2dcff96f8
[InstructBLIP] qformer_tokenizer is required input ( #33222 )
...
* [InstructBLIP] qformer_tokenizer is required input
* Bit safer
* Add to instructblipvideo processor
* Fix up
* Use video inputs
* Update tests/models/instructblipvideo/test_processor_instructblipvideo.py
2024-09-04 16:18:06 +01:00
dependabot[bot]
5731dc8dd8
Bump cryptography from 42.0.0 to 43.0.1 in /examples/research_projects/decision_transformer ( #33286 )
...
Bump cryptography in /examples/research_projects/decision_transformer
Bumps [cryptography](https://github.com/pyca/cryptography ) from 42.0.0 to 43.0.1.
- [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst )
- [Commits](https://github.com/pyca/cryptography/compare/42.0.0...43.0.1 )
---
updated-dependencies:
- dependency-name: cryptography
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-09-04 17:13:18 +02:00
Alex Sherstinsky
122ded0a11
Bugfix/alexsherstinsky/fix none check for attention factor in rope scaling 2024 08 28 0 ( #33188 )
...
* Fixing a bug in the way "attention_factor" is validated in ROPE utilities.
* Fixing a bug in the way "attention_factor" is validated in ROPE utilities.
* Fixing a bug in the way "attention_factor" is validated in ROPE utilities.
2024-09-04 17:01:12 +02:00
Yih-Dar
178cb6bb1c
wait 15m before SSH into runner workflow stops ( #33300 )
...
15m
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2024-09-04 16:20:56 +02:00
laurentd-lunit
d703477265
[fix] LlavaNextProcessor '_get_unpadded_features' method ( #33263 )
...
* [fix] LlavaNextProcessor '_get_unpadded_features' method
* [tests] add test_image_token_filling
* [chore] style + comment
* [minor] improve readability
* [chore] run make fix-copies
2024-09-04 17:41:51 +05:00
Joao Gante
d750b509fc
Config: unified logic to retrieve text config ( #33219 )
2024-09-04 12:03:30 +01:00
Raushan Turganbay
ebbe8d8014
Cache docs: update ( #32929 )
...
* some changes
* more updates
* fix cache copy
* nits
* nits
* add tests
2024-09-04 15:05:31 +05:00
Raushan Turganbay
35f72ebf47
Fix: multigpu training ( #33271 )
...
fix
2024-09-04 15:01:08 +05:00
Niklas Muennighoff
ecd61c6286
Add OLMoE ( #32406 )
...
* Add OLMoE
* Add OLMoE
* Updates
* Make norm optional; add keys
* Add output
* Add
* Fix dtype
* Fix eos config
* Update
* Add OLMoE
* Fix OLMoE path
* Format
* Format
* Rmv copy statement
* Rmv copy statement
* Format
* Add copies
* Cp rotary
* Fix aming
* Fix naming
* Update RoPE integration; num_logits_to_keep; Add copy statements
* Add eps to config
* Format
* Add aux loss
* Adapt router_aux_loss_coef
* Update md
* Adapt
* adapt tests
2024-09-03 18:43:12 +02:00
Joao Gante
d6534f996b
Repo checks: check documented methods exist ( #32320 )
2024-09-03 17:40:27 +01:00
Arthur
979d24e7fd
fix the parallel number of CI nodes when it is smaller than number of tests ( #33276 )
...
* fix the parallel number
* this?
* keep it simple
* woups
* nit
* style
* fix param name
* fix
* fix dtype
* yups
* ???
* ??
* this?
* ????
* no default flow style
* ??
* print config
* ????
* there we go!
* documentation
* update
* remove unwanted file
2024-09-03 16:53:21 +02:00
Zach Mueller
6b7d64ac1c
Only disallow DeepSpeed Zero-3 for auto bs finder ( #31731 )
...
* Only disallow DeepSpeed
* Clean
* DeepSpeed!
* Add a test for deepspeed
2024-09-03 09:16:28 -04:00
Omar Salman
03c12d0d63
Add sdpa support for Albert ( #32092 )
...
* Add sdpa support for Albert
* [run_slow] albert
* Add benchmarks and PR suggestion
* Fix quality
* Fix
* [run_slow] albert
2024-09-03 14:01:00 +01:00
dependabot[bot]
e969d884a6
Bump opencv-python from 4.4.0.42 to 4.8.1.78 in /examples/research_projects/visual_bert ( #33251 )
...
Bump opencv-python in /examples/research_projects/visual_bert
Bumps [opencv-python](https://github.com/opencv/opencv-python ) from 4.4.0.42 to 4.8.1.78.
- [Release notes](https://github.com/opencv/opencv-python/releases )
- [Commits](https://github.com/opencv/opencv-python/commits )
---
updated-dependencies:
- dependency-name: opencv-python
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-09-03 14:32:23 +02:00
Matt
0d86727354
Update chat template docs to remove Blenderbot ( #33254 )
...
* Update docs to remove obsolete Blenderbot
* Remove another reference to Blenderbot
2024-09-03 12:18:04 +01:00
Isotr0py
edeca4387c
🚨 Support dequantization for most GGML types ( #32625 )
...
* use gguf internal dequantize
* add Q5_0 test
* add iq1 test
* add remained test
* remove duplicated test
* update docs
* add gguf version limit
* make style
* update gguf import catch
* revert vocab_size patch
* make style
* use GGUF_MIN_VERSION everywhere
2024-09-03 12:58:14 +02:00
Yoach Lacombe
979f4774f6
Fix Bark saving ( #33266 )
2024-09-03 10:57:59 +02:00
Raushan Turganbay
7ed9789e21
Fix: num_logits_to_keep
in composite models ( #33168 )
...
* fix
* paligemma
2024-09-03 13:48:45 +05:00
Arthur
566302686a
remove torch input dependant control flow ( #33245 )
2024-09-03 07:41:14 +02:00
ZM
cff06aac6f
Fix: use torch.from_numpy()
to create tensors for np.ndarrays ( #33201 )
...
use torch.from_numpy for np.ndarrays
2024-09-02 17:45:55 +01:00
Sergio Paniego Blanco
28952248b1
Fixed typo repeated word in DETR docs ( #33250 )
2024-09-02 17:19:18 +02:00
Marc Sun
9ea1eacd11
remove to restriction for 4-bit model ( #33122 )
...
* remove to restiction for 4-bit model
* Update src/transformers/modeling_utils.py
Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>
* bitsandbytes: prevent dtype casting while allowing device movement with .to or .cuda
* quality fix
* Improve warning message for .to() and .cuda() on bnb quantized models
---------
Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>
2024-09-02 16:28:50 +02:00
Joao Gante
97c0f45b9c
Generate: fix assistant in different device ( #33257 )
2024-09-02 14:37:49 +01:00
Matt
52a0213755
Add assistant prefill for chat templates and TextGenerationPipeline ( #33198 )
...
* Add assistant prefill to chat templates
* Add assistant prefill to pipeline
* Add assistant prefill to pipeline
* Tweak another test that ended in assistant message
* Update tests that ended in assistant messages
* Update tests that ended in assistant messages
* Replace assistant_prefill with continue_final_message
* Allow passing continue_final_message to pipeline
* Small fixup
* Add continue_final_message as a pipeline kwarg
* Update docstrings
* Move repos to hf-internal-testing!
* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Lysandre Debut <hi@lysand.re>
* Add explanatory comment
* make fixup
* Update chat templating docs to explain continue_last_message
---------
Co-authored-by: Lysandre Debut <hi@lysand.re>
2024-09-02 13:23:47 +01:00
dependabot[bot]
2d37085817
Bump opencv-python from 4.4.0.42 to 4.8.1.78 in /examples/research_projects/lxmert ( #33227 )
...
Bump opencv-python in /examples/research_projects/lxmert
Bumps [opencv-python](https://github.com/opencv/opencv-python ) from 4.4.0.42 to 4.8.1.78.
- [Release notes](https://github.com/opencv/opencv-python/releases )
- [Commits](https://github.com/opencv/opencv-python/commits )
---
updated-dependencies:
- dependency-name: opencv-python
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-09-02 13:40:49 +02:00
Jeongseok Kang
963ed98bed
docs: Replace package abbreviations with full name(bitsandbytes
) in docstrings ( #33230 )
...
* docs: Provide fullname for `bitsandbytes` package
* docs: Provide fullname for `bitsandbytes` package (2)
2024-09-02 13:40:34 +02:00
Ankush
409fcfdfcc
Fix: Suppressed 'use_reentrant=False' warning ( #33208 )
...
Co-authored-by: Ankush <ankush13r>
2024-09-02 10:16:07 +02:00
Aymeric Roucher
1ca9ff5c91
Add duckduckgo search tool ( #32882 )
...
* Add duckduckgo search tool
2024-09-02 09:56:20 +02:00
NielsRogge
b9bc691e8d
Add GraniteRMSNorm ( #33177 )
...
* Add GraniteRMSNorm
* [run_slow] granite
2024-09-02 09:39:39 +02:00
Merve Noyan
2e3f8f7474
Add video text to text docs ( #33164 )
...
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
2024-09-01 12:06:31 +03:00
Joao Gante
eb5b968c5d
Generate: throw warning when return_dict_in_generate
is False but should be True ( #33146 )
2024-08-31 10:47:08 +01:00
Joao Gante
746104ba6f
Test fetcher: missing return on filtered tests; don't write empty files ( #33224 )
...
* missing return
* skip files without contents
* test 2
* dbg
* dbg
* how about this?
2024-08-31 00:41:52 +02:00
Arthur
51e6526b38
Fix red amin ( #33220 )
...
* fix
* oups
* oups
* proper fix
* forget about that
* arf
* ish
2024-08-30 18:49:23 +01:00
Yijun Lee
db70426854
🌐 [i18n-KO] Translated llm_optims.md
to Korean ( #32325 )
...
* docs: ko: llm_optims.md
* feat: nmt draft
* fix toc title
* fix: manual edits
* Update docs/source/ko/llm_optims.md
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
* Update docs/source/ko/llm_optims.md
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
* Update docs/source/ko/llm_optims.md
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
* Update docs/source/ko/llm_optims.md
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
* Update docs/source/ko/llm_optims.md
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
* Update docs/source/ko/llm_optims.md
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
* Update docs/source/ko/llm_optims.md
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
* Update docs/source/ko/llm_optims.md
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
* Update docs/source/ko/llm_optims.md
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
* Update docs/source/ko/llm_optims.md
Co-authored-by: HyunJi Shin <74661937+shinhyunji36@users.noreply.github.com>
* Update docs/source/ko/llm_optims.md
Co-authored-by: HyunJi Shin <74661937+shinhyunji36@users.noreply.github.com>
* Update llm_optims.md
* fix: resolve suggestions
* fix: resolve suggestions
* Apply suggestions from code review
fix: resolve suggestions
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
---------
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
Co-authored-by: HyunJi Shin <74661937+shinhyunji36@users.noreply.github.com>
2024-08-30 09:52:41 -07:00
Aymeric Roucher
c79bfc71b8
Create local Transformers Engine ( #33218 )
...
* Create local Transformers Engine
2024-08-30 18:22:27 +02:00
Arthur
b017a9eb11
Refactor CI: more explicit ( #30674 )
...
* don't run custom when not needed?
* update test fetcher filtering
* fixup and updates
* update
* update
* reduce burden
* nit
* nit
* mising comma
* this?
* this?
* more parallelism
* more
* nit for real parallelism on tf and torch examples
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update to make it more custom
* update to make it more custom
* update to make it more custom
* update to make it more custom
* update
* update
* update
* update
* update
* update
* use correct path
* fix path to test files and examples
* filter-tests
* filter?
* filter?
* filter?
* nits
* fix naming of the artifacts to be pushed
* list vs files
* list vs files
* fixup
* fix list of all tests
* fix the install steps
* fix the install steps
* fix the config
* fix the config
* only split if needed
* only split if needed
* extend should fix it
* extend should fix it
* arg
* arg
* update
* update
* run tests
* run tests
* run tests
* more nits
* update
* update
* update
* update
* update
* update
* update
* simpler way to show the test, reduces the complexity of the generated config
* simpler way to show the test, reduces the complexity of the generated config
* style
* oups
* oups
* fix import errors
* skip some tests for now
* update doctestjob
* more parallelism
* fixup
* test only the test in examples
* test only the test in examples
* nits
* from Arthur
* fix generated congi
* update
* update
* show tests
* oups
* oups
* fix torch job for now
* use single upload setp
* oups
* fu**k
* fix
* nit
* update
* nit
* fix
* fixes
* [test-all]
* add generate marker and generate job
* oups
* torch job runs not generate tests
* let repo utils test all utils
* UPdate
* styling
* fix repo utils test
* more parallel please
* don't test
* update
* bit more verbose sir
* more
* hub were skipped
* split by classname
* revert
* maybe?
* Amazing catch
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
* fix
* update
* update
* maybe non capturing
* manual convert?
* pass artifacts as parameters as otherwise the config is too long
* artifact.json
* store output
* might not be safe?
* my token
* mmm?
* use CI job IS
* can't get a proper id?
* ups
* build num
* update
* echo url
* this?
* this!
* fix
* wget
* ish
* dang
* udpdate
* there we go
* update
* update
* pass all
* not .txt
* update
* fetcg
* fix naming
* fix
* up
* update
* update
* ??
* update
* more updates
* update
* more
* skip
* oups
* pr documentation tests are currently created differently
* update
* hmmmm
* oups
* curl -L
* update
* ????
* nit
* mmmm
* ish
* ouf
* update
* ish
* update
* update
* updatea
* nit
* nit
* up
* oups
* documentation_test fix
* test hub tests everything, just marker
* update
* fix
* test_hub is the only annoying one now
* tf threads?
* oups
* not sure what is happening?
* fix?
* just use folder for stating hub
* I am getting fucking annoyed
* fix the test?
* update
* uupdate
* ?
* fixes
* add comment!
* nit
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
2024-08-30 18:17:25 +02:00
Matt
38d58a4427
Fix local repos with remote code not registering for pipelines ( #33100 )
...
* Extremely experimental fix!
* Try removing the clause entirely
* Add test
* make fixup
* stash commit
* Remove breakpoint
* Add anti-regression test
* make fixup
* Move repos to hf-internal-testing!
2024-08-30 16:56:22 +01:00
Matt
fbff27623a
Add warning for stop string edge case ( #33169 )
...
* Add warning for edge case
* make fixup
2024-08-30 16:26:26 +01:00
Julien Denize
e259d6d1e0
Add missing quotes in modeling_llava_next_video.py ( #33214 )
2024-08-30 15:39:23 +02:00
dependabot[bot]
9a6956baab
Bump torch from 1.13.1 to 2.2.0 in /examples/research_projects/decision_transformer ( #33215 )
...
Bump torch in /examples/research_projects/decision_transformer
Bumps [torch](https://github.com/pytorch/pytorch ) from 1.13.1 to 2.2.0.
- [Release notes](https://github.com/pytorch/pytorch/releases )
- [Changelog](https://github.com/pytorch/pytorch/blob/main/RELEASE.md )
- [Commits](https://github.com/pytorch/pytorch/compare/v1.13.1...v2.2.0 )
---
updated-dependencies:
- dependency-name: torch
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-08-30 15:38:53 +02:00
dependabot[bot]
4987463de7
Bump torch from 1.13.1 to 2.2.0 in /examples/research_projects/codeparrot ( #33173 )
...
Bump torch in /examples/research_projects/codeparrot
Bumps [torch](https://github.com/pytorch/pytorch ) from 1.13.1 to 2.2.0.
- [Release notes](https://github.com/pytorch/pytorch/releases )
- [Changelog](https://github.com/pytorch/pytorch/blob/main/RELEASE.md )
- [Commits](https://github.com/pytorch/pytorch/compare/v1.13.1...v2.2.0 )
---
updated-dependencies:
- dependency-name: torch
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-08-30 15:23:35 +02:00
Joao Gante
b127fb8fdc
Pipeline: fix bad generation kwargs docs ( #33205 )
...
fix link
2024-08-30 14:14:42 +02:00
Arthur
c409cd8177
use a single for loop ( #33148 )
...
* use a single for loop
* oups
* fixup
* fix typo
2024-08-29 15:55:02 +02:00