Yih-Dar
cf32c94135
Run all tests if circleci/create_circleci_config.py
is modified ( #27413 )
...
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-11-09 22:01:06 +01:00
Yih-Dar
740cd93590
Fix Owlv2
checkpoint name and a default value in Owlv2VisionConfig
( #27402 )
...
* fix
* fix
* fix
* fix
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-11-09 21:39:03 +01:00
Yoach Lacombe
51a98c40ee
remove failing tests and clean FE files ( #27414 )
...
* remove failing tests and clean FE files
* remove same similar text from tvlt
2023-11-09 18:35:42 +00:00
Lucain
e38348ae8f
Fix RequestCounter to make it more future-proof ( #27406 )
...
* Fix RequestCounter to make it more future-proof
* code quality
2023-11-09 18:53:26 +01:00
Yih-Dar
c8b6052ff6
Final fix of the accelerate installation issue ( #27408 )
...
* fix
* [test-all] commit
* fix
* [test-all] commit
* [test-all] commit
* fix
* fix
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-11-09 18:52:29 +01:00
Zach Mueller
c5037b459e
Use editable install for git deps ( #27404 )
...
* Use editable install
* Full command
2023-11-09 10:20:12 -05:00
Yih-Dar
cf2a3f37bf
Fix fuyu checkpoint repo in FuyuConfig
( #27399 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-11-09 15:47:46 +01:00
Yih-Dar
3258ff9330
use pytest.mark
directly ( #27390 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-11-09 13:32:54 +01:00
Dave Berenbaum
791ec370d1
Adds dvclive callback ( #27352 )
...
* dvclive trainer callback
* style fixes
* dvclive link fixes
2023-11-09 12:19:31 +00:00
Hz, Ji
c5d7754b11
device-agnostic deepspeed testing ( #27342 )
2023-11-09 12:34:13 +01:00
amyeroberts
9999b73968
Skip failing cache call tests ( #27393 )
...
* Skip failing cache call tests
* Fixup
2023-11-09 11:03:37 +00:00
Yih-Dar
bc086a2516
Put doctest options back to pyproject.toml
( #27366 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-11-09 11:50:19 +01:00
Zach Mueller
e9adb0c9cf
Change thresh in test ( #27378 )
...
Change thresh
2023-11-09 04:44:36 -05:00
Arthur
085ea7e56c
[CodeLlamaTokenizer
] Nit, update __init__ to make sure the AddedTokens are not normalized because they are special ( #27359 )
...
* make sure tokens are properly initialized for codellama slow
* add m ore pretrained models
* style
* test more tokenizers checkpoints
2023-11-09 10:15:10 +01:00
Sourab Mangrulkar
7ecd229ba4
Smangrul/fix failing ds ci tests ( #27358 )
...
* fix failing DeepSpeed CI tests due to `safetensors` being default
* debug
* remove debug statements
* resolve comments
* Update test_deepspeed.py
2023-11-09 11:47:24 +05:30
jiaqiw09
ced9fd86f5
translate debugging.md to chinese ( #27374 )
...
* update
* update
2023-11-08 14:04:06 -08:00
Sergii Dymchenko
0e402e1478
Update deprecated torch.range
in test_modeling_ibert.py
( #27355 )
...
* Update deprecated torch.range
* Remove comment
2023-11-08 20:58:36 +01:00
Yoach Lacombe
a5bee89c9d
Add Flash Attention 2 support to Bark ( #27364 )
...
* change handmade attention mask to _prepare_4d_attention_mask
* add flashattention2 support in Bark
* add flashattention2 tests on BarkSemanticModel
* make style
* fix flashattention and tests + make style
* fix memory leak and allow Bark to pass flash attention to sub-models
* make style
* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
* remove unecessary code from tests + justify overriding
* Update tests/models/bark/test_modeling_bark.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* make style
---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-11-08 17:06:35 +00:00
jiaqiw09
ef71673616
translate big_models.md and performance.md to chinese ( #27334 )
...
* translate performance.md
* tranlsate performance.md and big_models.md
* update translation
* update review
2023-11-08 08:48:46 -08:00
Yih-Dar
bd8f45b167
Fix tiny model script: not using from_pt=True
( #27372 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-11-08 17:15:57 +01:00
Sanchit Gandhi
7b175cfaa7
[Flax Whisper] large-v3 compatibility ( #27360 )
2023-11-08 15:11:38 +00:00
Zach Mueller
845aa832b7
Remove unused param from example script tests ( #27354 )
...
Unused param
2023-11-08 09:07:32 -05:00
Mert Yanık
eb30a49b20
Translate index.md to Turkish ( #27093 )
...
* Add index.md for tukish language
* Fix index.md (huggingface/transformers#27088 )
* Add 'tr' to additional files
* Update docs/source/tr/_toctree.yml
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update index.md
---------
Co-authored-by: Mert Yanık <mert.yanik@lcwaikiki.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
2023-11-08 08:35:20 -05:00
Sanchit Gandhi
f16ff0f07e
MusicGen Update ( #27084 )
...
* [MusicGen] Add stereo model
* safe serialization
* Update src/transformers/models/musicgen/modeling_musicgen.py
* split over 2 lines
* fix slow tests on cuda
2023-11-08 13:26:02 +00:00
Yih-Dar
5ef650b0ae
Fix Kosmos-2
device issue ( #27346 )
...
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-11-08 14:14:45 +01:00
Zach Mueller
efa57cb234
Fix example tests from failing ( #27353 )
...
* Fix example tests from failing
* CHange thresh
2023-11-08 07:45:21 -05:00
Hz, Ji
b6dbfee0a2
moving example of benchmarking to legacy dir ( #27337 )
...
move example of benchmarking to legacy
2023-11-08 09:27:37 +01:00
Yoach Lacombe
be74b2ead6
Add numpy alternative to FE using torchaudio ( #26339 )
...
* add audio_utils usage in the FE of SpeechToText
* clean unecessary parameters of AudioSpectrogramTransformer FE
* add audio_utils usage in AST
* add serialization tests and function to FEs
* make style
* remove use_torchaudio and move to_dict to FE
* test audio_utils usage
* make style and fix import (remove torchaudio dependency import)
* fix torch dependency for jax and tensor tests
* fix typo
* clean tests with suggestions
* add lines to test if is_speech_availble is False
2023-11-08 07:39:37 +00:00
jiaqiw09
e264745051
translate model_sharing.md and llm_tutorial.md to chinese ( #27283 )
...
* translate model_sharing.md
* translate llm_tutorial.md to chiense
* update wrong translation
* update _torctree.yml
* update typos
* update
2023-11-07 15:34:33 -08:00
九是否随意的称呼
f213d5dd8c
translate the en tokenizer_summary.md to Chinese ( #27291 )
...
* translate the en tokenizer_summary.md to Chinese
* revise WordPiece
* add to source/zh/_toctree.yml
2023-11-07 15:31:51 -08:00
Plemeur
7e1eff7600
Allow scheduler parameters ( #26480 )
...
* Allow for scheduler kwargs
* Formatting
* Arguments checks, passing the tests
* Black failed somehow
---------
Co-authored-by: Pierre <pierre@avatarin.com>
2023-11-07 21:40:00 +00:00
Yoach Lacombe
ac5d4cf6de
FIx Bark batching feature ( #27271 )
...
* fix bark batching
* make style
* add tests and make style
2023-11-07 18:32:00 +00:00
Arthur
8f840edd31
[Whisper
] Nit converting the tokenizer ( #27349 )
...
* `nospeech` instead of `nocaption` for the no speech token
* oups
2023-11-07 18:43:26 +01:00
Susnato Dhar
cc9f27bb1e
Remove padding_masks from gpt_bigcode
. ( #27348 )
...
Update modeling_gpt_bigcode.py
2023-11-07 17:24:43 +00:00
Folco Bertini Baldassini
8c91f15ae5
Resolve AttributeError by utilizing device calculation at the start of the forward function ( #27347 )
...
This commit addresses the 'NoneType' object AttributeError within the IdeficsModel forward function. Previously, the 'device' attribute was accessed directly from input_ids, resulting in a potential 'NoneType' error. Now, the device is properly calculated at the beginning of the forward function and utilized consistently throughout, ensuring the 'image_hidden_states' are derived from the correct device. This modification enables smoother processing and compatibility, ensuring the correct device attribution for 'image_encoder_embeddings' in the IdeficsModel forward pass.
2023-11-07 16:26:15 +00:00
Chi
9459d821d1
Remove a redundant variable. ( #27288 )
...
* Removed the redundant SiLUActivation class and now use nn.functional.silu directly.
* I apologize for adding torch.functional.silu. I have replaced it with nn.SiLU.
* Remove redundant variable in feature_extraction file
2023-11-07 15:57:48 +00:00
Arthur
88832c01c8
[Whisper
] Add conversion script for the tokenizer ( #27338 )
...
* draft
* updates
* full conversion taken from `https://gist.github.com/xenova/a452a6474428de0182b17605a98631ee `
* psuh
* nits
* updates
* more nits
* Add co author
Co-authored-by: Joshua Lochner <admin@xenova.com>
* fixup
* cleanup
* styling
* add proper path
* update
* nits
* don't push the exit
* clean
* update whisper doc
* don't error out if tiktoken is not here
* make sure we are BC with conversion
* nit
* Update docs/source/en/model_doc/whisper.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* merge and update
* update markdwon
* Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
---------
Co-authored-by: Joshua Lochner <admin@xenova.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-11-07 15:07:55 +01:00
Susnato Dhar
0ded281557
[FA2
] Add flash attention for GPT-Neo
( #26486 )
...
* added flash attention for gpt-neo
* small change
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* readme updated
* .
* changes
* removed padding_mask
* Update src/transformers/models/gpt_neo/modeling_gpt_neo.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-11-07 13:54:01 +00:00
Xabier de Zuazo
606d90845f
Fix Whisper Conversion Script: Correct decoder_attention_heads and _download function ( #26834 )
...
* Fix error in convert_openai_to_hf.py: "_download() missing 1 required positional argument: root"
* Fix error in convert_openai_to_hf.py: "TypeError: byte indices must be integers or slices, not str"
* Fix decoder_attention_heads value in convert_openai_to_hf.py.
Correct the assignment for `decoder_attention_heads` in the conversion script for the Whisper model.
* Black reformat convert_openai_to_hf.py file.
* Fix Whisper model configuration defaults (for Tiny).
- Correct encoder/decoder layers and attention heads count.
- Update model width (`d_model`) to 384.
* Add docstring to the convert_openai_to_hf.py script with a doctest
* Add shebang and +x permission to the convert_openai_to_hf.py
* convert_openai_to_hf.py: reuse the read model_bytes in the _download() function
* Move convert_openai_to_hf.py doctest example to whisper.md
* whisper.md: Add an inference example to the Conversion section.
* whisper.md: remove `model.config.forced_decoder_ids` from examples (deprecated)
* whisper.md: Remove "## Format Conversion" section; not used by users
* whisper.md: Use librispeech_asr_dummy dataset and load_dataset()
2023-11-07 13:39:42 +01:00
Joao Gante
90b4adc1f1
Generate: skip tests on unsupported models instead of passing ( #27265 )
2023-11-07 12:08:28 +00:00
Younes Belkada
26d8d5f211
Fix autoawq docker image ( #27339 )
...
* Update Dockerfile
* Update docker/transformers-all-latest-gpu/Dockerfile
2023-11-07 11:21:04 +01:00
Sanchit Gandhi
da7ea9a4e3
[Whisper] Block language/task args for English-only ( #27322 )
...
* [Whisper] Block language/task args for English-only
* Update src/transformers/models/whisper/modeling_whisper.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-11-07 10:04:23 +00:00
Maria Khalusova
9beb2737d7
[docs] fixed links with 404 ( #27327 )
...
* fixed links with 404
* make style
2023-11-06 19:45:03 +00:00
Yih-Dar
1b20e2bb42
Fix Kosmos2Processor
batch mode ( #27323 )
...
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-11-06 19:05:50 +01:00
Iker García-Ferrero
a6e0d5a219
Fix VideoMAEforPretrained dtype error ( #27296 )
...
* Fix dtype error
* Fix mean and std dtype
* make style
2023-11-06 17:20:06 +00:00
Akshay Chintalapati
e9dbd39263
Update sequence_classification.md ( #27281 )
...
I'm adding accelerate as one of the libraries to install because otherwise when running the Trainer, the model errorr out with the error.
ImportError: Using the `Trainer` with `PyTorch` requires `accelerate>=0.20.1`: Please run `pip install transformers[torch]` or `pip install accelerate -U`
Further context:
1. I've tried this across different environments so I believe that the environment is not the issue.
2. I had the latest transformers library version running.
3. Typically even after install accelerate and import it, it wouldn't resolve the issue until I restart the notebook and try again.
2023-11-06 14:21:48 +00:00
Arthur
147f774671
[PretrainedTokenizer
] add some of the most important functions to the doc ( #27313 )
2023-11-06 15:11:00 +01:00
Hz, Ji
1ffc4dee5b
enable memory tracker metrics for npu ( #27280 )
2023-11-06 13:44:21 +00:00
Pingzhi Li
d7dcfa8917
Remove an unexpected argument for FlaxResNetBasicLayerCollection ( #27272 )
...
Remove unexpected argument for FlaxResNetBasicLayerCollection
2023-11-06 12:16:03 +00:00
Yih-Dar
eef7ea98c3
Update doctest workflow file ( #27306 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-11-06 11:27:48 +01:00