Yih-Dar
e26c6f03be
Fix Wav2Vec2
CI OOM ( #24190 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-06-12 11:39:04 +02:00
Yih-Dar
8f093fb799
Avoid OOM in doctest CI ( #24139 )
...
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-06-10 09:47:38 +02:00
Stas Bekman
0d217f428f
[tests] fix bitsandbytes import issue ( #24151 )
...
fix bitsandbytes import issue
2023-06-09 21:53:11 -07:00
Lysandre Debut
deff5979fe
Tool types ( #24032 )
...
* Tool types
* Tests + fixes
* Isolate types
* Oops
* Review comments + docs
* Tests + docs
* soundfile -> vision
2023-06-09 13:34:07 -04:00
Freddie Vargus
061580c82c
Fix typo in streamers.py ( #24144 )
2023-06-09 17:27:46 +01:00
LiamSwayne
12bb853ccd
[documentation] grammatical fixes in image_classification.mdx ( #24141 )
...
Update image_classification.mdx
2023-06-09 16:59:44 +01:00
Yih-Dar
d0d1632958
Fix Pipeline CI OOM issue ( #24124 )
...
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-06-09 16:49:02 +02:00
Arthur
a7501f6fc6
[BlenderBotSmall] Update doc example ( #24092 )
...
* small tokenizer uses `__start__` and `__end__`
* fix PR doctest
2023-06-09 16:31:57 +02:00
Arthur
5af3a1aa48
[lamaTokenizerFast] Update documentation ( #24132 )
...
* Update documentation
* nits
2023-06-09 16:30:20 +02:00
Younes Belkada
62fe753325
[SAM
] Fix sam slow test ( #24140 )
...
* fix sam test
* update pipeline typehint
2023-06-09 16:22:09 +02:00
Yih-Dar
847b47c0ee
Fix XGLM OOM on CI ( #24123 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-06-09 15:20:59 +02:00
Yih-Dar
b8fe259f16
Fix SAM OOM issue on CI ( #24125 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-06-09 15:07:08 +02:00
Yih-Dar
707023d155
Fix TF Rag OOM issue ( #24122 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-06-09 15:03:11 +02:00
Sourab Mangrulkar
f2b918356c
fix bugs with trainer ( #24134 )
...
* fix the deepspeed test failures
* apex fix
* FSDP save ckpt fix
* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-06-09 17:54:53 +05:30
Joao Gante
be10092e63
Generate: PT's top_p
enforces min_tokens_to_keep
when it is 1
( #24111 )
2023-06-09 13:20:05 +01:00
Matt
03585f3734
Correctly build models and import call_context for older TF versions ( #24138 )
2023-06-09 13:11:01 +01:00
Younes Belkada
a6d05d55f6
[bnb
] Fix bnb config json serialization ( #24137 )
...
* fix bnb config json serialization
* forward contrib credits from discussions
---------
Co-authored-by: Andrechang <Andrechang@users.noreply.github.com>
2023-06-09 13:41:14 +02:00
Elliott Wang
e2972dffdd
PLAM => PaLM ( #24129 )
2023-06-09 12:32:16 +01:00
Arthur
535542d38d
[Lllama] Update tokenization code to ensure parsing of the special tokens [core] ( #24042 )
...
* preventllama fast from returning token type ids
* remove type hints
* normalised False
2023-06-09 09:36:19 +02:00
Yih-Dar
2e2088f24b
Avoid GPT-2
daily CI job OOM (in TF tests) ( #24106 )
...
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-06-08 18:21:09 +02:00
Serge Panev
9322c24476
Fix typo in Llama docstrings ( #24020 )
...
* Fix typo in Llama docstrings
Signed-off-by: Serge Panev <spanev@nvidia.com>
* Update
Signed-off-by: Serge Panev <spanev@nvidia.com>
* make style
Signed-off-by: Serge Panev <spanev@nvidia.com>
---------
Signed-off-by: Serge Panev <spanev@nvidia.com>
2023-06-08 17:19:07 +01:00
Radamés Ajna
a73883ae9e
add trust_remote_code option to CLI download cmd ( #24097 )
...
* add trust_remote_code option
* require_torch
2023-06-08 11:13:57 -04:00
Younes Belkada
8b169142f8
[GPT2
] Add correct keys on _keys_to_ignore_on_load_unexpected
on all child classes of GPT2PreTrainedModel
( #24113 )
...
* add correct keys on `_keys_to_ignore_on_load_unexpected`
* oops
2023-06-08 10:21:42 -04:00
Marc Sun
71a114d3e0
fix get_keys_to_not_convert function ( #24095 )
...
* fix get_keys_to_not_convert funct
* Fix style
2023-06-08 10:14:27 -04:00
Sylvain Gugger
8c5f306719
Update the pin on Accelerate ( #24110 )
2023-06-08 10:11:01 -04:00
Younes Belkada
2200bf7a45
[Trainer
] Correct behavior of _load_best_model
for PEFT models ( #24103 )
...
* v1
* some refactor
- add ST format as well
* fix
* add `ADAPTER_WEIGHTS_NAME` & `ADAPTER_SAFE_WEIGHTS_NAME`
2023-06-08 15:38:30 +02:00
Sourab Mangrulkar
0f23605094
reset accelerate env variables after each test ( #24107 )
2023-06-08 09:19:07 -04:00
Sadra Barikbin
5fa0a1b23b
Fix a tiny typo in WhisperForConditionalGeneration::generate
docstring ( #24045 )
2023-06-08 13:54:56 +01:00
Sylvain Gugger
ba695c1efd
v4.31.0.dev0
2023-06-07 16:49:00 -04:00
Sylvain Gugger
c3572e6bfb
Add AzureOpenAiAgent ( #24058 )
...
* Add AzureOpenAiAgent
* quality
* Update src/transformers/tools/agents.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
---------
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
2023-06-07 16:34:53 -04:00
Zachary Mueller
5eb3d3c702
Up pinned accelerate version ( #24089 )
...
* Min accelerate
* Also min version
* Min accelerate
* Also min version
* To different minor version
* Empty
2023-06-07 16:21:51 -04:00
Sourab Mangrulkar
d1c039e398
fix accelerator prepare during eval only mode ( #24014 )
...
* fix mixed precision prep during eval only mode
* update to address comments
* update to reflect the changes in accelerate
2023-06-08 01:03:13 +05:30
Sylvain Gugger
2c887cf8e0
Do not prepare lr scheduler as it as the right number of steps ( #24088 )
...
* Do not prepare lr scheduler as it as the right number of steps
* Trigger CI
* Trigger CI
* Trigger CI
* Add fake comment
* Remove fake comment
* Trigger CI please!
2023-06-07 15:31:32 -04:00
Sourab Mangrulkar
12298cb65c
fix executable batch size issue ( #24067 )
...
* fix executable batch size issue
* fix
* undo
2023-06-07 22:08:04 +05:30
Mishig
ef010071ee
Update delete_doc_comment_trigger.yml ( #24084 )
...
fix base workflow name
2023-06-07 17:55:48 +02:00
Sylvain Gugger
89b00eef94
Fix expected value in tests of the test fetcher ( #24077 )
...
* Fix expected value in tests of the test fetcher
* Fix trigger for repo util tests
2023-06-07 11:38:56 -04:00
Mishig
5c9394b54c
[doc build] Use secrets ( #24079 )
2023-06-07 17:33:39 +02:00
Matt
1fc832b454
Make the TF dummies even smaller ( #24071 )
...
* Let's see if we can use the smallest possible dummies
* Make GPT-2's dummies a little longer
* Just use (1,2) as the default shape
* Update other dummies in sync
* Correct imports for Keras 2.13
* Shrink the Wav2Vec2 dummies
2023-06-07 16:23:05 +01:00
Yih-Dar
092c14c37d
Be nice to TF ( #24076 )
...
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-06-07 16:18:13 +02:00
Younes Belkada
4795219228
[bnb
] Fix bnb skip modules ( #24043 )
...
* fix skip modules test
* oops
* address comments
2023-06-07 15:27:46 +02:00
Michael Benayoun
a1160185ff
Fix is_optimum_neuron_available
( #23961 )
...
Fix is_optimum_neuron_available
2023-06-07 09:13:01 -04:00
Younes Belkada
6b548129b1
[Hub
] Add safe_serialization
in push_to_hub ( #24074 )
...
add `safe_serialization` in push_to_hub
2023-06-07 09:07:33 -04:00
Younes Belkada
6daf7c311b
Support PEFT models when saving the model using trainer ( #24073 )
...
* support PEFT models when saving the model using trainer
* fixup
2023-06-07 14:30:55 +02:00
YangLiu
1e4a7737ed
Add support for non-rust implemented tokenization for __getitem__
method. ( #24039 )
...
* Add support for non-rust implemented tokenization for `__getitem__` method.
* Update for error message on adding new sub-branch for `__item__` method.
---------
Co-authored-by: liuyang17 <liuyang17@zhihu.com>
2023-06-07 12:29:19 +01:00
Patrick von Platen
52972e70c7
[Wav2Vec2] Fix torch srcipt ( #24062 )
...
* [Wav2Vec2] Fix torch srcipt
* fix more
2023-06-07 07:27:07 -04:00
Joao Gante
612b2a1a6d
Generate: increase left-padding test atol ( #23448 )
...
increase atol
2023-06-07 11:56:57 +01:00
Sylvain Gugger
f1660d7e23
Remote code improvements ( #23959 )
...
* Fix model load when it has both code on the Hub and locally
* Add input check with timeout
* Add tests
* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
* Some non-saved stuff
* Add feature extractors
* Add image processor
* Add model
* Add processor and tokenizer
* Reduce timeout
---------
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
2023-06-06 14:31:14 -04:00
Sylvain Gugger
60825f2c6e
Fix device placement for model-parallelism in generate for encoder/de… ( #24025 )
...
* Fix device placement for model-parallelism in generate for encoder/decoders
* Remove debug statements
2023-06-06 14:30:59 -04:00
Yih-Dar
02d255db26
bring back filtered_test_list_cross_tests.txt
( #24055 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-06-06 19:35:24 +02:00
Edward Z. Yang
bc9ecef942
Use new parametrization based weight norm if available ( #24030 )
...
* Use new parametrization based weight norm if available
See https://github.com/pytorch/pytorch/pull/103001
Signed-off-by: Edward Z. Yang <ezyang@meta.com>
* handle copies
Signed-off-by: Edward Z. Yang <ezyang@meta.com>
* black
Signed-off-by: Edward Z. Yang <ezyang@meta.com>
---------
Signed-off-by: Edward Z. Yang <ezyang@meta.com>
2023-06-06 13:34:57 -04:00