Yih-Dar
ff4c0fc7d2
Tiny fix for check_self_hosted_runner.py
( #24052 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-06-06 18:17:41 +02:00
amyeroberts
a717e0318c
Add TimmBackbone model ( #22619 )
...
* Add test_backbone for convnext
* Add TimmBackbone model
* Add check for backbone type
* Tidying up - config checks
* Update convnextv2
* Tidy up
* Fix indices & clearer comment
* Exceptions for config checks
* Correclty update config for tests
* Safer imports
* Safer safer imports
* Fix where decorators go
* Update import logic and backbone tests
* More import fixes
* Fixup
* Only import all_models if torch available
* Fix kwarg updates in from_pretrained & main rebase
* Tidy up
* Add tests for AutoBackbone
* Tidy up
* Fix import error
* Fix up
* Install nattan in doc_test_job
* Revert back to setting self._out_xxx directly
* Bug fix - out_indices mapping from out_features
* Fix tests
* Dont accept output_loading_info for Timm models
* Set out_xxx and don't remap
* Use smaller checkpoint for test
* Don't remap timm indices - check out_indices based on stage names
* Skip test as it's n/a
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Cleaner imports / spelling is hard
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-06-06 17:11:30 +01:00
Sylvain Gugger
b8935980a2
Modification of one text example file should trigger said test ( #24051 )
2023-06-06 12:02:56 -04:00
Tom Aarsen
02fe3af275
Prevent ZeroDivisionError on trainer.evaluate
if model and dataset are tiny ( #24049 )
...
Prevent ZeroDivisionError if evaluation is too quick
2023-06-06 11:31:05 -04:00
Roy Hvaara
d924390d5b
Use TruncatedNormal from Keras initializers ( #24036 )
...
Co-authored-by: Andrey Voynov <avoin@google.com>
2023-06-06 14:51:44 +01:00
Nicolas Patry
c2e3fa0b2a
Fixing single candidate_label return. ( #24023 )
2023-06-06 15:26:10 +02:00
Marc Sun
6307312dfc
Add check for tied parameters ( #24029 )
...
* Add check for tied parameters
* Fix style
* fix style
* Fix versioning
* Change if to elif
2023-06-06 09:12:46 -04:00
Wonhyeong Seo
7da3ce04a6
🌐 [i18n-KO] Translated bertology.mdx
to Korean ( #23968 )
...
* docs: ko: `bertology.mdx`
* feat: nmt draft
* fix: manual edits
* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
---------
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
2023-06-06 09:08:45 -04:00
Wonhyeong Seo
c938597657
🌐 [i18n-KO] Translated language-modeling.mdx
( #23969 )
...
* docs: ko: `language_modeling.mdx`
* feat: nmt draft
* fix: manual edits
* fix: add inline toc
* fix: typo in toc_tree.yml
* fix: resolve suggestions
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
---------
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
2023-06-06 09:08:26 -04:00
Yih-Dar
7631db0fdc
Pin deepspeed
to 0.9.2
for now ( #24024 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-06-05 20:00:28 +02:00
Yih-Dar
17846646f2
Fix MobileViTV2
checkpoint name ( #24018 )
...
* fix
* fix
* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-06-05 18:12:45 +02:00
Hyeonseo Yun
649ffbf575
🌐 [i18n-KO] Translated tasks_explained.mdx
to Korean ( #23844 )
...
* docs: ko: tasks_explained.mdx
* feat: nmt and manual edit `tasks_explained.mdx`
* revised: resolve suggestions task_explained.mdx
* fixed: added draft of reference docs
Co-Authored-By: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com>
* revised: resolve suggestions(voca, spell check) task_explained.mdx
Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
* revised: remove duplicate sentence in task_explained.mdx
* fixed: remove draft of reference docs
- I think it will be confusing in the translation process.
- This issue is included in #23971 .
---------
Co-authored-by: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
Co-authored-by: Nayeon Han <nayeon2.han@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
2023-06-05 12:02:03 -04:00
Brian Yu
2872f9671b
TensorBoard callback no longer adds hparams ( #23999 )
...
tensorboard callback no longer adds hparams
2023-06-05 11:53:45 -04:00
Jungwoo Park
44bd590a29
Pix2Struct: fix wrong broadcast axis of attention mask in visual encoder ( #23976 )
...
* fix wrong broadcast axis of attention mask in visual encoder
* fix slow tests
---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
2023-06-05 11:47:29 -04:00
Yessen Kanapin
7824fa431e
expose safe_serialization argument in the pipeline API ( #23775 )
...
expose safe_serialization argument of PreTrainedModel and TFPreTrainedModel in the save_pretrained of the pipeline api
Co-authored-by: Yessen Kanapin <yessen@deepinfra.com>
2023-06-05 11:19:58 -04:00
Bearnardd
b4919cb520
Auto tokenizer registration ( #23965 )
...
add check loop over extra content
2023-06-05 11:10:47 -04:00
Yih-Dar
b143019005
Update README.md ( #24022 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-06-05 17:08:15 +02:00
Yih-Dar
5176dc2310
Skip test_multi_gpu_data_parallel_forward
for MobileViTV2ModelTest
( #24017 )
...
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-06-05 16:29:32 +02:00
Sourab Mangrulkar
460b844360
fix trainer slow tests related to hyperparam search ( #24011 )
...
* fix trainer slow tests
* commit 2
2023-06-05 17:58:10 +05:30
Kaede Fujisaki
3c3108972a
Fix typo in doc comment of BitsAndBytesConfig ( #23978 )
2023-06-05 12:10:31 +01:00
dependabot[bot]
539e2281cd
Bump cryptography from 39.0.1 to 41.0.0 in /examples/research_projects/decision_transformer ( #23964 )
...
Bump cryptography in /examples/research_projects/decision_transformer
Bumps [cryptography](https://github.com/pyca/cryptography ) from 39.0.1 to 41.0.0.
- [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst )
- [Commits](https://github.com/pyca/cryptography/compare/39.0.1...41.0.0 )
---
updated-dependencies:
- dependency-name: cryptography
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-06-02 16:23:44 -04:00
Eli Simhayev
bacaab1629
Added time-series blogs to the models ( #23857 )
...
* added blogs to docs
* removed new-line
2023-06-02 12:32:34 -04:00
Matt
167a0d8f87
Add an option to reduce compile() console spam ( #23938 )
...
* Add an option to reduce compile() console spam
* Add annotations to the example scripts
* Add notes to the quicktour docs as well
* minor fix
2023-06-02 15:28:52 +01:00
Sanchit Gandhi
c9cf337772
[Whisper Tokenizer] Skip special tokens when decoding with timestamps ( #23945 )
2023-06-02 16:26:59 +02:00
Claudius Kienle
8940d315aa
Trainer: fixed evaluate raising KeyError
for ReduceLROnPlateau ( #23952 )
...
Trainer: fixed KeyError on evaluate for ReduceLROnPlateau
Co-authored-by: Claudius Kienle <claudius.kienle@artiminds.com>
2023-06-02 08:53:48 -04:00
Kihoon Son
2fdba73a99
🌐 [i18n-KO] Translated object_detection.mdx to Korean ( #23164 )
...
* translated object_detection.mdx
Co-Authored-By: Hyeonseo Yun <0525_hhgus@naver.com>
Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com>
Co-Authored-By: simso <3035487+simso@users.noreply.github.com>
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>
Co-Authored-By: Jungnerd <46880056+jungnerd@users.noreply.github.com>
* Apply suggestions from code review
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
---------
Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com>
Co-authored-by: Nayeon Han <nayeon2.han@gmail.com>
Co-authored-by: simso <3035487+simso@users.noreply.github.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
2023-06-02 07:43:55 -04:00
Patrick von Platen
dcb5e18c9e
add new mms functions to doc ( #23954 )
2023-06-02 11:35:52 +01:00
Shehan Munasinghe
07c54413ac
Add MobileViTv2 ( #22820 )
...
* generated code from add-new-model-like
* Add code for modeling, config, and weight conversion
* add tests for image-classification, update modeling and config
* add code, tests for semantic-segmentation
* make style, make quality, make fix-copies
* make fix-copies
* Update modeling_mobilevitv2.py
fix bugs
* Update _toctree.yml
* update modeling, config
fix bugs
* Edit docs - fix bug MobileViTv2v2 -> MobileViTv2
* Update mobilevitv2.mdx
* update docstrings
* Update configuration_mobilevitv2.py
make style
* Update convert_mlcvnets_to_pytorch.py
remove unused options
* Update convert_mlcvnets_to_pytorch.py
make style
* Add suggestions from code review
Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* make style, make quality
* Add suggestions from code review
Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Add suggestions from code review
Remove MobileViTv2ImageProcessor
Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* make style
* Add suggestions from code review
Rename MobileViTv2 -> MobileViTV2
Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Add suggestions from code review
Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Update modeling_mobilevitv2.py
make style
* Update serialization.mdx
* Update modeling_mobilevitv2.py
---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-06-02 10:37:02 +01:00
Patrick von Platen
5dfd407b37
[MMS] Scaling Speech Technology to 1,000+ Languages | Add attention adapter to Wav2Vec2 ( #23813 )
...
* add fine-tuned with adapter layer
* Add set_target_lang to tokenizer
* Implement load adapter
* add tests
* make style
* Apply suggestions from code review
* Update src/transformers/models/wav2vec2/tokenization_wav2vec2.py
* make fix-copies
* Apply suggestions from code review
* make fix-copies
* make style again
* mkae style again
* fix doc string
* Update tests/models/wav2vec2/test_tokenization_wav2vec2.py
* Apply suggestions from code review
* fix
* Correct wav2vec2 adapter
* mkae style
* Update src/transformers/models/wav2vec2/modeling_wav2vec2.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
* add more nice docs
* finish
* finish
* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Apply suggestions from code review
* all finish
---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-06-02 10:30:24 +01:00
wasupandceacar
f49a3453ca
Fix ReduceLROnPlateau
object has no attribute 'get_last_lr' ( #23944 )
...
* Fix 'ReduceLROnPlateau' object has no attribute 'get_last_lr'
* fix style
2023-06-01 16:10:52 -04:00
Kashif Rasul
c62b01d0b0
use _make_causal_mask in clip/vit models ( #23942 )
...
use _make_causal_mask in clip models
2023-06-01 16:10:24 -04:00
Marc Sun
e03a9cc0cd
Modify device_map behavior when loading a model using from_pretrained ( #23922 )
...
* Modify device map behavior for 4/8 bits model
* Remove device_map arg for training 4/8 bit model
* Remove index
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Add Exceptions
* Modify comment
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Fix formatting
* Get current device with accelerate
* Revert "Get current device with accelerate"
This reverts commit 46f0079910
.
* Fix Exception
* Modify quantization doc
* Fix error
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-06-01 13:21:22 -04:00
Brendon Soong
d1fa349e78
#23675 Registering Malay language ( #23689 )
...
* #23675 Registering Malay language
* removing untranslated files
* some translate
* more updates to toctree
* inc index
* additional translations for toctree
* translations of more sections
* removing untranslated file
* translated index.mdx to malay
2023-06-01 13:17:27 -04:00
Lysandre Debut
dc67da0182
Revert "Update stale.yml to use HuggingFaceBot" ( #23943 )
...
Revert "Update stale.yml to use HuggingFaceBot (#23941 )"
This reverts commit 5929f86ebb
.
2023-06-01 11:58:11 -04:00
Matt
8088ca4185
Make TF ESM inv_freq non-trainable like PyTorch ( #23940 )
...
Make TF inv_freq non-trainable like PyTorch
2023-06-01 16:15:00 +01:00
Lysandre Debut
5929f86ebb
Update stale.yml to use HuggingFaceBot ( #23941 )
2023-06-01 10:54:50 -04:00
Adam Lewis
857d4e1c87
rename DocumentQuestionAnsweringTool parameter input to match docstring ( #23939 )
...
rename encode input to match docstring
2023-06-01 10:54:01 -04:00
Sylvain Gugger
9193188276
Pin rhoknp ( #23937 )
2023-06-01 10:25:43 -04:00
Sheon Han
af2c36793f
Fix doc string nits ( #23929 )
2023-06-01 10:10:15 -04:00
fxmarty
9a35a7b9e1
Effectively allow encoder_outputs
input to be a tuple in pix2struct ( #23932 )
...
consistentcy
2023-06-01 09:07:57 -04:00
Sanchit Gandhi
9603ef890a
[Flax Whisper] Update decode docstring ( #23908 )
2023-06-01 14:36:45 +02:00
Sylvain Gugger
fabe17a726
Skip device placement for past key values in decoder models ( #23919 )
2023-05-31 15:32:21 -04:00
NielsRogge
6affd9cd7c
[PushToHub] Make it possible to upload folders ( #23920 )
...
Add first draft
2023-05-31 15:31:28 -04:00
Sylvain Gugger
4aa13224a5
Update the update metadata job to use upload_folder ( #23917 )
2023-05-31 14:10:14 -04:00
Sylvain Gugger
3ff443a6d9
Re-enable squad test ( #23912 )
...
* Re-enable squad test
* [all-test]
* [all-test] Fix all test command
* Fix the all-test
2023-05-31 13:44:26 -04:00
Sourab Mangrulkar
d13021e35f
remove the extra accelerator.prepare
( #23914 )
...
remove the extra `accelerator.prepare` that slipped in with multiple update from main 😅
2023-05-31 23:04:55 +05:30
amyeroberts
c608b8fc93
Bug fix - flip_channel_order for channels first images ( #23701 )
...
Bug fix - flip_channel_order for channels_first
2023-05-31 17:12:27 +01:00
Sylvain Gugger
0b3d092f63
Empty circleci config ( #23913 )
...
* Try easy first
* Add an empty job
* Fix name
* Fix method
2023-05-31 12:02:05 -04:00
amyeroberts
8714b964ee
Raise error if loss can't be calculated - ViT MIM ( #23872 )
...
Raise error if loss can't be calculated
2023-05-31 17:01:53 +01:00
Hari
404d925384
add conditional statement for auxiliary loss calculation ( #23899 )
...
* add conditional statement for auxiliary loss calculation
* fix style and copies
2023-05-31 16:40:23 +01:00