transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Yih-Dar	ff4c0fc7d2	Tiny fix for `check_self_hosted_runner.py` (#24052 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-06 18:17:41 +02:00
amyeroberts	a717e0318c	Add TimmBackbone model (#22619 ) * Add test_backbone for convnext * Add TimmBackbone model * Add check for backbone type * Tidying up - config checks * Update convnextv2 * Tidy up * Fix indices & clearer comment * Exceptions for config checks * Correclty update config for tests * Safer imports * Safer safer imports * Fix where decorators go * Update import logic and backbone tests * More import fixes * Fixup * Only import all_models if torch available * Fix kwarg updates in from_pretrained & main rebase * Tidy up * Add tests for AutoBackbone * Tidy up * Fix import error * Fix up * Install nattan in doc_test_job * Revert back to setting self._out_xxx directly * Bug fix - out_indices mapping from out_features * Fix tests * Dont accept output_loading_info for Timm models * Set out_xxx and don't remap * Use smaller checkpoint for test * Don't remap timm indices - check out_indices based on stage names * Skip test as it's n/a * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Cleaner imports / spelling is hard --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-06-06 17:11:30 +01:00
Sylvain Gugger	b8935980a2	Modification of one text example file should trigger said test (#24051 )	2023-06-06 12:02:56 -04:00
Tom Aarsen	02fe3af275	Prevent ZeroDivisionError on `trainer.evaluate` if model and dataset are tiny (#24049 ) Prevent ZeroDivisionError if evaluation is too quick	2023-06-06 11:31:05 -04:00
Roy Hvaara	d924390d5b	Use TruncatedNormal from Keras initializers (#24036 ) Co-authored-by: Andrey Voynov <avoin@google.com>	2023-06-06 14:51:44 +01:00
Nicolas Patry	c2e3fa0b2a	Fixing single candidate_label return. (#24023 )	2023-06-06 15:26:10 +02:00
Marc Sun	6307312dfc	Add check for tied parameters (#24029 ) * Add check for tied parameters * Fix style * fix style * Fix versioning * Change if to elif	2023-06-06 09:12:46 -04:00
Wonhyeong Seo	7da3ce04a6	🌐 [i18n-KO] Translated `bertology.mdx` to Korean (#23968 ) * docs: ko: `bertology.mdx` * feat: nmt draft * fix: manual edits * fix: resolve suggestions Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com> --------- Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>	2023-06-06 09:08:45 -04:00
Wonhyeong Seo	c938597657	🌐 [i18n-KO] Translated `language-modeling.mdx` (#23969 ) * docs: ko: `language_modeling.mdx` * feat: nmt draft * fix: manual edits * fix: add inline toc * fix: typo in toc_tree.yml * fix: resolve suggestions Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> --------- Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>	2023-06-06 09:08:26 -04:00
Yih-Dar	7631db0fdc	Pin `deepspeed` to `0.9.2` for now (#24024 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-05 20:00:28 +02:00
Yih-Dar	17846646f2	Fix `MobileViTV2` checkpoint name (#24018 ) * fix * fix * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-06-05 18:12:45 +02:00
Hyeonseo Yun	649ffbf575	🌐 [i18n-KO] Translated `tasks_explained.mdx` to Korean (#23844 ) * docs: ko: tasks_explained.mdx * feat: nmt and manual edit `tasks_explained.mdx` * revised: resolve suggestions task_explained.mdx * fixed: added draft of reference docs Co-Authored-By: Kihoon Son <75935546+KIHOON71@users.noreply.github.com> Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com> * revised: resolve suggestions(voca, spell check) task_explained.mdx Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com> * revised: remove duplicate sentence in task_explained.mdx * fixed: remove draft of reference docs - I think it will be confusing in the translation process. - This issue is included in #23971. --------- Co-authored-by: Kihoon Son <75935546+KIHOON71@users.noreply.github.com> Co-authored-by: Nayeon Han <nayeon2.han@gmail.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>	2023-06-05 12:02:03 -04:00
Brian Yu	2872f9671b	TensorBoard callback no longer adds hparams (#23999 ) tensorboard callback no longer adds hparams	2023-06-05 11:53:45 -04:00
Jungwoo Park	44bd590a29	Pix2Struct: fix wrong broadcast axis of attention mask in visual encoder (#23976 ) * fix wrong broadcast axis of attention mask in visual encoder * fix slow tests --------- Co-authored-by: younesbelkada <younesbelkada@gmail.com>	2023-06-05 11:47:29 -04:00
Yessen Kanapin	7824fa431e	expose safe_serialization argument in the pipeline API (#23775 ) expose safe_serialization argument of PreTrainedModel and TFPreTrainedModel in the save_pretrained of the pipeline api Co-authored-by: Yessen Kanapin <yessen@deepinfra.com>	2023-06-05 11:19:58 -04:00
Bearnardd	b4919cb520	Auto tokenizer registration (#23965 ) add check loop over extra content	2023-06-05 11:10:47 -04:00
Yih-Dar	b143019005	Update README.md (#24022 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-05 17:08:15 +02:00
Yih-Dar	5176dc2310	Skip `test_multi_gpu_data_parallel_forward` for `MobileViTV2ModelTest` (#24017 ) * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-05 16:29:32 +02:00
Sourab Mangrulkar	460b844360	fix trainer slow tests related to hyperparam search (#24011 ) * fix trainer slow tests * commit 2	2023-06-05 17:58:10 +05:30
Kaede Fujisaki	3c3108972a	Fix typo in doc comment of BitsAndBytesConfig (#23978 )	2023-06-05 12:10:31 +01:00
dependabot[bot]	539e2281cd	Bump cryptography from 39.0.1 to 41.0.0 in /examples/research_projects/decision_transformer (#23964 ) Bump cryptography in /examples/research_projects/decision_transformer Bumps [cryptography](https://github.com/pyca/cryptography) from 39.0.1 to 41.0.0. - [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst) - [Commits](https://github.com/pyca/cryptography/compare/39.0.1...41.0.0) --- updated-dependencies: - dependency-name: cryptography dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-06-02 16:23:44 -04:00
Eli Simhayev	bacaab1629	Added time-series blogs to the models (#23857 ) * added blogs to docs * removed new-line	2023-06-02 12:32:34 -04:00
Matt	167a0d8f87	Add an option to reduce compile() console spam (#23938 ) * Add an option to reduce compile() console spam * Add annotations to the example scripts * Add notes to the quicktour docs as well * minor fix	2023-06-02 15:28:52 +01:00
Sanchit Gandhi	c9cf337772	[Whisper Tokenizer] Skip special tokens when decoding with timestamps (#23945 )	2023-06-02 16:26:59 +02:00
Claudius Kienle	8940d315aa	Trainer: fixed evaluate raising `KeyError` for ReduceLROnPlateau (#23952 ) Trainer: fixed KeyError on evaluate for ReduceLROnPlateau Co-authored-by: Claudius Kienle <claudius.kienle@artiminds.com>	2023-06-02 08:53:48 -04:00
Kihoon Son	2fdba73a99	🌐 [i18n-KO] Translated object_detection.mdx to Korean (#23164 ) * translated object_detection.mdx Co-Authored-By: Hyeonseo Yun <0525_hhgus@naver.com> Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com> Co-Authored-By: simso <3035487+simso@users.noreply.github.com> Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com> Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com> Co-Authored-By: Jungnerd <46880056+jungnerd@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com> Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> --------- Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com> Co-authored-by: Nayeon Han <nayeon2.han@gmail.com> Co-authored-by: simso <3035487+simso@users.noreply.github.com> Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com> Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com> Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>	2023-06-02 07:43:55 -04:00
Patrick von Platen	dcb5e18c9e	add new mms functions to doc (#23954 )	2023-06-02 11:35:52 +01:00
Shehan Munasinghe	07c54413ac	Add MobileViTv2 (#22820 ) * generated code from add-new-model-like * Add code for modeling, config, and weight conversion * add tests for image-classification, update modeling and config * add code, tests for semantic-segmentation * make style, make quality, make fix-copies * make fix-copies * Update modeling_mobilevitv2.py fix bugs * Update _toctree.yml * update modeling, config fix bugs * Edit docs - fix bug MobileViTv2v2 -> MobileViTv2 * Update mobilevitv2.mdx * update docstrings * Update configuration_mobilevitv2.py make style * Update convert_mlcvnets_to_pytorch.py remove unused options * Update convert_mlcvnets_to_pytorch.py make style * Add suggestions from code review Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * make style, make quality * Add suggestions from code review Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Add suggestions from code review Remove MobileViTv2ImageProcessor Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * make style * Add suggestions from code review Rename MobileViTv2 -> MobileViTV2 Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Add suggestions from code review Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update modeling_mobilevitv2.py make style * Update serialization.mdx * Update modeling_mobilevitv2.py --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-06-02 10:37:02 +01:00
Patrick von Platen	5dfd407b37	[MMS] Scaling Speech Technology to 1,000+ Languages \| Add attention adapter to Wav2Vec2 (#23813 ) * add fine-tuned with adapter layer * Add set_target_lang to tokenizer * Implement load adapter * add tests * make style * Apply suggestions from code review * Update src/transformers/models/wav2vec2/tokenization_wav2vec2.py * make fix-copies * Apply suggestions from code review * make fix-copies * make style again * mkae style again * fix doc string * Update tests/models/wav2vec2/test_tokenization_wav2vec2.py * Apply suggestions from code review * fix * Correct wav2vec2 adapter * mkae style * Update src/transformers/models/wav2vec2/modeling_wav2vec2.py Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * add more nice docs * finish * finish * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Apply suggestions from code review * all finish --------- Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-06-02 10:30:24 +01:00
wasupandceacar	f49a3453ca	Fix `ReduceLROnPlateau` object has no attribute 'get_last_lr' (#23944 ) * Fix 'ReduceLROnPlateau' object has no attribute 'get_last_lr' * fix style	2023-06-01 16:10:52 -04:00
Kashif Rasul	c62b01d0b0	use _make_causal_mask in clip/vit models (#23942 ) use _make_causal_mask in clip models	2023-06-01 16:10:24 -04:00
Marc Sun	e03a9cc0cd	Modify device_map behavior when loading a model using from_pretrained (#23922 ) * Modify device map behavior for 4/8 bits model * Remove device_map arg for training 4/8 bit model * Remove index Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Add Exceptions * Modify comment Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Fix formatting * Get current device with accelerate * Revert "Get current device with accelerate" This reverts commit `46f0079910`. * Fix Exception * Modify quantization doc * Fix error Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-06-01 13:21:22 -04:00
Brendon Soong	d1fa349e78	#23675 Registering Malay language (#23689 ) * #23675 Registering Malay language * removing untranslated files * some translate * more updates to toctree * inc index * additional translations for toctree * translations of more sections * removing untranslated file * translated index.mdx to malay	2023-06-01 13:17:27 -04:00
Lysandre Debut	dc67da0182	Revert "Update stale.yml to use HuggingFaceBot" (#23943 ) Revert "Update stale.yml to use HuggingFaceBot (#23941)" This reverts commit `5929f86ebb`.	2023-06-01 11:58:11 -04:00
Matt	8088ca4185	Make TF ESM inv_freq non-trainable like PyTorch (#23940 ) Make TF inv_freq non-trainable like PyTorch	2023-06-01 16:15:00 +01:00
Lysandre Debut	5929f86ebb	Update stale.yml to use HuggingFaceBot (#23941 )	2023-06-01 10:54:50 -04:00
Adam Lewis	857d4e1c87	rename DocumentQuestionAnsweringTool parameter input to match docstring (#23939 ) rename encode input to match docstring	2023-06-01 10:54:01 -04:00
Sylvain Gugger	9193188276	Pin rhoknp (#23937 )	2023-06-01 10:25:43 -04:00
Sheon Han	af2c36793f	Fix doc string nits (#23929 )	2023-06-01 10:10:15 -04:00
fxmarty	9a35a7b9e1	Effectively allow `encoder_outputs` input to be a tuple in pix2struct (#23932 ) consistentcy	2023-06-01 09:07:57 -04:00
Sanchit Gandhi	9603ef890a	[Flax Whisper] Update decode docstring (#23908 )	2023-06-01 14:36:45 +02:00
Sylvain Gugger	fabe17a726	Skip device placement for past key values in decoder models (#23919 )	2023-05-31 15:32:21 -04:00
NielsRogge	6affd9cd7c	[PushToHub] Make it possible to upload folders (#23920 ) Add first draft	2023-05-31 15:31:28 -04:00
Sylvain Gugger	4aa13224a5	Update the update metadata job to use upload_folder (#23917 )	2023-05-31 14:10:14 -04:00
Sylvain Gugger	3ff443a6d9	Re-enable squad test (#23912 ) * Re-enable squad test * [all-test] * [all-test] Fix all test command * Fix the all-test	2023-05-31 13:44:26 -04:00
Sourab Mangrulkar	d13021e35f	remove the extra `accelerator.prepare` (#23914 ) remove the extra `accelerator.prepare` that slipped in with multiple update from main 😅	2023-05-31 23:04:55 +05:30
amyeroberts	c608b8fc93	Bug fix - flip_channel_order for channels first images (#23701 ) Bug fix - flip_channel_order for channels_first	2023-05-31 17:12:27 +01:00
Sylvain Gugger	0b3d092f63	Empty circleci config (#23913 ) * Try easy first * Add an empty job * Fix name * Fix method	2023-05-31 12:02:05 -04:00
amyeroberts	8714b964ee	Raise error if loss can't be calculated - ViT MIM (#23872 ) Raise error if loss can't be calculated	2023-05-31 17:01:53 +01:00
Hari	404d925384	add conditional statement for auxiliary loss calculation (#23899 ) * add conditional statement for auxiliary loss calculation * fix style and copies	2023-05-31 16:40:23 +01:00

1 2 3 4 5 ...

13076 Commits