transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Marc Sun	fa6107c97e	modify context length for GPTQ + version bump (#25899 ) * add new arg for gptq * add tests * add min version autogptq * fix order * skip test * fix * Update src/transformers/modeling_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fix style * change model path --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-09-06 11:45:47 -04:00
Matt	300d6a4a62	Remove Falcon from undocumented list (#26008 ) Remove falcon from undocumented list	2023-09-06 15:49:04 +01:00
Harheem Kim	fa522d8d7b	🌐[i18n-KO] Translated `llm_tutorial.md` to Korean (#25791 ) * docs: ko: llm_tutoroal.md * feat: chatgpt draft * fix: manual edits * fix: resolve suggestions * fix: resolve suggestions	2023-09-06 07:40:03 -07:00
zspo	3e203f92be	Fix small typo README.md (#25934 ) * fix some samll bugs in readme * Update docs/README.md Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-09-06 14:07:29 +01:00
Matt	842e99f1b9	TF-OPT attention mask fixes (#25238 ) * stash commit * More OPT updates * Update src/transformers/models/opt/modeling_tf_opt.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-09-06 13:37:27 +01:00
Lysandre Debut	f6301b9a13	Falcon: fix revision propagation (#26006 ) * Fix revision propagation * Cleaner	2023-09-06 07:21:00 -04:00
Nino Risteski	f6295c6c53	Update README.md (#26003 ) fixed a typo	2023-09-06 10:55:11 +01:00
tju_skywalker	172f42c512	save space when converting hf model to megatron model. (#25950 ) * fix convert megatron model too large * fix convert megatron model too large	2023-09-05 16:47:48 -04:00
Tanay Mehta	b8def68934	Fix Mega chunking error when using decoder-only model (#25765 ) * add: potential fix to mega chunking in decoder only model bug * add: decoder with chunking test * add: input_mask passed with input_ids	2023-09-05 21:50:14 +02:00
Arthur	4fa0aff21e	[`VITS`] tokenizer integration test: fix revision did not exist (#25996 ) * revision did not exist * correct revision	2023-09-05 21:21:33 +02:00
Arthur	d0354e5e86	[`CI`] Fix red CI and ERROR failed should show (#25995 ) * start with error too * fix ? * start with nit * one more path * use `job_name` * mark pipeline test as slow	2023-09-05 20:16:00 +02:00
Injin Paek	6206f599e1	Add LLaMA resources (#25859 ) * docs: feat: model resources for llama * fix: resolve suggestion Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com> Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com> Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>	2023-09-05 10:50:08 -07:00
Sanchit Gandhi	8d518013ef	[Wav2Vec2 Conformer] Fix inference float16 (#25985 ) * [Wav2Vec2 Conformer] Fix inference float16 * fix test * fix test more * clean pipe test	2023-09-05 18:26:06 +01:00
Sourab Mangrulkar	6bc517ccd4	deepspeed resume from ckpt fixes and adding support for deepspeed optimizer and HF scheduler (#25863 ) * Add support for deepspeed optimizer and HF scheduler * fix bug * fix the import * fix issue with deepspeed scheduler saving for hf optim + hf scheduler scenario * fix loading of hf scheduler when loading deepspeed checkpoint * fix import of `DeepSpeedSchedulerWrapper` * add tests * add the comment and skip the failing tests * address comment	2023-09-05 22:31:20 +05:30
raghavanone	1110b565d6	Add TFDebertaV2ForMultipleChoice (#25932 ) * Add TFDebertaV2ForMultipleChoice * Import newer model in main init * Fix import issues * Fix copies * Add doc * Fix tests * Fix copies * Fix docstring	2023-09-05 17:13:06 +01:00
andreeahedes	da1af21dbb	PegasusX add _no_split_modules (#25933 ) * no_split_modules * no_split_modules * inputs_embeds+pos same device * update _no_split_modules * update _no_split_modules	2023-09-05 16:34:34 +01:00
Abhilash Majumder	70a98024b1	Patch with accelerate xpu (#25714 ) * patch with accelerate xpu * patch with accelerate xpu * formatting * fix tests * revert ruff unrelated fixes * revert ruff unrelated fixes * revert ruff unrelated fixes * fix test * review fixes * review fixes * black fixed * review commits * review commits * style fix * use pytorch_utils * revert markuplm test	2023-09-05 15:41:42 +01:00
Yih-Dar	aa5c94d38d	Show failed tests on CircleCI layout in a better way (#25895 ) * update * update * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-09-05 15:49:33 +02:00
Joao Gante	9a70d6e56f	Trainer: delegate default generation values to `generation_config` (#25987 )	2023-09-05 14:47:00 +01:00
Sahel Sharify	aea761499f	Update training_args.py to remove the runtime error (#25920 ) This cl iterates through a list of keys rather than dict items while updating the dict elements. Fixes the following error: File "..../transformers/training_args.py", line 1544, in post_init for k, v in self.fsdp_config.items(): RuntimeError: dictionary keys changed during iteration	2023-09-05 12:43:51 +01:00
Traun Leyden	7011cd8667	Update RAG README.md with correct path to examples/seq2seq (#25953 ) Update README.md with correct path to examples/seq2seq	2023-09-05 12:31:59 +01:00
Julien Chaumond	6316ce8d27	[doc] Always call it Agents for consistency (#25958 )	2023-09-05 12:27:20 +01:00
Yih-Dar	391f26459a	Use main in conversion script (#25973 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-09-05 13:04:49 +02:00
Kai	6f125aaa48	fix typo (#25981 ) rename doanloading to downloading	2023-09-05 11:13:06 +01:00
Susnato Dhar	52a46dc57b	Add `Pop2Piano` space demo. (#25975 ) Update pop2piano.md	2023-09-05 11:07:02 +01:00
Huazhong Ji	1cc3bc22fe	nn.Identity is not required to be compatible with PyTorch < 1.1.0 as the minimum PyTorch version we currently support is 1.10.0 (#25974 ) nn.Identity is not required to be compatible with PyTorch < 1.1.0 as the minimum PyTorch version we currently support is 1.10.0	2023-09-05 11:37:54 +02:00
Yih-Dar	fbbe1b8a40	Fix `test_load_img_url_timeout` (#25976 ) * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-09-05 11:34:28 +02:00
Yih-Dar	feec56959a	Fix Detr CI (#25972 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-09-05 11:19:56 +02:00
Susnato Dhar	404ff8fc17	Fix typo (#25966 ) * Update feature_extraction_clap.py * changed all lenght to length	2023-09-05 10:12:25 +02:00
Lysandre	d8e13b3e04	v4.34.dev.0	2023-09-04 15:12:11 -04:00
Younes Belkada	49b69fe0d4	[`Falcon`] Remove SDPA for falcon to support earlier versions of PyTorch (< 2.0) (#25947 ) * remove SDPA for falcon * revert previous behaviour and add warning * nit * Update src/transformers/models/falcon/modeling_falcon.py Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Update src/transformers/models/falcon/modeling_falcon.py --------- Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> Co-authored-by: Lysandre Debut <hi@lysand.re>	2023-09-04 14:34:04 -04:00
Lysandre Debut	22a69f1d7d	Put Falcon back (#25960 ) * Put Falcon back * Update src/transformers/models/auto/configuration_auto.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update test --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-09-04 14:17:09 -04:00
David Reguera	040c4613c2	Add type hints for tf models final batch (#25883 ) * Add missing type hints and consistency to `RegNet` models * Add missing type hints and consistency to `TFSamModel` * Add missing type hints to `TFSegformerDecodeHead` * Add missing type hints and consistency to `TransfoXL` family models * Add missing type hints and consistency to `TFWav2Vec2ForSequenceClassification` * Add type hints to `TFXLMModel` * Fix linter * Revert the type hints for `RegNet` to python 3.8 compliant * Remove the redundant np.ndarray type hint.	2023-09-04 18:16:10 +01:00
Yih-Dar	44d2c199f6	Fix smart check (#25955 ) * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-09-04 18:54:34 +02:00
Lysandre Debut	3a479672ea	Fix failing test (#25963 )	2023-09-04 12:53:50 -04:00
Matt	034bc5d26a	Add proper Falcon docs and conversion script (#25954 ) * Add proper Falcon docs and conversion script * Autodetect the decoder architecture instead of using an arg * Update docs now that we can autodetect * Fix doc error * Add doc to toctree * Quick doc update	2023-09-04 17:18:34 +01:00
Sanchit Gandhi	d750eff627	[VITS] Fix init test (#25945 ) * [VITS] Fix init test * add flaky decorator * style * max attempts Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> * style --------- Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>	2023-09-04 17:09:26 +01:00
Nino Risteski	7cd01d4e38	Update README.md (#25922 ) fixed a typo	2023-09-04 16:11:00 +02:00
Omar Sanseviero	bfb1895e33	Import deepspeed utilities from integrations (#25919 ) Follow up from #25599	2023-09-04 14:03:48 +01:00
Sanchit Gandhi	eb984418e2	[VITS] Handle deprecated weight norm (#25946 )	2023-09-04 11:54:03 +01:00
Sanchit Gandhi	f435003e0c	[MMS] Fix pip install in docs (#25949 )	2023-09-04 11:53:41 +01:00
Nino Risteski	604a6c51ae	Update README.md (#25941 ) fixed a typo	2023-09-04 11:28:21 +01:00
Nino Risteski	d4407a3bd1	Update autoclass_tutorial.md (#25929 ) fixed typos	2023-09-04 11:16:49 +01:00
Nino Risteski	51e1e8120b	Update community.md (#25928 ) fixed a few typos	2023-09-04 11:16:34 +01:00
omahs	0f0e1a2c2b	Fix typos (#25936 ) * fix typo * fix typo * fix typo * fix typos * fix typos * fix typo * fix typo * fix typo * fix typos * fix typo * fix typo * fix typo * fix typos * fix typos	2023-09-04 11:15:12 +01:00
Yih-Dar	b1d475f6d2	Skip offload tests for `ViTDet` (#25913 ) * update * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-09-04 11:35:39 +02:00
ydshieh	ab8cba824e	CI: hotfix (skip VitsModelTest::test_initialization)	2023-09-04 09:06:11 +02:00
Nino Risteski	0afa5071bd	Update model_memory_anatomy.md (#25896 ) typo fixes	2023-09-01 12:27:01 -07:00
Arthur	a4dd53d88e	Update-llama-code (#25826 ) * some bug fixes * updates * Update code_llama.md Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com> * Add co author Co-authored-by: pcuenca <pedro@latenitesoft.com> * add a test * fixup * nits * some updates * fix-coies * adress comments * nits * nits * fix docsting * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * update * add int for https://huggingface.co/spaces/hf-accelerate/model-memory-usage --------- Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com> Co-authored-by: pcuenca <pedro@latenitesoft.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-09-01 20:40:40 +02:00
Sanchit Gandhi	3587769c08	[VITS] Only trigger tokenizer warning for uroman (#25915 )	2023-09-01 19:27:01 +01:00

1 2 3 4 5 ...

13924 Commits