Marc Sun
fa6107c97e
modify context length for GPTQ + version bump ( #25899 )
...
* add new arg for gptq
* add tests
* add min version autogptq
* fix order
* skip test
* fix
* Update src/transformers/modeling_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* fix style
* change model path
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2023-09-06 11:45:47 -04:00
Matt
300d6a4a62
Remove Falcon from undocumented list ( #26008 )
...
Remove falcon from undocumented list
2023-09-06 15:49:04 +01:00
Harheem Kim
fa522d8d7b
🌐 [i18n-KO] Translated llm_tutorial.md
to Korean ( #25791 )
...
* docs: ko: llm_tutoroal.md
* feat: chatgpt draft
* fix: manual edits
* fix: resolve suggestions
* fix: resolve suggestions
2023-09-06 07:40:03 -07:00
zspo
3e203f92be
Fix small typo README.md ( #25934 )
...
* fix some samll bugs in readme
* Update docs/README.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-09-06 14:07:29 +01:00
Matt
842e99f1b9
TF-OPT attention mask fixes ( #25238 )
...
* stash commit
* More OPT updates
* Update src/transformers/models/opt/modeling_tf_opt.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-09-06 13:37:27 +01:00
Lysandre Debut
f6301b9a13
Falcon: fix revision propagation ( #26006 )
...
* Fix revision propagation
* Cleaner
2023-09-06 07:21:00 -04:00
Nino Risteski
f6295c6c53
Update README.md ( #26003 )
...
fixed a typo
2023-09-06 10:55:11 +01:00
tju_skywalker
172f42c512
save space when converting hf model to megatron model. ( #25950 )
...
* fix convert megatron model too large
* fix convert megatron model too large
2023-09-05 16:47:48 -04:00
Tanay Mehta
b8def68934
Fix Mega chunking error when using decoder-only model ( #25765 )
...
* add: potential fix to mega chunking in decoder only model bug
* add: decoder with chunking test
* add: input_mask passed with input_ids
2023-09-05 21:50:14 +02:00
Arthur
4fa0aff21e
[VITS
] tokenizer integration test: fix revision did not exist ( #25996 )
...
* revision did not exist
* correct revision
2023-09-05 21:21:33 +02:00
Arthur
d0354e5e86
[CI
] Fix red CI and ERROR failed should show ( #25995 )
...
* start with error too
* fix ?
* start with nit
* one more path
* use `job_name`
* mark pipeline test as slow
2023-09-05 20:16:00 +02:00
Injin Paek
6206f599e1
Add LLaMA resources ( #25859 )
...
* docs: feat: model resources for llama
* fix: resolve suggestion
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
2023-09-05 10:50:08 -07:00
Sanchit Gandhi
8d518013ef
[Wav2Vec2 Conformer] Fix inference float16 ( #25985 )
...
* [Wav2Vec2 Conformer] Fix inference float16
* fix test
* fix test more
* clean pipe test
2023-09-05 18:26:06 +01:00
Sourab Mangrulkar
6bc517ccd4
deepspeed resume from ckpt fixes and adding support for deepspeed optimizer and HF scheduler ( #25863 )
...
* Add support for deepspeed optimizer and HF scheduler
* fix bug
* fix the import
* fix issue with deepspeed scheduler saving for hf optim + hf scheduler scenario
* fix loading of hf scheduler when loading deepspeed checkpoint
* fix import of `DeepSpeedSchedulerWrapper`
* add tests
* add the comment and skip the failing tests
* address comment
2023-09-05 22:31:20 +05:30
raghavanone
1110b565d6
Add TFDebertaV2ForMultipleChoice ( #25932 )
...
* Add TFDebertaV2ForMultipleChoice
* Import newer model in main init
* Fix import issues
* Fix copies
* Add doc
* Fix tests
* Fix copies
* Fix docstring
2023-09-05 17:13:06 +01:00
andreeahedes
da1af21dbb
PegasusX add _no_split_modules ( #25933 )
...
* no_split_modules
* no_split_modules
* inputs_embeds+pos same device
* update _no_split_modules
* update _no_split_modules
2023-09-05 16:34:34 +01:00
Abhilash Majumder
70a98024b1
Patch with accelerate xpu ( #25714 )
...
* patch with accelerate xpu
* patch with accelerate xpu
* formatting
* fix tests
* revert ruff unrelated fixes
* revert ruff unrelated fixes
* revert ruff unrelated fixes
* fix test
* review fixes
* review fixes
* black fixed
* review commits
* review commits
* style fix
* use pytorch_utils
* revert markuplm test
2023-09-05 15:41:42 +01:00
Yih-Dar
aa5c94d38d
Show failed tests on CircleCI layout in a better way ( #25895 )
...
* update
* update
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-09-05 15:49:33 +02:00
Joao Gante
9a70d6e56f
Trainer: delegate default generation values to generation_config
( #25987 )
2023-09-05 14:47:00 +01:00
Sahel Sharify
aea761499f
Update training_args.py to remove the runtime error ( #25920 )
...
This cl iterates through a list of keys rather than dict items while updating the dict elements. Fixes the following error:
File "..../transformers/training_args.py", line 1544, in post_init
for k, v in self.fsdp_config.items():
RuntimeError: dictionary keys changed during iteration
2023-09-05 12:43:51 +01:00
Traun Leyden
7011cd8667
Update RAG README.md with correct path to examples/seq2seq ( #25953 )
...
Update README.md with correct path to examples/seq2seq
2023-09-05 12:31:59 +01:00
Julien Chaumond
6316ce8d27
[doc] Always call it Agents for consistency ( #25958 )
2023-09-05 12:27:20 +01:00
Yih-Dar
391f26459a
Use main in conversion script ( #25973 )
...
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-09-05 13:04:49 +02:00
Kai
6f125aaa48
fix typo ( #25981 )
...
rename doanloading to downloading
2023-09-05 11:13:06 +01:00
Susnato Dhar
52a46dc57b
Add Pop2Piano
space demo. ( #25975 )
...
Update pop2piano.md
2023-09-05 11:07:02 +01:00
Huazhong Ji
1cc3bc22fe
nn.Identity is not required to be compatible with PyTorch < 1.1.0 as the minimum PyTorch version we currently support is 1.10.0 ( #25974 )
...
nn.Identity is not required to be compatible with PyTorch < 1.1.0 as the
minimum PyTorch version we currently support is 1.10.0
2023-09-05 11:37:54 +02:00
Yih-Dar
fbbe1b8a40
Fix test_load_img_url_timeout
( #25976 )
...
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-09-05 11:34:28 +02:00
Yih-Dar
feec56959a
Fix Detr CI ( #25972 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-09-05 11:19:56 +02:00
Susnato Dhar
404ff8fc17
Fix typo ( #25966 )
...
* Update feature_extraction_clap.py
* changed all lenght to length
2023-09-05 10:12:25 +02:00
Lysandre
d8e13b3e04
v4.34.dev.0
2023-09-04 15:12:11 -04:00
Younes Belkada
49b69fe0d4
[Falcon
] Remove SDPA for falcon to support earlier versions of PyTorch (< 2.0) ( #25947 )
...
* remove SDPA for falcon
* revert previous behaviour and add warning
* nit
* Update src/transformers/models/falcon/modeling_falcon.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
* Update src/transformers/models/falcon/modeling_falcon.py
---------
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
Co-authored-by: Lysandre Debut <hi@lysand.re>
2023-09-04 14:34:04 -04:00
Lysandre Debut
22a69f1d7d
Put Falcon back ( #25960 )
...
* Put Falcon back
* Update src/transformers/models/auto/configuration_auto.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update test
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2023-09-04 14:17:09 -04:00
David Reguera
040c4613c2
Add type hints for tf models final batch ( #25883 )
...
* Add missing type hints and consistency to `RegNet` models
* Add missing type hints and consistency to `TFSamModel`
* Add missing type hints to `TFSegformerDecodeHead`
* Add missing type hints and consistency to `TransfoXL` family models
* Add missing type hints and consistency to `TFWav2Vec2ForSequenceClassification`
* Add type hints to `TFXLMModel`
* Fix linter
* Revert the type hints for `RegNet` to python 3.8 compliant
* Remove the redundant np.ndarray type hint.
2023-09-04 18:16:10 +01:00
Yih-Dar
44d2c199f6
Fix smart check ( #25955 )
...
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-09-04 18:54:34 +02:00
Lysandre Debut
3a479672ea
Fix failing test ( #25963 )
2023-09-04 12:53:50 -04:00
Matt
034bc5d26a
Add proper Falcon docs and conversion script ( #25954 )
...
* Add proper Falcon docs and conversion script
* Autodetect the decoder architecture instead of using an arg
* Update docs now that we can autodetect
* Fix doc error
* Add doc to toctree
* Quick doc update
2023-09-04 17:18:34 +01:00
Sanchit Gandhi
d750eff627
[VITS] Fix init test ( #25945 )
...
* [VITS] Fix init test
* add flaky decorator
* style
* max attempts
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
* style
---------
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
2023-09-04 17:09:26 +01:00
Nino Risteski
7cd01d4e38
Update README.md ( #25922 )
...
fixed a typo
2023-09-04 16:11:00 +02:00
Omar Sanseviero
bfb1895e33
Import deepspeed utilities from integrations ( #25919 )
...
Follow up from #25599
2023-09-04 14:03:48 +01:00
Sanchit Gandhi
eb984418e2
[VITS] Handle deprecated weight norm ( #25946 )
2023-09-04 11:54:03 +01:00
Sanchit Gandhi
f435003e0c
[MMS] Fix pip install in docs ( #25949 )
2023-09-04 11:53:41 +01:00
Nino Risteski
604a6c51ae
Update README.md ( #25941 )
...
fixed a typo
2023-09-04 11:28:21 +01:00
Nino Risteski
d4407a3bd1
Update autoclass_tutorial.md ( #25929 )
...
fixed typos
2023-09-04 11:16:49 +01:00
Nino Risteski
51e1e8120b
Update community.md ( #25928 )
...
fixed a few typos
2023-09-04 11:16:34 +01:00
omahs
0f0e1a2c2b
Fix typos ( #25936 )
...
* fix typo
* fix typo
* fix typo
* fix typos
* fix typos
* fix typo
* fix typo
* fix typo
* fix typos
* fix typo
* fix typo
* fix typo
* fix typos
* fix typos
2023-09-04 11:15:12 +01:00
Yih-Dar
b1d475f6d2
Skip offload tests for ViTDet
( #25913 )
...
* update
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-09-04 11:35:39 +02:00
ydshieh
ab8cba824e
CI: hotfix (skip VitsModelTest::test_initialization)
2023-09-04 09:06:11 +02:00
Nino Risteski
0afa5071bd
Update model_memory_anatomy.md ( #25896 )
...
typo fixes
2023-09-01 12:27:01 -07:00
Arthur
a4dd53d88e
Update-llama-code ( #25826 )
...
* some bug fixes
* updates
* Update code_llama.md
Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com>
* Add co author
Co-authored-by: pcuenca <pedro@latenitesoft.com>
* add a test
* fixup
* nits
* some updates
* fix-coies
* adress comments
* nits
* nits
* fix docsting
* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* update
* add int for https://huggingface.co/spaces/hf-accelerate/model-memory-usage
---------
Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com>
Co-authored-by: pcuenca <pedro@latenitesoft.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-09-01 20:40:40 +02:00
Sanchit Gandhi
3587769c08
[VITS] Only trigger tokenizer warning for uroman ( #25915 )
2023-09-01 19:27:01 +01:00