Daniel Levenson
4e1522d65a
Fix typo in mega.mdx ( #22998 )
...
MegaConfiig -> MegaConfig
2023-04-25 17:58:45 -04:00
Wonhyeong Seo
d95045717e
🌐 [i18n-KO] Translated serialization.mdx
to Korean ( #22806 )
...
docs: ko: serialization.mdx
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
2023-04-25 12:38:51 -04:00
Younes Belkada
a0ae2310ec
[DocTest
] Fix correct checkpoint ( #22988 )
...
fix pipeline issue
2023-04-25 15:20:36 +02:00
Lingepumpe
5427250351
Avoid invalid escape sequences, use raw strings ( #22936 )
...
* Avoid invalid escape sequences, use raw strings
* Integrate PR feedback
2023-04-25 09:17:56 -04:00
Jari Van Melckebeke
81c1910c86
fixed small typo in code example ( #22982 )
...
fixed typo in code example
fixed a really small typo in the docs of single gpu inference
2023-04-25 08:56:21 -04:00
AleksanderWWW
0a570dbd2e
Neptune fix bug init run ( #22836 )
...
* [neptune] fix checkpoint bug with relative out_dir
* update imports
* reformat with black
* check neptune without imports
* fix typing-related issue
* run black on code
* use os.path.sep instead of raw \
* simplify imports and remove type annotation
* make ruff happy
* apply review suggestions
* replace run with with_id kwarg to run
* update imports to avoid deprecation warnings for the latest client
---------
Co-authored-by: kshitij12345 <kshitijkalambarkar@gmail.com>
2023-04-25 08:51:05 -04:00
Younes Belkada
d4d628462f
[SAM
] Add sam doc ( #22984 )
...
* add sam doc
* fixes
* multiple fixes
2023-04-25 14:00:27 +02:00
Nayeon Han
f0f5e28f82
🌐 [i18n-KO] Fixed tasks/masked_language_modeling.mdx
( #22965 )
...
fix: docs: missing newline before code block
2023-04-25 09:59:17 +02:00
Yih-Dar
60f9649653
Fix DeepSpeed
CI job link in Past CI ( #22967 )
...
* Fix job link
* fix artifact name logic
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-25 09:52:19 +02:00
Yih-Dar
073baf7f22
Install accelerete@main
in PyTorch Past CI jobs ( #22963 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-24 21:19:06 +02:00
Joao Gante
e4a97f82bf
Generate: assisted generation with sample (take 2) ( #22949 )
...
* temperature controls speed
2023-04-24 19:54:55 +01:00
Gabriel Yang
7701716efc
🌐 [i18n-KO] translate create_a_model
doc to Korean ( #22754 )
...
docs: ko: translates create_a_model.mdx
Co-authored-by: Nayeon Han <nayeon2.han@gmail.com>
Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
2023-04-24 13:02:19 -04:00
amyeroberts
8f20e61c85
Update feature selection in to_tf_dataset ( #21935 )
...
* Update feature selection
* Check compatibility with datasets version
* Checkout from datasets main
2023-04-24 17:34:30 +01:00
Matt
345a1371d8
Fix TF example in quicktour ( #22960 )
...
* Fix TF example in quicktour
* Fix model.fit() and the dataset section too
2023-04-24 17:25:13 +01:00
othertea
503e8c8b32
fix ValueError message in LlamaAttention ( #22966 )
2023-04-24 12:02:05 -04:00
Nicolas Patry
6e32959329
Reverting Deta cloning mecanism. ( #22656 )
...
* Fixed the revert by making sure that even the regexp can cover all
duplicates.
* Code simplification using hash.
* Fixing the `ident`.
* Fixing ignoring patterened duplicate names.
* Using `accelerate@find_tied_parameters` for from_pretrained
This is more correct there, since it handles meta device seemlessly
and we don't need to handle "non-duplicate" tensors (slices of each
other).
* Protecting accelerate.
* Update src/transformers/modeling_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-04-24 11:24:35 -04:00
Nayeon Han
d6f1da6b71
🌐 [i18n-KO] Translated run_scripts.mdx
to Korean ( #22793 )
...
docs: ko: `run_scripts` to Korean
Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
2023-04-24 10:18:20 -04:00
Lucain
74c55ab9e5
Prepare tests for hfh 0.14 ( #22958 )
...
* Test hf_hub 0.14.0rc1
* fix mocked tests
* package version
---------
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>
Co-authored-by: testbot <lucainp@hf.co>
2023-04-24 09:31:50 -04:00
hanrui1sensetime
69f2d5386b
[Fix Bugs] Fix keys in _load_pretrained_model
( #22947 )
...
fix transformers keys
2023-04-24 09:28:51 -04:00
Connor Boyle
b5f06d6c59
Raise error if stride
is too high in TokenClassificationPipeline
( #22942 )
...
* Raise error if `stride` is too high
* Clarify use of `stride`
2023-04-24 09:27:49 -04:00
Yih-Dar
3f6a4b5bd7
Decorate test_codegen_sample_max_time
as flaky ( #22953 )
...
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-24 15:27:31 +02:00
fxmarty
edb6d950cb
Add an attribute to disable custom kernels in deformable detr in order to make the model ONNX exportable ( #22918 )
...
* add disable kernel option
* add comment
* fix copies
* add disable_custom_kernels to config
* Update src/transformers/models/deta/modeling_deta.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Update src/transformers/models/deta/modeling_deta.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Update src/transformers/models/deta/modeling_deta.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* style
* fix
---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-04-24 09:27:03 -04:00
Sohyun Sim
84097f6d38
🌐 [i18n-KO] Translated tasks/summarization.mdx
to Korean ( #22783 )
...
docs: ko: tasks/summarization.mdx
Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Nayeon Han <nayeon2.han@gmail.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Kihoon Son <75935546+kihoon71@users.noreply.github.com>
2023-04-24 09:03:02 -04:00
Nayeon Han
093be36f6c
🌐 [i18n-KO] Translated tasks/masked_language_modeling.mdx
to Korean ( #22838 )
...
docs: ko: `tasks/masked_language_modeling.mdx` to Korean
Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
2023-04-24 09:02:21 -04:00
Yih-Dar
975159bb61
Update tiny models and a few fixes ( #22928 )
...
* run_check_tiny_models
* update summary
* update mixin
* update pipeline_model_mapping
* update pipeline_model_mapping
* Update for gpt_bigcode
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-24 14:45:22 +02:00
Joao Gante
2fbd6df81c
Generate: Add exception path for Donut ( #22955 )
2023-04-24 13:05:55 +01:00
Arthur
df017c3ccc
[CLAP] Doc nits ( #22957 )
...
clap nits
2023-04-24 14:00:29 +02:00
Hyeonseo Yun
137eb8e663
[i18n-KO] Translated accelerate.mdx
to Korean ( #22830 )
...
* docs: ko: init: accelerate.mdx
* docs: ko: translated: accelerate.mdx
* docs: ko: revised: natural expression accelerate.mdx
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
* docs: ko: revised: natural expression2 accelerate.mdx
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
---------
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
2023-04-24 07:49:05 -04:00
NielsRogge
3d3204c025
Add FocalNet ( #21532 )
...
Adds FocalNet by Microsoft to transformers
---------
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: alaradirik <alaradirik@gmail.com>
2023-04-23 20:03:05 +03:00
SUSHMANTH REDDY
d04ec99bec
vilt_model ( #22930 )
2023-04-21 20:01:25 -04:00
hamid mohammadi
4d10de55b4
Feature to convert videomae huge and small finetuned on kinetics and ssv2 added to the videomae to pytorch converter ( #22788 )
...
* Feature to convert videomae huge finetuned kinetics and videomae small finetuned kinetics and ssv2 added to videomae to pytorch converter
* Reformat convert_videomae_to_pytorch using black
* Value exception added for the possible videomae model architectures
2023-04-21 16:13:06 -04:00
Arthur
7579a52b55
Small sam patch ( #22920 )
...
* patch
* add test
* move tests
* cover more cases (will fail nw update the code)
* style
* fix
* Update src/transformers/models/sam/image_processing_sam.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Update src/transformers/models/sam/image_processing_sam.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* add better check
---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
2023-04-21 21:41:18 +02:00
Yih-Dar
5166c30e29
Fix a minor bug in CI slack report ( #22906 )
...
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-21 20:36:35 +02:00
Connor Henderson
b950c38565
tests: Fix flaky test for NLLB-MoE ( #22880 )
...
* add test update and docs edits
* docs edit suggestion
2023-04-21 17:09:40 +01:00
Wing Lian
d00997e66c
ddp fixes for training ( #22874 )
...
ddp fixes for stable lm training
2023-04-21 11:42:02 -04:00
Arthur
eddf9eeca0
[CI] clap patch fusion test values ( #22922 )
...
* patch test with values
* lower tol
2023-04-21 11:22:07 -04:00
Matt
5600e6f3ba
Hardcode GELU as the intermediate activation for ESM ( #22892 )
...
* Hardcode GELU as the intermediate activation for ESM
* Sneak a quick fix to the weight tying in too
* Make the call to gelu explicit
2023-04-21 16:10:10 +01:00
Roy Hvaara
874c7caf19
Remove broken test_data symlink in legacy s2s examples ( #22876 )
2023-04-21 15:35:42 +01:00
SeongBeomLEE
587a19c725
fix: GPTNeoX half inference error ( #22888 )
...
* fix: half inference error
norm_factor is still torch.float32 after using model.half
So I changed it to register_buffer so I can change it to torch.float16 after using model.half
* fix: Added a variable "persistent=False"
* run make style
2023-04-21 10:23:53 -04:00
fxmarty
3d852da2db
Expose AutoModelForMaskGeneration ( #22910 )
...
* expose
* style
* add dummy object
* amazed by the quality of transformers CI
2023-04-21 10:04:45 -04:00
fxmarty
75444551c0
Make sam ONNX exportable ( #22915 )
...
* fix code not exportable
* fix
* Update src/transformers/models/sam/modeling_sam.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-04-21 09:54:30 -04:00
Nathan Fradet
d03d8c720f
Fix: Seq2SeqTrainingArgs overriding to_dict for GenerationConfig json support ( #22919 )
...
* Seq2SeqTrainingArgs overriding to_dict for GenerationConfig json support
* seq2seqTrainingArgs to_dict calling super method before handling genconf
2023-04-21 09:53:24 -04:00
Yusong Wu
64ec802e50
fix bug of CLAP dataloader ( #22674 )
...
fix bug of CLAP: https://github.com/LAION-AI/CLAP/issues/62
2023-04-21 09:41:29 -04:00
Alara Dirik
3db2e40422
Update Swin MIM output class ( #22893 )
...
Updates Swin MIM output class to match other masked image modeling outputs
2023-04-21 16:38:32 +03:00
Yih-Dar
1e1cb6f8e5
Fix FillMaskPipelineTests
( #22894 )
...
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-21 15:16:45 +02:00
Lei Li
9fdf158aa0
Add inputs_embeds functionality when generating with GPT-Neox ( #22916 )
...
* support gpt neox generate with inputs embeds
* Update src/transformers/models/gpt_neox/modeling_gpt_neox.py
great thx for the suggestion!
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
---------
Co-authored-by: Lei Li <tobiaslee@qq.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
2023-04-21 12:51:28 +01:00
Matthijs Hollemans
ec93b895c1
fix CLAP integration tests ( #22834 )
...
* integration tests were not being run
* add tests for short input waveform
* rewrite test for long input
* even more betterer
* my bad
* oh boy
2023-04-21 11:04:15 +01:00
Yih-Dar
3080fb714f
Fix Slack report for Nightly CI and Past CI ( #22901 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-21 11:23:16 +02:00
Yih-Dar
435abb22cb
Fix counting in Slack report for some jobs ( #22913 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-21 11:22:23 +02:00
SUSHMANTH REDDY
aab14120d4
Moved labels to enable parallelism pipeline in Luke model ( #22909 )
2023-04-21 10:19:15 +01:00