transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-18 03:58:25 +06:00

Author	SHA1	Message	Date
Daniel Levenson	4e1522d65a	Fix typo in mega.mdx (#22998 ) MegaConfiig -> MegaConfig	2023-04-25 17:58:45 -04:00
Wonhyeong Seo	d95045717e	🌐 [i18n-KO] Translated `serialization.mdx` to Korean (#22806 ) docs: ko: serialization.mdx Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>	2023-04-25 12:38:51 -04:00
Younes Belkada	a0ae2310ec	[`DocTest`] Fix correct checkpoint (#22988 ) fix pipeline issue	2023-04-25 15:20:36 +02:00
Lingepumpe	5427250351	Avoid invalid escape sequences, use raw strings (#22936 ) * Avoid invalid escape sequences, use raw strings * Integrate PR feedback	2023-04-25 09:17:56 -04:00
Jari Van Melckebeke	81c1910c86	fixed small typo in code example (#22982 ) fixed typo in code example fixed a really small typo in the docs of single gpu inference	2023-04-25 08:56:21 -04:00
AleksanderWWW	0a570dbd2e	Neptune fix bug init run (#22836 ) * [neptune] fix checkpoint bug with relative out_dir * update imports * reformat with black * check neptune without imports * fix typing-related issue * run black on code * use os.path.sep instead of raw \ * simplify imports and remove type annotation * make ruff happy * apply review suggestions * replace run with with_id kwarg to run * update imports to avoid deprecation warnings for the latest client --------- Co-authored-by: kshitij12345 <kshitijkalambarkar@gmail.com>	2023-04-25 08:51:05 -04:00
Younes Belkada	d4d628462f	[`SAM`] Add sam doc (#22984 ) * add sam doc * fixes * multiple fixes	2023-04-25 14:00:27 +02:00
Nayeon Han	f0f5e28f82	🌐 [i18n-KO] Fixed `tasks/masked_language_modeling.mdx` (#22965 ) fix: docs: missing newline before code block	2023-04-25 09:59:17 +02:00
Yih-Dar	60f9649653	Fix `DeepSpeed` CI job link in Past CI (#22967 ) * Fix job link * fix artifact name logic --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-25 09:52:19 +02:00
Yih-Dar	073baf7f22	Install `accelerete@main` in PyTorch Past CI jobs (#22963 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-24 21:19:06 +02:00
Joao Gante	e4a97f82bf	Generate: assisted generation with sample (take 2) (#22949 ) * temperature controls speed	2023-04-24 19:54:55 +01:00
Gabriel Yang	7701716efc	🌐 [i18n-KO] translate `create_a_model` doc to Korean (#22754 ) docs: ko: translates create_a_model.mdx Co-authored-by: Nayeon Han <nayeon2.han@gmail.com> Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com> Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>	2023-04-24 13:02:19 -04:00
amyeroberts	8f20e61c85	Update feature selection in to_tf_dataset (#21935 ) * Update feature selection * Check compatibility with datasets version * Checkout from datasets main	2023-04-24 17:34:30 +01:00
Matt	345a1371d8	Fix TF example in quicktour (#22960 ) * Fix TF example in quicktour * Fix model.fit() and the dataset section too	2023-04-24 17:25:13 +01:00
othertea	503e8c8b32	fix ValueError message in LlamaAttention (#22966 )	2023-04-24 12:02:05 -04:00
Nicolas Patry	6e32959329	Reverting Deta cloning mecanism. (#22656 ) * Fixed the revert by making sure that even the regexp can cover all duplicates. * Code simplification using hash. * Fixing the `ident`. * Fixing ignoring patterened duplicate names. * Using `accelerate@find_tied_parameters` for from_pretrained This is more correct there, since it handles meta device seemlessly and we don't need to handle "non-duplicate" tensors (slices of each other). * Protecting accelerate. * Update src/transformers/modeling_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-04-24 11:24:35 -04:00
Nayeon Han	d6f1da6b71	🌐 [i18n-KO] Translated `run_scripts.mdx` to Korean (#22793 ) docs: ko: `run_scripts` to Korean Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com> Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>	2023-04-24 10:18:20 -04:00
Lucain	74c55ab9e5	Prepare tests for hfh 0.14 (#22958 ) * Test hf_hub 0.14.0rc1 * fix mocked tests * package version --------- Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com> Co-authored-by: testbot <lucainp@hf.co>	2023-04-24 09:31:50 -04:00
hanrui1sensetime	69f2d5386b	[Fix Bugs] Fix keys in `_load_pretrained_model` (#22947 ) fix transformers keys	2023-04-24 09:28:51 -04:00
Connor Boyle	b5f06d6c59	Raise error if `stride` is too high in `TokenClassificationPipeline` (#22942 ) * Raise error if `stride` is too high * Clarify use of `stride`	2023-04-24 09:27:49 -04:00
Yih-Dar	3f6a4b5bd7	Decorate `test_codegen_sample_max_time` as flaky (#22953 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-24 15:27:31 +02:00
fxmarty	edb6d950cb	Add an attribute to disable custom kernels in deformable detr in order to make the model ONNX exportable (#22918 ) * add disable kernel option * add comment * fix copies * add disable_custom_kernels to config * Update src/transformers/models/deta/modeling_deta.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/deta/modeling_deta.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/deta/modeling_deta.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * style * fix --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-04-24 09:27:03 -04:00
Sohyun Sim	84097f6d38	🌐 [i18n-KO] Translated `tasks/summarization.mdx` to Korean (#22783 ) docs: ko: tasks/summarization.mdx Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com> Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com> Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by: Nayeon Han <nayeon2.han@gmail.com> Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com> Co-authored-by: Kihoon Son <75935546+kihoon71@users.noreply.github.com>	2023-04-24 09:03:02 -04:00
Nayeon Han	093be36f6c	🌐 [i18n-KO] Translated `tasks/masked_language_modeling.mdx` to Korean (#22838 ) docs: ko: `tasks/masked_language_modeling.mdx` to Korean Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com> Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>	2023-04-24 09:02:21 -04:00
Yih-Dar	975159bb61	Update tiny models and a few fixes (#22928 ) * run_check_tiny_models * update summary * update mixin * update pipeline_model_mapping * update pipeline_model_mapping * Update for gpt_bigcode --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-24 14:45:22 +02:00
Joao Gante	2fbd6df81c	Generate: Add exception path for Donut (#22955 )	2023-04-24 13:05:55 +01:00
Arthur	df017c3ccc	[CLAP] Doc nits (#22957 ) clap nits	2023-04-24 14:00:29 +02:00
Hyeonseo Yun	137eb8e663	[i18n-KO] Translated `accelerate.mdx` to Korean (#22830 ) * docs: ko: init: accelerate.mdx * docs: ko: translated: accelerate.mdx * docs: ko: revised: natural expression accelerate.mdx Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com> * docs: ko: revised: natural expression2 accelerate.mdx Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> --------- Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>	2023-04-24 07:49:05 -04:00
NielsRogge	3d3204c025	Add FocalNet (#21532 ) Adds FocalNet by Microsoft to transformers --------- Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> Co-authored-by: alaradirik <alaradirik@gmail.com>	2023-04-23 20:03:05 +03:00
SUSHMANTH REDDY	d04ec99bec	vilt_model (#22930 )	2023-04-21 20:01:25 -04:00
hamid mohammadi	4d10de55b4	Feature to convert videomae huge and small finetuned on kinetics and ssv2 added to the videomae to pytorch converter (#22788 ) * Feature to convert videomae huge finetuned kinetics and videomae small finetuned kinetics and ssv2 added to videomae to pytorch converter * Reformat convert_videomae_to_pytorch using black * Value exception added for the possible videomae model architectures	2023-04-21 16:13:06 -04:00
Arthur	7579a52b55	Small sam patch (#22920 ) * patch * add test * move tests * cover more cases (will fail nw update the code) * style * fix * Update src/transformers/models/sam/image_processing_sam.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/sam/image_processing_sam.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * add better check --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: younesbelkada <younesbelkada@gmail.com>	2023-04-21 21:41:18 +02:00
Yih-Dar	5166c30e29	Fix a minor bug in CI slack report (#22906 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-21 20:36:35 +02:00
Connor Henderson	b950c38565	tests: Fix flaky test for NLLB-MoE (#22880 ) * add test update and docs edits * docs edit suggestion	2023-04-21 17:09:40 +01:00
Wing Lian	d00997e66c	ddp fixes for training (#22874 ) ddp fixes for stable lm training	2023-04-21 11:42:02 -04:00
Arthur	eddf9eeca0	[CI] clap patch fusion test values (#22922 ) * patch test with values * lower tol	2023-04-21 11:22:07 -04:00
Matt	5600e6f3ba	Hardcode GELU as the intermediate activation for ESM (#22892 ) * Hardcode GELU as the intermediate activation for ESM * Sneak a quick fix to the weight tying in too * Make the call to gelu explicit	2023-04-21 16:10:10 +01:00
Roy Hvaara	874c7caf19	Remove broken test_data symlink in legacy s2s examples (#22876 )	2023-04-21 15:35:42 +01:00
SeongBeomLEE	587a19c725	fix: GPTNeoX half inference error (#22888 ) * fix: half inference error norm_factor is still torch.float32 after using model.half So I changed it to register_buffer so I can change it to torch.float16 after using model.half * fix: Added a variable "persistent=False" * run make style	2023-04-21 10:23:53 -04:00
fxmarty	3d852da2db	Expose AutoModelForMaskGeneration (#22910 ) * expose * style * add dummy object * amazed by the quality of transformers CI	2023-04-21 10:04:45 -04:00
fxmarty	75444551c0	Make sam ONNX exportable (#22915 ) * fix code not exportable * fix * Update src/transformers/models/sam/modeling_sam.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-04-21 09:54:30 -04:00
Nathan Fradet	d03d8c720f	Fix: Seq2SeqTrainingArgs overriding to_dict for GenerationConfig json support (#22919 ) * Seq2SeqTrainingArgs overriding to_dict for GenerationConfig json support * seq2seqTrainingArgs to_dict calling super method before handling genconf	2023-04-21 09:53:24 -04:00
Yusong Wu	64ec802e50	fix bug of CLAP dataloader (#22674 ) fix bug of CLAP: https://github.com/LAION-AI/CLAP/issues/62	2023-04-21 09:41:29 -04:00
Alara Dirik	3db2e40422	Update Swin MIM output class (#22893 ) Updates Swin MIM output class to match other masked image modeling outputs	2023-04-21 16:38:32 +03:00
Yih-Dar	1e1cb6f8e5	Fix `FillMaskPipelineTests` (#22894 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-21 15:16:45 +02:00
Lei Li	9fdf158aa0	Add inputs_embeds functionality when generating with GPT-Neox (#22916 ) * support gpt neox generate with inputs embeds * Update src/transformers/models/gpt_neox/modeling_gpt_neox.py great thx for the suggestion! Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> --------- Co-authored-by: Lei Li <tobiaslee@qq.com> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>	2023-04-21 12:51:28 +01:00
Matthijs Hollemans	ec93b895c1	fix CLAP integration tests (#22834 ) * integration tests were not being run * add tests for short input waveform * rewrite test for long input * even more betterer * my bad * oh boy	2023-04-21 11:04:15 +01:00
Yih-Dar	3080fb714f	Fix Slack report for Nightly CI and Past CI (#22901 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-21 11:23:16 +02:00
Yih-Dar	435abb22cb	Fix counting in Slack report for some jobs (#22913 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-21 11:22:23 +02:00
SUSHMANTH REDDY	aab14120d4	Moved labels to enable parallelism pipeline in Luke model (#22909 )	2023-04-21 10:19:15 +01:00

... 46 47 48 49 50 ...

15053 Commits