transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Zachary Mueller	febe42b5da	Update no_trainer scripts with new Accelerate functionalities (#16617 ) Adds logging and save/loading to the Accelerate scripts Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-04-06 15:29:32 -04:00
Sylvain Gugger	10c15d2d1e	Allow the same config in the auto mapping (#16631 )	2022-04-06 14:21:15 -04:00
Anmol Joshi	8ac9b82724	Added Annotations for PyTorch models (#16619 ) * Update modeling_mpnet.py * Update modeling_ctrl.py * formatting * Formatting * Formatting * annotated FSMT * Added annotations for LED * Added Annotations for M2M * Added annotations for nystromformer * Added annotations for OpenAI * Added annotations for RAG * Removed unused imports * fix isort errors * Removed inputs_embeds docstring, corrected original * flake8 fixes * doc-builder fixes	2022-04-06 14:12:01 -04:00
Joao Gante	3f43d824b9	TF generate refactor - Beam Search (#16374 ) * refactor TF beam search * refactored generate can now properly use attention masks * add force bos/eos logit processors	2022-04-06 18:19:34 +01:00
Stas Bekman	4d10083539	[modeling_utils] rearrange text (#16632 )	2022-04-06 09:35:42 -07:00
Lysandre Debut	a180efe7fd	Dev version	2022-04-06 11:08:12 -04:00
Sylvain Gugger	b9bf91a970	Revert "Allow the same config in the auto mapping" This reverts commit `b1a7dfe099`.	2022-04-06 09:58:13 -04:00
Sylvain Gugger	b1a7dfe099	Allow the same config in the auto mapping	2022-04-06 09:57:47 -04:00
Yih-Dar	2aef4cfe58	Fix TFTransfoXLLMHeadModel outputs (#16590 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-04-06 15:42:15 +02:00
Sanchit Gandhi	8d57c424e0	[FlaxSpeechEncoderDecoderModel] More Rigorous PT-Flax Equivalence Tests (#16589 )	2022-04-06 15:33:32 +02:00
Patrick von Platen	c65633156b	[Speech2Text Doc] Fix docs (#16611 ) * [Speech2Text Doc] Fix docs * apply ydshiehs suggestions	2022-04-06 14:19:00 +02:00
Stas Bekman	fb3d0df454	typo (#16621 )	2022-04-06 07:28:17 -04:00
Yih-Dar	ae6a7a763b	Use CLIP model config to set some kwargs for components (#16609 ) * Use CLIP model's config for some fields (if specified) instead of those of vision & text components. Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-04-06 12:15:09 +02:00
Suraj Patil	47c5c05932	don't load state_dict twice when using low_cpu_mem_usage in from_pretrained (#16602 )	2022-04-06 11:43:02 +02:00
Suraj Patil	a2b7d19bd7	Fix seq2seq doc tests (#16606 ) * fix bart and mbart * add ckpt names as variables * fix mbart * fix plbart * use varibale for ckot name	2022-04-06 11:32:39 +02:00
Patrick von Platen	0bf18643f4	[Minds14] Correct quicktour (#16626 )	2022-04-06 11:27:11 +02:00
Jun	d55fcbcc50	fix default num_attention_heads in segformer doc (#16612 )	2022-04-06 09:51:58 +02:00
Anmol Joshi	b18dfd95e1	added type hints to CTRL pytorch (#16593 ) * Completed documentation of CTRL * Missing optional None * Added return types * updated imports * Update modeling_ctrl.py	2022-04-05 16:55:01 -04:00
Sylvain Gugger	208f4c109a	Quality	2022-04-05 14:12:01 -04:00
Steven Liu	f553c3ce4c	Update summary of the tasks (#16528 ) * 📝 add image/vision classification and asr * 🖍 minor formatting fixes * Fixed a typo in legacy seq2seq_trainer.py (#16531) * Add ONNX export for BeiT (#16498) * Add beit onnx conversion support * Updated docs * Added cross reference to ViT ONNX config * call on_train_end when trial is pruned (#16536) * Type hints added (#16529) * Fix Bart type hints (#16297) * Add type hints to PLBart PyTorch * Remove pending merge conflicts * Fix PLBart Type Hints * Add changes from review * Add VisualBert type hints (#16544) * Adding missing type hints for mBART model (PyTorch) (#16429) * added type hints for mbart tensorflow tf implementation * Adding missing type hints for mBART model Tensorflow Implementation model added with missing type hints * Missing Type hints - correction For TF model * Code fixup using make quality tests * Hint types - typo error * make fix-copies and make fixup * type hints * updated files * type hints update * making dependent modesls coherent Co-authored-by: matt <rocketknight1@gmail.com> * Remove MBart subclass of XLMRoberta in tokenzier docs (#16546) * Remove MBart subclass of XLMRoberta in tokenzier * Fix style * Copy docs from MBart50 tokenizer * Use random_attention_mask for TF tests (#16517) * use random_attention_mask for TF tests * Fix for TFCLIP test (for now). Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> * Improve code example (#16450) Co-authored-by: Niels Rogge <nielsrogge@nielss-mbp.home> * Pin tokenizers version <0.13 (#16539) * Pin tokenizers version <0.13 * Style * Add code samples for TF speech models (#16494) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> * [FlaxSpeechEncoderDecoder] Fix dtype bug (#16581) * [FlaxSpeechEncoderDecoder] Fix dtype bug * more fixes * Making the impossible to connect error actually report the right URL. (#16446) * Fix flax import in __init__.py: modeling_xglm -> modeling_flax_xglm (#16556) * Add utility to find model labels (#16526) * Add utility to find model labels * Use it in the Trainer * Update src/transformers/utils/generic.py Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Quality Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Enable doc in Spanish (#16518) * Reorganize doc for multilingual support * Fix style * Style * Toc trees * Adapt templates * Add use_auth to load_datasets for private datasets to PT and TF examples (#16521) * fix formatting and remove use_auth * Add use_auth_token to Flax examples * add a test checking the format of `convert_tokens_to_string`'s output (#16540) * add new tests * add comment to overridden tests * TF: Finalize `unpack_inputs`-related changes (#16499) * Add unpack_inputs to remaining models * removed kwargs to `call()` in TF models * fix TF T5 tests * [SpeechEncoderDecoderModel] Correct Encoder Last Hidden State Output (#16586) * initialize the default rank set on TrainerState (#16530) * initialize the default rank set on TrainerState * fix style * Trigger doc build * Fix CI: test_inference_for_pretraining in ViTMAEModelTest (#16591) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> * add a template to add missing tokenization test (#16553) * add a template to add missing tokenization test * add cookiecutter setting * improve doc * Update templates/adding_a_missing_tokenization_test/README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * made _load_pretrained_model_low_mem static + bug fix (#16548) * handle torch_dtype in low cpu mem usage (#16580) * [Doctests] Correct filenaming (#16599) * [Doctests] Correct filenaming * improve quicktour * make style * Adding new train_step logic to make things less confusing for users (#15994) * Adding new train_step logic to make things less confusing for users * DO NOT ASK WHY WE NEED THAT SUBCLASS * Metrics now working, at least for single-output models with type annotations! * Updates and TODOs for the new train_step * Make fixup * Temporary test workaround until T5 has types * Temporary test workaround until T5 has types * I think this actually works! Needs a lot of tests though * MAke style/quality * Revert changes to T5 tests * Deleting the aforementioned unmentionable subclass * Deleting the aforementioned unmentionable subclass * Adding a Keras API test * Style fixes * Removing unneeded TODO and comments * Update test_step too * Stop trying to compute metrics with the dummy_loss, patch up test * Make style * make fixup * Docstring cleanup * make fixup * make fixup * Stop expanding 1D input tensors when using dummy loss * Adjust T5 test given the new compile() * make fixup * Skipping test for convnext * Removing old T5-specific Keras test now that we have a common one * make fixup * make fixup * Only skip convnext test on CPU * Update src/transformers/modeling_tf_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_tf_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Avoiding TF import issues * make fixup * Update compile() to support TF 2.3 * Skipping model.fit() on template classes for now * Skipping model.fit() on template class tests for now * Replace ad-hoc solution with find_labels * make fixup Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Adding missing type hints for BigBird model (#16555) * added type hints for mbart tensorflow tf implementation * Adding missing type hints for mBART model Tensorflow Implementation model added with missing type hints * Missing Type hints - correction For TF model * Code fixup using make quality tests * Hint types - typo error * make fix-copies and make fixup * type hints * updated files * type hints update * making dependent modesls coherent * Type hints for BigBird * removing typos Co-authored-by: matt <rocketknight1@gmail.com> * [deepspeed] fix typo, adjust config name (#16597) * 🖍 apply feedback Co-authored-by: Cathy <815244047@qq.com> Co-authored-by: Jim Rohrer <jrohrer1@gmail.com> Co-authored-by: Ferdinand Schlatt <fschlatt@gmail.com> Co-authored-by: Dahlbomii <101373053+Dahlbomii@users.noreply.github.com> Co-authored-by: Gunjan Chhablani <chhablani.gunjan@gmail.com> Co-authored-by: Rishav Chandra Varma <rishavchandra.v16@iiits.in> Co-authored-by: matt <rocketknight1@gmail.com> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Niels Rogge <nielsrogge@nielss-mbp.home> Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by: Daniel Stancl <46073029+stancld@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> Co-authored-by: Karim Foda <35491698+KMFODA@users.noreply.github.com> Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com> Co-authored-by: Joao Gante <joao@huggingface.co> Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: Andres Codas <andrescodas@users.noreply.github.com> Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com> Co-authored-by: Francesco Saverio Zuppichini <francesco.zuppichini@gmail.com> Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>	2022-04-05 12:48:42 -05:00
Stas Bekman	23fc4cba0d	[benchmark tool] trainer-benchmark.py (#14934 ) * [benchmark tool] trainer-benchmark.py * improve * massive rework/expansion * fix * mucho improved * improved * fix prefix * fix * fix diff calculation * address suggestions	2022-04-05 10:27:29 -07:00
John Giorgi	b33ab4eb59	Add global_attention_mask to gen_kwargs (#16485 ) If global_attention_mask is found in the models inputs (used by certain models, like LED) in the prediction_step method of Seq2SeqTrainer, it is added to the gen_kwargs, which are passed to model.decode(). This allows us to properly set the global attention when decoding.	2022-04-05 13:05:27 -04:00
Stas Bekman	9fd5e6bbe6	[deepspeed] fix typo, adjust config name (#16597 )	2022-04-05 08:13:12 -07:00
Rishav Chandra Varma	367558b90d	Adding missing type hints for BigBird model (#16555 ) * added type hints for mbart tensorflow tf implementation * Adding missing type hints for mBART model Tensorflow Implementation model added with missing type hints * Missing Type hints - correction For TF model * Code fixup using make quality tests * Hint types - typo error * make fix-copies and make fixup * type hints * updated files * type hints update * making dependent modesls coherent * Type hints for BigBird * removing typos Co-authored-by: matt <rocketknight1@gmail.com>	2022-04-05 14:50:45 +01:00
Matt	4354005291	Adding new train_step logic to make things less confusing for users (#15994 ) * Adding new train_step logic to make things less confusing for users * DO NOT ASK WHY WE NEED THAT SUBCLASS * Metrics now working, at least for single-output models with type annotations! * Updates and TODOs for the new train_step * Make fixup * Temporary test workaround until T5 has types * Temporary test workaround until T5 has types * I think this actually works! Needs a lot of tests though * MAke style/quality * Revert changes to T5 tests * Deleting the aforementioned unmentionable subclass * Deleting the aforementioned unmentionable subclass * Adding a Keras API test * Style fixes * Removing unneeded TODO and comments * Update test_step too * Stop trying to compute metrics with the dummy_loss, patch up test * Make style * make fixup * Docstring cleanup * make fixup * make fixup * Stop expanding 1D input tensors when using dummy loss * Adjust T5 test given the new compile() * make fixup * Skipping test for convnext * Removing old T5-specific Keras test now that we have a common one * make fixup * make fixup * Only skip convnext test on CPU * Update src/transformers/modeling_tf_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_tf_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Avoiding TF import issues * make fixup * Update compile() to support TF 2.3 * Skipping model.fit() on template classes for now * Skipping model.fit() on template class tests for now * Replace ad-hoc solution with find_labels * make fixup Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-04-05 14:23:27 +01:00
Patrick von Platen	7ccacdf10f	[Doctests] Correct filenaming (#16599 ) * [Doctests] Correct filenaming * improve quicktour * make style	2022-04-05 14:15:02 +02:00
Suraj Patil	21decb7731	handle torch_dtype in low cpu mem usage (#16580 )	2022-04-05 12:26:03 +02:00
Francesco Saverio Zuppichini	8bf6d28c10	made _load_pretrained_model_low_mem static + bug fix (#16548 )	2022-04-05 11:56:36 +02:00
SaulLu	02214cb3cc	add a template to add missing tokenization test (#16553 ) * add a template to add missing tokenization test * add cookiecutter setting * improve doc * Update templates/adding_a_missing_tokenization_test/README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-04-05 10:50:22 +02:00
Yih-Dar	765bafb8e4	Fix CI: test_inference_for_pretraining in ViTMAEModelTest (#16591 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-04-05 10:00:03 +02:00
Sylvain Gugger	104c065277	Trigger doc build	2022-04-04 14:06:49 -04:00
Andres Codas	1cd2e21d1b	initialize the default rank set on TrainerState (#16530 ) * initialize the default rank set on TrainerState * fix style	2022-04-04 12:20:26 -04:00
Sanchit Gandhi	6f9d8dc156	[SpeechEncoderDecoderModel] Correct Encoder Last Hidden State Output (#16586 )	2022-04-04 17:50:56 +02:00
Joao Gante	dad5ca83b2	TF: Finalize `unpack_inputs`-related changes (#16499 ) * Add unpack_inputs to remaining models * removed kwargs to `call()` in TF models * fix TF T5 tests	2022-04-04 16:37:33 +01:00
SaulLu	be9474bd35	add a test checking the format of `convert_tokens_to_string`'s output (#16540 ) * add new tests * add comment to overridden tests	2022-04-04 16:57:24 +02:00
Karim Foda	24a85cca61	Add use_auth to load_datasets for private datasets to PT and TF examples (#16521 ) * fix formatting and remove use_auth * Add use_auth_token to Flax examples	2022-04-04 10:27:45 -04:00
Sylvain Gugger	b9a768b3ff	Enable doc in Spanish (#16518 ) * Reorganize doc for multilingual support * Fix style * Style * Toc trees * Adapt templates	2022-04-04 10:25:46 -04:00
Sylvain Gugger	3951b9f390	Add utility to find model labels (#16526 ) * Add utility to find model labels * Use it in the Trainer * Update src/transformers/utils/generic.py Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Quality Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>	2022-04-04 10:06:57 -04:00
Daniel Stancl	ec4da72fe9	Fix flax import in __init__.py: modeling_xglm -> modeling_flax_xglm (#16556 )	2022-04-04 14:54:25 +02:00
Nicolas Patry	013a7dbe3d	Making the impossible to connect error actually report the right URL. (#16446 )	2022-04-04 14:26:23 +02:00
Patrick von Platen	ad0cba08ea	[FlaxSpeechEncoderDecoder] Fix dtype bug (#16581 ) * [FlaxSpeechEncoderDecoder] Fix dtype bug * more fixes	2022-04-04 13:53:54 +02:00
Yih-Dar	60d27b1f15	Add code samples for TF speech models (#16494 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-04-01 17:54:01 +02:00
Lysandre Debut	53a4d6b115	Pin tokenizers version <0.13 (#16539 ) * Pin tokenizers version <0.13 * Style	2022-04-01 11:53:18 -04:00
NielsRogge	61ee26a892	Improve code example (#16450 ) Co-authored-by: Niels Rogge <nielsrogge@nielss-mbp.home>	2022-04-01 17:19:36 +02:00
Yih-Dar	2199382dfd	Use random_attention_mask for TF tests (#16517 ) * use random_attention_mask for TF tests * Fix for TFCLIP test (for now). Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-04-01 16:53:07 +02:00
Gunjan Chhablani	823dbf8a41	Remove MBart subclass of XLMRoberta in tokenzier docs (#16546 ) * Remove MBart subclass of XLMRoberta in tokenzier * Fix style * Copy docs from MBart50 tokenizer	2022-04-01 16:39:28 +02:00
Rishav Chandra Varma	5fe06b9bdd	Adding missing type hints for mBART model (PyTorch) (#16429 ) * added type hints for mbart tensorflow tf implementation * Adding missing type hints for mBART model Tensorflow Implementation model added with missing type hints * Missing Type hints - correction For TF model * Code fixup using make quality tests * Hint types - typo error * make fix-copies and make fixup * type hints * updated files * type hints update * making dependent modesls coherent Co-authored-by: matt <rocketknight1@gmail.com>	2022-04-01 15:21:26 +01:00
Gunjan Chhablani	9947dd077c	Add VisualBert type hints (#16544 )	2022-04-01 15:02:58 +01:00
Gunjan Chhablani	59a9c83e40	Fix Bart type hints (#16297 ) * Add type hints to PLBart PyTorch * Remove pending merge conflicts * Fix PLBart Type Hints * Add changes from review	2022-04-01 14:50:22 +01:00
Dahlbomii	afc5a1ea3a	Type hints added (#16529 )	2022-04-01 14:27:41 +01:00

1 2 3 4 5 ...

9491 Commits