transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Anton Lozhkov	cd3166a8ed	Add the SEW and SEW-D speech models (#13962 ) * Working encoder * SEW-D and tests * Further conv fixes * Automodels and conv inits * Update integration tests, add docs * Docs cleanup, resolve todos * Conf fix * Fix docs * Fix tests, apply suggestions * Update src/transformers/models/sew/modeling_sew.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Model conversion and updated no-mask tests * Remove copy of feature_proj * Style * Update src/transformers/models/auto/feature_extraction_auto.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/auto/feature_extraction_auto.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Move orgs Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-10-15 18:26:26 +03:00
jacksukk	d5b82bb70c	Fixed horizon_length for PPLM (#13886 ) * fixed horizon_length * fixed horizon_length * fix style	2021-10-14 21:46:09 -04:00
Lysandre Debut	5b317f7ea4	Scatter dummies + skip pipeline tests (#13996 ) * Scatter dummies + skip pipeline tests * Add torch scatter to build docs	2021-10-14 15:30:27 -04:00
Lukas Weiner	b65c389769	Raise exceptions instead of asserts in src/transformers/models/bart/modeling_flax_[bart, marian, mbart, pegasus].py (#13939 ) * Raise exceptions instead of asserts * fix: fixed failing quality check with copies * fix: fixed max line length * rerun github ci, failed to install dependencies	2021-10-14 10:12:32 -04:00
Patrick von Platen	7fb2a8b3d9	up (#14008 )	2021-10-14 15:46:22 +02:00
Lysandre Debut	7604557e44	Fix FNet tokenizer tests (#13995 )	2021-10-14 09:07:51 -04:00
Sylvain Gugger	f2002fea11	Add strong test for configuration attributes (#14000 ) * Add strong test for configuration attributes * Add fake modif to trigger all tests * Add a better fake modif * Ignore is_encoder_decoder * Fix faulty configs * Remove fake modif	2021-10-14 09:07:08 -04:00
Sylvain Gugger	0ef61d392c	Revert "Skip faulty test" This reverts commit `5b6bd4e788`.	2021-10-14 09:02:41 -04:00
David del Río Medina	a5be95413f	Replace assertion with ValueError exception (#14006 )	2021-10-14 08:57:12 -04:00
Patrick von Platen	cc36064960	up (#13988 )	2021-10-14 10:54:20 +02:00
Sylvain Gugger	5b6bd4e788	Skip faulty test	2021-10-13 22:04:40 -04:00
Li-Huai (Allan) Lin	51ee20fc26	Remove wrong model_args supplied (#13937 ) * Remove wrong model_args of config.from_pretrained * Fix tf & flax	2021-10-13 21:28:11 -04:00
NielsRogge	408b2d2bd0	Add TrOCR + VisionEncoderDecoderModel (#13874 ) * First draft * Update self-attention of RoBERTa as proposition * Improve conversion script * Add TrOCR decoder-only model * More improvements * Make forward pass with pretrained weights work * More improvements * Some more improvements * More improvements * Make conversion work * Clean up print statements * Add documentation, processor * Add test files * Small improvements * Some more improvements * Make fix-copies, improve docs * Make all vision encoder decoder model tests pass * Make conversion script support other models * Update URL for OCR image * Update conversion script * Fix style & quality * Add support for the large-printed model * Fix some issues * Add print statement for debugging * Add print statements for debugging * Make possible fix for sinusoidal embedding * Further debugging * Potential fix v2 * Add more print statements for debugging * Add more print statements for debugging * Deubg more * Comment out print statements * Make conversion of large printed model possible, address review comments * Make it possible to convert the stage1 checkpoints * Clean up code, apply suggestions from code review * Apply suggestions from code review, use Microsoft models in tests * Rename encoder_hidden_size to cross_attention_hidden_size * Improve docs	2021-10-13 10:28:56 +02:00
Stas Bekman	61f6426269	[parallel doc] dealing with layers larger than one gpu (#13980 )	2021-10-12 15:37:55 -07:00
Yih-Dar	8b240a0661	Add TFEncoderDecoderModel + Add cross-attention to some TF models (#13222 ) * Add cross attentions to TFGPT2Model * Add TFEncoderDecoderModel * Add TFBaseModelOutputWithPoolingAndCrossAttentions * Add cross attentions to TFBertModel * Fix past or past_key_values argument issue * Fix generation * Fix save and load * Add some checks and comments * Clean the code that deals with past keys/values * Add kwargs to processing_inputs * Add serving_output to TFEncoderDecoderModel * Some cleaning + fix use_cache value issue * Fix tests + add bert2bert/bert2gpt2 tests * Fix more tests * Ignore crossattention.bias when loading GPT2 weights into TFGPT2 * Fix return_dict_in_generate in tf generation * Fix is_token_logit_eos_token bug in tf generation * Finalize the tests after fixing some bugs * Fix another is_token_logit_eos_token bug in tf generation * Add/Update docs * Add TFBertEncoderDecoderModelTest * Clean test script * Add TFEncoderDecoderModel to the library * Add cross attentions to TFRobertaModel * Add TFRobertaEncoderDecoderModelTest * make style * Change the way of position_ids computation * bug fix * Fix copies in tf_albert * Remove some copied from and apply some fix-copies * Remove some copied * Add cross attentions to some other TF models * Remove encoder_hidden_states from TFLayoutLMModel.call for now * Make style * Fix TFRemBertForCausalLM * Revert the change to longformer + Remove copies * Revert the change to albert and convbert + Remove copies * make quality * make style * Add TFRembertEncoderDecoderModelTest * make quality and fix-copies * test TFRobertaForCausalLM * Fixes for failed tests * Fixes for failed tests * fix more tests * Fixes for failed tests * Fix Auto mapping order * Fix TFRemBertEncoder return value * fix tf_rembert * Check copies are OK * Fix missing TFBaseModelOutputWithPastAndCrossAttentions is not defined * Add TFEncoderDecoderModelSaveLoadTests * fix tf weight loading * check the change of use_cache * Revert the change * Add missing test_for_causal_lm for TFRobertaModelTest * Try cleaning past * fix _reorder_cache * Revert some files to original versions * Keep as many copies as possible * Apply suggested changes - Use raise ValueError instead of assert * Move import to top * Fix wrong require_torch * Replace more assert by raise ValueError * Add test_pt_tf_model_equivalence (the test won't pass for now) * add test for loading/saving * finish * finish * Remove test_pt_tf_model_equivalence * Update tf modeling template * Remove pooling, added in the prev. commit, from MainLayer * Update tf modeling test template * Move inputs["use_cache"] = False to modeling_tf_utils.py * Fix torch.Tensor in the comment * fix use_cache * Fix missing use_cache in ElectraConfig * Add a note to from_pretrained * Fix style * Change test_encoder_decoder_save_load_from_encoder_decoder_from_pt * Fix TFMLP (in TFGPT2) activation issue * Fix None past_key_values value in serving_output * Don't call get_encoderdecoder_model in TFEncoderDecoderModelTest.test_configuration_tie until we have a TF checkpoint on Hub * Apply review suggestions - style for cross_attns in serving_output * Apply review suggestions - change assert + docstrings * break the error message to respect the char limit * deprecate the argument past * fix docstring style * Update the encoder-decoder rst file * fix Unknown interpreted text role "method" * fix typo Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-10-13 00:10:34 +02:00
Nicolas Patry	26b6ef79d6	Fixing the lecture values by making sure defaults are not changed (#13976 ) 384 // 4 < 128 would break `doc_stride`.	2021-10-12 18:18:19 +02:00
Patrick von Platen	58bf882579	[Wav2Vec2] Make sure tensors are always bool for mask_indices (#13977 ) * correct long to bool * up * correct code	2021-10-12 18:17:06 +02:00
Mishig Davaadorj	11c043d27d	Specify im-seg mask greyscole mode (#13974 )	2021-10-12 16:26:18 +02:00
Hardian Lawi	85d69a7dd1	Fix missing tpu variable in benchmark_args_tf.py (#13968 )	2021-10-11 23:30:03 -04:00
Lysandre Debut	990de2c17c	Remove pip 21.3 from installation candidates for model templates	2021-10-11 23:21:37 -04:00
Patrick von Platen	d45fc7da3d	[Speech Examples] Add pytorch speech pretraining (#13877 ) * adapt wav2vec2 * add example * add files * adapt * remove bogus file * Apply suggestions from code review * adapt files more * upload changes * del old files * up * up * up * up * up * correct gradient checkpoitning * add readme * finish * finish * up * more fixes * up * up * add demo run to readme * up	2021-10-12 00:46:32 +02:00
Lahfa Samy	3499728dc4	Replace assert by ValueError of src/transformers/models/electra/modeling_{electra,tf_electra}.py and all other models that had copies (#13955 ) * Replace all assert by ValueError in src/transformers/models/electra * Reformat with black to pass check_code_quality test * Change some assert to ValueError of modeling_bert & modeling_tf_albert * Change some assert in multiples models * Change multiples models assertion to ValueError in order to validate check_code_style test and models template test. * Black reformat * Change some more asserts in multiples models * Change assert to ValueError in modeling_layoutlm.py to fix copy error in code_style_check * Add proper message to ValueError in modeling_tf_albert.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Simplify logic in models/bert/modeling_bert.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Add ValueError message to models/convbert/modeling_tf_convbert.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Add error message for ValueError to modeling_tf_electra.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Simplify logic in models/tapas/modeling_tapas.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Simplify logic in models/electra/modeling_electra.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Add ValueError message in src/transformers/models/bert/modeling_tf_bert.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Simplify logic in src/transformers/models/rembert/modeling_rembert.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Simplify logic in src/transformers/models/albert/modeling_albert.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-10-11 13:58:09 -04:00
Lukas Weiner	64743d0abe	Raise exceptions instead of asserts (#13938 )	2021-10-11 12:21:49 -04:00
Sylvain Gugger	32634bce33	Make username optional in hub_model_id (#13940 )	2021-10-11 12:03:58 -04:00
Midhun R Nair	708ffff665	Raise exceptions instead of asserts in xnli.py (#13945 )	2021-10-11 10:22:35 -04:00
Luis F. Talavera R	e1bb2ebd92	Replace assert with unittest assertions (#13957 )	2021-10-11 10:21:46 -04:00
Jungwoo Park	6e4c8f683c	change to apply `pad_to_multiple_of` to labels (#13949 )	2021-10-11 09:35:20 -04:00
Patrick von Platen	dca6796876	[Gradient checkpoining] Correct disabling `find_unused_parameters` in Trainer when gradient checkpointing is enabled (#13961 ) * up * correct test	2021-10-11 15:34:01 +02:00
Sylvain Gugger	4a18337bae	Honor existing attention mask in tokenzier.pad (#13926 ) * Honor existing attention mask in tokenzier.pad * Fix initialization of attention mask * Roll the implem on all subclasses * Fix tests	2021-10-11 09:12:09 -04:00
Lahfa Samy	3c0c699ffd	Raise ValueError instead of asserts in src/transformers/benchmark/benchmark.py (#13951 ) * Raise ValueError exception instead of assert * Remove f unnecessary f-strings * Remove unused f-strings	2021-10-11 10:59:16 +02:00
oraby8	91758e399f	fix issue 13904 -attribute does not exist- by change self_.mapping to self._model_mapping (#13942 )	2021-10-09 09:07:39 -04:00
Lysandre Debut	239bd61b99	Update bug-report.md (#13934 ) * Update bug-report.md * Update .github/ISSUE_TEMPLATE/bug-report.md Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update .github/ISSUE_TEMPLATE/bug-report.md Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update .github/ISSUE_TEMPLATE/bug-report.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update .github/ISSUE_TEMPLATE/bug-report.md Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>	2021-10-08 14:41:51 -04:00
Chungman Lee	46dfe99e44	Fix typo in README.md (#13883 )	2021-10-08 14:25:32 -04:00
Sylvain Gugger	3e218523e8	Merge remote-tracking branch 'origin/master'	2021-10-08 11:30:39 -04:00
Sylvain Gugger	9e15b511c3	Move to TF only	2021-10-08 11:30:29 -04:00
Sylvain Gugger	cb911e5bc1	Style	2021-10-08 11:29:10 -04:00
Patrick von Platen	c8b07612a1	[Generation] Fix max_new_tokens (#13919 ) * up * Update src/transformers/generation_stopping_criteria.py * finish	2021-10-08 17:28:18 +02:00
Sylvain Gugger	5a1b5e4b1d	Register `keras_callbacks` as a submodule	2021-10-08 11:00:48 -04:00
Adam Kaczmarek	23ee06ed55	Fixed typo: herBERT -> HerBERT (#13936 )	2021-10-08 10:27:32 -04:00
Stella Biderman	de344815ed	Adds `PreTrainedModel.framework` attribute (#13817 ) * Added `framework` attribute * Update modeling_utils.py * Update modeling_flax_utils.py * Update modeling_tf_utils.py * Update modeling_utils.py * Update modeling_tf_utils.py * Update modeling_tf_utils.py * Update modeling_flax_utils.py * Update modeling_tf_utils.py * Update modeling_utils.py * Update modeling_utils.py * Update modeling_tf_utils.py * Update modeling_flax_utils.py * string -> str * Update modeling_tf_utils.py * string -> str * fixup * make flake happy Co-authored-by: patil-suraj <surajp815@gmail.com>	2021-10-08 19:37:09 +05:30
Nicolas Patry	d70919e6d5	Adding support for tokens being suffixes or part of each other. (#13918 ) * Adding support for tokens being suffixes or part of each other. * Better test name.	2021-10-08 10:10:38 +02:00
Mishig Davaadorj	026866df92	Image Segmentation pipeline (#13828 ) * Implement img seg pipeline * Update src/transformers/pipelines/image_segmentation.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/pipelines/image_segmentation.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update output shape with individual masks * Rm dev change * Remove loops in test Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>	2021-10-08 09:59:53 +02:00
Stas Bekman	be71ac3bcb	[trainer] memory metrics: add memory at the start report (#13915 ) * [trainer] memory metrics: add memory at start * fix for no-gpu	2021-10-07 10:29:01 -07:00
Matt	61cf2ea9c0	Fix incorrect output shapes for TF/PT LED (#13882 ) * Fix issues with LED model * Style pass * Bugfixes * correct attentions as well Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-10-07 17:30:15 +01:00
Mishig Davaadorj	5f34163b88	Add missing character (#13922 )	2021-10-07 18:10:19 +02:00
Patrick von Platen	0f5488f79f	[Wav2Vec2] Fix mask_feature_prob (#13921 ) * up * overwrite hubert	2021-10-07 19:07:32 +03:00
Alex Hedges	57420b103e	Add missing whitespace to multiline strings (#13916 )	2021-10-07 09:22:11 -04:00
Dhananjay Shettigar	319beb64eb	#12789 Replace assert statements with exceptions (#13909 ) * #12789 Replace assert statements with exceptions * fix-copies: made copy changes to utils_qa.py in examples/pytorch/question-answering and examples/tensorflow/question-answering * minor refactor for clarity	2021-10-07 09:09:01 -04:00
Jay Zhang	279ce5b705	Add an example of exporting BartModel + BeamSearch to ONNX module. (#13765 ) * Add all example files. * Reformat files by black. * Style. * Remove unused imports. Co-authored-by: Morgan Funtowicz <funtowiczmo@gmail.com>	2021-10-07 12:07:02 +02:00
Максим Заякин	0d309ce39a	Raise exceptions instead of asserts (#13907 )	2021-10-07 12:44:23 +05:30

1 2 3 4 5 ...

8136 Commits