transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-02 19:21:31 +06:00

Author	SHA1	Message	Date
Julien Chaumond	eed31db948	[hf_api] delete deprecated methods and tests (#10159 ) * [hf_api] delete deprecated methods and tests cc @lhoestq * Update test_hf_api.py	2021-02-12 15:35:06 -05:00
Patrick von Platen	495c157d6f	[Wav2Vec2] Improve Tokenizer & Model for batched inference (#10117 ) * save intermediate * finish batch the same as fairseq * add normalization * fix batched input * add better comment * Update src/transformers/models/wav2vec2/modeling_wav2vec2.py * add nice docstring * add tokenizer tests * make all slow tests pass * finish PR * correct import	2021-02-11 15:40:54 +03:00
Suraj Patil	c130e67dce	remove adjust_logits_during_generation method (#10087 ) * add forced logits processors * delete adjust_logits method * add forced_eos_token_id argument in config * add tests for forced logits processors * update gen utils tests * add forced option to tf generate * remove adjust_logits method from tf models * update adjust_logits for marian * delete _force_token_id_to_be_generated method * style * import warnings * pass max_length to _get_logits_processor * set forced_eos_token_id to None * set forced attributes in conf utils * typo * fix rag generate * add forced_eos_token_id in rag config * remove force_bos_token_to_be_generated from BartConfig * remove _force_token_ids_generation from FSMT * nit * fix negative constant * apply suggestions from code review	2021-02-10 22:39:09 +05:30
Julien Plu	22a32cf485	Fix TF LED/Longformer attentions computation (#10007 ) * Fix test * Remove commented test * Fix name * Apply style * Fix check copies * Remove prints * Restore boolean * Fix reshape	2021-02-10 10:58:37 -05:00
Lysandre Debut	0d8e554d42	Line endings should be LF across repo and not CRLF (#10119 )	2021-02-10 10:50:00 -05:00
abhishek thakur	480a9d6ba0	Fix TFConvBertModelIntegrationTest::test_inference_masked_lm Test (#10104 )	2021-02-09 20:22:54 +01:00
Daniel Stancl	e7381c4596	Add head_mask and decoder_head_mask to TF LED (#9988 ) * Add head masking to TF LED * Add head_mask to Longformer + one doc piece to LED * Fix integration tests	2021-02-09 11:45:18 -05:00
Patrick von Platen	b972125ced	Deprecate Wav2Vec2ForMaskedLM and add Wav2Vec2ForCTC (#10089 ) * add wav2vec2CTC and deprecate for maskedlm * remove from docs	2021-02-09 03:49:02 -05:00
sandip	263fac71a2	Integration test for electra model (#10073 )	2021-02-08 15:42:25 -05:00
demSd	3b7e612a5e	Implementing the test integration of BertGeneration (#9990 ) * claiming this issue * Integration test for BertGeneration(Encoder and Decoder) * fix code quality	2021-02-08 08:22:19 -05:00
Patrick von Platen	9e795eac88	fix bert2bert test (#10063 )	2021-02-08 16:04:28 +03:00
Julien Plu	31563e056d	Restore TF embeddings and attention layers to their previous version (#9890 ) * Refacto BERT * Restore all the concerned models * Remove print * Update template * Apply Sylvain's and Morgan's comments * Fix cast * Put the cast inside call * Remove cond in ebds * Fix funnel * Restore previous dot product (attention_scores) computation * Add ConvBERT and BART * Make all the S2S models ONNX compliant * Fix test * Fix check copies	2021-02-08 14:36:30 +03:00
Julien Plu	8bb52bd240	Disable temporarily too slow tests (Longformer/LED) (#10062 ) * Disable temporarily too slow tests * Fix style * Fix template	2021-02-08 12:32:31 +01:00
Nicolas Patry	b1aa4982cd	Cleaning up `ConversationalPipeline` to support more than DialoGPT. (#10002 ) * Cleaning up `ConversationalPipeline` to support more than DialoGPT. Currently ConversationalPipeline was heavily biased towards DialoGPT ,which is the default model for this pipeline. This PR proposes changes to put back the modifications specific to DialoGPT into tokenizer-specific behavior wherever possible, by creating `_build_conversation_input_ids` function that takes conversation as input, and returns a list of ints corresponding to the tokens. It feels natural to put here because all models have probably different strategies to build input_ids from the full conversation and it's the tokenizer's job to transform strings into tokens (and vice-versa) If `_build_conversation_input_ids` is missing, previous behavior is used so we don't break anything so far (except for blenderbot where it's a fix). This PR also contains a fix for too long inputs. There used to be dead code for trying to limit the size of incoming input. The introduced fixed is that we limit within `_build_conversation_input_ids` to `tokenizer.model_max_length`. It corresponds to the intent of the removed dead code and is actually better because it corresponds to `model_max_length` which is different from `max_length` (which is a default parameter for `generate`). - Removed `history` logic from the Conversation as it's not relevant anymore because tokenization logic has been moved to tokenizer. And tokenizer cannot save any cache, and conversation cannot know what is relevant or not. Also it's not usable from `blenderbot` because the input_ids are not append only (EOS tokens is always at the end). - Added `iter_texts` method on `Conversation` because all the code was literred with some form of this iteration of past/generated_responses. * Removing torch mention in types. * Adding type checking to `_build_conversation_input_ids`. * Fixing import in strings.	2021-02-08 14:29:07 +03:00
Patrick von Platen	9a0399e18d	fix bart tests (#10060 )	2021-02-08 13:25:09 +03:00
Lysandre Debut	d51302cca0	Fix slow dpr test (#10059 ) * Correct cast to device * Comment back the slow test	2021-02-08 04:43:25 -05:00
sandip	12e44af5d3	Integration test for FlauBert (#10022 )	2021-02-08 04:36:50 -05:00
Nicolas Patry	d5888ef0ab	Hotfixing tests (blenderbot decoderonly tests, also need to remove (#10003 ) `encoder_no_repeat_ngram_size` from their config.	2021-02-04 11:41:34 -05:00
Nicolas Patry	aeb18b9224	Adding new `encoder_no_repeat_ngram_size` to `generate`. (#9984 ) Adding new `encoder_no_repeat_ngram_size` to `generate`. Blenderbot results seemed off compared to original ParlAI script: `https://parl.ai/projects/recipes/`. Notably the model seems to repeat a lot what was said during the conversation. The actual problem was that `no_repeat_ngram_size` actually applies to the `encoder_input_ids` but HF's `no_repeat_ngram_size` applies to the previously generated ids (within the decoder). The history conversation of blenderbot is within the `encoder` part so that explains why HF's implementation had the repetitions. This fix was focused on blenderbot not small and added tests for those because they are quite different in configuration. This change includes: - Adding a new EncoderNoRepeatLogitProcessor. - Adding 1 new arg to `generate` (`encoder_no_repeat_ngram_size`) - Adding 1 new config parameter `encoder_no_repeat_ngram_size`. - Adding 2 tests, one for the pipeline (high level, inputs exhibited repeat behavior, one low level for EncoderNoRepeatLogitProcessor) - Factored NoRepeatLogitProcessor so that logic could be reused. Further work: - Blenderbot conversational pipeline still does not behave correctly as they way input is prepared within the pipeline is still incorrect (follow up PR) - Blenderbot allows the bot to have personas, which is done by prepending "your personna: XXXX" to the input, this could be explored too in a follow up PR. @patrickvonplaten @LysandreJik * Update src/transformers/generation_logits_process.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/configuration_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Doc quality. * Fixing test. * Last fixes. * Fixing to account for batch_size. * Update src/transformers/configuration_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/generation_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-02-04 15:00:18 +01:00
Daniel Hug	804cd185d8	Added Integration testing for DistilBert model from issue #9948 ' (#9995 )	2021-02-04 04:24:59 -05:00
demSd	00031785a8	BartForCausalLM analogs to `ProphetNetForCausalLM` (#9128 ) * initiliaze bart4causalLM * create BartDecoderWrapper, setters/getters * delete spaces * forward and additional methods * update cache function, loss function, remove ngram* params in data class. * add bartcausallm, bartdecoder testing * correct bart for causal lm * remove at * add mbart as well * up * fix typo * up * correct * add pegasusforcausallm * add blenderbotforcausallm * add blenderbotsmallforcausallm * add marianforcausallm * add test for MarianForCausalLM * add Pegasus test * add BlenderbotSmall test * add blenderbot test * fix a fail * fix an import fail * a fix * fix * Update modeling_pegasus.py * fix models * fix inputs_embeds setting getter * adapt tests * correct repo utils check * finish test improvement * fix tf models as well * make style * make fix-copies * fix copies * run all tests * last changes * fix all tests Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-02-04 11:56:12 +03:00
sandip	2f06f2bcd6	Alber model integration testing added (#9980 )	2021-02-03 11:41:10 -05:00
sandip	75fd00fb25	Integration test added for TF MPnet (#9979 )	2021-02-03 11:39:40 -05:00
sandip	ce08043f7a	Integration test for mobilebert (#9978 )	2021-02-03 11:36:45 -05:00
sandip	1486205d23	TF DistilBERT integration tests (#9975 ) * TF DistilBERT integration test * Update test_modeling_tf_distilbert.py	2021-02-03 09:51:00 -05:00
sandip	f2d5c04e1f	Added integration tests for TensorFlow implementation of the ALBERT model (#9976 ) * TF Albert integration test * TF Alber integration test added	2021-02-03 09:49:18 -05:00
Julien Plu	3f77c26d74	Fix Longformer and LED (#9942 ) * Fix Longformer and LED * Add a test for graph execution with inputs_embeds * Apply style	2021-02-03 12:26:32 +01:00
Daniel Stancl	71bdc076dd	Add head_mask and decoder_head_mask to PyTorch LED (#9856 ) * Add {decoder_,}head_mask to LED * Fix create_custom_forward signatue in encoder * Add head_mask to longformer * Add head_mask to longformer to fix dependencies of LED on Longformer. * Not working yet * Add mising one input in longofrmer_modeling.py * make fix-copies	2021-02-02 11:06:52 -08:00
Patrick von Platen	d6217fb30c	Wav2Vec2 (#9659 ) * add raw scaffold * implement feat extract layers * make style * remove + * correctly convert weights * make feat extractor work * make feature extraction proj work * run forward pass * finish forward pass * Succesful decoding example * remove unused files * more changes * add wav2vec tokenizer * add new structure * fix run forward * add other layer norm architecture * finish 2nd structure * add model tests * finish tests for tok and model * clean-up * make style * finish docstring for model and config * make style * correct docstring * correct tests * change checkpoints to fairseq * fix examples * finish wav2vec2 * make style * apply sylvains suggestions * apply lysandres suggestions * change print to log.info * re-add assert statement * add input_values as required input name * finish wav2vec2 tokenizer * Update tests/test_tokenization_wav2vec2.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * apply sylvains suggestions Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-02-02 15:52:10 +03:00
Lysandre Debut	1809de5165	ALBERT Tokenizer integration test (#9943 ) * ALBERT Tokenizer integration test * Batching * Style	2021-02-02 04:39:33 -05:00
Patrick von Platen	538b3b4607	[Tokenizer Utils Base] Make pad function more flexible (#9928 ) * change tokenizer requirement * split line * Correct typo from list to str * improve style * make other function pretty as well * add comment * correct typo * add new test * pass tests for tok without padding token * Apply suggestions from code review	2021-02-02 10:35:27 +03:00
Daniel Stancl	0c6c0afc0e	Add head_mask and decoder_head_mask to FSMT (#9819 ) * Add {decoder_,}head_mask to fsmt_modeling.py * Enable test_headmasking and some changes to docs * Remove test_head_masking flag from fsmt test file Remove test_head_masking flag from test_modeling_fsmt.py since test_head_masking is set to be True by default (thus it is redundant to store). * Merge master and remove test_head_masking = True * Rebase necessary due to an update of jaxlib * Remove test_head_masking=True in tests/test_modeling_fsmt.py as it is redundant.	2021-02-01 09:30:21 +03:00
Julien Plu	fdcde144d8	Add XLA test (#9848 )	2021-01-29 11:25:03 +01:00
Nicolas Patry	c2d0ffec8c	Adding a new `return_full_text` parameter to TextGenerationPipeline. (#9852 ) * Adding a new `return_full_text` parameter to TextGenerationPipeline. For text-generation, it's sometimes used as prompting text. In that context, prefixing `generated_text` with the actual input forces the caller to take an extra step to remove it. The proposed change adds a new parameter (for backward compatibility). `return_full_text` that enables the caller to prevent adding the prefix. * Doc quality.	2021-01-29 10:27:32 +01:00
Daniel Stancl	4c3ae89ad3	Remove redundant `test_head_masking = True` flags in test files (#9858 ) * Remove redundant test_head_masking = True flags * Remove all redundant test_head_masking flags in PyTorch test_modeling_* files * Make test_head_masking = True as a default choice in test_modeling_tf_commong.py * Remove all redundant test_head_masking flags in TensorFlow test_modeling_tf_* files * Put back test_head_masking=False fot TFT5 models	2021-01-28 10:09:13 -05:00
Sylvain Gugger	b4e559cfa1	Deprecate model_path in Trainer.train (#9854 )	2021-01-28 08:32:46 -05:00
Nicolas Patry	b936582f71	Fixing flaky conversational test + flag it as a pipeline test. (#9837 )	2021-01-28 10:19:55 +01:00
Stefan Schweter	5ed5a54684	ADD BORT (#9813 ) * tests: add integration tests for new Bort model * bort: add conversion script from Gluonnlp to Transformers 🚀 * bort: minor cleanup (BORT -> Bort) * add docs * make fix-copies * clean doc a bit * correct docs * Update docs/source/model_doc/bort.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/model_doc/bort.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * correct dialogpt doc * correct link * Update docs/source/model_doc/bort.rst * Update docs/source/model_doc/dialogpt.rst Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * make style Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-01-27 21:25:11 +03:00
Stas Bekman	7c6d63298f	[traner] fix --lr_scheduler_type choices (#9800 ) * fix --lr_scheduler_type choices * rewrite to fix for all enum-based cl args * cleanup * adjust test * style * Proposal that should work * Remove needless code * Fix test Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>	2021-01-27 10:12:15 -05:00
Sylvain Gugger	893120facc	Allow --arg Value for booleans in HfArgumentParser (#9823 ) * Allow --arg Value for booleans in HfArgumentParser * Update last test * Better error message	2021-01-27 09:31:42 -05:00
Sylvain Gugger	35d55b7b84	When resuming training from checkpoint, Trainer loads model (#9818 ) * Whenresuming training from checkpoint, Trainer loads model * Finish cleaning tests * Address review comment * Use global_step from state	2021-01-27 09:31:18 -05:00
Nicolas Patry	285c6262a8	Adding a test to prevent late failure in the Table question answering (#9808 ) pipeline. - If table is empty then the line that contain `answer[0]` will fail. - This PR add a check to prevent `answer[0]`. - Also adds an early check for presence of `table` and `query` to prevent late failure and give better error message. - Adds a few tests to make sure these errors are correctly raised.	2021-01-27 04:10:53 -05:00
Julien Plu	2c891c156d	Add a test for mixed precision (#9806 )	2021-01-27 03:36:49 -05:00
abhishek thakur	f617490e71	ConvBERT Model (#9717 ) * finalize convbert * finalize convbert * fix * fix * fix * push * fix * tf image patches * fix torch model * tf tests * conversion * everything aligned * remove print * tf tests * fix tf * make tf tests pass * everything works * fix init * fix * special treatment for sepconv1d * style * 🙏🏽 * add doc and cleanup * add electra test again * fix doc * fix doc again * fix doc again * Update src/transformers/modeling_tf_pytorch_utils.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/models/conv_bert/configuration_conv_bert.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Update docs/source/model_doc/conv_bert.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/auto/configuration_auto.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/conv_bert/configuration_conv_bert.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * conv_bert -> convbert * more fixes from review * add conversion script * dont use pretrained embed * unused config * suggestions from julien * some more fixes * p -> param * fix copyright * fix doc * Update src/transformers/models/convbert/configuration_convbert.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * comments from reviews * fix-copies * fix style * revert shape_list Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-01-27 03:20:09 -05:00
Nicolas Patry	781e4b1384	Adding `skip_special_tokens=True` to FillMaskPipeline (#9783 ) * We most likely don't want special tokens in this output. * Adding `skip_special_tokens=True` to FillMaskPipeline - It's backward incompatible. - It makes for sense for pipelines to remove references to special_tokens (all of the other pipelines do that). - Keeping special tokens makes it hard for users to actually remove them because all models have different tokens (<s>, <cls>, [CLS], ....) * Fixing `token_str` in the same vein, and actually fix the tests too !	2021-01-26 10:06:28 +01:00
Daniel Stancl	1867d9a8d7	Add head_mask/decoder_head_mask for TF BART models (#9639 ) * Add head_mask/decoder_head_mask for TF BART models * Add head_mask and decoder_head_mask input arguments for TF BART-based models as a TF counterpart to the PR #9569 * Add test_headmasking functionality to tests/test_modeling_tf_common.py * TODO: Add a test to verify that we can get a gradient back for importance score computation * Remove redundant #TODO note Remove redundant #TODO note from tests/test_modeling_tf_common.py * Fix assertions * Make style * Fix ...Model input args and adjust one new test * Add back head_mask and decoder_head_mask to BART-based ...Model after the last commit * Remove head_mask ande decoder_head_mask from input_dict in TF test_train_pipeline_custom_model as these two have different shape than other input args (Necessary for passing this test) * Revert adding global_rng in test_modeling_tf_common.py	2021-01-26 03:50:00 -05:00
Patrick von Platen	d94cc2f904	[Flaky Generation Tests] Make sure that no early stopping is happening for beam search (#9794 ) * fix ci * fix ci * renaming * fix dup line	2021-01-26 03:21:44 -05:00
Stas Bekman	fac7cfb16a	[fsmt] onnx triu workaround (#9738 ) * onnx triu workaround * style * working this time * add test * more efficient version	2021-01-25 08:57:37 -05:00
Julien Plu	a449ffcbd2	Fix test (#9755 )	2021-01-22 17:40:16 +01:00
Julien Plu	d7c31abf38	Fix some TF slow tests (#9728 ) * Fix saved model tests + fix a graph issue in longformer * Apply style	2021-01-22 14:50:46 +01:00

1 2 3 4 5 ...

832 Commits