transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-19 20:48:22 +06:00

Author	SHA1	Message	Date
Julien Plu	34df26ec3a	Making TF OpenAI GPT model compliant with AMP and XLA (#10261 ) * Fix AMP and XLA * Remove useless var	2021-02-19 09:33:25 -05:00
Julien Plu	3e116ed331	Making TF TransfoXL model compliant with AMP (#10264 ) * Fix AMP * Apply style * Remove unused import	2021-02-19 06:58:07 -05:00
Julien Plu	86caeb7636	Fix XLA and AMP (#10262 )	2021-02-19 06:57:16 -05:00
Julien Plu	3d72d47f09	Making TF MPNet model compliant with XLA (#10260 ) * Fix XLA * Rework cast * Apply style	2021-02-19 06:56:41 -05:00
Julien Plu	fb56bf2584	Making TF MobileBert model compliant with AMP (#10259 ) * Fix AMP * Trigger CI * Rework cast	2021-02-19 06:55:25 -05:00
Julien Plu	2fc6284f04	Making TF Lxmert model compliant with AMP (#10257 ) * Fix AMP * Rework cast * Apply style	2021-02-19 06:54:14 -05:00
Stas Bekman	4eddc459a9	[trainer] implement support for full fp16 in evaluation/predict (#10268 ) * implement --fp16_full_eval * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * style * add test Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-02-18 17:02:35 -08:00
Stas Bekman	d9a81fc0c5	fix func signature (#10271 )	2021-02-18 16:44:42 -08:00
Stas Bekman	97e688bc22	[Trainer] memory tracker metrics (#10225 ) * memory tracker metrics * go back to eval for somewhat consistency * handle no-gpu case * deal with stackable eval calls * restore callback order * style * simplify the API * add test * docs * consistently use eval_ prefix * improve docs * Update src/transformers/trainer_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * rename method * style Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-02-18 09:27:32 -08:00
Julien Plu	2acae50a0c	Reduce the time spent for the TF slow tests (#10152 ) * rework savedmodel slow test * Improve savedmodel tests * Remove useless content	2021-02-18 15:52:57 +01:00
Julien Plu	14ed3b978e	Fix AMP (#10216 )	2021-02-18 06:29:43 -05:00
Julien Plu	bdf1669e3f	Making TF GPT2 compliant with XLA and AMP (#10230 ) * Fix XLA and AMP * Fix AMP and XLA * Apply style * Apply Patrick's comment	2021-02-18 09:36:01 +01:00
Julien Plu	7246785a67	Make TF CTRL compliant with XLA and AMP (#10209 ) * Fix XLA and AMP * Apply style * Remove useless cast	2021-02-17 18:54:15 +01:00
Julien Plu	fdb2351ebb	Making TF XLM-like models XLA and AMP compliant (#10211 ) * Fix Flaubert and XLM * Remove useless cast * Tiny fix * Tiny fix	2021-02-17 18:02:48 +01:00
Julien Plu	83d803ba02	Making TF BART-like models XLA and AMP compliant (#10191 ) * Update BART * Update Blenderbot * Update BlenderbotSmall * Update Marian * Update MBart * Update MBart * Update Pegasus * Update template * Fix Marian and Pegasus * Apply style * Default initializer * Default initializer * Default initializer * Remove int32 casts * Fix template * Remove more cast	2021-02-17 17:48:56 +01:00
Daniel Stancl	8d79e5ca49	Fix head masking for TFT5 (#9877 ) * Fix head_mask and decoder_head_mask in TFT5 models * Enable test_headmasking both fot TFT5 tester and TFT5EncoderOnly tester Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>	2021-02-17 19:00:09 +03:00
Sylvain Gugger	7169d1ea7b	Store FLOS as floats to avoid overflow. (#10213 )	2021-02-16 11:15:15 -05:00
Julien Plu	5c2d66a2f5	Unlock XLA test for convbert (#10207 )	2021-02-16 07:59:41 -05:00
Lysandre Debut	8cbd0bd137	Specify dataset dtype (#10195 ) Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com> Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>	2021-02-15 12:57:17 -05:00
Julien Plu	31b0560ab4	Add AMP for Albert (#10141 )	2021-02-15 17:18:33 +01:00
Suraj Patil	6fc940ed09	Add mBART-50 (#10154 ) * add tokenizer for mBART-50 * update tokenizers * make src_lang and tgt_lang optional * update tokenizer test * add setter * update docs * update conversion script * update docs * update conversion script * update tokenizer * update test * update docs * doc * address Sylvain's suggestions * fix test * fix formatting * nits	2021-02-15 20:58:54 +05:30
Julien Plu	c8d3fa0dfd	Check TF ops for ONNX compliance (#10025 ) * Add check-ops script * Finish to implement check_tf_ops and start the test * Make the test mandatory only for BERT * Update tf_ops folder * Remove useless classes * Add the ONNX test for GPT2 and BART * Add a onnxruntime slow test + better opset flexibility * Fix test + apply style * fix tests * Switch min opset from 12 to 10 * Update src/transformers/file_utils.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Fix GPT2 * Remove extra shape_list usage * Fix GPT2 * Address Morgan's comments Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-02-15 07:55:10 -05:00
Nicolas Patry	900daec24e	Fixing NER pipeline for list inputs. (#10184 ) Fixes #10168	2021-02-15 06:22:45 -05:00
Nicolas Patry	c9837a0d27	Conversion from slow to fast for BPE spm vocabs contained an error. (#10120 ) * Conversion from slow to fast for BPE spm vocabs contained an error. - There is only 1 test currently (tokenizers + slow) that used the modified path and it's reformer, which does not contain any ids modification so the bug was silent for now. - The real issue is that vocab variable was overloaded by SentencePieceExtractor, leading to Slow specific vocab oddities to be completely ignored - The bug was reported here https://github.com/huggingface/transformers/issues/9518 - Ran the complete tokenization test suite with slow without error (`RUN_SLOW=1 pytest -sv tests/test_tokenization_`) Remove rebase error. * Adding the fixture.	2021-02-13 08:24:53 -05:00
Julien Chaumond	641f418e10	[hf_api] delete deprecated methods and tests (2)	2021-02-12 21:46:17 +01:00
Julien Chaumond	eed31db948	[hf_api] delete deprecated methods and tests (#10159 ) * [hf_api] delete deprecated methods and tests cc @lhoestq * Update test_hf_api.py	2021-02-12 15:35:06 -05:00
Patrick von Platen	495c157d6f	[Wav2Vec2] Improve Tokenizer & Model for batched inference (#10117 ) * save intermediate * finish batch the same as fairseq * add normalization * fix batched input * add better comment * Update src/transformers/models/wav2vec2/modeling_wav2vec2.py * add nice docstring * add tokenizer tests * make all slow tests pass * finish PR * correct import	2021-02-11 15:40:54 +03:00
Suraj Patil	c130e67dce	remove adjust_logits_during_generation method (#10087 ) * add forced logits processors * delete adjust_logits method * add forced_eos_token_id argument in config * add tests for forced logits processors * update gen utils tests * add forced option to tf generate * remove adjust_logits method from tf models * update adjust_logits for marian * delete _force_token_id_to_be_generated method * style * import warnings * pass max_length to _get_logits_processor * set forced_eos_token_id to None * set forced attributes in conf utils * typo * fix rag generate * add forced_eos_token_id in rag config * remove force_bos_token_to_be_generated from BartConfig * remove _force_token_ids_generation from FSMT * nit * fix negative constant * apply suggestions from code review	2021-02-10 22:39:09 +05:30
Julien Plu	22a32cf485	Fix TF LED/Longformer attentions computation (#10007 ) * Fix test * Remove commented test * Fix name * Apply style * Fix check copies * Remove prints * Restore boolean * Fix reshape	2021-02-10 10:58:37 -05:00
Lysandre Debut	0d8e554d42	Line endings should be LF across repo and not CRLF (#10119 )	2021-02-10 10:50:00 -05:00
abhishek thakur	480a9d6ba0	Fix TFConvBertModelIntegrationTest::test_inference_masked_lm Test (#10104 )	2021-02-09 20:22:54 +01:00
Daniel Stancl	e7381c4596	Add head_mask and decoder_head_mask to TF LED (#9988 ) * Add head masking to TF LED * Add head_mask to Longformer + one doc piece to LED * Fix integration tests	2021-02-09 11:45:18 -05:00
Patrick von Platen	b972125ced	Deprecate Wav2Vec2ForMaskedLM and add Wav2Vec2ForCTC (#10089 ) * add wav2vec2CTC and deprecate for maskedlm * remove from docs	2021-02-09 03:49:02 -05:00
sandip	263fac71a2	Integration test for electra model (#10073 )	2021-02-08 15:42:25 -05:00
demSd	3b7e612a5e	Implementing the test integration of BertGeneration (#9990 ) * claiming this issue * Integration test for BertGeneration(Encoder and Decoder) * fix code quality	2021-02-08 08:22:19 -05:00
Patrick von Platen	9e795eac88	fix bert2bert test (#10063 )	2021-02-08 16:04:28 +03:00
Julien Plu	31563e056d	Restore TF embeddings and attention layers to their previous version (#9890 ) * Refacto BERT * Restore all the concerned models * Remove print * Update template * Apply Sylvain's and Morgan's comments * Fix cast * Put the cast inside call * Remove cond in ebds * Fix funnel * Restore previous dot product (attention_scores) computation * Add ConvBERT and BART * Make all the S2S models ONNX compliant * Fix test * Fix check copies	2021-02-08 14:36:30 +03:00
Julien Plu	8bb52bd240	Disable temporarily too slow tests (Longformer/LED) (#10062 ) * Disable temporarily too slow tests * Fix style * Fix template	2021-02-08 12:32:31 +01:00
Nicolas Patry	b1aa4982cd	Cleaning up `ConversationalPipeline` to support more than DialoGPT. (#10002 ) * Cleaning up `ConversationalPipeline` to support more than DialoGPT. Currently ConversationalPipeline was heavily biased towards DialoGPT ,which is the default model for this pipeline. This PR proposes changes to put back the modifications specific to DialoGPT into tokenizer-specific behavior wherever possible, by creating `_build_conversation_input_ids` function that takes conversation as input, and returns a list of ints corresponding to the tokens. It feels natural to put here because all models have probably different strategies to build input_ids from the full conversation and it's the tokenizer's job to transform strings into tokens (and vice-versa) If `_build_conversation_input_ids` is missing, previous behavior is used so we don't break anything so far (except for blenderbot where it's a fix). This PR also contains a fix for too long inputs. There used to be dead code for trying to limit the size of incoming input. The introduced fixed is that we limit within `_build_conversation_input_ids` to `tokenizer.model_max_length`. It corresponds to the intent of the removed dead code and is actually better because it corresponds to `model_max_length` which is different from `max_length` (which is a default parameter for `generate`). - Removed `history` logic from the Conversation as it's not relevant anymore because tokenization logic has been moved to tokenizer. And tokenizer cannot save any cache, and conversation cannot know what is relevant or not. Also it's not usable from `blenderbot` because the input_ids are not append only (EOS tokens is always at the end). - Added `iter_texts` method on `Conversation` because all the code was literred with some form of this iteration of past/generated_responses. * Removing torch mention in types. * Adding type checking to `_build_conversation_input_ids`. * Fixing import in strings.	2021-02-08 14:29:07 +03:00
Patrick von Platen	9a0399e18d	fix bart tests (#10060 )	2021-02-08 13:25:09 +03:00
Lysandre Debut	d51302cca0	Fix slow dpr test (#10059 ) * Correct cast to device * Comment back the slow test	2021-02-08 04:43:25 -05:00
sandip	12e44af5d3	Integration test for FlauBert (#10022 )	2021-02-08 04:36:50 -05:00
Nicolas Patry	d5888ef0ab	Hotfixing tests (blenderbot decoderonly tests, also need to remove (#10003 ) `encoder_no_repeat_ngram_size` from their config.	2021-02-04 11:41:34 -05:00
Nicolas Patry	aeb18b9224	Adding new `encoder_no_repeat_ngram_size` to `generate`. (#9984 ) Adding new `encoder_no_repeat_ngram_size` to `generate`. Blenderbot results seemed off compared to original ParlAI script: `https://parl.ai/projects/recipes/`. Notably the model seems to repeat a lot what was said during the conversation. The actual problem was that `no_repeat_ngram_size` actually applies to the `encoder_input_ids` but HF's `no_repeat_ngram_size` applies to the previously generated ids (within the decoder). The history conversation of blenderbot is within the `encoder` part so that explains why HF's implementation had the repetitions. This fix was focused on blenderbot not small and added tests for those because they are quite different in configuration. This change includes: - Adding a new EncoderNoRepeatLogitProcessor. - Adding 1 new arg to `generate` (`encoder_no_repeat_ngram_size`) - Adding 1 new config parameter `encoder_no_repeat_ngram_size`. - Adding 2 tests, one for the pipeline (high level, inputs exhibited repeat behavior, one low level for EncoderNoRepeatLogitProcessor) - Factored NoRepeatLogitProcessor so that logic could be reused. Further work: - Blenderbot conversational pipeline still does not behave correctly as they way input is prepared within the pipeline is still incorrect (follow up PR) - Blenderbot allows the bot to have personas, which is done by prepending "your personna: XXXX" to the input, this could be explored too in a follow up PR. @patrickvonplaten @LysandreJik * Update src/transformers/generation_logits_process.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/configuration_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Doc quality. * Fixing test. * Last fixes. * Fixing to account for batch_size. * Update src/transformers/configuration_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/generation_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-02-04 15:00:18 +01:00
Daniel Hug	804cd185d8	Added Integration testing for DistilBert model from issue #9948 ' (#9995 )	2021-02-04 04:24:59 -05:00
demSd	00031785a8	BartForCausalLM analogs to `ProphetNetForCausalLM` (#9128 ) * initiliaze bart4causalLM * create BartDecoderWrapper, setters/getters * delete spaces * forward and additional methods * update cache function, loss function, remove ngram* params in data class. * add bartcausallm, bartdecoder testing * correct bart for causal lm * remove at * add mbart as well * up * fix typo * up * correct * add pegasusforcausallm * add blenderbotforcausallm * add blenderbotsmallforcausallm * add marianforcausallm * add test for MarianForCausalLM * add Pegasus test * add BlenderbotSmall test * add blenderbot test * fix a fail * fix an import fail * a fix * fix * Update modeling_pegasus.py * fix models * fix inputs_embeds setting getter * adapt tests * correct repo utils check * finish test improvement * fix tf models as well * make style * make fix-copies * fix copies * run all tests * last changes * fix all tests Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-02-04 11:56:12 +03:00
sandip	2f06f2bcd6	Alber model integration testing added (#9980 )	2021-02-03 11:41:10 -05:00
sandip	75fd00fb25	Integration test added for TF MPnet (#9979 )	2021-02-03 11:39:40 -05:00
sandip	ce08043f7a	Integration test for mobilebert (#9978 )	2021-02-03 11:36:45 -05:00
sandip	1486205d23	TF DistilBERT integration tests (#9975 ) * TF DistilBERT integration test * Update test_modeling_tf_distilbert.py	2021-02-03 09:51:00 -05:00

1 2 3 4 5 ...

857 Commits