transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-06 22:30:09 +06:00

Author	SHA1	Message	Date
Julien Chaumond	448c467256	Fix: unpin flake8 and fix cs errors (#4367 ) * Fix: unpin flake8 and fix cs errors * Ok we still need to quote those	2020-05-14 13:14:26 -04:00
Sam Shleifer	9a687ebb77	[Marian Fixes] prevent predicting pad_token_id before softmax, support language codes, name multilingual models (#4290 )	2020-05-13 17:29:41 -04:00
Julien Chaumond	241759101e	(v2) Improvements to the wandb integration (#4324 ) * Improvements to the wandb integration * small reorg + no global necessary * feat(trainer): log epoch and final metrics * Simplify logging a bit * Fixup * Fix crash when just running eval Co-authored-by: Chris Van Pelt <vanpelt@gmail.com> Co-authored-by: Boris Dayma <boris.dayma@gmail.com>	2020-05-12 21:52:01 -04:00
Julien Chaumond	4bf5042240	Fix BART tests on GPU (#4298 )	2020-05-12 09:11:50 -04:00
Sam Shleifer	3487be75ef	[Marian] documentation and AutoModel support (#4152 ) - MarianSentencepieceTokenizer - > MarianTokenizer - Start using unk token. - add docs page - add better generation params to MarianConfig - more conversion utilities	2020-05-10 13:54:57 -04:00
Patrick von Platen	cf08830c28	[Pipeline, Generation] tf generation pipeline bug (#4217 ) * fix PR * move tests to correct place	2020-05-08 08:30:05 -04:00
Jared T Nielsen	8bf7312654	Add AlbertForPreTraining and TFAlbertForPreTraining models. (#4057 ) * Add AlbertForPreTraining and TFAlbertForPreTraining models. * PyTorch conversion * TensorFlow conversion * style Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-05-07 19:44:51 -04:00
Julien Chaumond	0ae96ff8a7	BIG Reorganize examples (#4213 ) * Created using Colaboratory * [examples] reorganize files * remove run_tpu_glue.py as superseded by TPU support in Trainer * Bugfix: int, not tuple * move files around	2020-05-07 13:48:44 -04:00
Funtowicz Morgan	0a6cbea0a5	Rewritten batch support in pipelines. (#4154 ) * Rewritten batch support in pipelines. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Fix imports sorting 🔧 Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Set pad_to_max_length=True by default on Pipeline. * Set pad_to_max_length=False for generation pipelines. Most of generation models doesn't have padding token. * Address @joeddav review comment: Uniformized args. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> Address @joeddav review comment: Uniformized *args (second). Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>	2020-05-07 09:52:40 -04:00
Patrick von Platen	dca34695d0	Reformer (#3351 ) * first copy & past commit from Bert and morgans LSH code * add easy way to compare to trax original code * translate most of function * make trax lsh self attention deterministic with numpy seed + copy paste code * add same config * add same config * make layer init work * implemented hash_vectors function for lsh attention * continue reformer translation * hf LSHSelfAttentionLayer gives same output as trax layer * refactor code * refactor code * refactor code * refactor * refactor + add reformer config * delete bogus file * split reformer attention layer into two layers * save intermediate step * save intermediate step * make test work * add complete reformer block layer * finish reformer layer * implement causal and self mask * clean reformer test and refactor code * fix merge conflicts * fix merge conflicts * update init * fix device for GPU * fix chunk length init for tests * include morgans optimization * improve memory a bit * improve comment * factorize num_buckets * better testing parameters * make whole model work * make lm model work * add t5 copy paste tokenizer * add chunking feed forward * clean config * add improved assert statements * make tokenizer work * improve test * correct typo * extend config * add complexer test * add new axial position embeddings * add local block attention layer * clean tests * refactor * better testing * save intermediate progress * clean test file * make shorter input length work for model * allow variable input length * refactor * make forward pass for pretrained model work * add generation possibility * finish dropout and init * make style * refactor * add first version of RevNet Layers * make forward pass work and add convert file * make uploaded model forward pass work * make uploaded model forward pass work * refactor code * add namedtuples and cache buckets * correct head masks * refactor * made reformer more flexible * make style * remove set max length * add attention masks * fix up tests * fix lsh attention mask * make random seed optional for the moment * improve memory in reformer * add tests * make style * make sure masks work correctly * detach gradients * save intermediate * correct backprob through gather * make style * change back num hashes * rename to labels * fix rotation shape * fix detach * update * fix trainer * fix backward dropout * make reformer more flexible * fix conflict * fix * fix * add tests for fixed seed in reformer layer * fix trainer typo * fix typo in activations * add fp16 tests * add fp16 training * support fp16 * correct gradient bug in reformer * add fast gelu * re-add dropout for embedding dropout * better naming * better naming * renaming * finalize test branch * finalize tests * add more tests * finish tests * fix * fix type trainer * fix fp16 tests * fix tests * fix tests * fix tests * fix issue with dropout * fix dropout seeds * correct random seed on gpu * finalize random seed for dropout * finalize random seed for dropout * remove duplicate line * correct half precision bug * make style * refactor * refactor * docstring * remove sinusoidal position encodings for reformer * move chunking to modeling_utils * make style * clean config * make style * fix tests * fix auto tests * pretrained models * fix docstring * update conversion file * Update pretrained_models.rst * fix rst * fix rst * update copyright * fix test path * fix test path * fix small issue in test * include reformer in generation tests * add docs for axial position encoding * finish docs * Update convert_reformer_trax_checkpoint_to_pytorch.py * remove isort * include sams comments * remove wrong comment in utils * correct typos * fix typo * Update reformer.rst * applied morgans optimization * make style * make gpu compatible * remove bogus file * big test refactor * add example for chunking * fix typo * add to README	2020-05-07 10:17:01 +02:00
Julien Plu	aad50151f3	TF version of the trainer (#4017 ) * First commit to add a TF version of the trainer. * Make the TF trainer closer to what looks the PT trainer * Refactoring common code between the PT and TF trainer into an util file. * Some bugfix + better similarity with the PT trainer * Add missing class in transformers init * Bugfix over prediction + use classification report instead of simple metrics * Fix name error * Fix optimization tests + style * Apply style * Several bugfix for multi-gpu training * Apply style * Apply style * Add glue example for the TF trainer * Several bugix + address the reviews * Fix on the TF training args file * Add a debug mode * Bugfix in utils_ner.py when segment_ids is None * Apply style * Apply style * Add TPU strategy * Fix selection strategy	2020-05-06 12:56:52 -04:00
Lysandre Debut	79b1c6966b	Pytorch 1.5.0 (#3973 ) * Standard deviation can no longer be set to 0 * Remove torch pinned version * 9th instead of 10th, silly me	2020-05-05 10:23:01 -04:00
Patrick von Platen	8e67573a64	[EncoderDecoder Tests] Improve tests (#4046 ) * Hoist bert model tester for patric * indent * make tests work * Update tests/test_modeling_bert.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: sshleifer <sshleifer@gmail.com> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-04 02:18:36 +02:00
Sam Shleifer	18db92dd9a	[testing] add timeout_decorator (#3543 )	2020-05-01 09:05:47 -04:00
Julien Chaumond	f39217a5ec	[tests] Light cleanup of tempfile in tests/	2020-04-30 22:30:15 -04:00
Julien Chaumond	f54dc3f4d5	[ci] Load pretrained models into the default (long-lived) cache There's an inconsistency right now where: - we load some models into CACHE_DIR - and some models in the default cache - and often, in both for the same models When running the RUN_SLOW tests, this takes a lot of disk space, time, and bandwidth. I'd rather always use the default cache	2020-04-30 22:30:15 -04:00
Julien Chaumond	ab90353f1a	[cli] {login, upload, s3} display more helpful error messages	2020-04-30 12:51:06 -04:00
Julien Chaumond	452dd0e4d9	[ci] Align test_hf_api.py with API change	2020-04-30 12:06:01 -04:00
Sam Shleifer	2c77842887	[Fix common tests on GPU] send model, ids to torch_device (#4014 )	2020-04-29 09:47:20 -04:00
Sam Shleifer	847e7f3379	MarianMTModel.from_pretrained('Helsinki-NLP/opus-marian-en-de') (#3908 ) Co-Authored-By: Stefan Schweter <stefan@schweter.it>	2020-04-28 18:22:37 -04:00
Patrick von Platen	fa49b9afea	Clean Encoder-Decoder models with Bart/T5-like API and add generate possibility (#3383 ) * change encoder decoder style to bart & t5 style * make encoder decoder generation dummy work for bert * make style * clean init config in encoder decoder * add tests for encoder decoder models * refactor and add last tests * refactor and add last tests * fix attn masks for bert encoder decoder * make style * refactor prepare inputs for Bert * refactor * finish encoder decoder * correct typo * add docstring to config * finish * add tests * better naming * make style * fix flake8 * clean docstring * make style * rename	2020-04-28 15:11:09 +02:00
Lorenzo Ampil	f16540fcba	Pipeline for Text Generation: GenerationPipeline (#3758 ) * Add GenerationPipeline * Fix parameter names * Correct parameter __call__ parameters * Add model type attribute and correct function calls for prepare_input * Take out trailing commas from init attributes * Remove unnecessary tokenization line * Implement support for multiple text inputs * Apply generation support for multiple input text prompts * Take out tensor coersion * Take out batch index * Add text prompt to return sequence * Squeeze token tensore before decoding * Return only a single list of sequences if only one prompt was used * Correct results variable name * Add GenerationPipeline to SUPPORTED_TASKS with the alias , initalized w GPT2 * Registedred AutoModelWithLMHead for both pt and t * Update docstring for GenerationPipeline * Add kwargs parameter to mode.generate * Take out kwargs parameter after all * Add generation pipeline example in pipeline docstring * Fix max length by squeezing tokens tensor * Apply ensure_tensor_on_device to pytorch tensor * Include generation step in torch.no_grad * Take out input from prepare_xlm_input and set 'en' as default xlm_language * Apply framework specific encoding during prepare_input * Format w make style * Move GenerationPipeline import to follow proper import sorting * Take out training comma from generation dict * Apply requested changes * Change name to TextGenerationPipeline * Apply TextGenerationPipeline rename to __init___ * Changing alias to * Set input mapping as input to ensure_tensor_on_device * Fix assertion placement * Add test_text_generation * Add TextGenerationPipeline to PipelineCommonTests * Take out whitespace * Format __init__ w black * Fix __init__ style * Forman __init___ * Add line to end of __init__ * Correct model tokenizer set for test_text_generation * Ensure to return list of list, not list of string (to pass test) * Limit test models to only 3 to limit runtime to address circleCI timeout error * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Remove argument docstring, __init__, add additional __call__ arguments, and reformat results to list of dict * Fix blank result list * Add TextGenerationPipeline to pipelines.rst * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Fix typos from adding PADDING_TEXT_TOKEN_LENGTH * Fix incorrectly moved result list * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Add back generation line and make style * Take out blank whitespace * Apply new alis, text-generation, to test_pipelines * Fix text generation alias in test * Update src/transformers/pipelines.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-04-22 09:37:03 -04:00
Julien Chaumond	dd9d483d03	Trainer (#3800 ) * doc * [tests] Add sample files for a regression task * [HUGE] Trainer * Feedback from @sshleifer * Feedback from @thomwolf + logging tweak * [file_utils] when downloading concurrently, get_from_cache will use the cached file for subsequent processes * [glue] Use default max_seq_length of 128 like before * [glue] move DataTrainingArguments around * [ner] Change interface of InputExample, and align run_{tf,pl} * Re-align the pl scripts a little bit * ner * [ner] Add integration test * Fix language_modeling with API tweak * [ci] Tweak loss target * Don't break console output * amp.initialize: model must be on right device before * [multiple-choice] update for Trainer * Re-align to `827d6d6ef0`	2020-04-21 20:11:56 -04:00
Thomas Wolf	827d6d6ef0	Cleanup fast tokenizers integration (#3706 ) * First pass on utility classes and python tokenizers * finishing cleanup pass * style and quality * Fix tests * Updating following @mfuntowicz comment * style and quality * Fix Roberta * fix batch_size/seq_length inBatchEncoding * add alignement methods + tests * Fix OpenAI and Transfo-XL tokenizers * adding trim_offsets=True default for GPT2 et RoBERTa * style and quality * fix tests * add_prefix_space in roberta * bump up tokenizers to rc7 * style * unfortunately tensorfow does like these - removing shape/seq_len for now * Update src/transformers/tokenization_utils.py Co-Authored-By: Stefan Schweter <stefan@schweter.it> * Adding doc and docstrings * making flake8 happy Co-authored-by: Stefan Schweter <stefan@schweter.it>	2020-04-18 13:43:57 +02:00
Lysandre Debut	8b63a01d95	XLM tokenizer should encode with bos token (#3791 ) * XLM tokenizer should encode with bos token * Update tests	2020-04-17 11:28:55 -04:00
Patrick von Platen	1d4a35b396	Higher tolerance for past testing in TF T5 (#3844 )	2020-04-17 11:26:16 -04:00
Patrick von Platen	d13eca11e2	Higher tolerance for past testing in T5 (#3843 )	2020-04-17 11:25:14 -04:00
Pierric Cistac	6d00033e97	Question Answering support for Albert and Roberta in TF (#3812 ) * Add TFAlbertForQuestionAnswering * Add TFRobertaForQuestionAnswering * Update TFAutoModel with Roberta/Albert for QA * Clean `super` TF Albert calls	2020-04-17 10:45:30 -04:00
Patrick von Platen	baca8fa8e6	clean pipelines (#3795 )	2020-04-16 10:21:34 -04:00
Patrick von Platen	38f7461df3	[TFT5, Cache] Add cache to TFT5 (#3772 ) * correct gpt2 test inputs * make style * delete modeling_gpt2 change in test file * translate from pytorch * correct tests * fix conflicts * fix conflicts * fix conflicts * fix conflicts * make tensorflow t5 caching work * make style * clean reorder cache * remove unnecessary spaces * fix test	2020-04-16 16:14:52 +02:00
Patrick von Platen	01c37dcdb5	[Config, Caching] Remove `output_past` everywhere and replace by `use_cache` argument (#3734 ) * remove output_past from pt * make style * add optional input length for gpt2 * add use cache to prepare input * save memory in gpt2 * correct gpt2 test inputs * make past input optional for gpt2 * finish use_cache for all models * make style * delete modeling_gpt2 change in test file * correct docstring * correct is true statements for gpt2	2020-04-14 14:40:28 -04:00
Teven	352d5472b0	Shift labels internally within TransfoXLLMHeadModel when called with labels (#3716 ) * Shifting labels inside TransfoXLLMHead * Changed doc to reflect change * Updated pytorch test * removed IDE whitespace changes * black reformat Co-authored-by: TevenLeScao <teven.lescao@gmail.com>	2020-04-13 18:11:23 +02:00
Julien Chaumond	b169ac9c2b	[examples] Generate argparsers from type hints on dataclasses (#3669 ) * [examples] Generate argparsers from type hints on dataclasses * [HfArgumentParser] way simpler API * Restore run_language_modeling.py for easier diff * [HfArgumentParser] final tweaks from code review	2020-04-10 12:21:58 -04:00
Sam Shleifer	7a7fdf71f8	Multilingual BART - (#3602 ) - support mbart-en-ro weights - add MBartTokenizer	2020-04-10 11:25:39 -04:00
Patrick von Platen	ce2298fb5f	[T5, generation] Add decoder caching for T5 (#3682 ) * initial commit to add decoder caching for T5 * better naming for caching * finish T5 decoder caching * correct test * added extensive past testing for T5 * clean files * make tests cleaner * improve docstring * improve docstring * better reorder cache * make style * Update src/transformers/modeling_t5.py Co-Authored-By: Yacine Jernite <yjernite@users.noreply.github.com> * make set output past work for all layers * improve docstring * improve docstring Co-authored-by: Yacine Jernite <yjernite@users.noreply.github.com>	2020-04-10 01:02:50 +02:00
LysandreJik	31baeed614	Update quotes cc @julien-c	2020-04-09 09:09:00 -04:00
Lysandre Debut	6435b9f908	Updating the TensorFlow models to work as expected with tokenizers v3.0.0 (#3684 ) * Updating modeling tf files; adding tests * Merge `encode_plus` and `batch_encode_plus`	2020-04-08 16:22:44 -04:00
Sam Shleifer	715aa5b135	[Bart] Replace config.output_past with use_cache kwarg (#3632 )	2020-04-07 19:08:26 -04:00
Sam Shleifer	0a4b1068e1	Speedup torch summarization tests (#3663 )	2020-04-07 14:01:30 -04:00
Funtowicz Morgan	96ab75b8dd	Tokenizers v3.0.0 (#3185 ) * Renamed num_added_tokens to num_special_tokens_to_add Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Cherry-Pick: Partially fix space only input without special tokens added to the output #3091 Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Added property is_fast on PretrainedTokenizer and PretrainedTokenizerFast Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Make fast tokenizers unittests work on Windows. * Entirely refactored unittest for tokenizers fast. * Remove ABC class for CommonFastTokenizerTest * Added embeded_special_tokens tests from allenai @dirkgr * Make embeded_special_tokens tests from allenai more generic * Uniformize vocab_size as a property for both Fast and normal tokenizers * Move special tokens handling out of PretrainedTokenizer (SpecialTokensMixin) * Ensure providing None input raise the same ValueError than Python tokenizer + tests. * Fix invalid input for assert_padding when testing batch_encode_plus * Move add_special_tokens from constructor to tokenize/encode/[batch_]encode_plus methods parameter. * Ensure tokenize() correctly forward add_special_tokens to rust. * Adding None checking on top on encode / encode_batch for TransfoXLTokenizerFast. Avoid stripping on None values. * unittests ensure tokenize() also throws a ValueError if provided None * Added add_special_tokens unittest for all supported models. * Style * Make sure TransfoXL test run only if PyTorch is provided. * Split up tokenizers tests for each model type. * Fix invalid unittest with new tokenizers API. * Filter out Roberta openai detector models from unittests. * Introduce BatchEncoding on fast tokenizers path. This new structure exposes all the mappings retrieved from Rust. It also keeps the current behavior with model forward. * Introduce BatchEncoding on slow tokenizers path. Backward compatibility. * Improve error message on BatchEncoding for slow path * Make add_prefix_space True by default on Roberta fast to match Python in majority of cases. * Style and format. * Added typing on all methods for PretrainedTokenizerFast * Style and format * Added path for feeding pretokenized (List[str]) input to PretrainedTokenizerFast. * Style and format * encode_plus now supports pretokenized inputs. * Remove user warning about add_special_tokens when working on pretokenized inputs. * Always go through the post processor. * Added support for pretokenized input pairs on encode_plus * Added is_pretokenized flag on encode_plus for clarity and improved error message on input TypeError. * Added pretokenized inputs support on batch_encode_plus * Update BatchEncoding methods name to match Encoding. * Bump setup.py tokenizers dependency to 0.7.0rc1 * Remove unused parameters in BertTokenizerFast * Make sure Roberta returns token_type_ids for unittests. * Added missing typings * Update add_tokens prototype to match tokenizers side and allow AddedToken * Bumping tokenizers to 0.7.0rc2 * Added documentation for BatchEncoding * Added (unused) is_pretokenized parameter on PreTrainedTokenizer encode_plus/batch_encode_plus methods. * Added higher-level typing for tokenize / encode_plus / batch_encode_plus. * Fix unittests failing because add_special_tokens was defined as a constructor parameter on Rust Tokenizers. * Fix text-classification pipeline using the wrong tokenizer * Make pipelines works with BatchEncoding * Turn off add_special_tokens on tokenize by default. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Remove add_prefix_space from tokenize call in unittest. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Style and quality Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Correct message for batch_encode_plus none input exception. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Fix invalid list comprehension for offset_mapping overriding content every iteration. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * TransfoXL uses Strip normalizer. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Bump tokenizers dependency to 0.7.0rc3 Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Support AddedTokens for special_tokens and use left stripping on mask for Roberta. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * SpecilaTokenMixin can use slots to faster access to underlying attributes. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Remove update_special_tokens from fast tokenizers. * Ensure TransfoXL unittests are run only when torch is available. * Style. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Style * Style 🙏🙏 * Remove slots on SpecialTokensMixin, need deep dive into pickle protocol. * Remove Roberta warning on __init__. * Move documentation to Google style. Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>	2020-04-07 00:29:15 +02:00
Patrick von Platen	2ee410560e	[Generate, Test] Split generate test function into beam search, no beam search (#3601 ) * split beam search and no beam search test * fix test * clean generate tests	2020-04-06 10:37:05 +02:00
Lysandre Debut	d5d7d88612	ELECTRA (#3257 ) * Electra wip * helpers * Electra wip * Electra v1 * ELECTRA may be saved/loaded * Generator & Discriminator * Embedding size instead of halving the hidden size * ELECTRA Tokenizer * Revert BERT helpers * ELECTRA Conversion script * Archive maps * PyTorch tests * Start fixing tests * Tests pass * Same configuration for both models * Compatible with base + large * Simplification + weight tying * Archives * Auto + Renaming to standard names * ELECTRA is uncased * Tests * Slight API changes * Update tests * wip * ElectraForTokenClassification * temp * Simpler arch + tests Removed ElectraForPreTraining which will be in a script * Conversion script * Auto model * Update links to S3 * Split ElectraForPreTraining and ElectraForTokenClassification * Actually test PreTraining model * Remove num_labels from configuration * wip * wip * From discriminator and generator to electra * Slight API changes * Better naming * TensorFlow ELECTRA tests * Accurate conversion script * Added to conversion script * Fast ELECTRA tokenizer * Style * Add ELECTRA to README * Modeling Pytorch Doc + Real style * TF Docs * Docs * Correct links * Correct model intialized * random fixes * style * Addressing Patrick's and Sam's comments * Correct links in docs	2020-04-03 14:10:54 -04:00
Yohei Tamura	8594dd80dd	BertJapaneseTokenizer accept options for mecab (#3566 ) * BertJapaneseTokenizer accept options for mecab * black * fix mecab_option to Option[str]	2020-04-03 11:12:19 -04:00
Patrick von Platen	a4ee4da18a	[T5, TF 2.2] change tf t5 argument naming (#3547 ) * change tf t5 argument naming for TF 2.2 * correct bug in testing	2020-04-01 22:04:20 +02:00
Patrick von Platen	b815edf69f	[T5, Testst] Add extensive hard-coded integration tests and make sure PT and TF give equal results (#3550 ) * add some t5 integration tests * finish summarization and translation integration tests for T5 - results loook good * add tf test * fix == vs is bug * fix tf beam search error and make tf t5 tests pass	2020-04-01 18:01:33 +02:00
Patrick von Platen	b38d552a92	[Generate] Add bad words list argument to the generate function (#3367 ) * add bad words list * make style * add bad_words_tokens * make style * better naming * make style * fix typo	2020-03-31 18:42:31 +02:00
Sam Shleifer	8deff3acf2	[bart-tiny-random] Put a 5MB model on S3 to allow faster exampl… (#3488 )	2020-03-30 12:28:27 -04:00
Patrick von Platen	75ec6c9e3a	[T5] make decoder input ids optional for t5 training (#3521 ) * make decoder input ids optional for t5 training * lm_lables should not be shifted in t5 * add tests * finish shift right functionality for PT T5 * move shift right to correct class * cleaner code * replace -100 values with pad token id * add assert statement * remove unnecessary for loop * make style	2020-03-30 13:45:26 +02:00
Sam Shleifer	f6a23d1911	[BART] add bart-large-xsum weights (#3422 )	2020-03-29 10:51:13 -04:00
Sam Shleifer	3ee431dd4c	[Bart/Memory] Two separate, smaller decoder attention masks (#3371 )	2020-03-26 21:34:15 -04:00
Sam Shleifer	39371ee454	[Bart/Memory] don't create lm_head (#3323 ) * delete lm_head, skips weight tying * Fixed s3	2020-03-26 18:40:39 -04:00
sakares saengkaew	1a6c546c6f	Add missing token classification for XLM (#3277 ) * Add the missing token classification for XLM * fix styling * Add XLMForTokenClassification to AutoModelForTokenClassification class * Fix docstring typo for non-existing class * Add the missing token classification for XLM * fix styling * fix styling * Add XLMForTokenClassification to AutoModelForTokenClassification class * Fix docstring typo for non-existing class * Add missing description for AlbertForTokenClassification * fix styling * Add missing docstring for AlBert * Slow tests should be slow Co-authored-by: Sakares Saengkaew <s.sakares@gmail.com> Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>	2020-03-26 10:22:13 -04:00
Patrick von Platen	022e8fab97	Adds translation pipeline (#3419 ) * fix merge conflicts * add t5 summarization example * change parameters for t5 summarization * make style * add first code snippet for translation * only add prefixes * add prefix patterns * make style * renaming * fix conflicts * remove unused patterns * solve conflicts * fix merge conflicts * remove translation example * remove summarization example * make sure tensors are in numpy for float comparsion * re-add t5 config * fix t5 import config typo * make style * remove unused numpy statements * update doctstring * import translation pipeline	2020-03-26 13:50:58 +01:00
Patrick von Platen	9c683ef01e	Add t5 to pipeline(task='summarization') (#3413 ) * solve conflicts * move warnings below * incorporate changes * add pad_to_max_length to pipelines * add bug fix for T5 beam search * add prefix patterns * make style * fix conflicts * adapt pipelines for task specific parameters * improve docstring * remove unused patterns	2020-03-26 11:03:13 +01:00
Patrick von Platen	e392ba6938	Add camembert integration tests (#3375 ) * add integration tests for camembert * use jplu/tf-camembert fro the moment * make style	2020-03-24 10:18:37 +01:00
Patrick von Platen	95e00d0808	Clean special token init in modeling_....py (#3264 ) * make style * fix conflicts	2020-03-20 21:41:04 +01:00
Patrick von Platen	bbf26c4e61	Support T5 Generation (#3228 ) * fix conflicts * update bart max length test * correct spelling mistakes * implemented model specific encode function * fix merge conflicts * better naming * save intermediate state -> need to rethink strucuture a bit * leave tf problem as it is for now * current version * add layers.pop * remove ipdb * make style * clean return cut decoding * remove ipdbs * Fix restoring layers in the decoders that doesnt exists. * push good intermediate solution for now * fix conflicts * always good to refuse to merge conflicts when rebasing * fix small bug * improve function calls * remove unused file * add correct scope behavior for t5_generate Co-authored-by: Morgan Funtowicz <funtowiczmo@gmail.com>	2020-03-19 23:18:23 +01:00
Sam Shleifer	ad7233fc01	[BART] cleanup: remove redundant kwargs, improve docstrings (#3319 )	2020-03-19 11:16:51 -04:00
Lysandre Debut	d6afbd323d	XLM-R Tokenizer now passes common tests + Integration tests (#3198 ) * XLM-R now passes common tests + Integration tests * Correct mask index * Model input names * Style * Remove text preprocessing * Unneccessary import	2020-03-18 09:52:49 -04:00
Patrick von Platen	292186a3e7	Adding LM Head to Transfo-XL and first step to fixing problem with Adaptive Embeddings in TransfoXL (#3286 ) * first commit * work in progress * make language generation task pass * update to working version for LM * delete print * remove dead code * make style	2020-03-18 09:24:27 -04:00
Sam Shleifer	38a555a83c	Add Summarization to Pipelines (#3128 ) * passing * Undo stupid chg * docs * undo rename * delete-cruft * only import if you have torch * Dont rely on dict ordering * Fix dict ordering upstream * docstring link * docstring link * remove trailing comma for 3.5 compat * new name * delegate kwarging * Update kwargs	2020-03-17 18:04:21 -04:00
Patrick von Platen	e8f44af5bf	[generate] do_sample default back to False (#3298 ) * change do_samples back * None better default as boolean * adapt do_sample to True in test example * make style	2020-03-17 10:52:37 -04:00
Sam Shleifer	b2c1a447fe	[BART] Delete redundant unit test (#3302 )	2020-03-16 23:09:10 -04:00
Sam Shleifer	5ea8ba67b4	[BART] Remove unused kwargs (#3279 ) * Remove unused kwargs * dont call forward in tests	2020-03-15 23:00:44 -04:00
Thomas Wolf	3814e167d9	Merge pull request #3225 from patrickvonplaten/finalize_merge_bart_generate_into_default_generate Complete merge Seq-2-Seq generation into default generation	2020-03-14 15:08:59 +01:00
Sam Shleifer	2bd79e23de	[BART] FP16 testing fixes (#3266 )	2020-03-13 19:48:26 -04:00
Patrick von Platen	6a82f774f2	fix typo	2020-03-12 21:10:51 +01:00
Patrick von Platen	f1c71da115	fix eos_token_ids in test	2020-03-12 21:00:54 +01:00
Patrick von Platen	6047f46b19	re-add eos token to get good bart results	2020-03-12 20:17:50 +01:00
Patrick von Platen	ac303eae46	fix problem with half	2020-03-11 12:24:30 +01:00
Patrick von Platen	bc9d5d917c	make all tensors half precision	2020-03-11 12:15:38 +01:00
Patrick von Platen	a332cc9f7f	finalize generation merge	2020-03-11 11:53:36 +01:00
Patrick von Platen	7351a8dbaf	re-add scoring filtering	2020-03-11 11:06:56 +01:00
Patrick von Platen	374deef48d	fixed typo	2020-03-11 11:06:56 +01:00
patrickvonplaten	41b437ea3a	add draft version of propsoed changes for ROGUE score	2020-03-11 11:06:56 +01:00
patrickvonplaten	a5751f7578	fix bug with attention_mask as optional input argument	2020-03-11 11:06:56 +01:00
patrickvonplaten	d880a5fbde	finalized PR	2020-03-11 11:06:56 +01:00
patrickvonplaten	2acfe63964	best current version and make style	2020-03-11 11:06:56 +01:00
patrickvonplaten	c62444da39	fix conflicts	2020-03-11 11:06:56 +01:00
Patrick von Platen	77e6775065	add current changes	2020-03-11 11:06:56 +01:00
Patrick von Platen	421216997b	comment out stuff	2020-03-11 11:06:56 +01:00
Patrick von Platen	7a11e925cf	work in progress	2020-03-11 11:06:56 +01:00
Patrick von Platen	aceb3fbaf4	only do output_past=True for language generation in bart	2020-03-11 11:06:56 +01:00
Patrick von Platen	7cba11fb9b	better naming	2020-03-11 11:06:56 +01:00
Patrick von Platen	ff648221bd	fix conflicts	2020-03-11 11:06:56 +01:00
Patrick von Platen	c0d9dd3ba9	refactored code a bit and made more generic	2020-03-11 11:06:56 +01:00
Patrick von Platen	d8e2b3c547	fix conflicts	2020-03-11 11:06:56 +01:00
Patrick von Platen	31f2437f07	Merge pull request #3191 from patrickvonplaten/add_integration_tests_lm_generate_torch_tf Add integration tests lm generate torch tf	2020-03-10 11:29:17 +01:00
Julien Chaumond	cbf8f5d32b	[model upload] Support for organizations	2020-03-09 17:33:57 -04:00
Lysandre	525b6b1c54	TFQA pipeline marked as slow test	2020-03-09 16:52:30 -04:00
Lysandre Debut	5164ea91a7	Skipping outputs (#3116 ) * Minimal example * Proposal 2 * Proposal 2 for fast tokenizers * Typings * Docs * Revert "Docs" for easier review This reverts commit eaf0f97062e809887704a542144c537f769d5223. * Remove unnecessary assignments * Tests * Fix faulty type * Remove prints * return_outputs -> model_input_names * Revert "Revert "Docs" for easier review" This reverts commit 6fdc69408102bf695797f2dfddbb6350c6b9e722. * code quality	2020-03-09 13:48:58 -04:00
Patrick von Platen	efb619235c	add print statement to avoid code quality problem	2020-03-09 15:31:21 +01:00
Patrick von Platen	b12541c4dc	test ctrl	2020-03-09 13:58:01 +00:00
Patrick von Platen	b73dd1a0e4	fix typo in test xlm tf	2020-03-09 11:34:31 +01:00
Patrick von Platen	4620caa864	fix if use lang embeddings in tf xlm	2020-03-09 11:18:54 +01:00
patrickvonplaten	fbd02d4693	fixed all tests, still need to check ctrl tf and pt and xlm tf	2020-03-08 21:45:55 +01:00
patrickvonplaten	b4a3a64744	fix xlnet & transfotests	2020-03-08 16:25:03 +01:00
patrickvonplaten	66c827656f	fix typo in test gpt2	2020-03-08 15:35:08 +01:00
patrickvonplaten	314bdc7c14	fix typo in test	2020-03-08 15:34:20 +01:00
patrickvonplaten	575976144a	updated all tests	2020-03-08 15:29:10 +01:00
Sam Shleifer	ed37f9fa4f	[Bart] _prepare_decoder_inputs should use large negative (#3158 )	2020-03-06 16:06:36 -05:00
Thomas Wolf	3e5da38dae	Merge pull request #3132 from huggingface/hf_api_model_list [hf_api] Get the public list of all the models on huggingface	2020-03-06 13:05:52 +01:00
Thomas Wolf	9499a3778e	Merge pull request #3103 from gthb/keras-serialization Support keras JSON/HDF5 serialization of main layers	2020-03-06 12:59:13 +01:00
patrickvonplaten	58fc8f97a3	fix renaming problem	2020-03-06 00:35:47 +01:00
Sam Shleifer	857e0a0d3b	Rename BartForMaskedLM -> BartForConditionalGeneration (#3114 ) * improved documentation	2020-03-05 17:41:18 -05:00
Lysandre Debut	146c521235	Merge branch 'master' into add_models_special_tokens_to_specific_configs	2020-03-05 17:24:42 -05:00
Lysandre Debut	b623ddc000	Pass kwargs to configuration (#3147 ) * Pass kwargs to configuration * Setter * test	2020-03-05 17:16:57 -05:00
Lysandre Debut	0001d05686	Correct missing keys + test (#3143 )	2020-03-05 17:01:54 -05:00
sshleifer	1360dacaa3	cleanup deltas	2020-03-05 12:57:42 -05:00
sshleifer	c36fdc88d4	tests pass	2020-03-05 12:33:08 -05:00
Julien Chaumond	f564f93c84	[hf_api] Get the public list of all the models on huggingface	2020-03-04 23:33:09 -05:00
Julien Chaumond	ff9e79ba3a	make style	2020-03-04 20:18:07 -05:00
Lysandre	07a79db505	Fix failing doc samples	2020-03-04 19:11:31 -05:00
Thomas Wolf	bdd3d0c76d	Merge pull request #3118 from patrickvonplaten/add_beam_search_to_generation_tf_2_0 Add beam search to generation tf 2 0	2020-03-04 23:28:00 +01:00
Patrick von Platen	932eab943d	include tf gpt2 tests for attn mask and past variable (#3122 )	2020-03-04 12:03:46 -05:00
patrickvonplaten	61fef6e957	added beam_search generation for tf 2.0	2020-03-04 17:27:47 +01:00
Gunnlaugur Thor Briem	96c4990165	fix unused imports and style	2020-03-03 22:57:05 +00:00
Gunnlaugur Thor Briem	470753bcf5	Put @keras_serializable only on layers it works on And only run the test on TF*MainLayer classes so marked.	2020-03-03 22:44:45 +00:00
Gunnlaugur Thor Briem	0c716ede8c	Use class decorator instead of superclass When supplied by Keras deserialization, the config parameter to initializers will be a dict. So intercept it and convert to PretrainedConfig object (and store in instance attribute for get_config to get at it) before passing to the actual initializer. To accomplish this, and repeat as little code as possible, use a class decorator on TF*MainLayer classes.	2020-03-03 22:31:42 +00:00
Sam Shleifer	e9e6efdc45	BartForSequenceClassification: fix num_labels, add test (#3110 )	2020-03-03 15:54:29 -05:00
Julien Chaumond	f631e01d2c	[ci] Re-run integration ground truth from fairseq Adopted best practice set by @patrickvonplaten of commenting lines run on fairseq, for easy comparison also see #3020	2020-03-03 15:31:40 -05:00
Gunnlaugur Thor Briem	b8da16f390	Add (failing) tests for Keras save/load	2020-03-03 15:22:34 +00:00
Patrick von Platen	4134100363	Add generate() functionality to TF 2.0 (#3063 ) * add first copy past test to tf 2 generate * add tf top_k_top_p_filter fn * add generate function for TF * add generate function for TF * implemented generate for all models expect transfoXL * implemented generate for all models expect transfoXL * implemented generate for all models expect transfoXL * make style * change permission of test file to correct ones * delete ipdb * delete ipdb * fix bug and finish simple gpt2 integration test * clean test file * clean test file * make style * make style * make style * make style * change import style * change import style * make style * make style * add decorators * add decorators * fix tf ctrl bug dim => axis in TF * make style * make style * refactored test file * refactored test file * take out test_torch_tf_conversion if nothing is defined * take out test_torch_tf_conversion if nothing is defined * remove useless files * remove useless files * fix conflicts * fix conflicts * fix conflicts * fix conflicts * fix conflicts * solve conflicts * solve conflicts * fix conflicts * fix conflicts * merge conflicts * delete ipdb * exposed top_k_top_p_filtering fns * delete weirdly created w! file * add comment to test tf common modeling * fix conflicts * fix conflicts * make style * merge conflicts * make style * change tf.tensor.shape to shape_list(tensor)	2020-03-03 09:42:15 -05:00
Julien Chaumond	f169957d0c	TF GPU CI (#3085 ) * debug env * Restrict TF GPU memory * Fixup * One more test * rm debug logs * Fixup	2020-03-02 15:45:25 -05:00
Lysandre Debut	d3eb7d23a4	Pipeline doc (#3055 ) * Pipeline doc initial commit * pipeline abstraction * Remove modelcard argument from pipeline * Task-specific pipelines can be instantiated with no model or tokenizer * All pipelines doc	2020-03-02 14:07:10 -05:00
Julien Chaumond	0e56b37e80	rm bogus file cc @patrickvonplaten	2020-03-02 12:27:12 -05:00
Patrick von Platen	2fdc7f6ce8	correct greedy generation when doing beam search (#3078 ) * correct greedy generation when doing beam search * improve comment	2020-03-02 12:00:09 -05:00
Patrick von Platen	c0135194eb	Force pad_token_id to be set before padding for standard tokenizer (#3035 ) * force pad_token_id to be set before padding * fix tests and forbid padding without having a padding_token_id set	2020-03-02 10:53:55 -05:00
Sam Shleifer	b54ef78d0c	Bart-CNN (#3059 ) `generate` code that produces 99% identical summarizations to fairseq on CNN test data, with caching.	2020-03-02 10:35:53 -05:00
Lysandre Debut	8bcb37bfb8	NER support for Albert in run_ner.py and NerPipeline (#2983 ) * * Added support for Albert when fine-tuning for NER * Added support for Albert in NER pipeline * Added command-line options to examples/ner/run_ner.py to better control tokenization * Added class AlbertForTokenClassification * Changed output for NerPipeline to use .convert_ids_to_tokens(...) instead of .decode(...) to better reflect tokens * Added , * Now passes style guide enforcement * Changes from reviews. * Code now passes style enforcement * Added test for AlbertForTokenClassification * Added test for AlbertForTokenClassification	2020-02-27 10:22:55 -05:00
Martin Malmsten	f71157529e	Added test for AlbertForTokenClassification	2020-02-27 12:24:20 +01:00
Martin Malmsten	aceb6a0907	Added test for AlbertForTokenClassification	2020-02-27 11:52:46 +01:00
Julien Chaumond	b370cc7e99	[gpu] Fixup `fdd61b1992`	2020-02-26 21:48:49 +00:00
Julien Chaumond	f5516805c2	Fix bart slow test	2020-02-26 20:47:49 +00:00
Patrick von Platen	fdd61b1992	Fix attn mask gpt2 when using past (#3033 ) * fix issue and add some tests * fix issue and add some tests * updated doc string gpt2	2020-02-26 12:04:37 -05:00
Julien Chaumond	9cda3620b6	Fix (non-slow) tests on GPU (torch) (#3024 ) * Fix tests on GPU (torch) * Fix bart slow tests Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-02-26 11:59:25 -05:00
Sam Shleifer	9df74b8bc4	Delete all mentions of Model2Model (#3019 )	2020-02-26 11:36:27 -05:00
Patrick von Platen	c913eb9c38	Add integration tests for xlm roberta modelling and xlm roberta tokenzier (#3014 ) * add first files * add xlm roberta integration tests * make style * flake 8 issues solved	2020-02-25 16:51:25 -05:00
Patrick von Platen	f5b50c6b8e	make style	2020-02-25 16:41:54 +01:00
Patrick von Platen	e645dcbb70	add special tokens to pretrain configs of respective lm head models	2020-02-25 16:37:56 +01:00
Lysandre Debut	b90745c590	Test correct tokenizers after default switch (#3003 )	2020-02-24 18:45:53 -05:00
Funtowicz Morgan	4cd9c0971c	Fix for fast tokenizers save_pretrained compatibility with Python. (#2933 ) * Renamed file generate by tokenizers when calling save_pretrained to match python. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Added save_vocabulary tests. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Remove python quick and dirty fix for clean Rust impl. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Bump tokenizers dependency to 0.5.1 Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * TransfoXLTokenizerFast uses a json vocabulary file + warning about incompatibility between Python and Rust Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Added some save_pretrained / from_pretrained unittests. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Update tokenizers to 0.5.2 Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Quality and format. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * flake8 Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Making sure there is really a bug in unittest * Fix TransfoXL constructor vocab_file / pretrained_vocab_file mixin. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>	2020-02-24 18:20:42 -05:00
Lysandre Debut	21d8b6a33e	Testing that batch_encode_plus is the same as encode_plus (#2973 ) * Testing that encode_plus and batch_encode_plus behave the same way Spoiler alert: they don't * Testing rest of arguments in batch_encode_plus * Test tensor return in batch_encode_plus * Addressing Sam's comments * flake8 * Simplified with `num_added_tokens`	2020-02-24 12:09:46 -05:00
Patrick von Platen	17c45c39ed	Add slow generate tests for pretrained lm models (#2909 ) * add slow generate lm_model tests * fix conflicts * merge conflicts * fix conflicts * add slow generate lm_model tests * make style * delete unused variable * fix conflicts * fix conflicts * fix conflicts * delete unused variable * fix conflicts * finished hard coded tests	2020-02-24 11:51:57 -05:00
Sam Shleifer	92487a1dc0	Bart: fix layerdrop and cached decoder_input_ids for generation (#2969 )	2020-02-22 16:25:04 -05:00
Joe Davison	c36416e53c	Add standardized get_vocab method to tokenizers	2020-02-22 12:09:01 -05:00
Funtowicz Morgan	cc6775cdf5	Fix max_length not taken into account when using pad_to_max_length on fast tokenizers (#2961 ) * enable_padding should pad up to max_length if set. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Added more testing on padding. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>	2020-02-22 09:27:47 -05:00
Patrick von Platen	fc38d4c86f	Improve special_token_id logic in run_generation.py and add tests (#2885 ) * improving generation * finalized special token behaviour for no_beam_search generation * solved modeling_utils merge conflict * solve merge conflicts in modeling_utils.py * add run_generation improvements from PR #2749 * adapted language generation to not use hardcoded -1 if no padding token is available * remove the -1 removal as hard coded -1`s are not necessary anymore * add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown * add slow language generation tests for pretrained models using hardcoded output with pytorch seed * delete ipdb * check that all generated tokens are valid * renaming * renaming Generation -> Generate * make style * updated so that generate_beam_search has same token behavior than generate_no_beam_search * consistent return format for run_generation.py * deleted pretrain lm generate tests -> will be added in another PR * cleaning of unused if statements and renaming * run_generate will always return an iterable * make style * consistent renaming * improve naming, make sure generate function always returns the same tensor, add docstring * add slow tests for all lmhead models * make style and improve example comments modeling_utils * better naming and refactoring in modeling_utils * improving generation * finalized special token behaviour for no_beam_search generation * solved modeling_utils merge conflict * solve merge conflicts in modeling_utils.py * add run_generation improvements from PR #2749 * adapted language generation to not use hardcoded -1 if no padding token is available * remove the -1 removal as hard coded -1`s are not necessary anymore * add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown * add slow language generation tests for pretrained models using hardcoded output with pytorch seed * delete ipdb * check that all generated tokens are valid * renaming * renaming Generation -> Generate * make style * updated so that generate_beam_search has same token behavior than generate_no_beam_search * consistent return format for run_generation.py * deleted pretrain lm generate tests -> will be added in another PR * cleaning of unused if statements and renaming * run_generate will always return an iterable * make style * consistent renaming * improve naming, make sure generate function always returns the same tensor, add docstring * add slow tests for all lmhead models * make style and improve example comments modeling_utils * better naming and refactoring in modeling_utils * changed fast random lm generation testing design to more general one * delete in old testing design in gpt2 * correct old variable name * temporary fix for encoder_decoder lm generation tests - has to be updated when t5 is fixed * adapted all fast random generate tests to new design * better warning description in modeling_utils * better comment * better comment and error message Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-02-21 12:09:59 -05:00
Sam Shleifer	53ce3854a1	New BartModel (#2745 ) * Results same as fairseq * Wrote a ton of tests * Struggled with api signatures * added some docs	2020-02-20 18:11:13 -05:00
Joe Davison	197d74f988	Add get_vocab method to PretrainedTokenizer	2020-02-20 15:26:49 -05:00
Funtowicz Morgan	d490b5d500	Fast Tokenizers save pretrained should return the list of generated file paths. (#2918 ) * Correctly return the tuple of generated file(s) when calling save_pretrained Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Quality and format. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>	2020-02-20 00:58:04 +01:00
Funtowicz Morgan	e676764241	Override build_inputs_with_special_tokens for fast tokenizers (#2912 ) * Override build_inputs_with_special_tokens for fast impl + unittest. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Quality + format. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>	2020-02-19 16:09:51 -05:00
Funtowicz Morgan	3f3fa7f7da	Integrate fast tokenizers library inside transformers (#2674 ) * Implemented fast version of tokenizers Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Bumped tokenizers version requirements to latest 0.2.1 Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Added matching tests Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Matching OpenAI GPT tokenization ! Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Matching GPT2 on tokenizers Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Expose add_prefix_space as constructor parameter for GPT2 Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Matching Roberta tokenization ! Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Removed fast implementation of CTRL. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Binding TransformerXL tokenizers to Rust. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Updating tests accordingly. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Added tokenizers as top-level modules. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Black & isort. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Rename LookupTable to WordLevel to match Rust side. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Black. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Use "fast" suffix instead of "ru" for rust tokenizers implementations. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Introduce tokenize() method on fast tokenizers. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * encode_plus dispatchs to batch_encode_plus Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * batch_encode_plus now dispatchs to encode if there is only one input element. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Bind all the encode_plus parameter to the forwarded batch_encode_plus call. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Bump tokenizers dependency to 0.3.0 Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Formatting. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Fix tokenization_auto with support for new (python, fast) mapping schema. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Give correct fixtures path in test_tokenization_fast.py for the CLI. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Expose max_len_ properties on BertTokenizerFast Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Move max_len_ properties to PreTrainedTokenizerFast and override in specific subclasses. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * _convert_encoding should keep the batch axis tensor if only one sample in the batch. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Add warning message for RobertaTokenizerFast if used for MLM. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Added use_fast (bool) parameter on AutoTokenizer.from_pretrained(). This allows to easily enable/disable Rust-based tokenizer instantiation. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Let's tokenizers handle all the truncation and padding stuff. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Allow to provide tokenizer arguments during pipeline creation. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Update test_fill_mask pipeline to not use fast tokenizers. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Fix too much parameters for convert_encoding. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * When enabling padding, max_length should be set to None. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Avoid returning nested tensors of length 1 when calling encode_plus Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Ensure output is padded when return_tensor is not None. Tensor creation requires the inital list input to be of the exact same size. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Disable transfoxl unittest if pytorch is not available (required to load the model) Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * encode_plus should not remove the leading batch axis if return_tensor is set Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Temporary disable fast tokenizers on QA pipelines. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Fix formatting issues. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Update tokenizers to 0.4.0 * Update style * Enable truncation + stride unit test on fast tokenizers. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Add unittest ensuring special_tokens set match between Python and Rust. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Ensure special_tokens are correctly set during construction. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Give more warning feedback to the user in case of padding without pad_token. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * quality & format. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Added possibility to add a single token as str Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Added unittest for add_tokens and add_special_tokens on fast tokenizers. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Fix rebase mismatch on pipelines qa default model. QA requires cased input while the tokenizers would be uncased. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Addressing review comment: Using offset mapping relative to the original string + unittest. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Addressing review comment: save_vocabulary requires folder and file name Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Addressing review comment: Simplify import for Bert. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Addressing review comment: truncate_and_pad disables padding according to the same heuristic than the one enabling padding. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Addressing review comment: Remove private member access in tokenize() Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Addressing review comment: Bump tokenizers dependency to 0.4.2 Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * format & quality. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Addressing review comment: Use named arguments when applicable. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Addressing review comment: Add Github link to Roberta/GPT2 space issue on masked input. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Addressing review comment: Move max_len_single_sentence / max_len_sentences_pair to PreTrainedTokenizerFast + tests. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Addressing review comment: Relax type checking to include tuple and list object. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Addressing review comment: Document the truncate_and_pad manager behavior. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Raise an exception if return_offsets_mapping is not available with the current tokenizer. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Ensure padding is set on the tokenizers before setting any padding strategy + unittest. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * On pytorch we need to stack tensor to get proper new axis. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Generalize tests to different framework removing hard written return_tensors="..." Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Bump tokenizer dependency for num_special_tokens_to_add Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Overflowing tokens in batch_encode_plus are now stacked over the batch axis. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Improved error message for padding strategy without pad token. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Bumping tokenizers dependency to 0.5.0 for release. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Optimizing convert_encoding around 4x improvement. 🚀 Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * expose pad_to_max_length in encode_plus to avoid duplicating the parameters in kwargs Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Generate a proper overflow_to_sampling_mapping when return_overflowing_tokens is True. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Fix unittests for overflow_to_sampling_mapping not being returned as tensor. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Format & quality. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Remove perfect alignment constraint for Roberta (allowing 1% difference max) Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Triggering final CI Co-authored-by: MOI Anthony <xn1t0x@gmail.com>	2020-02-19 11:35:40 -05:00
Sam Shleifer	20fc18fbda	Skip flaky test_tf_question_answering (#2845 ) * Skip flaky test * Style	2020-02-18 16:14:50 -05:00
Joe Davison	f1e8a51f08	Preserve spaces in GPT-2 tokenizers (#2778 ) * Preserve spaces in GPT-2 tokenizers Preserves spaces after special tokens in GPT-2 and inhereted (RoBERTa) tokenizers, enabling correct BPE encoding. Automatically inserts a space in front of first token in encode function when adding special tokens. * Add tokenization preprocessing method * Add framework argument to pipeline factory Also fixes pipeline test issue. Each test input now treated as a distinct sequence.	2020-02-13 13:29:43 -05:00
Sam Shleifer	ef74b0f07a	get_activation('relu') provides a simple mapping from strings i… (#2807 ) * activations.py contains a mapping from string to activation function * resolves some `gelu` vs `gelu_new` ambiguity	2020-02-13 08:28:33 -05:00
Oleksiy Syvokon	ee5de0ba44	BERT decoder: Fix causal mask dtype. PyTorch < 1.3 requires multiplication operands to be of the same type. This was violated when using default attention mask (i.e., attention_mask=None in arguments) given BERT in the decoder mode. In particular, this was breaking Model2Model and made tutorial from the quickstart failing.	2020-02-11 15:19:22 -05:00
VictorSanh	d8b43600fd	omission	2020-02-07 15:28:13 -05:00
VictorSanh	ee5a6856ca	distilbert-base-cased weights + Readmes + omissions	2020-02-07 15:28:13 -05:00
Lysandre	2184f87003	RoBERTa TensorFlow Tests	2020-02-04 18:05:35 -05:00
Lysandre	e615269cb8	Correct slow test	2020-02-04 18:05:35 -05:00
Lysandre	5f96ebc0be	Style	2020-02-04 18:05:35 -05:00
Lysandre	950c6a4f09	Flaubert PyTorch tests	2020-02-04 18:05:35 -05:00
Lysandre	d28b81dc29	RoBERTa Pytorch tests	2020-02-04 18:05:35 -05:00
sshleifer	9e5b549b4d	fix default getattr	2020-02-04 16:38:52 -05:00
sshleifer	25848a6094	double quotes	2020-02-04 16:38:52 -05:00
sshleifer	cbcb83f21d	minor cleanup of test_attention_outputs	2020-02-04 16:38:52 -05:00
Lysandre	1e82cd8457	Flaubert auto tokenizer + tests cc @julien-c	2020-01-31 14:16:52 -05:00
Julien Chaumond	9fa836a73f	fill_mask helper (#2576 ) * fill_mask helper * [poc] FillMaskPipeline * Revert "[poc] FillMaskPipeline" This reverts commit `67eeea55b0`. * Revert "fill_mask helper" This reverts commit `cacc17b884`. * README: clarify that Pipelines can also do text-classification cf. question at the AI&ML meetup last week, @mfuntowicz * Fix test: test feature-extraction pipeline * Test tweaks * Slight refactor of existing pipeline (in preparation of new FillMaskPipeline) * Extraneous doc * More robust way of doing this @mfuntowicz as we don't rely on the model name anymore (see AutoConfig) * Also add RobertaConfig as a quickfix for wrong token_type_ids * cs * [BIG] FillMaskPipeline	2020-01-30 18:15:42 -05:00
Lysandre	df27648bd9	Rename test_examples to test_doc_samples	2020-01-30 10:07:22 -05:00
Lysandre	e63a81dd25	Style	2020-01-29 16:29:20 -05:00
Lysandre	217349016a	Copy object instead of passing the reference	2020-01-29 16:15:39 -05:00
Lysandre	ea2600bd5f	Absolute definitive HeisenDistilBug solve cc @julien-c @thomwolf	2020-01-27 21:58:36 -05:00
thomwolf	0e31e06a75	Add AutoModelForPreTraining	2020-01-27 14:27:07 -05:00
Lysandre	875c4ae48f	Definitive HeisenDistilBug fix cc @julien-c @@thomwolf	2020-01-27 12:09:58 -05:00
Lysandre	24d5ad1dcc	Run the examples in slow	2020-01-23 09:38:45 -05:00
Lysandre	f81b6c95f2	Flake8 violation	2020-01-23 09:38:45 -05:00
Lysandre	632675ea88	Can test examples spread over multiple blocks	2020-01-23 09:38:45 -05:00
Lysandre	eaa6b9afc6	Require Torch when testing examples	2020-01-23 09:38:45 -05:00
Lysandre	64abd3e0aa	Multi-line examples can be tested + ALBERT patch for CircleCI All tests should now work fine.	2020-01-23 09:38:45 -05:00
Lysandre	837577256b	Automatic testing of examples The CircleCI test should fail.	2020-01-23 09:38:45 -05:00
Mark Neumann	65a89a8976	Fix BasicTokenizer to respect `never_split` parameters (#2557 ) * add failing test * fix call to _run_split_on_punc * format with black	2020-01-17 14:57:56 -05:00
Julien Chaumond	23a2cea8cb	Tokenizer.from_pretrained: fetch all possible files remotely	2020-01-16 16:47:19 -05:00
Julien Chaumond	9d8fd2d40e	tokenizer.save_pretrained: only save file if non-empty	2020-01-16 16:47:19 -05:00
Thomas Wolf	dc17f2a111	Merge pull request #2538 from huggingface/py3_super 💄 super	2020-01-16 13:17:15 +01:00
Julien Chaumond	d9fa1bad72	Fix failing torchscript test for xlnet model.parameters() order is apparently not stable (only for xlnet, for some reason)	2020-01-15 20:22:21 -05:00
Julien Chaumond	83a41d39b3	💄 super	2020-01-15 18:33:50 -05:00
Julien Chaumond	eb59e9f705	Graduate sst-2 to a canonical one	2020-01-15 16:28:50 +00:00
Julien Chaumond	e184ad13cf	Close #2392	2020-01-15 15:43:44 +00:00
Julien Chaumond	715fa638a7	Merge branch 'master' into from_scratch_training	2020-01-14 18:58:21 +00:00
Lysandre	100e3b6f21	Bias should be resized with the weights Created a link between the linear layer bias and the model attribute bias. This does not change anything for the user nor for the conversion scripts, but allows the `resize_token_embeddings` method to resize the bias as well as the weights of the decoder. Added a test.	2020-01-14 13:43:45 -05:00
Julien Chaumond	764f836d52	Update test_tokenization_auto.py	2020-01-13 22:50:34 -05:00
Julien Chaumond	d5831acb07	Update test_tokenization_auto.py	2020-01-13 22:47:33 -05:00
Julien Chaumond	ed6cd597cc	Update test_tokenization_auto.py	2020-01-13 22:46:35 -05:00
Julien Chaumond	5cb463a714	Update test_tokenization_auto.py	2020-01-13 22:38:29 -05:00
Julien Chaumond	0304628590	Map configs to models and tokenizers	2020-01-13 23:11:44 +00:00
Julien Chaumond	1fc855e456	[tests] Safety checks on CONFIG_MAPPING	2020-01-13 21:52:55 +00:00
Julien Chaumond	cf8a70bf68	More AutoConfig tests	2020-01-11 03:43:57 +00:00
Julien Chaumond	c6f682c1eb	flake	2020-01-11 03:18:31 +00:00
Julien Chaumond	4d1c98c012	AutoConfig + other Auto classes honor model_type	2020-01-11 02:46:17 +00:00
Julien Chaumond	2f32dfd33b	Convention: name mixins mixins	2020-01-11 01:24:29 +00:00
Julien Chaumond	055e80cfad	rm old ConfigTester	2020-01-10 21:36:18 +00:00
Julien Chaumond	84c0aa1868	num_parameters helper	2020-01-10 17:40:02 +00:00
alberduris	81d6841b4b	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
alberduris	dd4df80f0b	Moved the encoded_prompts to correct device	2020-01-06 15:11:12 +01:00
Aymeric Augustin	0ffc8eaf53	Enforce target version for black. This should stabilize formatting.	2020-01-05 12:52:14 -05:00
Julien Chaumond	594ca6dead	[debug] Debug Heisenbug, the old school way.	2019-12-29 10:07:21 -05:00
Julien Chaumond	f78ebc22ad	[cli] Add ability to delete remote object	2019-12-27 22:53:49 -05:00
Thomas Wolf	9f5f646442	Merge pull request #2211 from huggingface/fast-tokenizers Fast tokenizers	2019-12-27 10:24:29 +01:00
Anthony MOI	2818e50569	Add tests for fast tokenizers	2019-12-24 13:29:01 -05:00
Aymeric Augustin	e6c0019c80	Remove unused variables in tests.	2019-12-23 22:38:18 +01:00
Aymeric Augustin	1c62e87b34	Use built-in open(). On Python 3, `open is io.open`.	2019-12-22 18:38:56 +01:00
Aymeric Augustin	798b3b3899	Remove sys.version_info[0] == 2 or 3.	2019-12-22 18:38:42 +01:00
Aymeric Augustin	8af25b1664	Remove six.	2019-12-22 17:56:09 +01:00
Aymeric Augustin	c824d15aa1	Remove __future__ imports.	2019-12-22 17:47:54 +01:00
Aymeric Augustin	00204f2b4c	Replace CommonTestCases for tokenizers with a mixin. This is the same change as for (TF)CommonTestCases for modeling.	2019-12-22 15:35:25 +01:00
Aymeric Augustin	a3c5883f2c	Rename file for consistency.	2019-12-22 15:35:25 +01:00
Aymeric Augustin	daf8bebcdd	Remove unused GPTModelTester. It isn't imported anywhere.	2019-12-22 15:35:25 +01:00
Aymeric Augustin	345c23a60f	Replace (TF)CommonTestCases for modeling with a mixin. I suspect the wrapper classes were created in order to prevent the abstract base class (TF)CommonModelTester from being included in test discovery and running, because that would fail. I solved this by replacing the abstract base class with a mixin. Code changes are just de-indenting and automatic reformattings performed by black to use the extra line space.	2019-12-22 15:35:18 +01:00
Aymeric Augustin	7e98e211f0	Remove unittest.main() in test modules. This construct isn't used anymore these days. Running python tests/test_foo.py puts the tests/ directory on PYTHONPATH, which isn't representative of how we run tests. Use python -m unittest tests/test_foo.py instead.	2019-12-22 14:42:03 +01:00
Aymeric Augustin	ced0a94204	Switch test files to the standard test_*.py scheme.	2019-12-22 14:15:13 +01:00
Aymeric Augustin	067395d5c5	Move tests outside of library.	2019-12-22 13:47:17 +01:00
thomwolf	1484d67de9	[LARGE] updating all tests and API	2019-07-02 12:13:17 +02:00
thomwolf	4f8b5f687c	add fix for serialization of tokenizer	2019-06-29 23:35:21 +02:00
thomwolf	d9184620f9	fix tests and new API	2019-06-29 23:10:40 +02:00
thomwolf	7e3070ae4f	add from_pretrained method to all configuration classes	2019-06-26 11:12:00 +02:00
thomwolf	93e9971c54	fix tests	2019-06-26 10:02:45 +02:00
thomwolf	62d78aa37e	updating GLUE utils for compatibility with XLNet	2019-06-24 14:36:11 +02:00
thomwolf	c946bb51a6	fix xlnet tokenizer and python2	2019-06-22 22:28:49 +02:00
thomwolf	ebd2cb8d74	update from_pretrained to load XLNetModel as well	2019-06-21 21:08:44 +02:00
thomwolf	483cbc36a9	test deviation with tf model: max ~1e-3 should be ok	2019-06-21 16:38:01 +02:00
thomwolf	24d8068982	weights loading script ok	2019-06-21 12:33:44 +02:00
thomwolf	32da75486b	add tokenizer and tests	2019-06-21 11:09:51 +02:00
thomwolf	45709d7532	model running with simple inputs	2019-06-21 00:28:42 +02:00
thomwolf	34d706a0e1	pruning in bertology	2019-06-19 15:25:49 +02:00
thomwolf	33d3db5c43	updating head masking, readme and docstrings	2019-06-17 15:51:28 +02:00
thomwolf	965f172de6	output all hidden layers states in GPT/GPT-2	2019-06-17 14:34:12 +02:00
thomwolf	f12007e421	add head masking and pruning to openai GPT	2019-06-17 14:19:40 +02:00
thomwolf	b860e47cf5	add head masking and pruning to gpt-2	2019-06-17 14:12:10 +02:00
thomwolf	7220d47a1c	adding head pruning and tests	2019-06-17 13:20:45 +02:00
thomwolf	96c4d3d988	add head masking tests	2019-06-17 12:17:26 +02:00
thomwolf	5e1207b8ad	add attention to all bert models and add test	2019-06-14 16:28:25 +02:00
thomwolf	bcc9e93e6f	fix test	2019-06-14 15:38:20 +02:00
thomwolf	a3274ac40b	adding attention outputs in bert	2019-06-03 16:11:45 -05:00
thomwolf	c30139a013	add special tokens to gpt-2	2019-04-30 10:45:26 +02:00
lukovnikov	56a47ce2b7	- replaced OpenAIGPTAdam with OpenAIAdam in docs	2019-04-25 16:05:28 +02:00
lukovnikov	704037ad51	- updated docs for new LR API - added some images for illustration - updated comments in optimization	2019-04-25 15:59:39 +02:00
lukovnikov	bb7557d3ab	- removed __all__ in optimization - removed unused plotting code - using ABC for LRSchedule - added some schedule object init tests	2019-04-21 13:48:33 +02:00
lukovnikov	34ccc8ebf4	Merge remote-tracking branch 'upstream/master'	2019-04-21 13:16:15 +02:00
thomwolf	34ae5bf838	small clean up in tests	2019-04-17 14:52:12 +02:00
thomwolf	265550ec34	relax network connection requirements	2019-04-17 14:22:35 +02:00
thomwolf	31d387604c	adding s3 model tests with --runslow	2019-04-17 11:58:27 +02:00
thomwolf	bc70779bf0	fixed GPT-2 tokenization on python 2	2019-04-17 10:56:15 +02:00
thomwolf	18a8a15f78	improving GPT2 tokenization and adding tests	2019-04-16 17:00:55 +02:00
thomwolf	9761aa4845	add to_json_file method to configuration classes	2019-04-15 14:12:08 +02:00
thomwolf	e8568a3b17	fixing tests	2019-04-15 12:55:38 +02:00
thomwolf	870b734bfd	added tokenizers serialization tests	2019-04-15 12:03:56 +02:00
lukovnikov	20686b78fc	schedule fix	2019-04-03 18:13:52 +02:00
lukovnikov	1b4ce76c38	schedule fix	2019-04-03 17:40:12 +02:00
lukovnikov	23bd2eebf5	schedule fix	2019-04-03 17:10:34 +02:00
lukovnikov	91a073f804	schedule fix	2019-04-03 17:10:08 +02:00
lukovnikov	b64cc63a77	optimization schedule test update	2019-04-03 16:42:40 +02:00
lukovnikov	d164867d90	- updated docs for optimization	2019-04-03 16:13:51 +02:00
lukovnikov	262a9992d7	class weights	2019-03-18 18:29:12 +01:00
thomwolf	2dd8f524f5	removing test for long sequences error following #337	2019-03-06 10:10:41 +01:00
thomwolf	009ee86a19	fix tests - bump up version	2019-02-17 23:57:23 +01:00
thomwolf	ffd623823d	adding gpt2	2019-02-17 23:38:51 +01:00
thomwolf	884ca81d87	transposing the inputs of Transformer-XL to have a unified interface	2019-02-11 13:19:59 +01:00
thomwolf	0a9860daa7	tests pass on python 2 and 3	2019-02-11 10:47:52 +01:00
thomwolf	2071a9b86e	fix python 2.7 imports	2019-02-11 10:35:36 +01:00
thomwolf	b514a60c36	added tests for OpenAI GPT and Transformer-XL tokenizers	2019-02-11 10:17:16 +01:00
thomwolf	9bdcba53fd	fix tests	2019-02-09 17:07:12 +01:00
thomwolf	1320e4ec0c	mc_token_mask => mc_token_ids	2019-02-09 16:58:53 +01:00
thomwolf	2df41663f1	added test	2019-02-07 17:05:49 +01:00
thomwolf	ba9e4eb354	fix unicode in tokenization tests	2019-02-06 00:28:00 +01:00
thomwolf	448937c00d	python 2 compatibility	2019-02-06 00:07:46 +01:00
thomwolf	98c96fb1a7	splitting position and tokens embeddings in OpenAI GPT - updating tf imports - tests	2019-01-29 10:31:42 +01:00
thomwolf	a45a9cc0e1	update tests	2019-01-28 17:16:02 +01:00
thomwolf	dc5df92fa8	added LM head for OpenAI	2019-01-08 17:18:47 +01:00
thomwolf	3cf12b235a	added tests + fixed losses	2019-01-08 16:24:23 +01:00
Patrick Lewis	78cf7b4ab4	added code to raise value error for bert tokenizer for covert_tokens_to_indices	2018-12-18 14:41:30 +00:00
thomwolf	0f544625f4	fix swag example for work with apex	2018-12-13 13:35:59 +01:00
thomwolf	52c53f39d0	clean up apex integration	2018-12-13 13:02:17 +01:00
thomwolf	85fff78c2d	compatibility PT 1.0 and 0.4.1	2018-12-13 12:48:13 +01:00
Deyu Fu	c8ea286048	change to apex for better fp16 and multi-gpu support	2018-12-11 17:13:58 -08:00
thomwolf	7f7c41b0c1	tests for all model classes with and without labels	2018-11-30 22:54:33 +01:00
thomwolf	757750d6f6	fix tests	2018-11-17 11:58:14 +01:00
thomwolf	1de35b624b	preparing for first release	2018-11-15 20:56:10 +01:00
Yaser Martinez Palenzuela	4d124baf8f	Add test for Chinese tokenization	2018-11-05 23:04:29 +01:00
thomwolf	3d291dea4a	clean up tests	2018-11-04 21:27:19 +01:00
thomwolf	87da161c2a	finishing model test	2018-11-04 21:27:10 +01:00
thomwolf	f8276008df	update readme, file names, removing TF code, moving tests	2018-11-03 23:35:14 +01:00

... 94 95 96 97 98 ...

5042 Commits