transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-29 09:12:21 +06:00

Author	SHA1	Message	Date
Suraj Patil	e93ccb3290	BartForQuestionAnswering (#4908 )	2020-06-12 15:47:57 -04:00
Sylvain Gugger	538531cde5	Add AlbertForMultipleChoice (#4959 ) * Add AlbertForMultipleChoice * Make up to date and add all models to common tests	2020-06-12 14:20:19 -04:00
Patrick von Platen	86578bb04c	[AutoModel] Split AutoModelWithLMHead into clm, mlm, encoder-decoder (#4933 ) * first commit * add new auto models * better naming * fix bert automodel * fix automodel for pretraining * add models to init * fix name typo * fix typo * better naming * future warning instead of depreciation warning	2020-06-12 10:01:49 +02:00
Sam Shleifer	5620033115	[mbart] Fix fp16 testing logic (#4949 )	2020-06-11 22:11:34 -04:00
Sam Shleifer	08b59d10e5	MBartTokenizer:add language codes (#3776 )	2020-06-11 13:02:33 -04:00
Sylvain Gugger	20451195f0	Support multiple choice in tf common model tests (#4920 ) * Support multiple choice in tf common model tests * Add the input_embeds test	2020-06-11 10:31:26 -04:00
RafaelWO	e80d6c689b	Fix resize_token_embeddings for Transformer-XL (#4759 ) * Fixed resize_token_embeddings for transfo_xl model * Fixed resize_token_embeddings for transfo_xl. Added custom methods to TransfoXLPreTrainedModel for resizing layers of the AdaptiveEmbedding. * Updated docstring * Fixed resizinhg cutoffs; added check for new size of embedding layer. * Added test for resize_token_embeddings * Fixed code quality * Fixed unchanged cutoffs in model.config Co-authored-by: Rafael Weingartner <rweingartner.its-b2015@fh-salzburg.ac.at>	2020-06-10 19:03:06 -04:00
Sylvain Gugger	d541938c48	Make multiple choice models work with input_embeds (#4921 )	2020-06-10 18:38:34 -04:00
Sylvain Gugger	1e2631d6f8	Split LMBert model in two (#4874 ) * Split LMBert model in two * Fix example * Remove lm_labels * Adapt tests, refactor prepare_for_generation * Fix merge * Hide BeartLMHeadModel	2020-06-10 18:26:42 -04:00
Suraj Patil	ef2dcdccaa	ElectraForQuestionAnswering (#4913 ) * ElectraForQuestionAnswering * udate __init__ * add test for electra qa model * add ElectraForQuestionAnswering in auto models * add ElectraForQuestionAnswering in all_model_classes * fix outputs, input_ids defaults to None * add ElectraForQuestionAnswering in docs * remove commented line	2020-06-10 15:17:52 -04:00
Amil Khare	5d63ca6c38	[ctrl] fix pruning of MultiHeadAttention (#4904 )	2020-06-10 14:06:55 -04:00
Sylvain Gugger	4e10acb3e5	Add more models to common tests (#4910 )	2020-06-10 13:19:53 -04:00
Sylvain Gugger	ac99217e92	Fix the CI (#4903 ) * Fix CI	2020-06-10 09:26:06 -04:00
Sylvain Gugger	0a375f5abd	Deal with multiple choice in common tests (#4886 ) * Deal with multiple choice in common tests	2020-06-10 08:10:20 -04:00
Bharat Raghunathan	6e603cb789	[All models] Extend config.output_attentions with output_attentions function arguments (#4538 ) * DOC: Replace instances of ``config.output_attentions`` with function argument ``output_attentions`` * DOC: Apply Black Formatting * Fix errors where output_attentions was undefined * Remove output_attentions in classes per review * Fix regressions on tests having `output_attention` * Fix further regressions in tests relating to `output_attentions` Ensure proper propagation of `output_attentions` as a function parameter to all model subclasses * Fix more regressions in `test_output_attentions` * Fix issues with BertEncoder * Rename related variables to `output_attentions` * fix pytorch tests * fix bert and gpt2 tf * Fix most TF tests for `test_output_attentions` * Fix linter errors and more TF tests * fix conflicts * DOC: Apply Black Formatting * Fix errors where output_attentions was undefined * Remove output_attentions in classes per review * Fix regressions on tests having `output_attention` * fix conflicts * fix conflicts * fix conflicts * fix conflicts * fix pytorch tests * fix conflicts * fix conflicts * Fix linter errors and more TF tests * fix tf tests * make style * fix isort * improve output_attentions * improve tensorflow Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-06-09 23:39:06 +02:00
Patrick von Platen	2cfb947f59	[Benchmark] add tpu and torchscipt for benchmark (#4850 ) * add tpu and torchscipt for benchmark * fix name in tests * "fix email" * make style * better log message for tpu * add more print and info for tpu * allow possibility to print tpu metrics * correct cpu usage * fix test for non-install * remove bugus file * include psutil in testing * run a couple of times before tracing in torchscript * do not allow tpu memory tracing for now * make style * add torchscript to env * better name for torch tpu Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2020-06-09 23:12:43 +02:00
Patrick von Platen	c0554776de	fix PR (#4810 )	2020-06-08 15:31:12 +02:00
Sam Shleifer	c58e6c129a	[marian tests ] pass device to pipeline (#4815 )	2020-06-06 00:52:17 -04:00
Sam Shleifer	4ab7424597	[cleanup/marian] pipelines test and new kwarg (#4812 )	2020-06-05 18:45:19 -04:00
Patrick von Platen	8cca875569	[EncoderDecoderConfig] automatically set decoder config to decoder (#4809 ) * automatically set decoder config to decoder * add more tests	2020-06-05 23:16:37 +02:00
Sylvain Gugger	f1fe18465d	Use labels to remove deprecation warnings (#4807 )	2020-06-05 16:41:46 -04:00
Sylvain Gugger	4dd5cf2207	Fix argument label (#4792 ) * Fix argument label * Fix test	2020-06-05 15:20:29 -04:00
Julien Plu	f9414f7553	Tensorflow improvements (#4530 ) * Better None gradients handling * Apply Style * Apply Style * Create a loss class per task to compute its respective loss * Add loss classes to the ALBERT TF models * Add loss classes to the BERT TF models * Add question answering and multiple choice to TF Camembert * Remove prints * Add multiple choice model to TF DistilBERT + loss computation * Add question answering model to TF Electra + loss computation * Add token classification, question answering and multiple choice models to TF Flaubert * Add multiple choice model to TF Roberta + loss computation * Add multiple choice model to TF XLM + loss computation * Add multiple choice and question answering models to TF XLM-Roberta * Add multiple choice model to TF XLNet + loss computation * Remove unused parameters * Add task loss classes * Reorder TF imports + add new model classes * Add new model classes * Bugfix in TF T5 model * Bugfix for TF T5 tests * Bugfix in TF T5 model * Fix TF T5 model tests * Fix T5 tests + some renaming * Fix inheritance issue in the AutoX tests * Add tests for TF Flaubert and TF XLM Roberta * Add tests for TF Flaubert and TF XLM Roberta * Remove unused piece of code in the TF trainer * bugfix and remove unused code * Bugfix for TF 2.2 * Apply Style * Divide TFSequenceClassificationAndMultipleChoiceLoss into their two respective name * Apply style * Mirror the PT Trainer in the TF one: fp16, optimizers and tb_writer as class parameter and better dataset handling * Fix TF optimizations tests and apply style * Remove useless parameter * Bugfix and apply style * Fix TF Trainer prediction * Now the TF models return the loss such as their PyTorch couterparts * Apply Style * Ignore some tests output * Take into account the SQuAD cls_index, p_mask and is_impossible parameters for the QuestionAnswering task models. * Fix names for SQuAD data * Apply Style * Fix conflicts with 2.11 release * Fix conflicts with 2.11 * Fix wrongname * Add better documentation on the new create_optimizer function * Fix isort * logging_dir: use same default as PyTorch Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-06-04 19:45:53 -04:00
Funtowicz Morgan	5bf9afbf35	Introduce a new tensor type for return_tensors on tokenizer for NumPy (#4585 ) * Refactor tensor creation in tokenizers. * Make sure to convert string to TensorType * Refactor convert_to_tensors_ * Introduce numpy tensor creation * Format * Add unittest for TensorType creation from str * sorting imports * Added unittests for numpy tensor conversion. * Do not use in-place version for squeeze as numpy doesn't provide such feature. * Added extra parameter prepend_batch_axis: bool on prepare_for_model. * Ensure test_np_encode_plus_sent_to_model is not executed if encoder/decoder model. * style. * numpy tests require_torch for now while flax not merged. * Hopefully will make flake8 happy. * One more time 🎶	2020-06-04 06:57:01 +02:00
Sylvain Gugger	1b5820a565	Unify label args (#4722 ) * Deprecate masked_lm_labels argument * Apply to all models * Better error message	2020-06-03 09:36:26 -04:00
Patrick von Platen	9ca485734a	[Reformer] Improved memory if input is shorter than chunk length (#4720 ) * improve handling of short inputs for reformer * correct typo in assert statement * fix other tests	2020-06-02 23:08:39 +02:00
Sam Shleifer	70f7423436	TFRobertaModelIntegrationTest requires tf (#4726 )	2020-06-02 12:59:00 -04:00
Julien Chaumond	b42586ea56	Fix CI after killing archive maps (#4724 ) * 🐛 Fix model ids for BART and Flaubert	2020-06-02 10:21:09 -04:00
Julien Chaumond	d4c2cb402d	Kill model archive maps (#4636 ) * Kill model archive maps * Fixup * Also kill model_archive_map for MaskedBertPreTrainedModel * Unhook config_archive_map * Tokenizers: align with model id changes * make style && make quality * Fix CI	2020-06-02 09:39:33 -04:00
Rens	ec62b7d953	Fix onnx export input names order (#4641 ) * pass on tokenizer to pipeline * order input names when convert to onnx * update style * remove unused imports * make ordered inputs list needs to be mutable * add test custom bert model * remove unused imports	2020-06-01 16:12:48 +02:00
Patrick von Platen	0866669e75	[EncoderDecoder] Fix initialization and save/load bug (#4680 ) * fix bug * add more tests	2020-05-30 01:25:19 +02:00
Patrick von Platen	56ee2560be	[Longformer] Better handling of global attention mask vs local attention mask (#4672 ) * better api * improve automatic setting of global attention mask * fix longformer bug * fix global attention mask in test * fix global attn mask flatten * fix slow tests * update docstring * update docs and make more robust * improve attention mask	2020-05-29 17:58:42 +02:00
Patrick von Platen	9c17256447	[Longformer] Multiple choice for longformer (#4645 ) * add multiple choice for longformer * add models to docs * adapt docstring * add test to longformer * add longformer for mc in init and modeling auto * fix tests	2020-05-29 13:46:08 +02:00
Anthony MOI	5e737018e1	Fix add_special_tokens on fast tokenizers (#4531 )	2020-05-28 10:54:45 -04:00
Suraj Patil	e444648a30	LongformerForTokenClassification (#4638 )	2020-05-28 12:48:18 +02:00
Patrick von Platen	96f57c9ccb	[Benchmark] Memory benchmark utils (#4198 ) * improve memory benchmarking * correct typo * fix current memory * check torch memory allocated * better pytorch function * add total cached gpu memory * add total gpu required * improve torch gpu usage * update memory usage * finalize memory tracing * save intermediate benchmark class * fix conflict * improve benchmark * improve benchmark * finalize * make style * improve benchmarking * correct typo * make train function more flexible * fix csv save * better repr of bytes * better print * fix __repr__ bug * finish plot script * rename plot file * delete csv and small improvements * fix in plot * fix in plot * correct usage of timeit * remove redundant line * remove redundant line * fix bug * add hf parser tests * add versioning and platform info * make style * add gpu information * ensure backward compatibility * finish adding all tests * Update src/transformers/benchmark/benchmark_args.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/benchmark/benchmark_args_utils.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * delete csv files * fix isort ordering * add out of memory handling * add better train memory handling Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-05-27 23:22:16 +02:00
Suraj Patil	ec4cdfdd05	LongformerForSequenceClassification (#4580 ) * LongformerForSequenceClassification * better naming x=>hidden_states, fix typo in doc * Update src/transformers/modeling_longformer.py * Update src/transformers/modeling_longformer.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-05-27 22:30:00 +02:00
Sam Shleifer	07797c4da4	[testing] LanguageModelGenerationTests require_tf or require_torch (#4616 )	2020-05-27 09:10:26 -04:00
Sam Shleifer	b86e42e0ac	[ci] fix 3 remaining slow GPU failures (#4584 )	2020-05-25 19:20:50 -04:00
Suraj Patil	03d8527de0	Longformer for question answering (#4500 ) * added LongformerForQuestionAnswering * add LongformerForQuestionAnswering * fix import for LongformerForMaskedLM * add LongformerForQuestionAnswering * hardcoded sep_token_id * compute attention_mask if not provided * combine global_attention_mask with attention_mask when provided * update example in docstring * add assert error messages, better attention combine * add test for longformerForQuestionAnswering * typo * cast gloabl_attention_mask to long * make style * Update src/transformers/configuration_longformer.py * Update src/transformers/configuration_longformer.py * fix the code quality * Merge branch 'longformer-for-question-answering' of https://github.com/patil-suraj/transformers into longformer-for-question-answering Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-05-25 18:43:36 +02:00
Anthony MOI	35df911485	Fix convert_token_type_ids_from_sequences for fast tokenizers (#4503 )	2020-05-22 12:45:10 -04:00
Frankie Liuzzi	bd6e301832	added functionality for electra classification head (#4257 ) * added functionality for electra classification head * unneeded dropout * Test ELECTRA for sequence classification * Style Co-authored-by: Frankie <frankie@frase.io> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-05-22 09:48:21 -04:00
Zhangyx	49296533ca	Adds predict stage for glue tasks, and generate result files which can be submitted to gluebenchmark.com (#4463 ) * Adds predict stage for glue tasks, and generate result files which could be submitted to gluebenchmark.com website. * Use Split enum + always output the label name Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-21 09:17:44 -04:00
Julien Chaumond	865d4d595e	[ci] Close #4481	2020-05-20 18:27:42 -04:00
Julien Chaumond	a3af8e86cb	Update test_trainer_distributed.py	2020-05-20 18:26:51 -04:00
Lysandre Debut	14cb5b35fa	Fix slow gpu tests lysandre (#4487 ) * There is one missing key in BERT * Correct device for CamemBERT model * RoBERTa tokenization adding prefix space * Style	2020-05-20 11:59:45 -04:00
Sam Shleifer	efbc1c5a9d	[MarianTokenizer] implement save_vocabulary and other common methods (#4389 )	2020-05-19 19:45:49 -04:00
Sam Shleifer	956c4c4eb4	[gpu slow tests] fix mbart-large-enro gpu tests (#4472 )	2020-05-19 19:45:31 -04:00
Patrick von Platen	aa925a52fa	[Tests, GPU, SLOW] fix a bunch of GPU hardcoded tests in Pytorch (#4468 ) * fix gpu slow tests in pytorch * change model to device syntax	2020-05-19 21:35:04 +02:00
Sam Shleifer	07dd7c2fd8	[cleanup] test_tokenization_common.py (#4390 )	2020-05-19 10:46:55 -04:00

1 2 3 4 5 ...

351 Commits