transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-27 00:09:00 +06:00

Author	SHA1	Message	Date
Sylvain Gugger	e8db8b845a	Remove unused arguments in Multiple Choice example (#4853 ) * Remove unused arguments * Formatting * Remove second todo comment	2020-06-09 20:05:09 -04:00
songyouwei	29c36e9f36	run_pplm.py bug fix (#4867 ) `is_leaf` may become `False` after `.to(device=device)` function call.	2020-06-09 19:14:27 -04:00
Lysandre	13aa174112	uninstalled wandb raises AttributeError	2020-06-09 18:50:56 -04:00
Bharat Raghunathan	6e603cb789	[All models] Extend config.output_attentions with output_attentions function arguments (#4538 ) * DOC: Replace instances of ``config.output_attentions`` with function argument ``output_attentions`` * DOC: Apply Black Formatting * Fix errors where output_attentions was undefined * Remove output_attentions in classes per review * Fix regressions on tests having `output_attention` * Fix further regressions in tests relating to `output_attentions` Ensure proper propagation of `output_attentions` as a function parameter to all model subclasses * Fix more regressions in `test_output_attentions` * Fix issues with BertEncoder * Rename related variables to `output_attentions` * fix pytorch tests * fix bert and gpt2 tf * Fix most TF tests for `test_output_attentions` * Fix linter errors and more TF tests * fix conflicts * DOC: Apply Black Formatting * Fix errors where output_attentions was undefined * Remove output_attentions in classes per review * Fix regressions on tests having `output_attention` * fix conflicts * fix conflicts * fix conflicts * fix conflicts * fix pytorch tests * fix conflicts * fix conflicts * Fix linter errors and more TF tests * fix tf tests * make style * fix isort * improve output_attentions * improve tensorflow Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-06-09 23:39:06 +02:00
Sam Shleifer	f90bc44d9a	[examples] Cleanup summarization docs (#4876 )	2020-06-09 17:38:28 -04:00
Patrick von Platen	2cfb947f59	[Benchmark] add tpu and torchscipt for benchmark (#4850 ) * add tpu and torchscipt for benchmark * fix name in tests * "fix email" * make style * better log message for tpu * add more print and info for tpu * allow possibility to print tpu metrics * correct cpu usage * fix test for non-install * remove bugus file * include psutil in testing * run a couple of times before tracing in torchscript * do not allow tpu memory tracing for now * make style * add torchscript to env * better name for torch tpu Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2020-06-09 23:12:43 +02:00
Hamza Harkous	f0340b3031	Removes from the of the parent of TFRobertaClassificationHead (#4884 ) Co-authored-by: Hamza Harkous <harkous@google.com>	2020-06-09 16:14:01 -04:00
Amil Khare	02e5f79662	[examples] consolidate summarization examples (#4837 )	2020-06-09 11:14:12 -04:00
Julien Plu	9f5d5a531d	Fix the __getattr__ method in BatchEncoding (#4772 )	2020-06-09 09:44:00 +02:00
Sylvain Gugger	41a1d27cde	Add XLMRobertaForQuestionAnswering (#4855 ) * Add XLMRobertaForQuestionAnswering * Formatting * Make test happy	2020-06-08 21:22:37 -04:00
Sam Shleifer	a139d1a160	[cleanup] consolidate some prune_heads logic (#4799 )	2020-06-08 17:08:04 -04:00
ZhuBaohe	4c7f564f9a	fix (#4839 )	2020-06-08 18:28:50 +02:00
Sylvain Gugger	37be3786cf	Clean documentation (#4849 ) * Clean documentation	2020-06-08 11:28:19 -04:00
Lysandre	42860e92a4	Turn off codecov patch for now	2020-06-08 09:47:13 -04:00
Julien Plu	36dfc317b3	TF Checkpoints (#4831 ) * Align checkpoint dir with the PT trainer * Use args for max to keep checkpoints	2020-06-08 09:45:23 -04:00
Patrick von Platen	439f1cab20	[Generate] beam search should generate without replacement (#4845 ) * fix flaky beam search * fix typo	2020-06-08 15:31:32 +02:00
Patrick von Platen	c0554776de	fix PR (#4810 )	2020-06-08 15:31:12 +02:00
Sylvain Gugger	e817747941	Expose classes used in documentation (#4808 ) * Expose classes used in documentation * Format code	2020-06-08 08:14:32 -04:00
daniel-shan	b6f365a8ed	Updates args in tf squad example. (#4820 ) Co-authored-by: Daniel Shan <daniel.shan@workday.com>	2020-06-08 05:36:09 -04:00
Bram Vanroy	e33fdc93b4	Export PretrainedBartModel from __init__ (#4819 )	2020-06-07 11:55:10 -04:00
Sam Shleifer	c58e6c129a	[marian tests ] pass device to pipeline (#4815 )	2020-06-06 00:52:17 -04:00
Mr Ruben	ddf9a3dfc7	Updated path "cd examples/text-generation/pplm" (#4778 ) https://github.com/huggingface/transformers/issues/4776	2020-06-05 21:16:48 -04:00
Sylvain Gugger	2d372a990b	Explain how to preview the docs in a PR (#4795 )	2020-06-05 20:47:02 -04:00
Sylvain Gugger	56d5d160cd	Add model and doc badges (#4811 ) * Add badges for models and docs	2020-06-05 18:45:42 -04:00
Sam Shleifer	4ab7424597	[cleanup/marian] pipelines test and new kwarg (#4812 )	2020-06-05 18:45:19 -04:00
Sam Shleifer	875288b344	[isort] add matplotlib to known 3rd party dependencies (#4800 )	2020-06-05 17:27:31 -04:00
Patrick von Platen	8cca875569	[EncoderDecoderConfig] automatically set decoder config to decoder (#4809 ) * automatically set decoder config to decoder * add more tests	2020-06-05 23:16:37 +02:00
Sylvain Gugger	f1fe18465d	Use labels to remove deprecation warnings (#4807 )	2020-06-05 16:41:46 -04:00
Sylvain Gugger	5c0cfc2cf0	Add link to community models (#4804 )	2020-06-05 15:29:20 -04:00
Sylvain Gugger	4dd5cf2207	Fix argument label (#4792 ) * Fix argument label * Fix test	2020-06-05 15:20:29 -04:00
Sam Shleifer	3723f30a18	[cleanup] MarianTokenizer: delete unused constants (#4802 )	2020-06-05 14:57:24 -04:00
Sylvain Gugger	acaa2e6267	Clean-up code (#4790 )	2020-06-05 12:36:22 -04:00
Sylvain Gugger	fa661ce749	Add model summary (#4789 ) * Add model summary * Add link to pretrained models	2020-06-05 12:22:50 -04:00
Lysandre Debut	79ab881eb1	No silent error when d_head already in the configuration (#4747 ) * No silent error when d_head already in the configuration * Update src/transformers/configuration_xlnet.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-06-05 12:01:43 -04:00
Julien Chaumond	b9109f2de1	[doc] Make it clearer that `text-generation` does not involve training	2020-06-05 14:59:22 +02:00
Sylvain Gugger	ceaab8dd22	Add .vs to gitignore (#4774 )	2020-06-05 07:56:11 -04:00
Julien Plu	f9414f7553	Tensorflow improvements (#4530 ) * Better None gradients handling * Apply Style * Apply Style * Create a loss class per task to compute its respective loss * Add loss classes to the ALBERT TF models * Add loss classes to the BERT TF models * Add question answering and multiple choice to TF Camembert * Remove prints * Add multiple choice model to TF DistilBERT + loss computation * Add question answering model to TF Electra + loss computation * Add token classification, question answering and multiple choice models to TF Flaubert * Add multiple choice model to TF Roberta + loss computation * Add multiple choice model to TF XLM + loss computation * Add multiple choice and question answering models to TF XLM-Roberta * Add multiple choice model to TF XLNet + loss computation * Remove unused parameters * Add task loss classes * Reorder TF imports + add new model classes * Add new model classes * Bugfix in TF T5 model * Bugfix for TF T5 tests * Bugfix in TF T5 model * Fix TF T5 model tests * Fix T5 tests + some renaming * Fix inheritance issue in the AutoX tests * Add tests for TF Flaubert and TF XLM Roberta * Add tests for TF Flaubert and TF XLM Roberta * Remove unused piece of code in the TF trainer * bugfix and remove unused code * Bugfix for TF 2.2 * Apply Style * Divide TFSequenceClassificationAndMultipleChoiceLoss into their two respective name * Apply style * Mirror the PT Trainer in the TF one: fp16, optimizers and tb_writer as class parameter and better dataset handling * Fix TF optimizations tests and apply style * Remove useless parameter * Bugfix and apply style * Fix TF Trainer prediction * Now the TF models return the loss such as their PyTorch couterparts * Apply Style * Ignore some tests output * Take into account the SQuAD cls_index, p_mask and is_impossible parameters for the QuestionAnswering task models. * Fix names for SQuAD data * Apply Style * Fix conflicts with 2.11 release * Fix conflicts with 2.11 * Fix wrongname * Add better documentation on the new create_optimizer function * Fix isort * logging_dir: use same default as PyTorch Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-06-04 19:45:53 -04:00
Théophile Blard	ccd26c2862	Create model card for tblard/allocine (#4775 ) https://huggingface.co/tblard/tf-allocine	2020-06-04 19:15:07 -04:00
Stefan Schweter	2a4b9e09c0	NER: Add new WNUT’17 example (#4681 ) * ner: add preprocessing script for examples that splits longer sentences * ner: example shell scripts use local preprocessing now * ner: add new example section for WNUT’17 NER task. Remove old English CoNLL-03 results * ner: satisfy black and isort	2020-06-04 19:13:17 -04:00
Setu Shah	0e1869cc28	Add drop_last arg for data loader	2020-06-04 18:30:31 -04:00
prajjwal1	48a05026de	removed deprecared use of Variable api from pplm example	2020-06-04 18:07:49 -04:00
Sylvain Gugger	12d0eb5f3e	Don't access pad_token_id if there is no pad_token (#4773 )	2020-06-04 17:57:04 -04:00
Manuel Romero	17a88d3192	Create model card for T5-base fine-tuned for Sentiment Span Extraction (#4737 )	2020-06-04 16:59:56 -04:00
Oren Amsalem	fb52143cf6	Create README.md (#4743 )	2020-06-04 16:59:37 -04:00
Suraj Parmar	5f077a3445	Model Card for RoBERTa trained on Sanskrit (#4763 ) * Model cad for SanBERTa Model Card for RoBERTa trained on Sanskrit * Model card for SanBERTa model card for RoBERTa trained on Sanskrit	2020-06-04 16:58:40 -04:00
Sylvain Gugger	cd4e07a85e	Add note about doc generation (#4770 )	2020-06-04 13:43:14 -04:00
Jason Phang	492b352ab6	Remove unnecessary model_type arg in example (#4771 )	2020-06-04 13:41:24 -04:00
Lysandre Debut	e645b9ab94	Codecov setup (#4768 ) * Codecov setup * Understanding codecov	2020-06-04 11:44:38 -04:00
Sam Shleifer	2b8b6c929e	[cleanup] PretrainedModel.generate: remove unused kwargs (#4761 )	2020-06-04 08:13:52 -04:00
Funtowicz Morgan	5bf9afbf35	Introduce a new tensor type for return_tensors on tokenizer for NumPy (#4585 ) * Refactor tensor creation in tokenizers. * Make sure to convert string to TensorType * Refactor convert_to_tensors_ * Introduce numpy tensor creation * Format * Add unittest for TensorType creation from str * sorting imports * Added unittests for numpy tensor conversion. * Do not use in-place version for squeeze as numpy doesn't provide such feature. * Added extra parameter prepend_batch_axis: bool on prepare_for_model. * Ensure test_np_encode_plus_sent_to_model is not executed if encoder/decoder model. * style. * numpy tests require_torch for now while flax not merged. * Hopefully will make flake8 happy. * One more time 🎶	2020-06-04 06:57:01 +02:00

... 31 32 33 34 35 ...

5759 Commits