transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-02 11:11:05 +06:00

Author	SHA1	Message	Date
Sam Shleifer	f90bc44d9a	[examples] Cleanup summarization docs (#4876 )	2020-06-09 17:38:28 -04:00
Patrick von Platen	2cfb947f59	[Benchmark] add tpu and torchscipt for benchmark (#4850 ) * add tpu and torchscipt for benchmark * fix name in tests * "fix email" * make style * better log message for tpu * add more print and info for tpu * allow possibility to print tpu metrics * correct cpu usage * fix test for non-install * remove bugus file * include psutil in testing * run a couple of times before tracing in torchscript * do not allow tpu memory tracing for now * make style * add torchscript to env * better name for torch tpu Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2020-06-09 23:12:43 +02:00
Hamza Harkous	f0340b3031	Removes from the of the parent of TFRobertaClassificationHead (#4884 ) Co-authored-by: Hamza Harkous <harkous@google.com>	2020-06-09 16:14:01 -04:00
Amil Khare	02e5f79662	[examples] consolidate summarization examples (#4837 )	2020-06-09 11:14:12 -04:00
Julien Plu	9f5d5a531d	Fix the __getattr__ method in BatchEncoding (#4772 )	2020-06-09 09:44:00 +02:00
Sylvain Gugger	41a1d27cde	Add XLMRobertaForQuestionAnswering (#4855 ) * Add XLMRobertaForQuestionAnswering * Formatting * Make test happy	2020-06-08 21:22:37 -04:00
Sam Shleifer	a139d1a160	[cleanup] consolidate some prune_heads logic (#4799 )	2020-06-08 17:08:04 -04:00
ZhuBaohe	4c7f564f9a	fix (#4839 )	2020-06-08 18:28:50 +02:00
Sylvain Gugger	37be3786cf	Clean documentation (#4849 ) * Clean documentation	2020-06-08 11:28:19 -04:00
Lysandre	42860e92a4	Turn off codecov patch for now	2020-06-08 09:47:13 -04:00
Julien Plu	36dfc317b3	TF Checkpoints (#4831 ) * Align checkpoint dir with the PT trainer * Use args for max to keep checkpoints	2020-06-08 09:45:23 -04:00
Patrick von Platen	439f1cab20	[Generate] beam search should generate without replacement (#4845 ) * fix flaky beam search * fix typo	2020-06-08 15:31:32 +02:00
Patrick von Platen	c0554776de	fix PR (#4810 )	2020-06-08 15:31:12 +02:00
Sylvain Gugger	e817747941	Expose classes used in documentation (#4808 ) * Expose classes used in documentation * Format code	2020-06-08 08:14:32 -04:00
daniel-shan	b6f365a8ed	Updates args in tf squad example. (#4820 ) Co-authored-by: Daniel Shan <daniel.shan@workday.com>	2020-06-08 05:36:09 -04:00
Bram Vanroy	e33fdc93b4	Export PretrainedBartModel from __init__ (#4819 )	2020-06-07 11:55:10 -04:00
Sam Shleifer	c58e6c129a	[marian tests ] pass device to pipeline (#4815 )	2020-06-06 00:52:17 -04:00
Mr Ruben	ddf9a3dfc7	Updated path "cd examples/text-generation/pplm" (#4778 ) https://github.com/huggingface/transformers/issues/4776	2020-06-05 21:16:48 -04:00
Sylvain Gugger	2d372a990b	Explain how to preview the docs in a PR (#4795 )	2020-06-05 20:47:02 -04:00
Sylvain Gugger	56d5d160cd	Add model and doc badges (#4811 ) * Add badges for models and docs	2020-06-05 18:45:42 -04:00
Sam Shleifer	4ab7424597	[cleanup/marian] pipelines test and new kwarg (#4812 )	2020-06-05 18:45:19 -04:00
Sam Shleifer	875288b344	[isort] add matplotlib to known 3rd party dependencies (#4800 )	2020-06-05 17:27:31 -04:00
Patrick von Platen	8cca875569	[EncoderDecoderConfig] automatically set decoder config to decoder (#4809 ) * automatically set decoder config to decoder * add more tests	2020-06-05 23:16:37 +02:00
Sylvain Gugger	f1fe18465d	Use labels to remove deprecation warnings (#4807 )	2020-06-05 16:41:46 -04:00
Sylvain Gugger	5c0cfc2cf0	Add link to community models (#4804 )	2020-06-05 15:29:20 -04:00
Sylvain Gugger	4dd5cf2207	Fix argument label (#4792 ) * Fix argument label * Fix test	2020-06-05 15:20:29 -04:00
Sam Shleifer	3723f30a18	[cleanup] MarianTokenizer: delete unused constants (#4802 )	2020-06-05 14:57:24 -04:00
Sylvain Gugger	acaa2e6267	Clean-up code (#4790 )	2020-06-05 12:36:22 -04:00
Sylvain Gugger	fa661ce749	Add model summary (#4789 ) * Add model summary * Add link to pretrained models	2020-06-05 12:22:50 -04:00
Lysandre Debut	79ab881eb1	No silent error when d_head already in the configuration (#4747 ) * No silent error when d_head already in the configuration * Update src/transformers/configuration_xlnet.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-06-05 12:01:43 -04:00
Julien Chaumond	b9109f2de1	[doc] Make it clearer that `text-generation` does not involve training	2020-06-05 14:59:22 +02:00
Sylvain Gugger	ceaab8dd22	Add .vs to gitignore (#4774 )	2020-06-05 07:56:11 -04:00
Julien Plu	f9414f7553	Tensorflow improvements (#4530 ) * Better None gradients handling * Apply Style * Apply Style * Create a loss class per task to compute its respective loss * Add loss classes to the ALBERT TF models * Add loss classes to the BERT TF models * Add question answering and multiple choice to TF Camembert * Remove prints * Add multiple choice model to TF DistilBERT + loss computation * Add question answering model to TF Electra + loss computation * Add token classification, question answering and multiple choice models to TF Flaubert * Add multiple choice model to TF Roberta + loss computation * Add multiple choice model to TF XLM + loss computation * Add multiple choice and question answering models to TF XLM-Roberta * Add multiple choice model to TF XLNet + loss computation * Remove unused parameters * Add task loss classes * Reorder TF imports + add new model classes * Add new model classes * Bugfix in TF T5 model * Bugfix for TF T5 tests * Bugfix in TF T5 model * Fix TF T5 model tests * Fix T5 tests + some renaming * Fix inheritance issue in the AutoX tests * Add tests for TF Flaubert and TF XLM Roberta * Add tests for TF Flaubert and TF XLM Roberta * Remove unused piece of code in the TF trainer * bugfix and remove unused code * Bugfix for TF 2.2 * Apply Style * Divide TFSequenceClassificationAndMultipleChoiceLoss into their two respective name * Apply style * Mirror the PT Trainer in the TF one: fp16, optimizers and tb_writer as class parameter and better dataset handling * Fix TF optimizations tests and apply style * Remove useless parameter * Bugfix and apply style * Fix TF Trainer prediction * Now the TF models return the loss such as their PyTorch couterparts * Apply Style * Ignore some tests output * Take into account the SQuAD cls_index, p_mask and is_impossible parameters for the QuestionAnswering task models. * Fix names for SQuAD data * Apply Style * Fix conflicts with 2.11 release * Fix conflicts with 2.11 * Fix wrongname * Add better documentation on the new create_optimizer function * Fix isort * logging_dir: use same default as PyTorch Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-06-04 19:45:53 -04:00
Théophile Blard	ccd26c2862	Create model card for tblard/allocine (#4775 ) https://huggingface.co/tblard/tf-allocine	2020-06-04 19:15:07 -04:00
Stefan Schweter	2a4b9e09c0	NER: Add new WNUT’17 example (#4681 ) * ner: add preprocessing script for examples that splits longer sentences * ner: example shell scripts use local preprocessing now * ner: add new example section for WNUT’17 NER task. Remove old English CoNLL-03 results * ner: satisfy black and isort	2020-06-04 19:13:17 -04:00
Setu Shah	0e1869cc28	Add drop_last arg for data loader	2020-06-04 18:30:31 -04:00
prajjwal1	48a05026de	removed deprecared use of Variable api from pplm example	2020-06-04 18:07:49 -04:00
Sylvain Gugger	12d0eb5f3e	Don't access pad_token_id if there is no pad_token (#4773 )	2020-06-04 17:57:04 -04:00
Manuel Romero	17a88d3192	Create model card for T5-base fine-tuned for Sentiment Span Extraction (#4737 )	2020-06-04 16:59:56 -04:00
Oren Amsalem	fb52143cf6	Create README.md (#4743 )	2020-06-04 16:59:37 -04:00
Suraj Parmar	5f077a3445	Model Card for RoBERTa trained on Sanskrit (#4763 ) * Model cad for SanBERTa Model Card for RoBERTa trained on Sanskrit * Model card for SanBERTa model card for RoBERTa trained on Sanskrit	2020-06-04 16:58:40 -04:00
Sylvain Gugger	cd4e07a85e	Add note about doc generation (#4770 )	2020-06-04 13:43:14 -04:00
Jason Phang	492b352ab6	Remove unnecessary model_type arg in example (#4771 )	2020-06-04 13:41:24 -04:00
Lysandre Debut	e645b9ab94	Codecov setup (#4768 ) * Codecov setup * Understanding codecov	2020-06-04 11:44:38 -04:00
Sam Shleifer	2b8b6c929e	[cleanup] PretrainedModel.generate: remove unused kwargs (#4761 )	2020-06-04 08:13:52 -04:00
Funtowicz Morgan	5bf9afbf35	Introduce a new tensor type for return_tensors on tokenizer for NumPy (#4585 ) * Refactor tensor creation in tokenizers. * Make sure to convert string to TensorType * Refactor convert_to_tensors_ * Introduce numpy tensor creation * Format * Add unittest for TensorType creation from str * sorting imports * Added unittests for numpy tensor conversion. * Do not use in-place version for squeeze as numpy doesn't provide such feature. * Added extra parameter prepend_batch_axis: bool on prepare_for_model. * Ensure test_np_encode_plus_sent_to_model is not executed if encoder/decoder model. * style. * numpy tests require_torch for now while flax not merged. * Hopefully will make flake8 happy. * One more time 🎶	2020-06-04 06:57:01 +02:00
Funtowicz Morgan	efae154929	never_split on slow tokenizers should not split (#4723 ) * Ensure tokens in never_split are not splitted when using basic tokenizer before wordpiece. * never_split only use membership attempt to use a set() which is 10x faster for this operation. * Use union to concatenate two sets. * Updated docstring for never_split parameter. * Avoid set.union() if never_split is None * Added comments. * Correct docstring format.	2020-06-03 16:48:28 -04:00
Lysandre Debut	2e4de76231	Update encode documentation (#4751 )	2020-06-03 16:30:59 -04:00
Patrick von Platen	ed4df85572	fix beam search bug in tf as well (#4745 )	2020-06-03 12:53:23 -04:00
Sylvain Gugger	1b5820a565	Unify label args (#4722 ) * Deprecate masked_lm_labels argument * Apply to all models * Better error message	2020-06-03 09:36:26 -04:00

1 2 3 4 5 ...

4155 Commits