transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-30 01:32:23 +06:00

Author	SHA1	Message	Date
Suraj Patil	9208f57b16	BartTokenizerFast (#4878 )	2020-06-14 13:04:49 -04:00
Sylvain Gugger	403d309857	Hans data (#4854 ) * Update hans data to be able to use Trainer * Fixes * Deal with tokenizer that don't have token_ids * Clean up things * Simplify data use * Fix the input dict * Formatting + proper path in README	2020-06-13 09:35:13 -04:00
Julien Chaumond	ca5e1cdf8e	model_cards: we can now tag datasets see corresponding model pages to see how it's rendered	2020-06-12 23:19:07 +02:00
Suraj Patil	e93ccb3290	BartForQuestionAnswering (#4908 )	2020-06-12 15:47:57 -04:00
Sylvain Gugger	538531cde5	Add AlbertForMultipleChoice (#4959 ) * Add AlbertForMultipleChoice * Make up to date and add all models to common tests	2020-06-12 14:20:19 -04:00
Manuel Romero	fe24139702	Create README.md (#4865 )	2020-06-12 09:03:43 -04:00
Yannis Papanikolaou	9aa219a1fe	Create README.md (#4872 )	2020-06-12 09:03:13 -04:00
Patrick von Platen	86578bb04c	[AutoModel] Split AutoModelWithLMHead into clm, mlm, encoder-decoder (#4933 ) * first commit * add new auto models * better naming * fix bert automodel * fix automodel for pretraining * add models to init * fix name typo * fix typo * better naming * future warning instead of depreciation warning	2020-06-12 10:01:49 +02:00
Sam Shleifer	5620033115	[mbart] Fix fp16 testing logic (#4949 )	2020-06-11 22:11:34 -04:00
VictorSanh	473808da0d	update `mvmt-pruning/saving_prunebert` (updating torch to 1.5)	2020-06-11 19:42:45 +00:00
Patrick von Platen	caf3746678	fix indentation issue (#4941 )	2020-06-11 21:28:01 +02:00
Suraj Patil	6293eb04df	[Model card] model card for electra-base QA model (#4936 )	2020-06-11 13:16:34 -04:00
Sam Shleifer	08b59d10e5	MBartTokenizer:add language codes (#3776 )	2020-06-11 13:02:33 -04:00
Sylvain Gugger	20451195f0	Support multiple choice in tf common model tests (#4920 ) * Support multiple choice in tf common model tests * Add the input_embeds test	2020-06-11 10:31:26 -04:00
Setu Shah	699541c4b3	TFTrainer: Add dataloader_drop_last (#4925 )	2020-06-11 02:11:22 -04:00
RafaelWO	e80d6c689b	Fix resize_token_embeddings for Transformer-XL (#4759 ) * Fixed resize_token_embeddings for transfo_xl model * Fixed resize_token_embeddings for transfo_xl. Added custom methods to TransfoXLPreTrainedModel for resizing layers of the AdaptiveEmbedding. * Updated docstring * Fixed resizinhg cutoffs; added check for new size of embedding layer. * Added test for resize_token_embeddings * Fixed code quality * Fixed unchanged cutoffs in model.config Co-authored-by: Rafael Weingartner <rweingartner.its-b2015@fh-salzburg.ac.at>	2020-06-10 19:03:06 -04:00
Sylvain Gugger	d541938c48	Make multiple choice models work with input_embeds (#4921 )	2020-06-10 18:38:34 -04:00
Sylvain Gugger	1e2631d6f8	Split LMBert model in two (#4874 ) * Split LMBert model in two * Fix example * Remove lm_labels * Adapt tests, refactor prepare_for_generation * Fix merge * Hide BeartLMHeadModel	2020-06-10 18:26:42 -04:00
Matthew Goldey	f6da8b2200	check type before logging in trainer to ensure values are scalars (#4883 ) * check type before logging to ensure it's a scalar * log when Trainer attempts to add a non-scalar value using TensorboardX's writer.add_scalar so we know what kinds of fixes are appropriate * black it * rephrase log message to clarify attribute was dropped Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-06-10 18:25:55 -04:00
Yannis Papanikolaou	1c986f42ff	Create README.md (#4871 )	2020-06-10 17:29:41 -04:00
Lysandre Debut	3ae2e86baf	Run a single wandb instance per TPU run (#4851 ) * Run a single wandb instance per TPU run * wandb: self.is_world_master * make style Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-06-10 16:28:18 -04:00
Lysandre Debut	466aa57a45	Don't init TPU device twice (#4916 )	2020-06-10 15:53:15 -04:00
Suraj Patil	ef2dcdccaa	ElectraForQuestionAnswering (#4913 ) * ElectraForQuestionAnswering * udate __init__ * add test for electra qa model * add ElectraForQuestionAnswering in auto models * add ElectraForQuestionAnswering in all_model_classes * fix outputs, input_ids defaults to None * add ElectraForQuestionAnswering in docs * remove commented line	2020-06-10 15:17:52 -04:00
Amil Khare	5d63ca6c38	[ctrl] fix pruning of MultiHeadAttention (#4904 )	2020-06-10 14:06:55 -04:00
Sylvain Gugger	4e10acb3e5	Add more models to common tests (#4910 )	2020-06-10 13:19:53 -04:00
Patrick von Platen	3b3619a327	[All models] fix docs after adding output attentions to all forward functions (#4909 ) * fix doc * add format file * add output attentions to all docs * add also for bart * fix naming * re-add doc to config	2020-06-10 18:10:59 +02:00
Sylvain Gugger	ac99217e92	Fix the CI (#4903 ) * Fix CI	2020-06-10 09:26:06 -04:00
Sylvain Gugger	0a375f5abd	Deal with multiple choice in common tests (#4886 ) * Deal with multiple choice in common tests	2020-06-10 08:10:20 -04:00
Sylvain Gugger	e8db8b845a	Remove unused arguments in Multiple Choice example (#4853 ) * Remove unused arguments * Formatting * Remove second todo comment	2020-06-09 20:05:09 -04:00
songyouwei	29c36e9f36	run_pplm.py bug fix (#4867 ) `is_leaf` may become `False` after `.to(device=device)` function call.	2020-06-09 19:14:27 -04:00
Lysandre	13aa174112	uninstalled wandb raises AttributeError	2020-06-09 18:50:56 -04:00
Bharat Raghunathan	6e603cb789	[All models] Extend config.output_attentions with output_attentions function arguments (#4538 ) * DOC: Replace instances of ``config.output_attentions`` with function argument ``output_attentions`` * DOC: Apply Black Formatting * Fix errors where output_attentions was undefined * Remove output_attentions in classes per review * Fix regressions on tests having `output_attention` * Fix further regressions in tests relating to `output_attentions` Ensure proper propagation of `output_attentions` as a function parameter to all model subclasses * Fix more regressions in `test_output_attentions` * Fix issues with BertEncoder * Rename related variables to `output_attentions` * fix pytorch tests * fix bert and gpt2 tf * Fix most TF tests for `test_output_attentions` * Fix linter errors and more TF tests * fix conflicts * DOC: Apply Black Formatting * Fix errors where output_attentions was undefined * Remove output_attentions in classes per review * Fix regressions on tests having `output_attention` * fix conflicts * fix conflicts * fix conflicts * fix conflicts * fix pytorch tests * fix conflicts * fix conflicts * Fix linter errors and more TF tests * fix tf tests * make style * fix isort * improve output_attentions * improve tensorflow Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-06-09 23:39:06 +02:00
Sam Shleifer	f90bc44d9a	[examples] Cleanup summarization docs (#4876 )	2020-06-09 17:38:28 -04:00
Patrick von Platen	2cfb947f59	[Benchmark] add tpu and torchscipt for benchmark (#4850 ) * add tpu and torchscipt for benchmark * fix name in tests * "fix email" * make style * better log message for tpu * add more print and info for tpu * allow possibility to print tpu metrics * correct cpu usage * fix test for non-install * remove bugus file * include psutil in testing * run a couple of times before tracing in torchscript * do not allow tpu memory tracing for now * make style * add torchscript to env * better name for torch tpu Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2020-06-09 23:12:43 +02:00
Hamza Harkous	f0340b3031	Removes from the of the parent of TFRobertaClassificationHead (#4884 ) Co-authored-by: Hamza Harkous <harkous@google.com>	2020-06-09 16:14:01 -04:00
Amil Khare	02e5f79662	[examples] consolidate summarization examples (#4837 )	2020-06-09 11:14:12 -04:00
Julien Plu	9f5d5a531d	Fix the __getattr__ method in BatchEncoding (#4772 )	2020-06-09 09:44:00 +02:00
Sylvain Gugger	41a1d27cde	Add XLMRobertaForQuestionAnswering (#4855 ) * Add XLMRobertaForQuestionAnswering * Formatting * Make test happy	2020-06-08 21:22:37 -04:00
Sam Shleifer	a139d1a160	[cleanup] consolidate some prune_heads logic (#4799 )	2020-06-08 17:08:04 -04:00
ZhuBaohe	4c7f564f9a	fix (#4839 )	2020-06-08 18:28:50 +02:00
Sylvain Gugger	37be3786cf	Clean documentation (#4849 ) * Clean documentation	2020-06-08 11:28:19 -04:00
Lysandre	42860e92a4	Turn off codecov patch for now	2020-06-08 09:47:13 -04:00
Julien Plu	36dfc317b3	TF Checkpoints (#4831 ) * Align checkpoint dir with the PT trainer * Use args for max to keep checkpoints	2020-06-08 09:45:23 -04:00
Patrick von Platen	439f1cab20	[Generate] beam search should generate without replacement (#4845 ) * fix flaky beam search * fix typo	2020-06-08 15:31:32 +02:00
Patrick von Platen	c0554776de	fix PR (#4810 )	2020-06-08 15:31:12 +02:00
Sylvain Gugger	e817747941	Expose classes used in documentation (#4808 ) * Expose classes used in documentation * Format code	2020-06-08 08:14:32 -04:00
daniel-shan	b6f365a8ed	Updates args in tf squad example. (#4820 ) Co-authored-by: Daniel Shan <daniel.shan@workday.com>	2020-06-08 05:36:09 -04:00
Bram Vanroy	e33fdc93b4	Export PretrainedBartModel from __init__ (#4819 )	2020-06-07 11:55:10 -04:00
Sam Shleifer	c58e6c129a	[marian tests ] pass device to pipeline (#4815 )	2020-06-06 00:52:17 -04:00
Mr Ruben	ddf9a3dfc7	Updated path "cd examples/text-generation/pplm" (#4778 ) https://github.com/huggingface/transformers/issues/4776	2020-06-05 21:16:48 -04:00

1 2 3 4 5 ...

4187 Commits