transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-20 13:08:21 +06:00

Author	SHA1	Message	Date
Binny Mathew	8be260f18a	dehate-bert Model Card (#6254 ) Added citation and paper links.	2020-08-07 17:46:27 +08:00
Binny Mathew	dce7278cdf	dehate-bert Model Card (#6255 ) Added citation and paper links.	2020-08-07 17:45:52 +08:00
idoh	3be2d04884	fix consistency CrossEntropyLoss in modeling_bart (#6265 )	2020-08-07 17:44:28 +08:00
Lysandre	c72f9c90a1	Remove --no-cache-dir from github CI	2020-08-07 09:07:22 +02:00
Lysandre Debut	0d9328f2ef	Patch GPU failures (#6281 ) * Pin to 1.5.0 * Patch XLM GPU test	2020-08-07 02:58:15 -04:00
Lysandre Debut	80a0676a51	CI dependency wheel caching (#6287 ) * Single workflow cache test Remove cache dir, re-trigger cache Only pip archives Not sudo when pip * All workflow cache Remove no-cache-dir instruction Remove last sudo occurrences v0.3	2020-08-07 02:48:59 -04:00
Stas Bekman	175cd45e13	fix the shuffle agrument usage and the default (#6307 )	2020-08-06 20:32:28 -04:00
Bhashithe Abeysinghe	ffceef2042	[Fix] text-classification PL example (#6027 ) Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-08-06 15:46:43 -04:00
xujiaze13	eb2bd8d6eb	Remove redundant line in run_pl_glue.py (#6305 )	2020-08-06 15:43:45 -04:00
Patrick von Platen	118ecfd427	fix for pytorch < 1.6 (#6300 )	2020-08-06 21:14:46 +02:00
Sam Shleifer	2804fff839	[s2s]Use prepare_translation_batch for Marian finetuning (#6293 ) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-06 14:58:38 -04:00
Teven	2f2aa0c89c	added `n_inner` argument to gpt2 config (#6296 )	2020-08-06 17:47:32 +02:00
Manuel Romero	0a0d53dcf8	Update model card (#6290 ) Add links to RuPERTa models fine-tuned on Spanish SQUAD datasets	2020-08-06 11:42:43 -04:00
Doug Blank	b923871bb7	Adds comet_ml to the list of auto-experiment loggers (#6176 ) * Support for Comet.ml * Need to import comet first * Log this model, not the one in the backprop step * Log args as hyperparameters; use framework to allow fine control * Log hyperparameters with context * Apply black formatting * isort fix integrations * isort fix __init__ * Update src/transformers/trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/trainer_tf.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Address review comments * Style + Quality, remove Tensorboard import test Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-08-06 11:31:30 -04:00
Philip May	d5bc32ce92	Add strip_accents to basic BertTokenizer. (#6280 ) * Add strip_accents to basic tokenizer * Add tests for strip_accents. * fix style with black * Fix strip_accents test * empty commit to trigger CI * Improved strip_accents check * Add code quality with is not False	2020-08-06 18:52:28 +08:00
JME-P	31da35cc89	Create README.md (#6273 ) I am adding a descriptive README.md file to my recently uploaded twitter classification model: shrugging-grace/tweetclassifier.	2020-08-05 12:36:24 -04:00
JME-P	a8bdba232f	Create README.md for uploaded classifier (#6272 ) I am adding a descriptive README.md file to my recently uploaded twitter classification model: shrugging-grace/tweetclassifier.	2020-08-05 12:27:46 -04:00
HUSEIN ZOLKEPLI	a23a535c10	added t5 bahasa summarization readme (#6269 )	2020-08-05 12:27:27 -04:00
Sylvain Gugger	c67d1a0259	Tf model outputs (#6247 ) * TF outputs and test on BERT * Albert to DistilBert * All remaining TF models except T5 * Documentation * One file forgotten * TF outputs and test on BERT * Albert to DistilBert * All remaining TF models except T5 * Documentation * One file forgotten * Add new models and fix issues * Quality improvements * Add T5 * A bit of cleanup * Fix for slow tests * Style	2020-08-05 11:34:39 -04:00
Teven	bd0eab351a	Trainer + wandb quality of life logging tweaks (#6241 ) * added `name` argument for wandb logging, also logging model config with trainer arguments * Update src/transformers/training_args.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * added tf, post-review changes Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-05 09:05:52 -04:00
Julien Plu	33966811bd	Add SequenceClassification and MultipleChoice TF models to Electra (#6227 ) * Add SequenceClassification and MultipleChoice TF models to Electra * Apply style * Add summary_proj_to_labels to Electra config * Finally mirroring the PT version of these models * Apply style * Fix Electra test	2020-08-05 09:04:27 -04:00
Stas Bekman	376c02e9a9	[WIP] lightning_base: support --lr_scheduler with multiple possibilities (#6232 ) * support --lr_scheduler with multiple possibilities * correct the error message * add a note about supported schedulers * cleanup * cleanup2 * needs the argument default * style * add another assert in the test * implement requested changes * cleanups * fix relative import * cleanup	2020-08-05 09:01:17 -04:00
Zhu Baohe	d89acd07cc	fix (#6257 )	2020-08-05 07:37:57 -04:00
Ninnart Fuengfusin	24c5a6e351	Update optimization.py (#6261 )	2020-08-05 07:34:57 -04:00
Lilian Bordeau	ed6b8f3128	Update to match renamed attributes in fairseq master (#5972 ) * Update to match renamed attributes in fairseq master RobertaModel no longer have model.encoder and args.num_classes attributes as of 5/28/20. * Quality Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-08-05 07:23:55 -04:00
Ali Safaya	d9149f00d1	Update README.md (#6201 )	2020-08-04 17:44:14 -04:00
Ali Safaya	ddfdbb86c1	Update README.md (#6200 )	2020-08-04 17:44:05 -04:00
Ali Safaya	4f67955662	Update README.md (#6199 )	2020-08-04 17:43:48 -04:00
Ali Safaya	869ec441c9	Update README.md (#6198 )	2020-08-04 17:43:38 -04:00
Adam Montgomerie	5177dca634	Create README.md (#6123 )	2020-08-04 17:42:53 -04:00
Manuel Romero	3f30ebe6ca	Create README.md (#6075 )	2020-08-04 17:41:23 -04:00
Binny Mathew	aa7c22a283	Update Model Card (#6246 ) Added citation and paper links.	2020-08-04 17:40:47 -04:00
Joe Davison	972535ea74	fix zero shot pipeline docs (#6245 )	2020-08-04 16:37:49 -04:00
Timo Moeller	5920a37a4c	Add license info to German Bert models (#6242 ) * Add xlm-r QA model card * Add tags * Add license info to german bert	2020-08-04 13:40:49 -04:00
Patrick von Platen	6c9ba1d8fc	[Reformer] Make random seed generator available on random seed and not on model device (#6244 ) * improve if else statement random seeds * Apply suggestions from code review * Update src/transformers/modeling_reformer.py	2020-08-04 13:22:43 -04:00
Sam Shleifer	d5b0a0e235	mBART Conversion script (#6230 )	2020-08-04 09:53:51 -04:00
Stas Bekman	268bf34630	typo (#6225 )	2020-08-04 09:31:49 -04:00
Patrick von Platen	7f65daa2e1	fix reformer fp16 (#6237 )	2020-08-04 13:02:25 +02:00
Andrés Felipe Cruz	7ea9b2db37	Encoder decoder config docs (#6195 ) * Adding docs for how to load encoder_decoder pretrained model with individual config objects * Adding docs for loading encoder_decoder config from pretrained folder * Fixing W293 blank line contains whitespace * Update src/transformers/modeling_encoder_decoder.py * Update src/transformers/modeling_encoder_decoder.py * Update src/transformers/modeling_encoder_decoder.py * Apply suggestions from code review model file should only show examples for how to load save model * Update src/transformers/configuration_encoder_decoder.py * Update src/transformers/configuration_encoder_decoder.py * fix space Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-08-04 09:23:28 +02:00
Lysandre Debut	1d5c3a3d96	Test with --no-cache-dir (#6235 )	2020-08-04 03:20:19 -04:00
Sam Shleifer	6730ecdd3c	Remove redundant coverage (#6224 )	2020-08-04 02:59:21 -04:00
Stas Bekman	5deed37f9f	cleanup torch unittests (#6196 ) * improve unit tests this is a sample of one test according to the request in https://github.com/huggingface/transformers/issues/5973 before I apply it to the rest * batch 1 * batch 2 * batch 3 * batch 4 * batch 5 * style * non-tf template * last deletion of check_loss_output	2020-08-04 02:42:56 -04:00
Gong Linyuan	b390a5672a	Make the order of additional special tokens deterministic (#5704 ) * Make the order of additional special tokens deterministic regardless of hash seeds * Fix	2020-08-04 02:38:30 -04:00
Lysandre Debut	d740351f7d	Upgrade pip when doing CI (#6234 ) * Upgrade pip when doing CI * Don't forget Github CI	2020-08-04 02:37:12 -04:00
Sam Shleifer	57eb1cb68d	[s2s] Document better mbart finetuning command (#6229 ) * Document better MT command * improve multigpu command	2020-08-03 18:22:31 -04:00
Victor SANH	0513f8d275	correct label extraction + add note on discrepancies on trained MNLI model and HANS (#6221 )	2020-08-03 15:02:51 -04:00
Kevin Canwen Xu	3c289fb38c	Remove outdated BERT tips (#6217 ) * Remove out-dated BERT tips * Update modeling_outputs.py * Update bert.rst * Update bert.rst	2020-08-04 01:17:56 +08:00
Sylvain Gugger	e4920c92d6	Doc pipelines (#6175 ) * Init work on pipelines doc * Work in progress * Work in progress * Doc pipelines * Rm unwanted default * Apply suggestions from code review Lysandre comments Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-08-03 11:44:46 -04:00
Sam Shleifer	b6b2f2270f	s2s: fix LR logging, remove some dead code. (#6205 )	2020-08-03 10:36:26 -04:00
Maurice Gonzenbach	06f1692b02	Fix _shift_right function in TFT5PreTrainedModel (#6214 )	2020-08-03 16:21:23 +02:00

... 19 20 21 22 23 ...

5759 Commits