transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Sam Shleifer	9bdce3a4f9	[s2s] fix lockfile and peg distillation constants (#7545 )	2020-10-02 15:58:14 -04:00
Sam Shleifer	de4d7b004a	[s2s] Adafactor support for builtin trainer (#7522 )	2020-10-01 17:27:45 -04:00
Sam Shleifer	d3a9601a11	[s2s] trainer scripts: Remove --run_name, thanks sylvain! (#7521 )	2020-10-01 17:18:47 -04:00
Sylvain Gugger	bdcc4b78a2	Fix seq2seq example test (#7518 ) * Fix seq2seq example test * Fix bad copy-paste * Also save the state	2020-10-01 14:13:29 -04:00
Sylvain Gugger	29baa8fabe	Clean the Trainer state (#7490 ) * Trainer should not modify its TrainingArguments * Trainer should not modify its TrainingArguments * Trainer should not modify its TrainingArguments * Add test of resumed training * Fixes * Non multiGPU test * Clean Trainer state * Add more to the state * Documentation * One last test * Make resume training test more complete * Unwanted changes	2020-10-01 13:07:04 -04:00
Sam Shleifer	2a358f45ef	[s2s] fix nltk pytest race condition with FileLock (#7515 )	2020-10-01 12:51:09 -04:00
Suraj Patil	72d363d979	[examples/s2s] clean up finetune_trainer (#7509 )	2020-10-01 12:19:29 -04:00
Patrick von Platen	bd2621583b	fix data type (#7513 )	2020-10-01 18:15:41 +02:00
Patrick von Platen	62f5ae68ec	[Seq2Seq] Fix a couple of bugs and clean examples (#7474 ) * clean T5 * fix t5 tests * fix index typo * fix tf common test * fix examples * change positional ordering for Bart and FSTM * add signature test * clean docs and add tests * add docs to encoder decoder * clean docs * correct two doc strings * remove sig test for TF Elektra & Funnel * fix tf t5 slow tests * fix input_ids to inputs in tf * Update src/transformers/modeling_bart.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_bart.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * implement lysandre results * make style * fix encoder decoder typo * fix tf slow tests * fix slow tests * renaming * remove unused input Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-10-01 17:38:50 +02:00
Muhammad Harris	a42f62d34f	Train T5 in Tensoflow 2 Community Notebook (#7428 ) * t5 t5 community notebook added * author link updated * t5 t5 community notebook added * author link updated * new colab link updated Co-authored-by: harris <muhammad.harris@visionx.io>	2020-10-01 16:54:29 +02:00
Kai Fricke	5fc3b5cba4	Fix Tune progress_reporter kwarg (#7508 )	2020-10-01 10:34:31 -04:00
Kai Fricke	dabc85d1ba	Report Tune metrics in final evaluation (#7507 )	2020-10-01 09:52:36 -04:00
Alexandr	9a92afb6d0	Update LayoutLM doc (#7388 ) Co-authored-by: Alexandr Maslov <avmaslov3@gmail.com>	2020-10-01 09:11:42 -04:00
Julien Chaumond	e32390931d	[model_card] distilbert-base-german-cased	2020-10-01 09:08:49 -04:00
Julien Chaumond	9a4e163b58	[model_card] Fix metadata, adalbertojunior/PTT5-SMALL-SUM	2020-10-01 08:54:06 -04:00
Adalberto	8435e10e24	Create README.md (#7299 ) * Create README.md * language metadata Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-01 08:52:28 -04:00
Martin Müller	d727432072	Update README.md (#7459 )	2020-10-01 08:51:26 -04:00
allenyummy	664da5b077	Create README.md (#7468 )	2020-10-01 08:50:26 -04:00
ahotrod	f745f61c99	Update README.md (#7491 ) Model now fine-tuned on Transformers 3.1.0, previous out-of-date model was fine-tuned on Transformers 2.3.0.	2020-10-01 08:50:07 -04:00
Abed khooli	6ef7658c0a	Create README.md (#7349 ) Model card for akhooli/personachat-arabic	2020-10-01 08:48:51 -04:00
Bayartsogt Yadamsuren	15ab3f049b	Creating readme for bert-base-mongolian-cased (#7439 ) * Creating readme for bert-base-mongolian-cased * Update model_cards/bayartsogt/bert-base-mongolian-cased/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-10-01 08:46:27 -04:00
Bayartsogt Yadamsuren	0c2b9fa831	creating readme for bert-base-mongolian-uncased (#7440 )	2020-10-01 08:45:22 -04:00
Akshay Gupta	381443c096	Update README.md (#7498 ) Making transformers readme more robust.	2020-10-01 07:42:07 -04:00
Lysandre Debut	85d2d8c920	Fix local_files_only for TF (#6091 )	2020-10-01 05:06:02 -04:00
Sam Shleifer	9e80f972fb	Enable pegasus fp16 by clamping large activations (#7243 ) * Clean clamp * boom boom * Take some other changes * boom boom * boom boom * boom boom * one chg * fix test * Use finfo * style	2020-10-01 04:48:37 -04:00
Sylvain Gugger	be51c1039d	Add forgotten return_dict argument in the docs (#7483 )	2020-10-01 04:41:29 -04:00
Sam Shleifer	48f23f92a8	[s2sTrainer] test + code cleanup (#7467 )	2020-10-01 00:33:01 -04:00
Sam Shleifer	097049b81b	Distributed Trainer: 2 little fixes (#7461 ) * reset model.config * Update src/transformers/trainer.py * use lower case tensor * Just tensor change	2020-09-30 22:14:14 -04:00
Julien Chaumond	0acd1ffa09	[doc] rm Azure buttons as not implemented yet	2020-09-30 17:31:08 -04:00
Sam Shleifer	03e46c1de3	[s2s] fix kwargs style (#7488 )	2020-09-30 17:00:06 -04:00
Sam Shleifer	6fe8a693eb	[s2s] Fix t5 warning for distributed eval (#7487 )	2020-09-30 16:58:03 -04:00
Sylvain Gugger	4c6728460a	Bump isort version. (#7484 )	2020-09-30 13:44:58 -04:00
Amanpreet Singh	c031d01023	Seq2SeqDataset: avoid passing src_lang everywhere (#7470 ) Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-09-30 13:27:48 -04:00
Suraj Patil	08939cfdf7	[s2strainer] fix eval dataset loading (#7477 )	2020-09-30 12:39:13 -04:00
Sylvain Gugger	a97a73e0ee	Small QOL improvements to TrainingArguments (#7475 ) * Small QOL improvements to TrainingArguments * With the self.	2020-09-30 12:12:03 -04:00
Sylvain Gugger	dc7d2daa4c	Alphabetize model lists (#7478 )	2020-09-30 10:43:58 -04:00
Sylvain Gugger	fdccf82e28	Remove config assumption in Trainer (#7464 ) * Remove config assumption in Trainer * Initialize for eval	2020-09-30 09:03:25 -04:00
François REMY	cc4eff8087	Make transformers install check positive (#7473 ) When transformers is correctly installed, you should get a positive message ^_^	2020-09-30 07:44:40 -04:00
Pengcheng He	7a0cf0ec93	Add DeBERTa model (#5929 ) * Add DeBERTa model * Remove dependency of deberta * Address comments * Patch DeBERTa Documentation Style * Add final tests * Style * Enable tests + nitpicks * position IDs * BERT -> DeBERTa * Quality * Style * Tokenization * Last updates. * @patrickvonplaten's comments * Not everything can be a copy * Apply most of @sgugger's review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Last reviews * DeBERTa -> Deberta Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-09-30 07:07:30 -04:00
Lysandre Debut	44a93c981f	Number of GPUs for multi-gpu (#7472 )	2020-09-30 06:53:20 -04:00
Lysandre Debut	886ef35ce6	Fix LXMERT with DataParallel (#7471 )	2020-09-30 06:41:24 -04:00
Lysandre	35e94c68df	Number of GPUs	2020-09-30 12:29:26 +02:00
Lysandre Debut	056723ad1d	Multi-GPU setup (#7453 )	2020-09-30 05:53:34 -04:00
Sylvain Gugger	4ba248748f	Get a better error when check_copies fails (#7457 ) * Get a better error when check_copies fails * Fix tests	2020-09-30 10:05:14 +02:00
Sam Shleifer	bef0175168	remove codecov PR comments (#7400 )	2020-09-29 15:16:43 -04:00
Sylvain Gugger	a1c2ef7bd0	Add documentation for v3.3.1	2020-09-29 14:31:43 -04:00
Sylvain Gugger	1ba08dc221	Release: v3.3.1	2020-09-29 14:17:34 -04:00
Sylvain Gugger	8546dc55c2	Fix Trainer tests in a multiGPU env (#7458 )	2020-09-29 14:06:41 -04:00
Sylvain Gugger	d0fd7154c5	Catch import datasets common errors (#7456 )	2020-09-29 13:42:09 -04:00
Sylvain Gugger	f1220c5fe2	Add a code of conduct (#7433 )	2020-09-29 13:38:47 -04:00

1 2 3 4 5 ...

5392 Commits