transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Sylvain Gugger	cb3e5c33f7	Fix a few last paths for the new repo org (#8666 )	2020-11-19 11:56:42 -05:00
Matthias	a79a96ddaa	fix small typo (#8644 ) Fixed a small typo on the XLNet and permutation language modelling section	2020-11-19 11:24:11 -05:00
Sylvain Gugger	4208f496ee	Better filtering of the model outputs in Trainer (#8633 ) * Better filtering of the model outputs in Trainer * Fix examples tests * Add test for Lysandre	2020-11-19 10:43:15 -05:00
Lysandre Debut	f2e07e7272	Fix a bunch of slow tests (#8634 ) * CI should install `sentencepiece` * Requiring TF * Fixing some TFDPR bugs * remove return_dict=False/True hack Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>	2020-11-19 10:41:41 -05:00
elk-cloner	5362bb8a6b	Tf longformer for sequence classification (#8231 ) * working on LongformerForSequenceClassification * add TFLongformerForMultipleChoice * add TFLongformerForTokenClassification * use add_start_docstrings_to_model_forward * test TFLongformerForSequenceClassification * test TFLongformerForMultipleChoice * test TFLongformerForTokenClassification * remove test from repo * add test and doc for TFLongformerForSequenceClassification, TFLongformerForTokenClassification, TFLongformerForMultipleChoice * add requested classes to modeling_tf_auto.py update dummy_tf_objects fix tests fix bugs in requested classes * pass all tests except test_inputs_embeds * sync with master * pass all tests except test_inputs_embeds * pass all tests * pass all tests * work on test_inputs_embeds * fix style and quality * make multi choice work * fix TFLongformerForTokenClassification signature * fix TFLongformerForMultipleChoice, TFLongformerForSequenceClassification signature * fix mult choice * fix mc hint * fix input embeds * fix input embeds * refactor input embeds * fix copy issue * apply sylvains changes and clean more Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-11-19 10:37:27 -05:00
Quentin Lhoest	62cd9ce9f8	fix missing return dict (#8653 )	2020-11-19 15:17:18 +01:00
Amine Abdaoui	0c2677f529	[model card] : fix bert-base-15lang-cased (#8655 ) the table was badly formatted because of a single line break	2020-11-19 05:41:02 -05:00
Amine Abdaoui	0a80959bdd	Add cards for all Geotrend models (#8617 ) * docs(bert-base-15lang-cased): add model card * add cards for all Geotrend models * [model cards] fix language tag for all Geotrend models	2020-11-19 04:47:24 -05:00
cronoik	dcc9c64299	Updated the Extractive Question Answering code snippets (#8636 ) * Updated the Extractive Question Answering code snippets The Extractive Question Answering code snippets do not work anymore since the models return task-specific output objects. This commit fixes the pytorch and tensorflow examples but adding `.values()` to the model call. * Update task_summary.rst	2020-11-18 18:56:47 -05:00
Tim Isbister	28d16e7ac5	Update README.md (#8635 )	2020-11-18 18:35:23 -05:00
cronoik	b290195ac7	grammar (#8639 )	2020-11-18 18:04:25 -05:00
Stas Bekman	d86d57faa3	[s2s] distillation apex breaks return_dict obj (#8631 ) * apex breaks return_dict obj * style	2020-11-18 12:51:29 -08:00
Perez Ogayo	bf3611b2ab	Created ModelCard for Hel-ach-en MT model (#8496 ) * Updated ModelCard * Apply suggestions from code review Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-18 14:42:13 -05:00
Yifan Peng	c95b26a719	Create README.md (#8362 )	2020-11-18 13:37:14 -05:00
Manuel Romero	fdbbb6c17a	Model card: T5-base fine-tuned on QuaRTz (#8369 ) * Model card: T5-base fine-tuned on QuaRTz * Update model_cards/mrm8488/t5-base-finetuned-quartz/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-18 13:34:27 -05:00
Yifan Peng	6e6d24c5d8	Create README.md (#8363 )	2020-11-18 13:33:04 -05:00
Divyanshu Kakwani	35fd3d64e3	Add model card for ai4bharat/indic-bert (#8464 )	2020-11-18 13:28:49 -05:00
dartrevan	38f01dfe03	Update README.md (#8405 ) * Update README.md * Update README.md	2020-11-18 13:23:08 -05:00
Abhilash Majumder	2d8fbf012a	Model Card for abhilash1910/financial_roberta (#8625 ) * Model Card for abhilash1910/financial_roberta * Update model_cards/abhilash1910/financial_roberta/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-18 13:22:28 -05:00
Vishal Singh	26dc6593f3	Update README.md (#8544 ) Modified Model in Action section. The class `AutoModelWithLMHead` is deprecated so changed it to `AutoModelForSeq2SeqLM` for encoder-decoder models. Removed duplicate eos token.	2020-11-18 13:19:32 -05:00
smanjil	6c8fad4f0d	replace performance table with markdown (#8565 ) * replace performance table with markdown * Update model_cards/smanjil/German-MedBERT/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-18 13:17:46 -05:00
hhou435	e7f77fc52a	model_cards for Chinese Couplet and Poem GPT2 models (#8620 )	2020-11-18 13:06:30 -05:00
Sylvain Gugger	a0c62d2493	Fix training from scratch in new scripts (#8623 )	2020-11-18 12:15:26 -05:00
Sylvain Gugger	1e62e999e8	Fixes the training resuming with gradient accumulation (#8624 )	2020-11-18 12:00:11 -05:00
Patrick von Platen	cdfa56afe0	[Tokenizer Doc] Improve tokenizer summary (#8622 ) * improve summary * small fixes * cleaned line length * correct "" formatting * apply sylvains suggestions	2020-11-18 17:14:15 +01:00
Nicola De Cao	2f9d49b389	Adding PrefixConstrainedLogitsProcessor (#8529 ) * Adding PrefixConstrainedLogitsProcessor * fixing RAG and style_doc * fixing black (v20 instead of v19) * Improving doc in generation_logits_process.py * Improving docs and typing in generation_utils.py * docs improvement * adding test and fixing doc typo * fixing doc_len * isort on test * fixed test * improve docstring a bit Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-11-18 17:06:25 +01:00
Julien Plu	3bc1540070	New TF loading weights (#8490 ) * New TF loading weights * apply style * Better naming * Largely comment the loading method * Apply style * Address Patrick's comments * Remove useless line of code * Update Docstring * Address Sylvain's and Lysandre's comments * Simplify the names computation * Typos	2020-11-18 10:48:31 -05:00
Ratthachat (Jung)	0df91ee4f7	self.self.activation_dropout -> self.activation_dropout (#8611 ) (one line typo)	2020-11-18 10:30:29 -05:00
Stas Bekman	cdf1b7ae82	fix to adjust for #8530 changes (#8612 )	2020-11-18 10:25:00 -05:00
Stas Bekman	2819da02f7	[s2s] broken test (#8613 )	2020-11-18 10:15:53 -05:00
Michał Pogoda	9fa3ed1a7f	Fix missing space in multiline warning (#8593 ) Multiline string informing about missing PyTorch/TensorFlow had missing space.	2020-11-18 10:09:26 -05:00
Sylvain Gugger	8fcb6935a1	Fix DataCollatorForLanguageModeling (#8621 )	2020-11-18 10:02:50 -05:00
Benjamin Minixhofer	f6fe41c96b	Reset loss to zero on logging in Trainer to avoid bfloat16 issues (#8561 ) * make tr_loss regular float * Revert "make tr_loss regular float" This reverts commit `c9d7ccfaf0`. * reset loss at each logging step * keep track of total loss with _total_loss_scalar * add remaining tr_loss at the end	2020-11-18 09:58:08 -05:00
cronoik	b592728eff	Fixed link to the wrong paper. (#8607 )	2020-11-17 19:00:44 -05:00
Sylvain Gugger	0512444ee5	Remove old doc	2020-11-17 17:34:25 -05:00
Caitlin Ostroff	5cf9c79665	Add Harry Potter Model Card (#8605 ) * Add Harry Potter Model * Update model_cards/ceostroff/harry-potter-gpt2-fanfiction/README.md * Update model_cards/ceostroff/harry-potter-gpt2-fanfiction/README.md * Update model_cards/ceostroff/harry-potter-gpt2-fanfiction/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-17 16:50:58 -05:00
Sylvain Gugger	dd52804f5f	Remove deprecated (#8604 ) * Remove old deprecated arguments Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr> * Remove needless imports * Fix tests Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>	2020-11-17 15:11:29 -05:00
Lysandre Debut	3095ee9dab	Tokenizers should be framework agnostic (#8599 ) * Tokenizers should be framework agnostic * Run the slow tests * Not testing * Fix documentation * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-11-17 14:03:03 -05:00
Sylvain Gugger	7f3b41a306	Fix check repo utils (#8600 )	2020-11-17 14:01:46 -05:00
Stas Bekman	f0435f5a61	these should run fine on multi-gpu (#8582 )	2020-11-17 14:00:41 -05:00
Sylvain Gugger	36a19915ea	Fix model templates (#8595 ) * First fixes * Fix imports and add init * Fix typo * Move init to final dest * Fix tokenization import * More fixes * Styling	2020-11-17 10:35:38 -05:00
Julien Chaumond	042a6aa777	Tokenizers: ability to load from model subfolder (#8586 ) * <small>tiny typo</small> * Tokenizers: ability to load from model subfolder * use subfolder for local files as well * Uniformize model shortcut name => model id * from s3 => from huggingface.co Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>	2020-11-17 08:58:45 -05:00
Sylvain Gugger	48395d6b8e	Fix init for MT5 (#8591 )	2020-11-17 08:52:13 -05:00
sgugger	a6cf9ca00b	Add __init__ to the models folder	2020-11-17 07:39:37 -05:00
Patrick von Platen	5104223552	[MT5] More docs (#8589 ) * add docs * make style	2020-11-17 12:47:57 +01:00
Patrick von Platen	86822a358b	T5 & mT5 (#8552 ) * add mt5 and t5v1_1 model * fix tests * correct some imports * add tf model * finish tf t5 * improve examples * fix copies * clean doc	2020-11-17 12:23:09 +01:00
fajri91	9e01f988dd	model_card for indolem/indobert-base-uncased (#8579 )	2020-11-17 03:36:50 -05:00
Sylvain Gugger	c89bdfbe72	Reorganize repo (#8580 ) * Put models in subfolders * Styling * Fix imports in tests * More fixes in test imports * Sneaky hidden imports * Fix imports in doc files * More sneaky imports * Finish fixing tests * Fix examples * Fix path for copies * More fixes for examples * Fix dummy files * More fixes for example * More model import fixes * Is this why you're unhappy GitHub? * Fix imports in conver command	2020-11-16 21:43:42 -05:00
Julien Plu	901507335f	Fix mixed precision issue for GPT2 (#8572 ) * Fix mixed precision issue for GPT2 * Forgot one cast * oops * Forgotten casts	2020-11-16 14:44:19 -05:00
Sylvain Gugger	1073a2bde5	Switch `return_dict` to `True` by default. (#8530 ) * Use the CI to identify failing tests * Remove from all examples and tests * More default switch * Fixes * More test fixes * More fixes * Last fixes hopefully * Use the CI to identify failing tests * Remove from all examples and tests * More default switch * Fixes * More test fixes * More fixes * Last fixes hopefully * Run on the real suite * Fix slow tests	2020-11-16 11:43:00 -05:00

1 2 3 4 5 ...

5924 Commits