transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 10:12:23 +06:00

Author	SHA1	Message	Date
Julien Chaumond	1dc9b3c784	Fixes #3877	2020-04-22 01:15:10 +00:00
Julien Chaumond	dd9d483d03	Trainer (#3800 ) * doc * [tests] Add sample files for a regression task * [HUGE] Trainer * Feedback from @sshleifer * Feedback from @thomwolf + logging tweak * [file_utils] when downloading concurrently, get_from_cache will use the cached file for subsequent processes * [glue] Use default max_seq_length of 128 like before * [glue] move DataTrainingArguments around * [ner] Change interface of InputExample, and align run_{tf,pl} * Re-align the pl scripts a little bit * ner * [ner] Add integration test * Fix language_modeling with API tweak * [ci] Tweak loss target * Don't break console output * amp.initialize: model must be on right device before * [multiple-choice] update for Trainer * Re-align to `827d6d6ef0`	2020-04-21 20:11:56 -04:00
Julien Chaumond	eb5601b0a5	[ci] Pin torch version while we update	2020-04-21 15:46:18 -04:00
Spencer Adams	53f5ef6df5	create readme for spentaur/yelp model (#3874 ) * create readme for spentaur/yelp model * update spentaur/yelp/README.md * remove typo	2020-04-21 15:31:36 -04:00
Julien Chaumond	d32585a304	Fix Torch.hub + Integration test	2020-04-21 14:13:30 -04:00
Bharat Raghunathan	7d40901ce3	Fix Documentation issue in BertForMaskedLM forward (#3855 )	2020-04-21 09:08:20 +02:00
Andrey Kulagin	b1ff0b2ae7	Fix bug in examples: double wrap into DataParallel during eval	2020-04-20 19:37:44 -04:00
husein zolkepli	7f23af1684	added electra model (cherry picked from commit `b5f2dc5d62`)	2020-04-20 17:17:58 -04:00
Punyajoy Saha	03121deba3	New model added The first model added to the repo	2020-04-20 17:10:01 -04:00
Manuel Romero	15b9868f8b	Create model card	2020-04-20 17:07:34 -04:00
Funtowicz Morgan	2c05b8a56c	Remove tqdm logging when using pipelines. (#3833 ) Introduce tqdm_enabled parameter on squad_convert_examples_to_features() default to True and set to False in QA pipelines.	2020-04-20 22:58:52 +02:00
Jared T Nielsen	c79b550dd0	Add `qas_id` to SquadResult and SquadExample (#3745 ) * Add qas_id * Fix incorrect name in squad.py * Make output files optional for squad eval	2020-04-20 16:08:57 -04:00
Patrick von Platen	c4158a6314	[Pipelines] Encode to max length of input not max length of tokenizer for batch input (#3857 ) * remove max_length = tokenizer.max_length when encoding * make style	2020-04-20 14:39:16 -04:00
Mohamed El-Geish	857ccdb259	exbert links for my albert model cards (#3729 ) * exbert links for my albert model cards * Added exbert tag to the metadata block * Adding "how to cite"	2020-04-20 10:54:39 -04:00
Sam Shleifer	a504cb49ec	[examples] fix summarization do_predict (#3866 )	2020-04-20 10:49:56 -04:00
ahotrod	52c85f847a	Update README.md	2020-04-20 10:10:56 -04:00
Patrick von Platen	a21d4fa410	add "by" to ReadMe	2020-04-18 18:07:17 +02:00
Thomas Wolf	827d6d6ef0	Cleanup fast tokenizers integration (#3706 ) * First pass on utility classes and python tokenizers * finishing cleanup pass * style and quality * Fix tests * Updating following @mfuntowicz comment * style and quality * Fix Roberta * fix batch_size/seq_length inBatchEncoding * add alignement methods + tests * Fix OpenAI and Transfo-XL tokenizers * adding trim_offsets=True default for GPT2 et RoBERTa * style and quality * fix tests * add_prefix_space in roberta * bump up tokenizers to rc7 * style * unfortunately tensorfow does like these - removing shape/seq_len for now * Update src/transformers/tokenization_utils.py Co-Authored-By: Stefan Schweter <stefan@schweter.it> * Adding doc and docstrings * making flake8 happy Co-authored-by: Stefan Schweter <stefan@schweter.it>	2020-04-18 13:43:57 +02:00
Julien Chaumond	60a42ef1c0	[model_cards] Fix CamemBERT table markdown see https://github.com/huggingface/transformers/pull/3836	2020-04-17 20:21:15 -04:00
Julien Chaumond	88aecee6a2	[ci] GitHub-hosted runner has no space left on device	2020-04-17 20:16:00 -04:00
Benjamin Muller	73efa694e6	Update camembert-base-README.md (#3836 )	2020-04-17 20:08:13 -04:00
Patrick von Platen	e9d0bc027a	[Config, Serialization] more readable config serialization (#3797 ) * better config serialization * finish configuration utils	2020-04-17 20:07:18 -04:00
Lysandre Debut	8b63a01d95	XLM tokenizer should encode with bos token (#3791 ) * XLM tokenizer should encode with bos token * Update tests	2020-04-17 11:28:55 -04:00
Patrick von Platen	1d4a35b396	Higher tolerance for past testing in TF T5 (#3844 )	2020-04-17 11:26:16 -04:00
Patrick von Platen	d13eca11e2	Higher tolerance for past testing in T5 (#3843 )	2020-04-17 11:25:14 -04:00
Harutaka Kawamura	b0c9fbb293	Add workflow to build docs (#3763 )	2020-04-17 11:23:18 -04:00
Santiago Castro	c19727fd38	Add support for the null answer in `QuestionAnsweringPipeline` (#3441 ) * Add support for the null answer in `QuestionAnsweringPipeline` * black * Fix min null score computation * Fix a PR comment	2020-04-17 11:17:21 -04:00
Simon Böhm	edf0582c0b	Fix token_type_id in BERT question-answering example (#3790 ) token_type_id is converted into the segment embedding. For question answering, this needs to highlight whether a token belongs to sequence 0 or 1. encode_plus takes care of correctly setting this parameter automatically.	2020-04-17 11:14:12 -04:00
Pierric Cistac	6d00033e97	Question Answering support for Albert and Roberta in TF (#3812 ) * Add TFAlbertForQuestionAnswering * Add TFRobertaForQuestionAnswering * Update TFAutoModel with Roberta/Albert for QA * Clean `super` TF Albert calls	2020-04-17 10:45:30 -04:00
Patrick von Platen	f399c00610	Update README	2020-04-17 09:42:22 +02:00
Sam Shleifer	f0c96fafd1	[examples] summarization/bart/finetune.py supports t5 (#3824 ) renames `run_bart_sum.py` to `finetune.py`	2020-04-16 15:15:19 -04:00
Jonathan Sum	0cec4fab7d	typo: fine-grained token-leven Changing from "fine-grained token-leven" to "fine-grained token-level"	2020-04-16 15:11:23 -04:00
Aryansh Omray	14cdeee75a	Tanh torch warnings	2020-04-16 15:10:35 -04:00
Sam Shleifer	16469fedbd	[PretrainedTokenizer] Factor out tensor conversion method (#3777 )	2020-04-16 15:02:43 -04:00
Patrick von Platen	80a1694514	[Examples, T5] Change newstest2013 to newstest2014 and clean up (#3817 ) * Refactored use of newstest2013 to newstest2014. Fixed bug where argparse consumed first command line argument as model_size argument rather than using default model_size by forcing explicit --model_size flag inclusion * More pythonic file handling through 'with' context * COSMETIC - ran Black and isort * Fixed reference to number of lines in newstest2014 * Fixed failing test. More pythonic file handling * finish PR from tholiao * remove outcommented lines * make style * make isort happy Co-authored-by: Thomas Liao <tholiao@gmail.com>	2020-04-16 20:00:41 +02:00
Lysandre Debut	d486795158	JIT not compatible with PyTorch/XLA (#3743 )	2020-04-16 11:19:24 -04:00
Davide Fiocco	b1e2368b32	Typo fix (#3821 )	2020-04-16 11:04:32 -04:00
Patrick von Platen	baca8fa8e6	clean pipelines (#3795 )	2020-04-16 10:21:34 -04:00
Patrick von Platen	38f7461df3	[TFT5, Cache] Add cache to TFT5 (#3772 ) * correct gpt2 test inputs * make style * delete modeling_gpt2 change in test file * translate from pytorch * correct tests * fix conflicts * fix conflicts * fix conflicts * fix conflicts * make tensorflow t5 caching work * make style * clean reorder cache * remove unnecessary spaces * fix test	2020-04-16 16:14:52 +02:00
Patrick von Platen	a5b249472e	change pad token id to config pad token id (#3793 )	2020-04-16 15:58:57 +02:00
Sam Shleifer	dbd041243d	[cleanup] factor out get_head_mask, invert_attn_mask, get_exten… (#3806 ) * Delete some copy pasted code	2020-04-16 09:55:25 -04:00
Patrick von Platen	d22894dfd4	[Docs] Add DialoGPT (#3755 ) * add dialoGPT * update README.md * fix conflict * update readme * add code links to docs * Update README.md * Update dialo_gpt2.rst * Update pretrained_models.rst * Update docs/source/model_doc/dialo_gpt2.rst Co-Authored-By: Julien Chaumond <chaumond@gmail.com> * change filename of dialogpt Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-04-16 09:04:32 +02:00
Sam Shleifer	c59b1e682d	[examples] unit test for run_bart_sum (#3544 ) - adds pytorch-lightning dependency	2020-04-15 18:35:01 -04:00
Patrick von Platen	301bf8d1b4	Create Modelcard for Reformer Model	2020-04-15 16:26:24 +02:00
Patrick von Platen	01c37dcdb5	[Config, Caching] Remove `output_past` everywhere and replace by `use_cache` argument (#3734 ) * remove output_past from pt * make style * add optional input length for gpt2 * add use cache to prepare input * save memory in gpt2 * correct gpt2 test inputs * make past input optional for gpt2 * finish use_cache for all models * make style * delete modeling_gpt2 change in test file * correct docstring * correct is true statements for gpt2	2020-04-14 14:40:28 -04:00
Patrick von Platen	092cf881a5	[Generation, EncoderDecoder] Apply Encoder Decoder 1.5GB memory… (#3778 )	2020-04-13 22:29:28 -04:00
Teven	352d5472b0	Shift labels internally within TransfoXLLMHeadModel when called with labels (#3716 ) * Shifting labels inside TransfoXLLMHead * Changed doc to reflect change * Updated pytorch test * removed IDE whitespace changes * black reformat Co-authored-by: TevenLeScao <teven.lescao@gmail.com>	2020-04-13 18:11:23 +02:00
elk-cloner	5ebd898953	fix dataset shuffling for Distributed training (#huggingface#3721) (#3766 )	2020-04-13 10:11:18 -04:00
HenrykBorzymowski	7972a4019f	updated dutch squad model card (#3736 ) * added model_cards for polish squad models * corrected mistake in polish design cards * updated model_cards for squad2_dutch model * added links to benchmark models Co-authored-by: Henryk Borzymowski <henryk.borzymowski@pwc.com>	2020-04-11 06:44:59 -04:00
HUSEIN ZOLKEPLI	f8c1071c51	Added README huseinzol05/albert-tiny-bahasa-cased (#3746 ) * add bert bahasa readme * update readme * update readme * added xlnet * added tiny-bert and fix xlnet readme * added albert base * added albert tiny	2020-04-11 06:42:06 -04:00

1 2 3 4 5 ...

3771 Commits