transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-25 15:28:59 +06:00

Author	SHA1	Message	Date
Patrick von Platen	180585741c	[Generation] Generation should allow to start with empty prompt (#3993 ) * fix empty prompt * fix length in generation pipeline	2020-04-28 14:33:15 +02:00
Patrick von Platen	52679fbc2e	add dialogpt training tips (#3996 )	2020-04-28 14:32:31 +02:00
Stefan Schweter	b5c6d3d4c7	notebooks: minor fix for community provided models example (#4025 )	2020-04-28 09:12:25 +02:00
martindh	2fade302ac	camembert-base-fquad Model card for illuin release of camembert-base-fquad	2020-04-27 18:29:55 -04:00
Manuel Romero	20c3b8cab4	Create model card	2020-04-27 18:27:46 -04:00
Manuel Romero	b3f272ffcb	Create model card	2020-04-27 18:27:04 -04:00
Nick Doiron	518f291eef	add model card for Hindi-BERT	2020-04-27 18:25:16 -04:00
monologg	d7b3bf547c	Model cards for KoELECTRA	2020-04-27 18:21:01 -04:00
Sai Saketh Aluru	db9d56c08a	Add modelcard for Hate-speech-CNERG/dehatebert-mono-arabic model (#3979 ) * Add dehatebert-mono-arabic readme card * Update dehatebert-mono-arabic model card	2020-04-27 18:18:54 -04:00
sshleifer	41750a6cff	Fix typos	2020-04-27 13:25:53 -04:00
Lorenzo Ampil	12bb7fe770	Fix t5 doc typos (#3978 ) * Fix tpo in into and add line under * Add missing blank line under * Correct types under	2020-04-27 18:27:15 +02:00
Julien Chaumond	97a375484c	rm boto3 dependency	2020-04-27 11:17:14 -04:00
Txus	4e817ff418	Create README.md (#3966 )	2020-04-25 09:16:40 -04:00
Junyi_Li	73d6a2f901	[model_cards] xlnet_chinese_large & roberta_chinese_large	2020-04-24 16:12:42 -04:00
Manuel Romero	623ba0236d	Create README.md (#3882 )	2020-04-24 15:57:01 -04:00
Leandro von Werra	f4078e0db6	Feat/add model card (#3923 ) * add model card for gpt2-imdb-ctrl * fix title * add sentiment control description	2020-04-24 10:24:28 -04:00
YuvalPeleg	03322b4261	Create README.md (#3917 )	2020-04-24 10:24:00 -04:00
Julien Chaumond	c811526004	[examples] For convenience, also save the tokenizer Close #3921	2020-04-24 09:52:42 -04:00
Cola	b0167632ce	Shuffle train subset for summarization example (#3909 ) * Shuffle train subset * Cleaner shuffle	2020-04-24 07:55:34 -04:00
Julien Chaumond	c53cc018de	[Trainer] Fix _rotate_checkpoints Close #3920	2020-04-23 23:59:43 +00:00
Julien Chaumond	cbbb3c43c5	[hubconf] Modify pythonpath to get canonical imports to work See https://github.com/huggingface/transformers/pull/3881/files#r412292660 Should we remove SRC_DIR from sys.path right after the imports, @aaugustin?	2020-04-23 16:27:43 -04:00
mneilly-et	77b75d2c78	Fix for #3873 to change type of exponent parameter for torch.pow() call from int to float (#3924 )	2020-04-23 14:25:31 -04:00
Clement	6ba254ee54	quick fix wording readme for community models (#3900 )	2020-04-23 14:19:45 -04:00
Jared T Nielsen	a79a9e1241	Fix TFAlbertForSequenceClassification classifier dropout probability. It was set to config.hidden_dropout_prob, but should be config.classifier_dropout_prob. (#3928 )	2020-04-23 13:18:16 -04:00
peterandluc	8e093e5981	Remove 50k limits bug	2020-04-23 11:15:09 -04:00
Julien Chaumond	6af5a54c28	[Trainer] reuse constant	2020-04-23 11:02:05 -04:00
Julien Chaumond	7c2a32ff88	[housekeeping] super()	2020-04-23 10:43:22 -04:00
Julien Chaumond	a946b6b51b	[housekeeping] Upgrade `# type` Python 2 syntax cc @sshleifer	2020-04-23 10:39:24 -04:00
Manuel Romero	cb3c2212c7	Create model card (#3890 ) Model: TinyBERT-spanish-uncased-finetuned-ner	2020-04-22 14:56:43 -04:00
Manuel Romero	d698b87f20	Update comparison table (#3889 )	2020-04-22 14:54:17 -04:00
Anthony MOI	13dd2acca4	Bump tokenizers version to final 0.7.0 (#3898 )	2020-04-22 11:02:29 -04:00
Lorenzo Ampil	f16540fcba	Pipeline for Text Generation: GenerationPipeline (#3758 ) * Add GenerationPipeline * Fix parameter names * Correct parameter __call__ parameters * Add model type attribute and correct function calls for prepare_input * Take out trailing commas from init attributes * Remove unnecessary tokenization line * Implement support for multiple text inputs * Apply generation support for multiple input text prompts * Take out tensor coersion * Take out batch index * Add text prompt to return sequence * Squeeze token tensore before decoding * Return only a single list of sequences if only one prompt was used * Correct results variable name * Add GenerationPipeline to SUPPORTED_TASKS with the alias , initalized w GPT2 * Registedred AutoModelWithLMHead for both pt and t * Update docstring for GenerationPipeline * Add kwargs parameter to mode.generate * Take out kwargs parameter after all * Add generation pipeline example in pipeline docstring * Fix max length by squeezing tokens tensor * Apply ensure_tensor_on_device to pytorch tensor * Include generation step in torch.no_grad * Take out input from prepare_xlm_input and set 'en' as default xlm_language * Apply framework specific encoding during prepare_input * Format w make style * Move GenerationPipeline import to follow proper import sorting * Take out training comma from generation dict * Apply requested changes * Change name to TextGenerationPipeline * Apply TextGenerationPipeline rename to __init___ * Changing alias to * Set input mapping as input to ensure_tensor_on_device * Fix assertion placement * Add test_text_generation * Add TextGenerationPipeline to PipelineCommonTests * Take out whitespace * Format __init__ w black * Fix __init__ style * Forman __init___ * Add line to end of __init__ * Correct model tokenizer set for test_text_generation * Ensure to return list of list, not list of string (to pass test) * Limit test models to only 3 to limit runtime to address circleCI timeout error * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Remove argument docstring, __init__, add additional __call__ arguments, and reformat results to list of dict * Fix blank result list * Add TextGenerationPipeline to pipelines.rst * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Fix typos from adding PADDING_TEXT_TOKEN_LENGTH * Fix incorrectly moved result list * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Add back generation line and make style * Take out blank whitespace * Apply new alis, text-generation, to test_pipelines * Fix text generation alias in test * Update src/transformers/pipelines.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-04-22 09:37:03 -04:00
Julien Chaumond	1dc9b3c784	Fixes #3877	2020-04-22 01:15:10 +00:00
Julien Chaumond	dd9d483d03	Trainer (#3800 ) * doc * [tests] Add sample files for a regression task * [HUGE] Trainer * Feedback from @sshleifer * Feedback from @thomwolf + logging tweak * [file_utils] when downloading concurrently, get_from_cache will use the cached file for subsequent processes * [glue] Use default max_seq_length of 128 like before * [glue] move DataTrainingArguments around * [ner] Change interface of InputExample, and align run_{tf,pl} * Re-align the pl scripts a little bit * ner * [ner] Add integration test * Fix language_modeling with API tweak * [ci] Tweak loss target * Don't break console output * amp.initialize: model must be on right device before * [multiple-choice] update for Trainer * Re-align to `827d6d6ef0`	2020-04-21 20:11:56 -04:00
Julien Chaumond	eb5601b0a5	[ci] Pin torch version while we update	2020-04-21 15:46:18 -04:00
Spencer Adams	53f5ef6df5	create readme for spentaur/yelp model (#3874 ) * create readme for spentaur/yelp model * update spentaur/yelp/README.md * remove typo	2020-04-21 15:31:36 -04:00
Julien Chaumond	d32585a304	Fix Torch.hub + Integration test	2020-04-21 14:13:30 -04:00
Bharat Raghunathan	7d40901ce3	Fix Documentation issue in BertForMaskedLM forward (#3855 )	2020-04-21 09:08:20 +02:00
Andrey Kulagin	b1ff0b2ae7	Fix bug in examples: double wrap into DataParallel during eval	2020-04-20 19:37:44 -04:00
husein zolkepli	7f23af1684	added electra model (cherry picked from commit `b5f2dc5d62`)	2020-04-20 17:17:58 -04:00
Punyajoy Saha	03121deba3	New model added The first model added to the repo	2020-04-20 17:10:01 -04:00
Manuel Romero	15b9868f8b	Create model card	2020-04-20 17:07:34 -04:00
Funtowicz Morgan	2c05b8a56c	Remove tqdm logging when using pipelines. (#3833 ) Introduce tqdm_enabled parameter on squad_convert_examples_to_features() default to True and set to False in QA pipelines.	2020-04-20 22:58:52 +02:00
Jared T Nielsen	c79b550dd0	Add `qas_id` to SquadResult and SquadExample (#3745 ) * Add qas_id * Fix incorrect name in squad.py * Make output files optional for squad eval	2020-04-20 16:08:57 -04:00
Patrick von Platen	c4158a6314	[Pipelines] Encode to max length of input not max length of tokenizer for batch input (#3857 ) * remove max_length = tokenizer.max_length when encoding * make style	2020-04-20 14:39:16 -04:00
Mohamed El-Geish	857ccdb259	exbert links for my albert model cards (#3729 ) * exbert links for my albert model cards * Added exbert tag to the metadata block * Adding "how to cite"	2020-04-20 10:54:39 -04:00
Sam Shleifer	a504cb49ec	[examples] fix summarization do_predict (#3866 )	2020-04-20 10:49:56 -04:00
ahotrod	52c85f847a	Update README.md	2020-04-20 10:10:56 -04:00
Patrick von Platen	a21d4fa410	add "by" to ReadMe	2020-04-18 18:07:17 +02:00
Thomas Wolf	827d6d6ef0	Cleanup fast tokenizers integration (#3706 ) * First pass on utility classes and python tokenizers * finishing cleanup pass * style and quality * Fix tests * Updating following @mfuntowicz comment * style and quality * Fix Roberta * fix batch_size/seq_length inBatchEncoding * add alignement methods + tests * Fix OpenAI and Transfo-XL tokenizers * adding trim_offsets=True default for GPT2 et RoBERTa * style and quality * fix tests * add_prefix_space in roberta * bump up tokenizers to rc7 * style * unfortunately tensorfow does like these - removing shape/seq_len for now * Update src/transformers/tokenization_utils.py Co-Authored-By: Stefan Schweter <stefan@schweter.it> * Adding doc and docstrings * making flake8 happy Co-authored-by: Stefan Schweter <stefan@schweter.it>	2020-04-18 13:43:57 +02:00

... 224 225 226 227 228 ...

15053 Commits