transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-23 22:38:58 +06:00

Author	SHA1	Message	Date
Sam Shleifer	1a5aefc95c	[Seq2Seq Generation] Call encoder before expanding input_ids (#3370 )	2020-03-26 18:41:19 -04:00
Sam Shleifer	39371ee454	[Bart/Memory] don't create lm_head (#3323 ) * delete lm_head, skips weight tying * Fixed s3	2020-03-26 18:40:39 -04:00
Patrick von Platen	5ad2ea06af	Add wmt translation example (#3428 ) * add translation example * make style * adapt docstring * add gpu device as input for example * small renaming * better README	2020-03-26 19:07:59 +01:00
Patrick von Platen	b4fb94fe6d	revert unpin isort commit	2020-03-26 13:19:18 -04:00
Patrick von Platen	e703e923ca	Add t5 summarization example (#3411 ) * rebase to master * change tf to pytorch * change to pytorch * small fix * renaming * add gpu training possibility * renaming * improve README * incoorporate collins feedback * better Readme * better README.md	2020-03-26 18:17:55 +01:00
sakares saengkaew	1a6c546c6f	Add missing token classification for XLM (#3277 ) * Add the missing token classification for XLM * fix styling * Add XLMForTokenClassification to AutoModelForTokenClassification class * Fix docstring typo for non-existing class * Add the missing token classification for XLM * fix styling * fix styling * Add XLMForTokenClassification to AutoModelForTokenClassification class * Fix docstring typo for non-existing class * Add missing description for AlbertForTokenClassification * fix styling * Add missing docstring for AlBert * Slow tests should be slow Co-authored-by: Sakares Saengkaew <s.sakares@gmail.com> Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>	2020-03-26 10:22:13 -04:00
Patrick von Platen	311970546f	rename string in pipeline	2020-03-26 14:59:49 +01:00
Manuel Romero	7420a6a9cc	Create card for model GPT-2-finetuned-CORD19	2020-03-26 09:10:09 -04:00
Patrick von Platen	022e8fab97	Adds translation pipeline (#3419 ) * fix merge conflicts * add t5 summarization example * change parameters for t5 summarization * make style * add first code snippet for translation * only add prefixes * add prefix patterns * make style * renaming * fix conflicts * remove unused patterns * solve conflicts * fix merge conflicts * remove translation example * remove summarization example * make sure tensors are in numpy for float comparsion * re-add t5 config * fix t5 import config typo * make style * remove unused numpy statements * update doctstring * import translation pipeline	2020-03-26 13:50:58 +01:00
HUSEIN ZOLKEPLI	3c5c567507	Update model card huseinzol05/bert-base-bahasa-cased (#3425 ) * add bert bahasa readme * update readme * update readme * added xlnet	2020-03-26 07:50:27 -04:00
Patrick von Platen	9c683ef01e	Add t5 to pipeline(task='summarization') (#3413 ) * solve conflicts * move warnings below * incorporate changes * add pad_to_max_length to pipelines * add bug fix for T5 beam search * add prefix patterns * make style * fix conflicts * adapt pipelines for task specific parameters * improve docstring * remove unused patterns	2020-03-26 11:03:13 +01:00
Lysandre Debut	ffcffebe85	Force the return of token type IDs (#3439 )	2020-03-26 09:41:36 +01:00
Travis McGuire	010e0460b2	Updated/added model cards (#3435 )	2020-03-25 16:40:03 -04:00
Patrick von Platen	ffa17fe322	Extend config with task specific configs. (#3433 ) * add new default configs * change prefix default to None	2020-03-25 21:32:04 +01:00
Julien Chaumond	83272a3853	Experiment w/ dataclasses (including Py36) (#3423 ) * [ci] Also run test_examples in py37 (will revert at the end of the experiment) * InputExample: use immutable dataclass * [deps] Install dataclasses for Py<3.7 * [skip ci] Revert "[ci] Also run test_examples in py37" This reverts commit `d29afd9959`.	2020-03-25 11:10:20 -04:00
Gabriele Sarti	ccbe839ee0	Added BioBERT-NLI model card (#3421 )	2020-03-24 21:15:55 -04:00
Andre Carrera	3d76df3a12	BART for summarization training with CNN/DM using pytorch-lightning	2020-03-24 21:00:24 -04:00
Julien Chaumond	eaabaaf750	[run_language_modeling] Fix: initialize a new model from a config object	2020-03-24 17:56:40 -04:00
Julien Chaumond	f8823bad9a	Expose missing mappings (see #3415 )	2020-03-24 17:46:25 -04:00
Julien Chaumond	d0c36a7b72	[ci] Partial revert of `18eec3a984` due to `fbc5bf10cf`	2020-03-24 12:10:43 -04:00
LysandreJik	fbc5bf10cf	v2.6.0 release: isort un-pinned	2020-03-24 11:52:02 -04:00
Manuel Romero	b88bda6af3	Add right model and tokenizer path in example	2020-03-24 11:30:12 -04:00
Stefan Schweter	b31ef225cf	[model_cards] 🇹🇷 Add new (uncased, 128k) BERTurk model	2020-03-24 11:29:06 -04:00
Stefan Schweter	b4009cb001	[model_cards] 🇹🇷 Add new (cased, 128k) BERTurk model	2020-03-24 11:29:06 -04:00
Stefan Schweter	d3283490ef	[model_cards] 🇹🇷 Add new (uncased) BERTurk model	2020-03-24 11:29:06 -04:00
Mohamed El-Geish	e279a312d6	Model cards for CS224n SQuAD2.0 models (#3406 ) * Model cards for CS224n SQuAD2.0 models * consistent spacing	2020-03-24 11:28:33 -04:00
Gabriele Sarti	7372e62b2c	Added precisions in SciBERT-NLI model card (#3410 )	2020-03-24 11:01:56 -04:00
LysandreJik	471cce24b3	Release: v2.6.0	2020-03-24 10:37:32 -04:00
Patrick von Platen	e392ba6938	Add camembert integration tests (#3375 ) * add integration tests for camembert * use jplu/tf-camembert fro the moment * make style	2020-03-24 10:18:37 +01:00
Julien Chaumond	a8e3336a85	[examples] Use AutoModels in more examples	2020-03-23 20:11:14 -04:00
Julien Chaumond	ec6766a363	[deps] scikit-learn's transient issue was fixed	2020-03-23 18:38:09 -04:00
Julien Chaumond	f7dcf8fcea	[BertAbs] Move files around for more consistent naming	2020-03-23 13:58:49 -04:00
Julien Chaumond	e25c4f4027	[ALBERT] move things around for more consistent naming see #3359 cc @lysandrejik	2020-03-23 13:58:21 -04:00
Manuel Romero	85b324bee5	Add comparison table with older brother in family	2020-03-23 12:11:20 -04:00
Manuel Romero	b7aa077a63	Create card for the model	2020-03-23 12:10:41 -04:00
Manuel Romero	f740177c87	Add comparison table with new models	2020-03-23 12:10:23 -04:00
LysandreJik	e52482909b	Correct order for dev/quality dependencies cc @julien-c	2020-03-23 12:01:23 -04:00
Gabriele Sarti	28424906c2	Added scibert-nli model card	2020-03-23 11:55:41 -04:00
Julien Chaumond	18eec3a984	[ci] simpler way to load correct version of isort hat/tip @bramvanroy	2020-03-23 10:03:22 -04:00
Julien Chaumond	cf72479bf1	One last reorder of {scheduler,optimizer}.step()	2020-03-20 18:05:50 -04:00
Elijah Rippeth	634bf6cf7e	fixes lr_scheduler warning For more details, see https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate	2020-03-20 18:03:50 -04:00
Travis McGuire	265709f5cd	New model, new model cards	2020-03-20 18:01:01 -04:00
Bram Vanroy	115abd2166	Handle pinned version of isort The CONTRIBUTING file pins to a specific version of isort, so we might as well install that in `dev` . This makes it easier for contributors so they don't have to manually install the specific commit.	2020-03-20 18:00:04 -04:00
Patrick von Platen	95e00d0808	Clean special token init in modeling_....py (#3264 ) * make style * fix conflicts	2020-03-20 21:41:04 +01:00
Nitish Shirish Keskar	8becb73293	removing torch.cuda.empty_cache() from TF function (#3267 ) torch.cuda.empty_cache() was being called from a TF function (even when torch is unavailable) not sure any replacement is needed if TF OOMs	2020-03-19 23:25:30 +01:00
Julien Chaumond	ecfd336318	Simpler Error message when loading config/model with .from_pretrained() (#3341 )	2020-03-19 23:23:03 +01:00
Kyeongpil Kang	8eeefcb576	Update 01-training-tokenizers.ipynb (typo issue) (#3343 ) I found there are two grammar errors or typo issues in the explanation of the encoding properties. The original sentences: If your was made of multiple \"parts\" such as (question, context), then this would be a vector with for each token the segment it belongs to If your has been truncated into multiple subparts because of a length limit (for BERT for example the sequence length is limited to 512), this will contain all the remaining overflowing parts. I think "input" should be inserted after the phrase "If your".	2020-03-19 23:21:49 +01:00
Patrick von Platen	bbf26c4e61	Support T5 Generation (#3228 ) * fix conflicts * update bart max length test * correct spelling mistakes * implemented model specific encode function * fix merge conflicts * better naming * save intermediate state -> need to rethink strucuture a bit * leave tf problem as it is for now * current version * add layers.pop * remove ipdb * make style * clean return cut decoding * remove ipdbs * Fix restoring layers in the decoders that doesnt exists. * push good intermediate solution for now * fix conflicts * always good to refuse to merge conflicts when rebasing * fix small bug * improve function calls * remove unused file * add correct scope behavior for t5_generate Co-authored-by: Morgan Funtowicz <funtowiczmo@gmail.com>	2020-03-19 23:18:23 +01:00
Julien Chaumond	656e1386a2	Fix #3305 : run_ner only possible on ModelForTokenClassification models	2020-03-19 16:41:28 -04:00
husein zolkepli	0c44b11917	add bert bahasa readme	2020-03-19 15:08:19 -04:00

... 228 229 230 231 232 ...

15053 Commits