transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-29 01:02:25 +06:00

Author	SHA1	Message	Date
Manuel Romero	b48a1f08c1	Add text shown in example of usage (#3464 )	2020-03-31 07:59:36 -04:00
Manuel Romero	99833a9cbf	Create model card (#3487 )	2020-03-31 07:59:22 -04:00
Sho Arora	ebceeeacda	Add electra and alectra model cards (#3524 )	2020-03-31 07:58:48 -04:00
Leandro von Werra	a6c4ee27fd	Add model cards (#3537 ) * feat: add model card bert-imdb * feat: add model card gpt2-imdb-pos * feat: add model card gpt2-imdb	2020-03-31 07:54:45 -04:00
Ethan Perez	e5c393dceb	[Bug fix] Using loaded checkpoint with --do_predict (instead of… (#3437 ) * Using loaded checkpoint with --do_predict Without this fix, I'm getting near-random validation performance for a trained model, and the validation performance differs per validation run. I think this happens since the `model` variable isn't set with the loaded checkpoint, so I'm using a randomly initialized model. Looking at the model activations, they differ each time I run evaluation (but they don't with this fix). * Update checkpoint loading * Fixing model loading	2020-03-30 17:06:08 -04:00
Sam Shleifer	8deff3acf2	[bart-tiny-random] Put a 5MB model on S3 to allow faster exampl… (#3488 )	2020-03-30 12:28:27 -04:00
dougian	1f72865726	[BART] Update encoder and decoder on set_input_embedding (#3501 ) Co-authored-by: Ioannis Douratsos <ioannisd@amazon.com>	2020-03-30 12:20:37 -04:00
Julien Chaumond	cc598b312b	[InputExample] Unfreeze for now, cf. #3423	2020-03-30 10:41:49 -04:00
Julien Plu	d38bbb225f	Update the NER TF script (#3511 ) * Update the NER TF script to remove the softmax and make the pad token label id to -1 * Reformat the quality and style Co-authored-by: Julien Plu <julien.plu@adevinta.com>	2020-03-30 09:50:12 -04:00
LysandreJik	eff757f2e3	Re-pin isort version	2020-03-30 09:00:47 -04:00
LysandreJik	a009d751c2	Un-pin isort for v2.7.0 pypi	2020-03-30 08:55:10 -04:00
LysandreJik	6f5a12a583	Release: v2.7.0	2020-03-30 08:49:24 -04:00
Patrick von Platen	296252c49e	fix lm lables in docstring (#3529 )	2020-03-30 14:26:24 +02:00
Patrick von Platen	75ec6c9e3a	[T5] make decoder input ids optional for t5 training (#3521 ) * make decoder input ids optional for t5 training * lm_lables should not be shifted in t5 * add tests * finish shift right functionality for PT T5 * move shift right to correct class * cleaner code * replace -100 values with pad token id * add assert statement * remove unnecessary for loop * make style	2020-03-30 13:45:26 +02:00
Patrick von Platen	5b44e0a31b	[T5] Add training documenation (#3507 ) * Add clear description of how to train T5 * correct docstring in T5 * correct typo * correct docstring format * update t5 model docs * implement collins feedback * fix typo and add more explanation for sentinal tokens * delete unnecessary todos	2020-03-30 13:35:53 +02:00
Sam Shleifer	33ef7002e1	[Docs] examples/summarization/bart: Simplify CNN/DM preprocessi… (#3516 )	2020-03-29 13:25:42 -04:00
Sam Shleifer	f6a23d1911	[BART] add bart-large-xsum weights (#3422 )	2020-03-29 10:51:13 -04:00
Stefan Schweter	601ac5b1dc	[model_cards]: use MIT license for all dbmdz models	2020-03-27 18:06:25 -04:00
Patrick von Platen	17dceae7a1	Fix circle ci flaky fail of wmt example (#3485 ) * force bleu * fix wrong file name * rename file * different filenames for each example test * test files should clean up after themselves * test files should clean up after themselves * do not force bleu * correct typo * fix isort	2020-03-27 13:01:28 -04:00
Patrick von Platen	00ea100e96	add summarization and translation to notebook (#3478 )	2020-03-27 11:05:37 -04:00
Funtowicz Morgan	b08259a120	run_ner.py / bert-base-multilingual-cased can output empty tokens (#2991 ) * Use tokenizer.num_added_tokens to count number of added special_tokens instead of hardcoded numbers. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * run_ner.py - Do not add a label to the labels_ids if word_tokens is empty. This can happen when using bert-base-multilingual-cased with an input containing an unique space. In this case, the tokenizer will output just an empty word_tokens thus leading to an non-consistent behavior over the labels_ids tokens adding one more tokens than tokens vector. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>	2020-03-27 10:59:55 -04:00
Patrick von Platen	f4f4946836	Rename `t5-large` to `t5-base` in README.md	2020-03-27 15:57:58 +01:00
Patrick von Platen	fa9af2468a	Add T5 to docs (#3461 ) * add t5 docs basis * improve docs * add t5 docs * improve t5 docstring * add t5 tokenizer docstring * finish docstring * make style * add pretrained models * correct typo * make examples work * finalize docs	2020-03-27 10:57:16 -04:00
Lysandre Debut	ff80b73157	Add option to choose T5 model size. (#3480 ) T5-small in test isort	2020-03-27 15:56:59 +01:00
LysandreJik	e2c05f06ef	Correct indentation in docstring For some reason Sphinx extremely dislikes this and crashes.	2020-03-27 09:28:52 -04:00
Sam Shleifer	3ee431dd4c	[Bart/Memory] Two separate, smaller decoder attention masks (#3371 )	2020-03-26 21:34:15 -04:00
Manuel Romero	53fe733805	Model Cards: Fix grammar error (#3467 )	2020-03-26 21:33:33 -04:00
Sam Shleifer	c10decf7a0	[Bart: example] drop columns that are exclusively pad_token_id… (#3400 ) * trim seq_len below 1024 if there are columns full of pad_token_id * Centralize trim_batch so SummarizationDataset can use it too	2020-03-26 19:33:54 -04:00
Sam Shleifer	63f4d8cad0	[Bart/Memory] SelfAttention only returns weights if config.outp… (#3369 )	2020-03-26 18:42:39 -04:00
Sam Shleifer	2b2a2f8df2	[Bart] Fix: put dummy_inputs on correct device (#3398 ) * Dummy inputs to model.device * Move self.device to ModuleUtilsMixin	2020-03-26 18:42:09 -04:00
Sam Shleifer	1a5aefc95c	[Seq2Seq Generation] Call encoder before expanding input_ids (#3370 )	2020-03-26 18:41:19 -04:00
Sam Shleifer	39371ee454	[Bart/Memory] don't create lm_head (#3323 ) * delete lm_head, skips weight tying * Fixed s3	2020-03-26 18:40:39 -04:00
Patrick von Platen	5ad2ea06af	Add wmt translation example (#3428 ) * add translation example * make style * adapt docstring * add gpu device as input for example * small renaming * better README	2020-03-26 19:07:59 +01:00
Patrick von Platen	b4fb94fe6d	revert unpin isort commit	2020-03-26 13:19:18 -04:00
Patrick von Platen	e703e923ca	Add t5 summarization example (#3411 ) * rebase to master * change tf to pytorch * change to pytorch * small fix * renaming * add gpu training possibility * renaming * improve README * incoorporate collins feedback * better Readme * better README.md	2020-03-26 18:17:55 +01:00
sakares saengkaew	1a6c546c6f	Add missing token classification for XLM (#3277 ) * Add the missing token classification for XLM * fix styling * Add XLMForTokenClassification to AutoModelForTokenClassification class * Fix docstring typo for non-existing class * Add the missing token classification for XLM * fix styling * fix styling * Add XLMForTokenClassification to AutoModelForTokenClassification class * Fix docstring typo for non-existing class * Add missing description for AlbertForTokenClassification * fix styling * Add missing docstring for AlBert * Slow tests should be slow Co-authored-by: Sakares Saengkaew <s.sakares@gmail.com> Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>	2020-03-26 10:22:13 -04:00
Patrick von Platen	311970546f	rename string in pipeline	2020-03-26 14:59:49 +01:00
Manuel Romero	7420a6a9cc	Create card for model GPT-2-finetuned-CORD19	2020-03-26 09:10:09 -04:00
Patrick von Platen	022e8fab97	Adds translation pipeline (#3419 ) * fix merge conflicts * add t5 summarization example * change parameters for t5 summarization * make style * add first code snippet for translation * only add prefixes * add prefix patterns * make style * renaming * fix conflicts * remove unused patterns * solve conflicts * fix merge conflicts * remove translation example * remove summarization example * make sure tensors are in numpy for float comparsion * re-add t5 config * fix t5 import config typo * make style * remove unused numpy statements * update doctstring * import translation pipeline	2020-03-26 13:50:58 +01:00
HUSEIN ZOLKEPLI	3c5c567507	Update model card huseinzol05/bert-base-bahasa-cased (#3425 ) * add bert bahasa readme * update readme * update readme * added xlnet	2020-03-26 07:50:27 -04:00
Patrick von Platen	9c683ef01e	Add t5 to pipeline(task='summarization') (#3413 ) * solve conflicts * move warnings below * incorporate changes * add pad_to_max_length to pipelines * add bug fix for T5 beam search * add prefix patterns * make style * fix conflicts * adapt pipelines for task specific parameters * improve docstring * remove unused patterns	2020-03-26 11:03:13 +01:00
Lysandre Debut	ffcffebe85	Force the return of token type IDs (#3439 )	2020-03-26 09:41:36 +01:00
Travis McGuire	010e0460b2	Updated/added model cards (#3435 )	2020-03-25 16:40:03 -04:00
Patrick von Platen	ffa17fe322	Extend config with task specific configs. (#3433 ) * add new default configs * change prefix default to None	2020-03-25 21:32:04 +01:00
Julien Chaumond	83272a3853	Experiment w/ dataclasses (including Py36) (#3423 ) * [ci] Also run test_examples in py37 (will revert at the end of the experiment) * InputExample: use immutable dataclass * [deps] Install dataclasses for Py<3.7 * [skip ci] Revert "[ci] Also run test_examples in py37" This reverts commit `d29afd9959`.	2020-03-25 11:10:20 -04:00
Gabriele Sarti	ccbe839ee0	Added BioBERT-NLI model card (#3421 )	2020-03-24 21:15:55 -04:00
Andre Carrera	3d76df3a12	BART for summarization training with CNN/DM using pytorch-lightning	2020-03-24 21:00:24 -04:00
Julien Chaumond	eaabaaf750	[run_language_modeling] Fix: initialize a new model from a config object	2020-03-24 17:56:40 -04:00
Julien Chaumond	f8823bad9a	Expose missing mappings (see #3415 )	2020-03-24 17:46:25 -04:00
Julien Chaumond	d0c36a7b72	[ci] Partial revert of `18eec3a984` due to `fbc5bf10cf`	2020-03-24 12:10:43 -04:00

... 314 315 316 317 318 ...

19383 Commits