LysandreJik
e2c05f06ef
Correct indentation in docstring
...
For some reason Sphinx extremely dislikes this and crashes.
2020-03-27 09:28:52 -04:00
Sam Shleifer
3ee431dd4c
[Bart/Memory] Two separate, smaller decoder attention masks ( #3371 )
2020-03-26 21:34:15 -04:00
Manuel Romero
53fe733805
Model Cards: Fix grammar error ( #3467 )
2020-03-26 21:33:33 -04:00
Sam Shleifer
c10decf7a0
[Bart: example] drop columns that are exclusively pad_token_id… ( #3400 )
...
* trim seq_len below 1024 if there are columns full of pad_token_id
* Centralize trim_batch so SummarizationDataset can use it too
2020-03-26 19:33:54 -04:00
Sam Shleifer
63f4d8cad0
[Bart/Memory] SelfAttention only returns weights if config.outp… ( #3369 )
2020-03-26 18:42:39 -04:00
Sam Shleifer
2b2a2f8df2
[Bart] Fix: put dummy_inputs on correct device ( #3398 )
...
* Dummy inputs to model.device
* Move self.device to ModuleUtilsMixin
2020-03-26 18:42:09 -04:00
Sam Shleifer
1a5aefc95c
[Seq2Seq Generation] Call encoder before expanding input_ids ( #3370 )
2020-03-26 18:41:19 -04:00
Sam Shleifer
39371ee454
[Bart/Memory] don't create lm_head ( #3323 )
...
* delete lm_head, skips weight tying
* Fixed s3
2020-03-26 18:40:39 -04:00
Patrick von Platen
5ad2ea06af
Add wmt translation example ( #3428 )
...
* add translation example
* make style
* adapt docstring
* add gpu device as input for example
* small renaming
* better README
2020-03-26 19:07:59 +01:00
Patrick von Platen
b4fb94fe6d
revert unpin isort commit
2020-03-26 13:19:18 -04:00
Patrick von Platen
e703e923ca
Add t5 summarization example ( #3411 )
...
* rebase to master
* change tf to pytorch
* change to pytorch
* small fix
* renaming
* add gpu training possibility
* renaming
* improve README
* incoorporate collins feedback
* better Readme
* better README.md
2020-03-26 18:17:55 +01:00
sakares saengkaew
1a6c546c6f
Add missing token classification for XLM ( #3277 )
...
* Add the missing token classification for XLM
* fix styling
* Add XLMForTokenClassification to AutoModelForTokenClassification class
* Fix docstring typo for non-existing class
* Add the missing token classification for XLM
* fix styling
* fix styling
* Add XLMForTokenClassification to AutoModelForTokenClassification class
* Fix docstring typo for non-existing class
* Add missing description for AlbertForTokenClassification
* fix styling
* Add missing docstring for AlBert
* Slow tests should be slow
Co-authored-by: Sakares Saengkaew <s.sakares@gmail.com>
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>
2020-03-26 10:22:13 -04:00
Patrick von Platen
311970546f
rename string in pipeline
2020-03-26 14:59:49 +01:00
Manuel Romero
7420a6a9cc
Create card for model GPT-2-finetuned-CORD19
2020-03-26 09:10:09 -04:00
Patrick von Platen
022e8fab97
Adds translation pipeline ( #3419 )
...
* fix merge conflicts
* add t5 summarization example
* change parameters for t5 summarization
* make style
* add first code snippet for translation
* only add prefixes
* add prefix patterns
* make style
* renaming
* fix conflicts
* remove unused patterns
* solve conflicts
* fix merge conflicts
* remove translation example
* remove summarization example
* make sure tensors are in numpy for float comparsion
* re-add t5 config
* fix t5 import config typo
* make style
* remove unused numpy statements
* update doctstring
* import translation pipeline
2020-03-26 13:50:58 +01:00
HUSEIN ZOLKEPLI
3c5c567507
Update model card huseinzol05/bert-base-bahasa-cased ( #3425 )
...
* add bert bahasa readme
* update readme
* update readme
* added xlnet
2020-03-26 07:50:27 -04:00
Patrick von Platen
9c683ef01e
Add t5 to pipeline(task='summarization') ( #3413 )
...
* solve conflicts
* move warnings below
* incorporate changes
* add pad_to_max_length to pipelines
* add bug fix for T5 beam search
* add prefix patterns
* make style
* fix conflicts
* adapt pipelines for task specific parameters
* improve docstring
* remove unused patterns
2020-03-26 11:03:13 +01:00
Lysandre Debut
ffcffebe85
Force the return of token type IDs ( #3439 )
2020-03-26 09:41:36 +01:00
Travis McGuire
010e0460b2
Updated/added model cards ( #3435 )
2020-03-25 16:40:03 -04:00
Patrick von Platen
ffa17fe322
Extend config with task specific configs. ( #3433 )
...
* add new default configs
* change prefix default to None
2020-03-25 21:32:04 +01:00
Julien Chaumond
83272a3853
Experiment w/ dataclasses (including Py36) ( #3423 )
...
* [ci] Also run test_examples in py37
(will revert at the end of the experiment)
* InputExample: use immutable dataclass
* [deps] Install dataclasses for Py<3.7
* [skip ci] Revert "[ci] Also run test_examples in py37"
This reverts commit d29afd9959
.
2020-03-25 11:10:20 -04:00
Gabriele Sarti
ccbe839ee0
Added BioBERT-NLI model card ( #3421 )
2020-03-24 21:15:55 -04:00
Andre Carrera
3d76df3a12
BART for summarization training with CNN/DM using pytorch-lightning
2020-03-24 21:00:24 -04:00
Julien Chaumond
eaabaaf750
[run_language_modeling] Fix: initialize a new model from a config object
2020-03-24 17:56:40 -04:00
Julien Chaumond
f8823bad9a
Expose missing mappings (see #3415 )
2020-03-24 17:46:25 -04:00
Julien Chaumond
d0c36a7b72
[ci] Partial revert of 18eec3a984
due to fbc5bf10cf
2020-03-24 12:10:43 -04:00
LysandreJik
fbc5bf10cf
v2.6.0 release: isort un-pinned
2020-03-24 11:52:02 -04:00
Manuel Romero
b88bda6af3
Add right model and tokenizer path in example
2020-03-24 11:30:12 -04:00
Stefan Schweter
b31ef225cf
[model_cards] 🇹🇷 Add new (uncased, 128k) BERTurk model
2020-03-24 11:29:06 -04:00
Stefan Schweter
b4009cb001
[model_cards] 🇹🇷 Add new (cased, 128k) BERTurk model
2020-03-24 11:29:06 -04:00
Stefan Schweter
d3283490ef
[model_cards] 🇹🇷 Add new (uncased) BERTurk model
2020-03-24 11:29:06 -04:00
Mohamed El-Geish
e279a312d6
Model cards for CS224n SQuAD2.0 models ( #3406 )
...
* Model cards for CS224n SQuAD2.0 models
* consistent spacing
2020-03-24 11:28:33 -04:00
Gabriele Sarti
7372e62b2c
Added precisions in SciBERT-NLI model card ( #3410 )
2020-03-24 11:01:56 -04:00
LysandreJik
471cce24b3
Release: v2.6.0
2020-03-24 10:37:32 -04:00
Patrick von Platen
e392ba6938
Add camembert integration tests ( #3375 )
...
* add integration tests for camembert
* use jplu/tf-camembert fro the moment
* make style
2020-03-24 10:18:37 +01:00
Julien Chaumond
a8e3336a85
[examples] Use AutoModels in more examples
2020-03-23 20:11:14 -04:00
Julien Chaumond
ec6766a363
[deps] scikit-learn's transient issue was fixed
2020-03-23 18:38:09 -04:00
Julien Chaumond
f7dcf8fcea
[BertAbs] Move files around for more consistent naming
2020-03-23 13:58:49 -04:00
Julien Chaumond
e25c4f4027
[ALBERT] move things around for more consistent naming
...
see #3359
cc @lysandrejik
2020-03-23 13:58:21 -04:00
Manuel Romero
85b324bee5
Add comparison table with older brother in family
2020-03-23 12:11:20 -04:00
Manuel Romero
b7aa077a63
Create card for the model
2020-03-23 12:10:41 -04:00
Manuel Romero
f740177c87
Add comparison table with new models
2020-03-23 12:10:23 -04:00
LysandreJik
e52482909b
Correct order for dev/quality dependencies
...
cc @julien-c
2020-03-23 12:01:23 -04:00
Gabriele Sarti
28424906c2
Added scibert-nli model card
2020-03-23 11:55:41 -04:00
Julien Chaumond
18eec3a984
[ci] simpler way to load correct version of isort
...
hat/tip @bramvanroy
2020-03-23 10:03:22 -04:00
Julien Chaumond
cf72479bf1
One last reorder of {scheduler,optimizer}.step()
2020-03-20 18:05:50 -04:00
Elijah Rippeth
634bf6cf7e
fixes lr_scheduler warning
...
For more details, see https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate
2020-03-20 18:03:50 -04:00
Travis McGuire
265709f5cd
New model, new model cards
2020-03-20 18:01:01 -04:00
Bram Vanroy
115abd2166
Handle pinned version of isort
...
The CONTRIBUTING file pins to a specific version of isort, so we might as well install that in `dev` . This makes it easier for contributors so they don't have to manually install the specific commit.
2020-03-20 18:00:04 -04:00
Patrick von Platen
95e00d0808
Clean special token init in modeling_....py ( #3264 )
...
* make style
* fix conflicts
2020-03-20 21:41:04 +01:00