Suchin
529534dc2f
BioMed Roberta-Base (AllenAI) ( #3643 )
...
* added model card
* updated README
* updated README
* updated README
* added evals
* removed pico eval
* Tweaks
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-04-06 16:12:09 -04:00
Lysandre Debut
261c4ff4e2
Update notebooks ( #3620 )
...
* Update notebooks
* From local to global link
* from local links to *actual* global links
2020-04-06 14:32:39 -04:00
Julien Chaumond
39a34cc375
[model_cards] ELECTRA (w/ examples of usage)
...
Co-Authored-By: Kevin Clark <clarkkev@users.noreply.github.com>
Co-Authored-By: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
2020-04-06 11:43:33 -04:00
LysandreJik
ea6dba2787
Re-pin isort
2020-04-06 10:09:54 -04:00
LysandreJik
11c3257a18
unpin isort for pypi
2020-04-06 10:06:41 -04:00
LysandreJik
36bffc81b3
Release: v2.8.0
2020-04-06 10:03:53 -04:00
Patrick von Platen
2ee410560e
[Generate, Test] Split generate test function into beam search, no beam search ( #3601 )
...
* split beam search and no beam search test
* fix test
* clean generate tests
2020-04-06 10:37:05 +02:00
Patrick von Platen
1789c7daf1
fix argument order ( #3637 )
2020-04-05 12:33:41 +02:00
Patrick von Platen
b809d2f073
Fix TF T5 docstring ( #3636 )
2020-04-05 12:23:09 +02:00
Timo Moeller
4ab8ab4f50
Adjust model card to reflect changes to vocabulary
...
(cherry picked from commit 8e25c4bf28
)
2020-04-04 15:27:41 -04:00
ktrapeznikov
ac40eed1a5
Create README.md
...
adding readme for
ktrapeznikov/albert-xlarge-v2-squad-v2
2020-04-04 15:18:54 -04:00
ktrapeznikov
fd9995ebc5
Create README.md
2020-04-04 15:18:31 -04:00
Julien Chaumond
5d912e7ed4
Tweak typing for #3566
2020-04-04 15:04:03 -04:00
Julien Chaumond
94eb68d742
weigths*weights
2020-04-04 15:03:26 -04:00
Manuel Romero
243e687be6
Create model card
2020-04-04 08:20:34 -04:00
Julien Chaumond
3e4b4dd190
[model_cards] Link to ExBERT visualisation
...
Hat/tip @bhoov @HendrikStrobelt @sebastianGehrmann
Also cc @srush and @thomwolf
2020-04-03 20:03:29 -04:00
Max Ryabinin
c6acd246ec
Speed up GELU computation with torch.jit ( #2988 )
...
* Compile gelu_new with torchscript
* Compile _gelu_python with torchscript
* Wrap gelu_new with torch.jit for torch>=1.4
2020-04-03 15:20:21 -04:00
Lysandre Debut
d5d7d88612
ELECTRA ( #3257 )
...
* Electra wip
* helpers
* Electra wip
* Electra v1
* ELECTRA may be saved/loaded
* Generator & Discriminator
* Embedding size instead of halving the hidden size
* ELECTRA Tokenizer
* Revert BERT helpers
* ELECTRA Conversion script
* Archive maps
* PyTorch tests
* Start fixing tests
* Tests pass
* Same configuration for both models
* Compatible with base + large
* Simplification + weight tying
* Archives
* Auto + Renaming to standard names
* ELECTRA is uncased
* Tests
* Slight API changes
* Update tests
* wip
* ElectraForTokenClassification
* temp
* Simpler arch + tests
Removed ElectraForPreTraining which will be in a script
* Conversion script
* Auto model
* Update links to S3
* Split ElectraForPreTraining and ElectraForTokenClassification
* Actually test PreTraining model
* Remove num_labels from configuration
* wip
* wip
* From discriminator and generator to electra
* Slight API changes
* Better naming
* TensorFlow ELECTRA tests
* Accurate conversion script
* Added to conversion script
* Fast ELECTRA tokenizer
* Style
* Add ELECTRA to README
* Modeling Pytorch Doc + Real style
* TF Docs
* Docs
* Correct links
* Correct model intialized
* random fixes
* style
* Addressing Patrick's and Sam's comments
* Correct links in docs
2020-04-03 14:10:54 -04:00
Yohei Tamura
8594dd80dd
BertJapaneseTokenizer accept options for mecab ( #3566 )
...
* BertJapaneseTokenizer accept options for mecab
* black
* fix mecab_option to Option[str]
2020-04-03 11:12:19 -04:00
HUSEIN ZOLKEPLI
216e167ce6
Added albert-base-bahasa-cased README and fixed tiny-bert-bahasa-cased README ( #3613 )
...
* add bert bahasa readme
* update readme
* update readme
* added xlnet
* added tiny-bert and fix xlnet readme
* added albert base
2020-04-03 09:28:43 -04:00
ahotrod
1ac6a246d8
Update README.md ( #3604 )
...
Update AutoModel & AutoTokernizer loading.
2020-04-03 09:28:25 -04:00
ahotrod
e91692f4a3
Update README.md ( #3603 )
2020-04-03 09:27:57 -04:00
HenrykBorzymowski
8e287d507d
corrected mistake in polish model cards ( #3611 )
...
* added model_cards for polish squad models
* corrected mistake in polish design cards
Co-authored-by: Henryk Borzymowski <henryk.borzymowski@pwc.com>
2020-04-03 09:07:15 -04:00
redewiedergabe
81484b447b
Create README.md ( #3568 )
...
* Create README.md
* added meta block (language: german)
* Added additional information about test data
2020-04-02 21:48:31 -04:00
ahotrod
9f6349aba9
Create README.md
2020-04-02 21:43:12 -04:00
Henryk Borzymowski
ddb1ce7418
added model_cards for polish squad models
2020-04-02 21:40:16 -04:00
Patrick von Platen
f68d22850c
delete bogus print statement ( #3595 )
2020-04-02 21:49:34 +02:00
Nicolas
c50aa67bff
Resizing embedding matrix before sending it to the optimizer. ( #3532 )
...
* Resizing embedding matrix after sending it to the optimizer prevents from updating the newly resized matrix.
* Remove space for style matter
2020-04-02 15:00:05 -04:00
Mark Kockerbeck
1b10159950
Adding should_continue check for retraining ( #3509 )
2020-04-02 14:07:08 -04:00
Patrick von Platen
390c128592
[Encoder-Decoder] Force models outputs to always have batch_size as their first dim ( #3536 )
...
* solve conflicts
* improve comments
2020-04-02 15:18:33 +02:00
Patrick von Platen
ab5d06a094
[T5, examples] replace heavy t5 models with tiny random models ( #3556 )
...
* replace heavy t5 models with tiny random models as was done by sshleifer
* fix isort
2020-04-02 12:34:05 +02:00
Patrick von Platen
a4ee4da18a
[T5, TF 2.2] change tf t5 argument naming ( #3547 )
...
* change tf t5 argument naming for TF 2.2
* correct bug in testing
2020-04-01 22:04:20 +02:00
Patrick von Platen
06dd597552
fix bug in warnings T5 pipelines ( #3545 )
2020-04-01 21:59:12 +02:00
Anirudh Srinivasan
9de9ceb6c5
Correct output shape for Bert NSP models in docs ( #3482 )
2020-04-01 15:04:38 -04:00
Patrick von Platen
b815edf69f
[T5, Testst] Add extensive hard-coded integration tests and make sure PT and TF give equal results ( #3550 )
...
* add some t5 integration tests
* finish summarization and translation integration tests for T5 - results loook good
* add tf test
* fix == vs is bug
* fix tf beam search error and make tf t5 tests pass
2020-04-01 18:01:33 +02:00
HUSEIN ZOLKEPLI
8538ce9044
Add tiny-bert-bahasa-cased model card ( #3567 )
...
* add bert bahasa readme
* update readme
* update readme
* added xlnet
* added tiny-bert and fix xlnet readme
2020-04-01 07:15:00 -04:00
Manuel Romero
c1a6252be1
Create model card ( #3557 )
...
Create model card for: distilbert-multi-finetuned-for-xqua-on-tydiqa
2020-04-01 07:14:23 -04:00
Julien Chaumond
50e15c825c
Tokenizers: Start cleaning examples a little ( #3455 )
...
* Start cleaning examples
* Fixup
2020-04-01 07:13:40 -04:00
Patrick von Platen
b38d552a92
[Generate] Add bad words list argument to the generate function ( #3367 )
...
* add bad words list
* make style
* add bad_words_tokens
* make style
* better naming
* make style
* fix typo
2020-03-31 18:42:31 +02:00
Patrick von Platen
ae6834e028
[Examples] Clean summarization and translation example testing files for T5 and Bart ( #3514 )
...
* fix conflicts
* add model size argument to summarization
* correct wrong import
* fix isort
* correct imports
* other isort make style
* make style
2020-03-31 17:54:13 +02:00
Manuel Romero
0373b60c4c
Update README.md ( #3552 )
...
- Show that the last uploaded version was trained on more data (custom_license files)
2020-03-31 10:40:34 -04:00
Patrick von Platen
83d1fbcff6
[Docs] Add usage examples for translation and summarization ( #3538 )
2020-03-31 09:36:03 -04:00
Patrick von Platen
55bcae7f25
remove useless and confusing lm_labels line ( #3531 )
2020-03-31 09:32:25 -04:00
Patrick von Platen
42e1e3c67f
Update usage doc regarding generate fn ( #3504 )
2020-03-31 09:31:46 -04:00
Patrick von Platen
57b0fab692
Add better explanation to check docs
locally. ( #3459 )
2020-03-31 09:30:17 -04:00
Manuel Romero
a8d4dff0a1
Update README.md ( #3470 )
...
Fix typo
2020-03-31 08:01:09 -04:00
Manuel Romero
4a5663568f
Create card for the model: GPT-2-finetuned-covid-bio-medrxiv ( #3453 )
2020-03-31 08:01:03 -04:00
Branden Chan
bbedb59675
Create README.md ( #3393 )
...
* Create README.md
* Update README.md
2020-03-31 08:00:35 -04:00
Manuel Romero
c2cf192943
Add link to 16 POS tags model ( #3465 )
2020-03-31 08:00:00 -04:00
Gabriele Sarti
c82ef72158
Added CovidBERT-NLI model card ( #3477 )
2020-03-31 07:59:49 -04:00