Jannes
9750e1300c
Create README.md ( #5847 )
2020-07-17 14:03:53 -04:00
Julien Chaumond
1bca4fbd39
[model_card] Fix metadata
2020-07-17 13:55:37 -04:00
Gianpaolo Di Pietro
a9d56a675a
Added model card for neuraly/bert-base-italian-cased-sentiment ( #5845 )
...
* Added model card for neuraly/bert-base-italian-cased-sentiment
* Update model_cards/neuraly/bert-base-italian-cased-sentiment/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: Gianpy15 <g.dipietro@neuraly.ai>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-07-17 13:50:49 -04:00
Patrick von Platen
12f14710ce
[Model card] Bert2Bert
...
Add Rouge2 results
2020-07-17 18:22:05 +02:00
Patrick von Platen
9d37c56bab
[Reformer] - Cache hidden states and buckets to speed up inference ( #5578 )
...
* fix merge rebase
* add intermediate reformer code
* save intermediate caching results
* save intermediate
* save intermediate results
* save intermediate
* upload next step
* fix generate tests
* make tests work
* add named tuple output
* Apply suggestions from code review
* fix use_cache for False case
* fix tensor to gpu
* fix tensor to gpu
* refactor
* refactor and make style
2020-07-17 16:17:42 +02:00
Patrick von Platen
0b6c255a95
[Model card] Bert2Bert ( #5841 )
...
* Create README.md
* Update README.md
* Update README.md
* Update README.md
2020-07-17 11:41:56 +02:00
Sam Shleifer
3d9556a72b
[cleanups] make Marian save as Marian ( #5830 )
2020-07-17 02:54:25 -04:00
Sam Shleifer
e238e3d55a
[seq2seq] Don't copy self.source in sortishsampler ( #5818 )
2020-07-17 01:53:25 -04:00
Bayartsogt Yadamsuren
2e4624b415
language tag addition on albert-mongolian ( #5828 )
...
* language tag addition on albert-mongolian
* Update model_cards/bayartsogt/albert-mongolian/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-07-17 01:40:38 -04:00
Manuel Romero
d088d744ad
Create README.md ( #5821 )
2020-07-16 15:18:31 -04:00
Nick Doiron
233072fc1e
dv-wave ( #5823 )
2020-07-16 15:13:51 -04:00
Sam Shleifer
283500ff9f
[seq2seq] pack_dataset.py rewrites dataset in max_tokens format ( #5819 )
2020-07-16 14:06:49 -04:00
Manuel Romero
c45d7a707d
Update README.md ( #5812 )
...
Fix missig "-" in meta data
2020-07-16 10:25:50 -04:00
Patrick von Platen
057411c56a
fix longformer slow down ( #5811 )
2020-07-16 16:19:37 +02:00
Patrick von Platen
89a78be51f
fix benchmark for longformer ( #5808 )
2020-07-16 15:15:10 +02:00
Patrick von Platen
aefc0c0429
fix benchmark non standard model ( #5801 )
2020-07-16 12:13:10 +02:00
Martin Müller
8ce610bc96
Update README.md ( #5789 )
2020-07-16 05:26:17 -04:00
Julien Chaumond
6b6d035d8f
[model_card] illuin/lepetit
2020-07-16 03:50:47 -04:00
HuYong
d1f74b9aff
ADD ERNIE model ( #5763 )
...
* ERNIE model card
* Update Readme.md
* Update Readme.md
* Update Readme.md
* Rename Readme.md to README.md
* Update README.md
* Update Readme.md
* Update README.md
* Rename Readme.md to README.md
* Update Readme.md
* Update Readme.md
* Rename Readme.md to README.md
* Update and rename Readme.md to README.md
Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>
2020-07-16 11:03:05 +08:00
Clement
3b924fabee
Create distilbert squad tags
2020-07-15 17:59:06 -04:00
Clement
067814102c
fix readme
2020-07-15 17:50:46 -04:00
Clement
d179fd69ca
test readme change
2020-07-15 17:48:22 -04:00
Manuel Romero
63761614eb
Update README.md ( #5776 )
...
Add cherry picked example for the widget
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-07-15 16:19:21 -04:00
Manuel Romero
221e23c6c1
Create README.md ( #5781 )
...
* Create README.md
* Update model_cards/mrm8488/RoBasquERTa/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-07-15 16:17:25 -04:00
Manuel Romero
d4cda29af1
Create README.md ( #5782 )
...
* Create README.md
* Apply suggestions from code review
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-07-15 16:17:19 -04:00
Julien Chaumond
62ec28ce4f
[model_cards] Fix pierreguillou/gpt2-small-portuguese
2020-07-15 22:14:52 +02:00
Pierre Guillou
a946724bbf
metadata ( #5758 )
...
* metadata
* Update model_cards/pierreguillou/gpt2-small-portuguese/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-07-15 16:13:28 -04:00
Julien Chaumond
015dc51fe3
[model_card] bert-portuguese: add language meta
...
cc @rodrigonogueira4 @abiocapsouza @robertoalotufo
Also cc @piegu
Obrigado :)
2020-07-15 21:25:52 +02:00
Sam Shleifer
1a647abf0b
[fix] check code quality ( #5772 )
2020-07-15 14:59:38 -04:00
Julien Chaumond
b23d3a5ad4
[model_cards] Switch all languages codes to ISO-639-{1,2,3}
2020-07-15 18:59:20 +02:00
Funtowicz Morgan
d533c7e9b9
[fix] T5 ONNX test: model.to(torch_device) ( #5769 )
...
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>
2020-07-15 10:11:22 -04:00
Sam Shleifer
d0486c8bc2
[cleanup] T5 test, warnings ( #5761 )
2020-07-15 08:23:22 -04:00
Patrick von Platen
ec0a945cf9
[AutoModels] Fix config params handling of all PT and TF AutoModels ( #5665 )
...
* fix auto model causal lm
* leverage given functionality
* apply unused kwargs to all auto models
2020-07-15 09:51:14 +02:00
Julien Chaumond
8ab565a4be
[model_card] Fix syntax
2020-07-14 22:27:07 +02:00
Bashar Talafha
92dc959224
Update README.md ( #5752 )
2020-07-14 15:48:59 -04:00
Bashar Talafha
baf93b02c4
Update README.md ( #5696 )
2020-07-14 12:51:57 -04:00
Joe Davison
5d178954c9
tiny ppl doc typo fix ( #5751 )
2020-07-14 10:39:44 -06:00
Manuel Romero
ac921f0385
RuPERTa model card ( #5743 )
...
* Customize inference widget input
* Update model_cards/mrm8488/RuPERTa-base/README.md
Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>
2020-07-14 22:58:45 +08:00
dartrevan
21c1fe5290
RuDR-BERT model card ( #5698 )
2020-07-14 22:51:53 +08:00
Doron Adler
2db1cc807b
Norod78/hewiki-articles-distilGPT2py-il model card ( #5735 )
...
Model card for hewiki-articles-distilGPT2py-il
A tiny GPT2 model for generating Hebrew text
2020-07-14 22:50:44 +08:00
Pierre Guillou
dae244ad89
GPorTuguese-2 model card ( #5744 )
2020-07-14 22:48:52 +08:00
Sam Shleifer
b2505f7db7
Cleanup bart caching logic ( #5640 )
2020-07-14 06:13:05 -04:00
Sam Shleifer
838950ee44
[fix] mbart_en_ro_generate test now identical to fairseq ( #5731 )
2020-07-14 06:12:24 -04:00
Boris Dayma
4d5a8d6557
docs(wandb): explain how to use W&B integration ( #5607 )
...
* docs(wandb): explain how to use W&B integration
fix #5262
* Also mention TensorBoard
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-07-14 05:12:33 -04:00
Gunnlaugur Thor Briem
cd30f98fd2
doc: fix apparent copy-paste error in docstring ( #5626 )
2020-07-14 09:47:41 +02:00
as-stevens
f867000f56
[Reformer classification head] Implement the reformer model classification head for text classification ( #5198 )
...
* Reformer model head classification implementation for text classification
* Reformat the reformer model classification code
* PR review comments, and test case implementation for reformer for classification head changes
* CI/CD reformer for classification head test import error fix
* CI/CD test case implementation added ReformerForSequenceClassification to all_model_classes
* Code formatting- fixed
* Normal test cases added for reformer classification head
* Fix test cases implementation for the reformer classification head
* removed token_type_id parameter from the reformer classification head
* fixed the test case for reformer classification head
* merge conflict with master fixed
* merge conflict, changed reformer classification to accept the choice_label parameter added in latest code
* refactored the the reformer classification head test code
* reformer classification head, common transform test cases fixed
* final set of the review comment, rearranging the reformer classes and docstring add to classification forward method
* fixed the compilation error and text case fix for reformer classification head
* Apply suggestions from code review
Remove unnecessary dup
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2020-07-14 09:16:22 +02:00
Gaurav Mishra
f0bda06f43
Update tokenization_t5.py ( #5717 )
...
Minor doc fix.
2020-07-14 00:02:03 -04:00
Sam Shleifer
c3c61ea017
[Fix] github actions CI by reverting #5138 ( #5686 )
2020-07-13 17:12:18 -04:00
Stas Bekman
45addfe96d
FlaubertForTokenClassification ( #5644 )
...
* implement FlaubertForTokenClassification as a subclass of XLMForTokenClassification
* fix mapping order
* add the doc
* add common tests
2020-07-13 14:59:53 -04:00
Patrick von Platen
7096e47513
[Longformer] fix longformer global attention output ( #5659 )
...
* fix longformer global attention output
* fix multi gpu problem
* replace -10000 with 0
* better comment
* make attention output equal local and global
* Update src/transformers/modeling_longformer.py
2020-07-13 17:23:22 +02:00