Pedro Lima
52d250f6aa
[model_cards] pvl/labse_bert model card
...
From **Language-Agnostic BERT Sentence Embedding**
https://ai.googleblog.com/2020/08/language-agnostic-bert-sentence.html
2020-09-15 08:54:12 -04:00
tuner007
84d64805b0
Create README.md ( #7097 )
...
Model card for PEGASUS finetuned for paraphrasing task
2020-09-15 08:48:25 -04:00
Philip May
52bb7ccce5
German electra model card v3 update ( #7089 )
...
* changed eval table model order
* Update install
* update mc
2020-09-15 08:48:13 -04:00
Siddharth Jain
1a85299a5e
Tiny typo fix ( #7143 )
2020-09-15 08:18:42 -04:00
Paul O'Leary McCann
e29c3f1b11
Add quotes to paths in MeCab arguments ( #7142 )
...
Without quotes directories with spaces in them will fail to be processed
correctly.
2020-09-15 19:04:50 +08:00
Yih-Dar
cb061e78e1
Fix TF Trainer loss calculation ( #6998 )
...
* create branch for issue #6968
* First attempt to fix incorrect tf trainer loss calculation
* Fix training loss in metric
* fix tf trainer evaluation loss
* apply count_instances_in_batch() for eval and test datasets
* prototype of using a new argument in trainer_tf.py to fix loss issue
* some renaming and fix, in particular for evaluation methods
* fix bugs to have a running version
* change to @staticmethod
* apply style
2020-09-15 05:41:00 -04:00
Stas Bekman
b0cbcdb05b
[logging] remove no longer needed verbosity override ( #7100 )
2020-09-15 04:01:14 -04:00
Sylvain Gugger
2bf70e2150
Fix reproducible tests in Trainer ( #7119 )
...
* Fix reproducible tests in Trainer
* Deal with multiple GPUs
2020-09-15 03:32:44 -04:00
Sam Shleifer
9e89390ce1
[QOL] add signature for prepare_seq2seq_batch ( #7108 )
2020-09-14 20:33:08 -04:00
Sam Shleifer
33d479d2b2
[s2s] distributed eval in one command ( #7124 )
2020-09-14 15:57:56 -04:00
sgugger
206b78d485
Pin version of TF and torch
2020-09-14 14:08:51 -04:00
Kevin Canwen Xu
90cde2e938
Add Mirror Option for Downloads ( #6679 )
...
* Add Tuna Mirror for Downloads from China
* format fix
* Use preset instead of hardcoding URL
* Fix
* make style
* update the mirror option doc
* update the mirror
2020-09-14 23:50:22 +08:00
Antonio V Mendoza
e0e0675ac7
Demoing LXMERT with raw images by incorporating the FRCNN model for roi-pooled extraction and bounding-box predction on the GQA answer set. ( #6986 )
...
* adding demo
* Update examples/lxmert/requirements.txt
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* Update examples/lxmert/checkpoint.sh
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* added user input for .py demo
* updated model loading, data extrtaction, checkpoints, and lots of other automation
* adding normalizing for bounding boxes
* Update requirements.txt
* some optimizations for extracting data
* added data extracting file
* added data extraction file
* minor fixes to reqs and readme
* Style
* remove options
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
2020-09-14 10:07:04 -04:00
sgugger
5636cbb25d
Extra )
2020-09-14 09:37:55 -04:00
Sylvain Gugger
ccc8e30c8a
Clean up autoclass doc ( #7081 )
2020-09-14 09:26:41 -04:00
Stas Bekman
3ca1874ca4
[examples testing] restore code ( #7099 )
...
For some reason https://github.com/huggingface/transformers/pull/5512 re-added temp dir creation code that was removed by
https://github.com/huggingface/transformers/pull/6494 defeating the purpose of that PR for those tests.
2020-09-14 08:54:23 -04:00
Stas Bekman
4d39148419
fix deprecation warnings ( #7033 )
...
* fix deprecation warnings
* remove tests/test_tokenization_common.py's test_padding_to_max_length
* revert test_padding_to_max_length
2020-09-14 07:51:19 -04:00
Stas Bekman
576eec98e0
ignore FutureWarning in tests ( #7079 )
2020-09-14 07:50:51 -04:00
Bartosz Telenczuk
15d18e0307
fix link to paper ( #7116 )
2020-09-14 07:43:40 -04:00
Lysandre Debut
bb3106f741
Temporarily skip failing tests due to dependency change ( #7118 )
...
* Temporarily skip failing tests due to dependency change
* Remove trace
2020-09-14 07:42:13 -04:00
Sam Shleifer
0fab39695a
[s2s distill] allow pegasus-12-12 ( #7104 )
2020-09-14 00:03:59 -04:00
Sam Shleifer
de9e297964
[s2s] distributed eval cleanup ( #7110 )
2020-09-13 23:40:38 -04:00
Sam Shleifer
54395d87a6
Update xsum length penalty to better values ( #7107 )
2020-09-13 20:48:47 -04:00
Sam Shleifer
e7f8d2ab64
[s2s] two stage run_distributed_eval.py ( #7105 )
2020-09-13 17:28:18 -04:00
Sam Shleifer
0ec63afec2
fix bug in pegasus converter ( #7094 )
2020-09-13 15:11:47 -04:00
Sam Shleifer
b76cb1c3df
[s2s] run_eval supports --prefix clarg. ( #6953 )
2020-09-12 01:08:21 -04:00
李明浩
563ffb3dc3
Create README.md ( #7066 )
2020-09-11 15:21:05 -04:00
李明浩
1ad49cde3a
Create README.md ( #7067 )
2020-09-11 15:20:54 -04:00
Sagor Sarker
4753816e39
added bangla-bert-base model card and also modified other model cards ( #7071 )
...
* added bangla-bert-base
* Apply suggestions from code review
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-09-11 15:17:25 -04:00
Suraj Patil
0a8c17d53c
[T5Tokenizer] remove prefix_tokens ( #7078 )
2020-09-11 14:18:45 -04:00
Sylvain Gugger
4cbd50e611
Compute loss method ( #7074 )
2020-09-11 12:06:31 -04:00
Sylvain Gugger
ae736163d0
Add tests and fix various bugs in ModelOutput ( #7073 )
...
* Add tests and fix various bugs in ModelOutput
* Update tests/test_model_output.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2020-09-11 12:01:33 -04:00
Sylvain Gugger
e841b75dec
Automate the lists in auto-xxx docs ( #7061 )
...
* More readable dict
* More nlp -> datasets
* Revert "More nlp -> datasets"
This reverts commit 3cd1883d22
.
* Automate the lists in auto-xxx docs
* More readable dict
* Revert "More nlp -> datasets"
This reverts commit 3cd1883d22
.
* Automate the lists in auto-xxx docs
* nlp -> datasets
* Fix new key
2020-09-11 10:42:09 -04:00
Sylvain Gugger
0054a48cdd
Add dep on datasets ( #7058 )
2020-09-11 04:43:19 -04:00
Patrick von Platen
221d4c63a3
clean naming ( #7068 )
2020-09-11 09:57:53 +02:00
Stas Bekman
8fcbe486e1
these tests require non-multigpu env ( #7059 )
...
* these tests require non-multigpu env
* cleanup
* clarify
2020-09-10 18:52:55 -04:00
Sam Shleifer
77950c485a
[wip/s2s] DistributedSortishSampler ( #7056 )
2020-09-10 15:23:44 -04:00
Sylvain Gugger
514486739c
Fix CI with change of name of nlp ( #7054 )
...
* nlp -> datasets
* More nlp -> datasets
* Woopsie
* More nlp -> datasets
* One last
2020-09-10 14:51:08 -04:00
Sam Shleifer
e9a2f772bc
[s2s] --eval_max_generate_length ( #7018 )
2020-09-10 14:11:34 -04:00
Stas Bekman
df4594a9da
[xlm tok] config dict: fix str into int to match definition ( #7034 )
2020-09-10 19:31:01 +02:00
Julien Chaumond
d6c08b07a0
[AutoTokenizer] Correct error message
2020-09-10 17:19:01 +02:00
Patrick von Platen
db38f7ce29
[BertGeneration, Docs] Fix another old name in docs ( #7050 )
...
* correct docs for bert generation
* upload
2020-09-10 17:12:33 +02:00
Patrick von Platen
3bd95b0faf
correct docs for bert generation ( #7048 )
2020-09-10 17:08:40 +02:00
Patrick von Platen
eb2feb5d90
Create README.md
2020-09-10 17:05:50 +02:00
Ashwin Geet Dsa
66a5a6fda8
fix to ensure that returned tensors after the tokenization is Long ( #7039 )
...
* fix to ensure that returned tensors after the tokenization is Long
* fix to ensure that returned tensors after the tokenization is Long
Co-authored-by: Ashwin Geet Dsa <adsa@grvingt-6.nancy.grid5000.fr>
2020-09-10 11:04:03 -04:00
Patrick von Platen
9ccdb1d517
Update README.md
2020-09-10 17:01:19 +02:00
Patrick von Platen
60698936fc
Create README.md
2020-09-10 17:00:10 +02:00
Patrick von Platen
e0c3bc8ee0
Create README.md
2020-09-10 16:51:15 +02:00
Patrick von Platen
c356b9878d
Create README.md
2020-09-10 16:45:44 +02:00
Patrick von Platen
5afd3f6196
Create README.md
2020-09-10 16:44:47 +02:00