Thomas Wolf
|
04c69db399
|
Merge pull request #1628 from huggingface/tfglue
run_tf_glue works with all tasks
|
2019-10-30 17:04:03 +01:00 |
|
Thomas Wolf
|
5c6a19a94a
|
Merge pull request #1604 from huggingface/deploy_doc
Versioning in documentation
|
2019-10-30 17:03:14 +01:00 |
|
Thomas Wolf
|
3df4367244
|
Merge pull request #1601 from huggingface/clean-roberta
Clean roberta model & all tokenizers now add special tokens by default (breaking change)
|
2019-10-30 17:00:40 +01:00 |
|
Thomas Wolf
|
6d73c92cae
|
Merge pull request #1455 from huggingface/conditional-generation
[WIP] Sequence generation using pretrained BERT
|
2019-10-30 16:54:18 +01:00 |
|
Thomas Wolf
|
36174696cc
|
Merge branch 'master' into clean-roberta
|
2019-10-30 16:51:06 +01:00 |
|
Thomas Wolf
|
228cdd6a6e
|
Merge branch 'master' into conditional-generation
|
2019-10-30 16:40:35 +01:00 |
|
Rémi Louf
|
3cf2020c6b
|
change kwargs processing
|
2019-10-30 16:27:51 +01:00 |
|
Rémi Louf
|
a88a0e4413
|
add tests to encoder-decoder model
|
2019-10-30 16:06:29 +01:00 |
|
Rémi Louf
|
3f07cd419c
|
update test on Bert to include decoder mode
|
2019-10-30 15:09:53 +01:00 |
|
Thomas Wolf
|
55fbfea369
|
Update CONTRIBUTING.md
Co-Authored-By: Stefan Schweter <stefan.schweter@bsb-muenchen.de>
|
2019-10-30 12:25:40 +01:00 |
|
Thomas Wolf
|
cef2a8f900
|
Update CONTRIBUTING.md
Co-Authored-By: Stefan Schweter <stefan.schweter@bsb-muenchen.de>
|
2019-10-30 12:25:31 +01:00 |
|
thomwolf
|
328a86d2af
|
adding links to the templates in readme and contributing
|
2019-10-30 11:37:55 +01:00 |
|
thomwolf
|
7f4226f9e6
|
adding templates
|
2019-10-30 11:31:56 +01:00 |
|
Rémi Louf
|
070507df1f
|
format utils for summarization
|
2019-10-30 11:24:12 +01:00 |
|
Rémi Louf
|
da10de8466
|
fix bug with padding mask + add corresponding test
|
2019-10-30 11:19:58 +01:00 |
|
Rémi Louf
|
3b0d2fa30e
|
rename seq2seq to encoder_decoder
|
2019-10-30 10:54:46 +01:00 |
|
Rémi Louf
|
9c1bdb5b61
|
revert renaming of lm_labels to ltr_lm_labels
|
2019-10-30 10:43:13 +01:00 |
|
Timothy Liu
|
842f3bf049
|
Fixed training for TF XLM
|
2019-10-30 01:32:15 +00:00 |
|
Rémi Louf
|
098a89f312
|
update docstrings; rename lm_labels to more explicit ltr_lm_labels
|
2019-10-29 20:08:03 +01:00 |
|
Rémi Louf
|
dfce409691
|
resolve PR comments
|
2019-10-29 17:10:20 +01:00 |
|
altsoph
|
079bfb32fb
|
Evaluation fixed.
|
2019-10-28 10:18:58 -04:00 |
|
altsoph
|
438f2730a0
|
Evaluation code fixed.
|
2019-10-28 10:18:58 -04:00 |
|
Rémi Louf
|
4c3ac4a7d8
|
here's one big commit
|
2019-10-28 10:49:50 +01:00 |
|
Rémi Louf
|
932543f77e
|
fix test of truncation function
|
2019-10-28 10:49:49 +01:00 |
|
Rémi Louf
|
a67413ccc8
|
extend works in-place
|
2019-10-28 10:49:49 +01:00 |
|
Rémi Louf
|
cb26b035c6
|
remove potential UndefinedError
|
2019-10-28 10:49:49 +01:00 |
|
Rémi Louf
|
b915ba9dfe
|
pad sequence with 0, mask with -1
|
2019-10-28 10:49:49 +01:00 |
|
Rémi Louf
|
dc580dd4c7
|
add lm_labels for the LM cross-entropy
|
2019-10-28 10:49:49 +01:00 |
|
Rémi Louf
|
f873a3edb2
|
the decoder attends to the output of the encoder stack (last layer)
|
2019-10-28 10:49:00 +01:00 |
|
Lorenzo Ampil
|
d36680df54
|
Rever changes to TF distilbert due to failed test: TFDistilBertModelTest.test_pt_tf_model_equivalence
|
2019-10-27 14:51:36 +08:00 |
|
Lorenzo Ampil
|
ec276d6aba
|
Add special tokens to documentation for the tensorflow model examples #1561
|
2019-10-27 14:00:40 +08:00 |
|
Lorenzo Ampil
|
6e011690a9
|
Add special tokens to documentation for the rest of pytorch model examples #1561
|
2019-10-27 13:59:14 +08:00 |
|
Lysandre
|
beaf66b1f3
|
Remove break
|
2019-10-24 21:43:28 +00:00 |
|
Lysandre
|
bab6ad01aa
|
run_tf_glue works with all tasks
|
2019-10-24 21:41:45 +00:00 |
|
Matt Maybeno
|
ae1d03fc51
|
Add roberta to doc
|
2019-10-24 14:32:48 -04:00 |
|
Matt Maybeno
|
4e5f88b74f
|
Add Roberta to run_ner.py
|
2019-10-24 14:32:48 -04:00 |
|
Matt Maybeno
|
b92d68421d
|
Use roberta model and update doc strings
|
2019-10-24 14:32:48 -04:00 |
|
Matt Maybeno
|
66085a1321
|
RoBERTa token classification
[WIP] copy paste bert token classification for roberta
|
2019-10-24 14:32:48 -04:00 |
|
Lysandre
|
b82bfbd0c3
|
Updated README to show all available documentation
|
2019-10-24 15:55:31 +00:00 |
|
VictorSanh
|
5b6cafb11b
|
[release] fix table weirdness
|
2019-10-23 10:35:16 -04:00 |
|
VictorSanh
|
8ad5c591cd
|
[RELEASE] DistilRoBERTa
|
2019-10-23 10:29:47 -04:00 |
|
focox@qq.com
|
bd847ce7d7
|
fixed the bug raised by "tmp_eval_loss += tmp_eval_loss.item()" when parallelly using multi-gpu.
|
2019-10-23 20:27:13 +08:00 |
|
Lysandre Debut
|
6e85bccafc
|
Fixed typo
|
2019-10-22 18:07:01 -04:00 |
|
Lysandre
|
fbcc5ff9fb
|
Change branch to master
|
2019-10-22 18:01:10 -04:00 |
|
Lysandre
|
69eba0ab19
|
Edit script path
|
2019-10-22 17:53:52 -04:00 |
|
Lysandre
|
bc3e57d551
|
Multi version doc deployment
|
2019-10-22 17:51:30 -04:00 |
|
Julien Chaumond
|
ef1b8b2ae5
|
[CTRL] warn if generation prompt does not start with a control code
see also https://github.com/salesforce/ctrl/pull/50
|
2019-10-22 21:30:32 +00:00 |
|
Julián Peller (dataista)
|
e16d46843a
|
Fix architectures count
|
2019-10-22 15:13:47 -04:00 |
|
Lysandre
|
7d709e55ed
|
Remove
|
2019-10-22 14:12:33 -04:00 |
|
Lysandre
|
44286b94d3
|
RoBERTa doesn't print a warning when no special tokens are passed.
|
2019-10-22 13:46:48 -04:00 |
|