Stefan Schweter
|
2b07b9e5ee
|
examples: add DistilBert support for NER fine-tuning
|
2019-11-11 16:19:34 +01:00 |
|
Julien Chaumond
|
f88c104d8f
|
[run_tf_glue] Add comment for context
|
2019-11-05 19:56:43 -05:00 |
|
Julien Chaumond
|
30968d70af
|
misc doc
|
2019-11-05 19:06:12 -05:00 |
|
Thomas Wolf
|
e99071f105
|
Merge pull request #1734 from orena1/patch-1
add progress bar to convert_examples_to_features
|
2019-11-05 11:34:20 +01:00 |
|
Thomas Wolf
|
ba973342e3
|
Merge pull request #1553 from WilliamTambellini/timeSquadInference
Add speed log to examples/run_squad.py
|
2019-11-05 11:13:12 +01:00 |
|
Thomas Wolf
|
237fad339c
|
Merge pull request #1709 from oneraghavan/master
Fixing mode in evaluate during training
|
2019-11-05 10:55:33 +01:00 |
|
Oren Amsalem
|
d7906165a3
|
add progress bar for convert_examples_to_features
It takes considerate amount of time (~10 min) to parse the examples to features, it is good to have a progress-bar to track this
|
2019-11-05 10:34:27 +02:00 |
|
thomwolf
|
89d6272898
|
Fix #1623
|
2019-11-04 16:21:12 +01:00 |
|
Thomas Wolf
|
9a3b173cd3
|
Merge branch 'master' into master
|
2019-11-04 11:41:26 +01:00 |
|
thomwolf
|
ad90868627
|
Update example readme
|
2019-11-04 11:27:22 +01:00 |
|
Raghavan
|
e5b1048bae
|
Fixing mode in evaluate during training
|
2019-11-03 16:14:46 +05:30 |
|
Lysandre
|
1a2b40cb53
|
run_tf_glue MRPC evaluation only for MRPC
|
2019-10-31 18:00:51 -04:00 |
|
Timothy Liu
|
be36cf92fb
|
Added mixed precision support to benchmarks.py
|
2019-10-31 17:24:37 -04:00 |
|
Julien Chaumond
|
f96ce1c241
|
[run_generation] Fix generation with batch_size>1
|
2019-10-31 18:27:11 +00:00 |
|
Julien Chaumond
|
3c1b6f594e
|
Merge branch 'master' into fix_top_k_top_p_filtering
|
2019-10-31 13:53:51 -04:00 |
|
Victor SANH
|
fa735208c9
|
update readme - fix example command distil*
|
2019-10-30 14:27:28 -04:00 |
|
Thomas Wolf
|
c7058d8224
|
Merge pull request #1608 from focox/master
Error raised by "tmp_eval_loss += tmp_eval_loss.item()" when using multi-gpu
|
2019-10-30 17:14:07 +01:00 |
|
Thomas Wolf
|
04c69db399
|
Merge pull request #1628 from huggingface/tfglue
run_tf_glue works with all tasks
|
2019-10-30 17:04:03 +01:00 |
|
Thomas Wolf
|
3df4367244
|
Merge pull request #1601 from huggingface/clean-roberta
Clean roberta model & all tokenizers now add special tokens by default (breaking change)
|
2019-10-30 17:00:40 +01:00 |
|
Thomas Wolf
|
36174696cc
|
Merge branch 'master' into clean-roberta
|
2019-10-30 16:51:06 +01:00 |
|
Thomas Wolf
|
228cdd6a6e
|
Merge branch 'master' into conditional-generation
|
2019-10-30 16:40:35 +01:00 |
|
Rémi Louf
|
070507df1f
|
format utils for summarization
|
2019-10-30 11:24:12 +01:00 |
|
Rémi Louf
|
da10de8466
|
fix bug with padding mask + add corresponding test
|
2019-10-30 11:19:58 +01:00 |
|
Rémi Louf
|
3b0d2fa30e
|
rename seq2seq to encoder_decoder
|
2019-10-30 10:54:46 +01:00 |
|
Rémi Louf
|
9c1bdb5b61
|
revert renaming of lm_labels to ltr_lm_labels
|
2019-10-30 10:43:13 +01:00 |
|
Rémi Louf
|
098a89f312
|
update docstrings; rename lm_labels to more explicit ltr_lm_labels
|
2019-10-29 20:08:03 +01:00 |
|
Rémi Louf
|
dfce409691
|
resolve PR comments
|
2019-10-29 17:10:20 +01:00 |
|
altsoph
|
079bfb32fb
|
Evaluation fixed.
|
2019-10-28 10:18:58 -04:00 |
|
altsoph
|
438f2730a0
|
Evaluation code fixed.
|
2019-10-28 10:18:58 -04:00 |
|
Rémi Louf
|
4c3ac4a7d8
|
here's one big commit
|
2019-10-28 10:49:50 +01:00 |
|
Rémi Louf
|
932543f77e
|
fix test of truncation function
|
2019-10-28 10:49:49 +01:00 |
|
Rémi Louf
|
a67413ccc8
|
extend works in-place
|
2019-10-28 10:49:49 +01:00 |
|
Rémi Louf
|
b915ba9dfe
|
pad sequence with 0, mask with -1
|
2019-10-28 10:49:49 +01:00 |
|
Lysandre
|
bab6ad01aa
|
run_tf_glue works with all tasks
|
2019-10-24 21:41:45 +00:00 |
|
Matt Maybeno
|
ae1d03fc51
|
Add roberta to doc
|
2019-10-24 14:32:48 -04:00 |
|
Matt Maybeno
|
4e5f88b74f
|
Add Roberta to run_ner.py
|
2019-10-24 14:32:48 -04:00 |
|
VictorSanh
|
5b6cafb11b
|
[release] fix table weirdness
|
2019-10-23 10:35:16 -04:00 |
|
VictorSanh
|
8ad5c591cd
|
[RELEASE] DistilRoBERTa
|
2019-10-23 10:29:47 -04:00 |
|
focox@qq.com
|
bd847ce7d7
|
fixed the bug raised by "tmp_eval_loss += tmp_eval_loss.item()" when parallelly using multi-gpu.
|
2019-10-23 20:27:13 +08:00 |
|
Julien Chaumond
|
ef1b8b2ae5
|
[CTRL] warn if generation prompt does not start with a control code
see also https://github.com/salesforce/ctrl/pull/50
|
2019-10-22 21:30:32 +00:00 |
|
Lysandre
|
7d709e55ed
|
Remove
|
2019-10-22 14:12:33 -04:00 |
|
Lysandre
|
1cfd974868
|
Option to benchmark only one of the two libraries
|
2019-10-22 13:32:23 -04:00 |
|
Pasquale Minervini
|
abd7110e21
|
gradient norm clipping should be done right before calling the optimiser - fixing run_glue and run_ner as well
|
2019-10-21 19:56:52 +01:00 |
|
Pasquale Minervini
|
3775550c4b
|
gradient norm clipping should be done right before calling the optimiser
|
2019-10-20 22:33:56 +01:00 |
|
LysandreJik
|
7dd29ed2f1
|
Benchmarks example script
|
2019-10-18 10:53:04 -04:00 |
|
William Tambellini
|
0919389d9a
|
Add speed log to examples/run_squad.py
Add a speed estimate log (time per example)
for evaluation to examples/run_squad.py
|
2019-10-17 14:41:04 -07:00 |
|
leo-du
|
ecd15667f3
|
fix repetition penalty
|
2019-10-17 14:47:14 -04:00 |
|
thomwolf
|
8cd56e3036
|
fix data processing in script
|
2019-10-17 16:33:26 +02:00 |
|
Rémi Louf
|
578d23e061
|
add training pipeline (formatting temporary)
|
2019-10-17 14:02:27 +02:00 |
|
Rémi Louf
|
47a06d88a0
|
use two different tokenizers for storyand summary
|
2019-10-17 13:04:26 +02:00 |
|