piero
821de121e8
Minor changes
2019-12-03 10:14:02 -05:00
w4nderlust
7469d03b1c
Fixed minor bug when running training on cuda
2019-12-03 10:14:02 -05:00
piero
0b51fba20b
Added script for training a discriminator for pplm to use
2019-12-03 10:14:02 -05:00
Piero Molino
34a83faabe
Let's make PPLM great again
2019-12-03 10:14:02 -05:00
Julien Chaumond
d5faa74cd6
tokenizer white space: revert to previous behavior
2019-12-03 10:14:02 -05:00
Julien Chaumond
0b77d66a6d
rm extraneous import
2019-12-03 10:14:02 -05:00
Rosanne Liu
83b1e6ac9e
fix the loss backward issue
...
(cherry picked from commit 566468cc984c6ec7e10dfc62b5b4191781a99cd2)
2019-12-03 10:14:02 -05:00
Julien Chaumond
572c24cfa2
PPLM (squashed)
...
Co-authored-by: piero <piero@uber.com>
Co-authored-by: Rosanne Liu <mimosavvy@gmail.com>
2019-12-03 10:14:02 -05:00
Thomas Wolf
f19a78a634
Merge pull request #1903 from valohai/master
...
Valohai integration
2019-12-03 16:13:01 +01:00
Thomas Wolf
d100ad99c0
Merge pull request #2014 from aaugustin/mark-tf-auto-model-test-as-slow
...
Mark tests in TFAutoModelTest as slow.
2019-12-03 16:03:48 +01:00
Juha Kiili
66fc8d25a5
Change ref to original GLUE downloader script
2019-12-03 10:49:50 +02:00
LysandreJik
fbaf05bd92
Remove annoying tokenization message
2019-12-02 18:23:00 -05:00
Lysandre
e85855f2c4
Fix ALBERT exports with pretraining + sp classifier; Fix naming for ALBERT TF models
2019-12-02 18:00:19 -05:00
Lysandre
b3d834ae11
Reorganize ALBERT conversion script
2019-12-02 15:01:52 -05:00
thomwolf
f3776df0f3
WIP debugging
2019-12-02 15:47:00 +01:00
Aymeric Augustin
5ab93083e4
Mark tests in TFAutoModelTest as slow.
...
Each test forces downloading the same 536MB file, which is slow
even with a decent internet connection.
2019-12-01 18:25:15 +01:00
Aditya Soni
c356290c8d
typo fix as per Pytorch v1.1+
2019-12-01 14:08:14 +05:30
Rostislav Nedelchev
76c0bc06d5
[XLNet] Changed post-processing of attention w.r.t to target_mapping
...
Whenever target_mapping is provided to the input, XLNet outputs two different attention streams.
Based on that the attention output would be on of the two:
- a list of tensors (usual case for most transformers)
- a list of 2-tuples of tensors, one tesor for each of attention streams
Docs and unit-tests have been updated
2019-11-30 21:01:04 +01:00
Rostislav Nedelchev
b90791e950
fixed XLNet attenttion output for both attention streams
2019-11-30 15:57:51 +01:00
maxvidal
b0ee7c7df3
Added Camembert to available models
2019-11-29 14:17:02 -05:00
Elad Segal
ecf15ebf3b
Add ALBERT to AutoClasses
2019-11-29 11:25:37 -05:00
thomwolf
4a666885b5
reducing my level of enthousiasm
2019-11-29 09:40:50 -05:00
thomwolf
adb5c79ff2
update all tf.shape and tensor.shape to shape_list
2019-11-29 09:40:50 -05:00
Juha Kiili
2421e54f8c
Add link to original source and license to download_glue.data.py
2019-11-29 15:39:28 +02:00
Juha Kiili
41aa0e8003
Refactor logs and fix loss bug
2019-11-29 15:33:25 +02:00
Thomas Wolf
1ab8dc44b3
Merge pull request #1876 from huggingface/mean-fix
...
Mean does not exist in TF2
2019-11-29 09:26:33 +01:00
Thomas Wolf
f0d22b6363
Merge pull request #1873 from stefan-it/distilbert-german
...
German DistilBERT
2019-11-29 09:25:47 +01:00
Lysandre
1e9ac5a7cf
New -> normal
2019-11-28 17:43:47 -05:00
Lysandre
0b84b9fd8a
Add processors to __init__
2019-11-28 17:38:52 -05:00
Lysandre
f671997ef7
Interface with TFDS
2019-11-28 17:17:20 -05:00
Lysandre
bd41e8292a
Cleanup & Evaluation now works
2019-11-28 16:03:56 -05:00
Thomas Wolf
d49c43ff78
Merge pull request #1778 from eukaryote31/patch-2
...
from_pretrained: convert DialoGPT format
2019-11-28 16:08:37 +01:00
Thomas Wolf
91caf2462c
Merge pull request #1770 from huggingface/initi-encoder-mask
...
Only init encoder_attention_mask if stack is decoder
2019-11-28 16:06:55 +01:00
Thomas Wolf
49a69d5b78
Merge pull request #1753 from digantamisra98/patch-1
...
Added Mish Activation Function
2019-11-28 15:24:08 +01:00
Thomas Wolf
96e7ee7238
Merge pull request #1740 from huggingface/fix-ctrl-past
...
Fix CTRL past
2019-11-27 23:28:30 +01:00
thomwolf
8da47b078d
fix merge tests
2019-11-27 23:11:37 +01:00
Stefan Schweter
8c276b9c92
Merge branch 'master' into distilbert-german
2019-11-27 18:11:49 +01:00
Yao Lu
3c28a2daac
add add_special_tokens=True for input examples
2019-11-27 12:05:23 -05:00
Thomas Wolf
a36f981d1b
Merge branch 'master' into fix-ctrl-past
2019-11-27 17:25:46 +01:00
Thomas Wolf
5afca00b47
Merge pull request #1724 from huggingface/fix_encode_plus
...
Fix encode_plus
2019-11-27 17:14:49 +01:00
Thomas Wolf
49108288ba
Merge pull request #1624 from Huawei-MRC-OSI/resumable_http
...
Add support for resumable downloads for HTTP protocol.
2019-11-27 17:11:07 +01:00
Thomas Wolf
5340d1f21f
Merge branch 'master' into resumable_http
2019-11-27 17:10:36 +01:00
VictorSanh
10bd1ddb39
soft launch distilbert multilingual
2019-11-27 11:07:22 -05:00
VictorSanh
d5478b939d
add distilbert + update run_xnli wrt run_glue
2019-11-27 11:07:22 -05:00
VictorSanh
07ab8d7af6
fix bug
2019-11-27 11:07:22 -05:00
VictorSanh
d474022639
cleaning simple_accuracy since not used anymore
2019-11-27 11:07:22 -05:00
VictorSanh
bcd8dc6b48
move xnli_compute_metrics to data/metrics
2019-11-27 11:07:22 -05:00
VictorSanh
73fe2e7385
remove fstrings
2019-11-27 11:07:22 -05:00
VictorSanh
3e7656f7ac
update readme
2019-11-27 11:07:22 -05:00
VictorSanh
abd397e954
uniformize w/ the cache_dir update
2019-11-27 11:07:22 -05:00