thomwolf
|
45709d7532
|
model running with simple inputs
|
2019-06-21 00:28:42 +02:00 |
|
thomwolf
|
de713fa9b4
|
starting
|
2019-06-20 10:54:19 +02:00 |
|
thomwolf
|
7f00a36e27
|
pruning should keep on device
|
2019-06-19 22:23:12 +02:00 |
|
thomwolf
|
34d706a0e1
|
pruning in bertology
|
2019-06-19 15:25:49 +02:00 |
|
thomwolf
|
dc8e0019b7
|
updating examples
|
2019-06-19 13:23:20 +02:00 |
|
thomwolf
|
68ab9599ce
|
small fix and updates to readme
|
2019-06-19 09:38:38 +02:00 |
|
thomwolf
|
4d8c4337ae
|
test barrier in distrib training
|
2019-06-18 22:41:28 +02:00 |
|
thomwolf
|
868de8d1d7
|
updating weights loading
|
2019-06-18 10:58:20 +02:00 |
|
thomwolf
|
64e0adda81
|
better error message
|
2019-06-18 10:51:31 +02:00 |
|
thomwolf
|
382e2d1e50
|
spliting config and weight files for bert also
|
2019-06-18 10:37:16 +02:00 |
|
thomwolf
|
33d3db5c43
|
updating head masking, readme and docstrings
|
2019-06-17 15:51:28 +02:00 |
|
thomwolf
|
7220d47a1c
|
adding head pruning and tests
|
2019-06-17 13:20:45 +02:00 |
|
thomwolf
|
8415a38b23
|
better error messages
|
2019-06-17 13:03:48 +02:00 |
|
thomwolf
|
96c4d3d988
|
add head masking tests
|
2019-06-17 12:17:26 +02:00 |
|
thomwolf
|
34858ae1d9
|
adding bert whole words, bertgerman and gpt-2 medium models, head masking
|
2019-06-17 11:02:39 +02:00 |
|
timoeller
|
16af9ff7b0
|
Add German Bert model to code, update readme
|
2019-06-14 17:42:46 +02:00 |
|
Thomas Wolf
|
35e6baab37
|
Merge branch 'master' into attention
|
2019-06-14 16:41:56 +02:00 |
|
thomwolf
|
5e1207b8ad
|
add attention to all bert models and add test
|
2019-06-14 16:28:25 +02:00 |
|
thomwolf
|
a3274ac40b
|
adding attention outputs in bert
|
2019-06-03 16:11:45 -05:00 |
|
VictorSanh
|
826496580b
|
Revert "add output_attentions for BertModel"
This reverts commit de5e5682a1 .
|
2019-06-03 17:10:25 -04:00 |
|
VictorSanh
|
de5e5682a1
|
add output_attentions for BertModel
|
2019-06-03 17:05:24 -04:00 |
|
Victor SANH
|
96592b544b
|
default in __init__s for classification BERT models (#650)
|
2019-05-30 15:53:13 -04:00 |
|
huntzhan
|
101ab4dd8e
|
Make the epsilon of LayerNorm configurable.
|
2019-05-06 00:26:21 +08:00 |
|
Thomas Wolf
|
e6cf62d499
|
Merge pull request #488 from dhpollack/fix_multichoice
fixed BertForMultipleChoice model init and forward pass
|
2019-04-25 21:04:16 +02:00 |
|
Thomas Wolf
|
d76a57b0ba
|
Merge pull request #506 from ailzhang/hubconf
Hubconf
|
2019-04-24 20:59:21 +02:00 |
|
thomwolf
|
80f995a141
|
revert BertForMultipleChoice linear classifier
|
2019-04-24 16:51:54 +02:00 |
|
Ailing Zhang
|
bfd6f6b257
|
fix from_pretrained positional args
|
2019-04-17 16:31:40 -07:00 |
|
Thomas Wolf
|
3d78e226e6
|
Merge pull request #489 from huggingface/tokenization_serialization
Better serialization for Tokenizers and Configuration classes - Also fix #466
|
2019-04-16 08:49:54 +02:00 |
|
thomwolf
|
df5d9c3551
|
load all models on cpu
|
2019-04-15 15:43:01 +02:00 |
|
thomwolf
|
60ea6c59d2
|
added best practices for serialization in README and examples
|
2019-04-15 15:00:33 +02:00 |
|
thomwolf
|
9761aa4845
|
add to_json_file method to configuration classes
|
2019-04-15 14:12:08 +02:00 |
|
David Pollack
|
38ba7b439b
|
fixed BertForMultipleChoice model init and forward pass
|
2019-04-15 10:38:01 +02:00 |
|
Martin Boyanov
|
34cf67fd6c
|
Extend the BertForSequenceClassification docs to mention the special CLS token.
|
2019-04-12 21:30:28 +03:00 |
|
thomwolf
|
19666dcb3b
|
Should fix #438
|
2019-04-03 11:01:01 +02:00 |
|
Mike Arpaia
|
8b5c63e4de
|
Fixes to the TensorFlow conversion tool
|
2019-04-01 13:17:54 -06:00 |
|
Ikuya Yamada
|
0401317b23
|
Remove padding_idx from position_embeddings and token_type_embeddings
|
2019-03-26 21:56:35 +09:00 |
|
Thomas Wolf
|
9b03d67b83
|
Merge pull request #362 from Bharat123rox/patch-1
Make the hyperlink of NVIDIA Apex clickable
|
2019-03-11 09:08:51 +01:00 |
|
Bharat Raghunathan
|
f91ce0b803
|
Make the hyperlink of NVIDIA Apex clickable
|
2019-03-09 20:05:39 +05:30 |
|
Haozhe Ji
|
72fa8d03a7
|
add 'padding_idx=0' for BertEmbeddings
|
2019-03-07 20:02:55 +08:00 |
|
thomwolf
|
5c85fc3977
|
fix typo - logger info
|
2019-03-06 10:05:21 +01:00 |
|
Dan Hendrycks
|
434d15da8e
|
Update activation function docstring
|
2019-02-16 12:17:52 -08:00 |
|
thomwolf
|
5ee4f17234
|
adding option to load on cpu
|
2019-02-08 10:37:40 +01:00 |
|
thomwolf
|
edcb56fd96
|
more explicit variable name
|
2019-02-08 09:54:49 +01:00 |
|
Thomas Wolf
|
848aae49e1
|
Merge branch 'master' into python_2
|
2019-02-06 00:13:20 +01:00 |
|
thomwolf
|
448937c00d
|
python 2 compatibility
|
2019-02-06 00:07:46 +01:00 |
|
thomwolf
|
822915142b
|
fix docstring
|
2019-02-05 16:34:32 +01:00 |
|
Thibault Fevry
|
f3bda2352a
|
Only keep the active part mof the loss for token classification
|
2019-02-04 11:46:36 -05:00 |
|
thomwolf
|
98c96fb1a7
|
splitting position and tokens embeddings in OpenAI GPT - updating tf imports - tests
|
2019-01-29 10:31:42 +01:00 |
|
thomwolf
|
5456d82311
|
more versatile model loading
|
2019-01-29 09:54:18 +01:00 |
|
thomwolf
|
bd3b3aee9c
|
update
|
2019-01-28 17:47:29 +01:00 |
|