thomwolf
|
a3274ac40b
|
adding attention outputs in bert
|
2019-06-03 16:11:45 -05:00 |
|
VictorSanh
|
826496580b
|
Revert "add output_attentions for BertModel"
This reverts commit de5e5682a1 .
|
2019-06-03 17:10:25 -04:00 |
|
VictorSanh
|
de5e5682a1
|
add output_attentions for BertModel
|
2019-06-03 17:05:24 -04:00 |
|
Thomas Wolf
|
e6cf62d499
|
Merge pull request #488 from dhpollack/fix_multichoice
fixed BertForMultipleChoice model init and forward pass
|
2019-04-25 21:04:16 +02:00 |
|
Thomas Wolf
|
d76a57b0ba
|
Merge pull request #506 from ailzhang/hubconf
Hubconf
|
2019-04-24 20:59:21 +02:00 |
|
thomwolf
|
80f995a141
|
revert BertForMultipleChoice linear classifier
|
2019-04-24 16:51:54 +02:00 |
|
Ailing Zhang
|
bfd6f6b257
|
fix from_pretrained positional args
|
2019-04-17 16:31:40 -07:00 |
|
Thomas Wolf
|
3d78e226e6
|
Merge pull request #489 from huggingface/tokenization_serialization
Better serialization for Tokenizers and Configuration classes - Also fix #466
|
2019-04-16 08:49:54 +02:00 |
|
thomwolf
|
df5d9c3551
|
load all models on cpu
|
2019-04-15 15:43:01 +02:00 |
|
thomwolf
|
60ea6c59d2
|
added best practices for serialization in README and examples
|
2019-04-15 15:00:33 +02:00 |
|
thomwolf
|
9761aa4845
|
add to_json_file method to configuration classes
|
2019-04-15 14:12:08 +02:00 |
|
David Pollack
|
38ba7b439b
|
fixed BertForMultipleChoice model init and forward pass
|
2019-04-15 10:38:01 +02:00 |
|
Martin Boyanov
|
34cf67fd6c
|
Extend the BertForSequenceClassification docs to mention the special CLS token.
|
2019-04-12 21:30:28 +03:00 |
|
thomwolf
|
19666dcb3b
|
Should fix #438
|
2019-04-03 11:01:01 +02:00 |
|
Mike Arpaia
|
8b5c63e4de
|
Fixes to the TensorFlow conversion tool
|
2019-04-01 13:17:54 -06:00 |
|
Ikuya Yamada
|
0401317b23
|
Remove padding_idx from position_embeddings and token_type_embeddings
|
2019-03-26 21:56:35 +09:00 |
|
Thomas Wolf
|
9b03d67b83
|
Merge pull request #362 from Bharat123rox/patch-1
Make the hyperlink of NVIDIA Apex clickable
|
2019-03-11 09:08:51 +01:00 |
|
Bharat Raghunathan
|
f91ce0b803
|
Make the hyperlink of NVIDIA Apex clickable
|
2019-03-09 20:05:39 +05:30 |
|
Haozhe Ji
|
72fa8d03a7
|
add 'padding_idx=0' for BertEmbeddings
|
2019-03-07 20:02:55 +08:00 |
|
thomwolf
|
5c85fc3977
|
fix typo - logger info
|
2019-03-06 10:05:21 +01:00 |
|
Dan Hendrycks
|
434d15da8e
|
Update activation function docstring
|
2019-02-16 12:17:52 -08:00 |
|
thomwolf
|
5ee4f17234
|
adding option to load on cpu
|
2019-02-08 10:37:40 +01:00 |
|
thomwolf
|
edcb56fd96
|
more explicit variable name
|
2019-02-08 09:54:49 +01:00 |
|
Thomas Wolf
|
848aae49e1
|
Merge branch 'master' into python_2
|
2019-02-06 00:13:20 +01:00 |
|
thomwolf
|
448937c00d
|
python 2 compatibility
|
2019-02-06 00:07:46 +01:00 |
|
thomwolf
|
822915142b
|
fix docstring
|
2019-02-05 16:34:32 +01:00 |
|
Thibault Fevry
|
f3bda2352a
|
Only keep the active part mof the loss for token classification
|
2019-02-04 11:46:36 -05:00 |
|
thomwolf
|
98c96fb1a7
|
splitting position and tokens embeddings in OpenAI GPT - updating tf imports - tests
|
2019-01-29 10:31:42 +01:00 |
|
thomwolf
|
5456d82311
|
more versatile model loading
|
2019-01-29 09:54:18 +01:00 |
|
thomwolf
|
bd3b3aee9c
|
update
|
2019-01-28 17:47:29 +01:00 |
|
thomwolf
|
b12616fd8e
|
updating code organization to fix imports
|
2019-01-28 17:03:39 +01:00 |
|
thomwolf
|
d77dd62ff8
|
directly load from TF checkpoints + code cleanup
|
2019-01-28 16:50:23 +01:00 |
|
thomwolf
|
e5c78c6684
|
update readme and few typos
|
2019-01-10 01:40:00 +01:00 |
|
thomwolf
|
ab90d4cddd
|
adding docs and example for OpenAI GPT
|
2019-01-09 00:12:43 +01:00 |
|
thomwolf
|
3cf12b235a
|
added tests + fixed losses
|
2019-01-08 16:24:23 +01:00 |
|
thomwolf
|
eed51c5bdf
|
add OpenAI GPT
|
2019-01-08 12:26:58 +01:00 |
|
thomwolf
|
793dcd236b
|
Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT into fifth-release
|
2019-01-07 13:37:55 +01:00 |
|
thomwolf
|
93f563b8a8
|
adding OpenAI GPT
|
2019-01-07 12:55:36 +01:00 |
|
Thomas Wolf
|
e048c7f1c8
|
Merge pull request #171 from donglixp/patch-1
LayerNorm initialization
|
2019-01-07 12:44:46 +01:00 |
|
Thomas Wolf
|
bcd607542c
|
Merge pull request #145 from wlhgtc/master
Correct the wrong note
|
2019-01-07 12:23:05 +01:00 |
|
Li Dong
|
d0d9b384f2
|
LayerNorm initialization
The LayerNorm gamma and beta should be initialized by .fill_(1.0) and .zero_().
reference links:
989e78c412/tensorflow/contrib/layers/python/layers/layers.py (L2298)
989e78c412/tensorflow/contrib/layers/python/layers/layers.py (L2308)
|
2019-01-07 15:51:33 +08:00 |
|
wlhgtc
|
e626eecc25
|
Update modeling.py
|
2018-12-22 20:26:05 +08:00 |
|
Grégory Châtel
|
7176674849
|
Fixing various class documentations.
|
2018-12-20 13:11:17 +01:00 |
|
thomwolf
|
4a4b0e5783
|
remove logging. basicConfig from library code
|
2018-12-14 14:46:25 +01:00 |
|
thomwolf
|
ae88eb88a4
|
set encoding to 'utf-8' in calls to open
|
2018-12-14 13:48:58 +01:00 |
|
thomwolf
|
52c53f39d0
|
clean up apex integration
|
2018-12-13 13:02:17 +01:00 |
|
thomwolf
|
d23eed85bb
|
model loading apex modification
|
2018-12-13 12:53:17 +01:00 |
|
thomwolf
|
93f335ef86
|
add pretrained loading from state_dict
|
2018-12-13 12:48:13 +01:00 |
|
Thomas Wolf
|
91aab2a6d3
|
Merge pull request #116 from FDecaYed/deyuf/fp16_with_apex
Change to use apex for better fp16 and multi-gpu support
|
2018-12-13 12:32:37 +01:00 |
|
Deyu Fu
|
3b0a14b761
|
add fallback path for apex used in modeling.py
|
2018-12-12 15:05:45 -08:00 |
|