thomwolf
|
01a3966bc6
|
more options on special tokens
|
2019-02-04 17:26:25 +01:00 |
|
thomwolf
|
05f961840b
|
logging
|
2019-02-04 13:06:19 +01:00 |
|
thomwolf
|
3a848111e6
|
update config, docstrings and readme to switch to seperated tokens and position embeddings
|
2019-01-29 11:00:11 +01:00 |
|
thomwolf
|
98c96fb1a7
|
splitting position and tokens embeddings in OpenAI GPT - updating tf imports - tests
|
2019-01-29 10:31:42 +01:00 |
|
thomwolf
|
5456d82311
|
more versatile model loading
|
2019-01-29 09:54:18 +01:00 |
|
thomwolf
|
9b2540b5a7
|
update __init__
|
2019-01-29 09:54:08 +01:00 |
|
thomwolf
|
bd3b3aee9c
|
update
|
2019-01-28 17:47:29 +01:00 |
|
thomwolf
|
b12616fd8e
|
updating code organization to fix imports
|
2019-01-28 17:03:39 +01:00 |
|
thomwolf
|
d77dd62ff8
|
directly load from TF checkpoints + code cleanup
|
2019-01-28 16:50:23 +01:00 |
|
thomwolf
|
9c35c132fa
|
apex LayerNorm
|
2019-01-17 09:19:19 +01:00 |
|
thomwolf
|
b9c77b98d5
|
fix transposition in model conversion and memory initialization
|
2019-01-17 00:33:21 +01:00 |
|
thomwolf
|
009101de12
|
fix loading bug and check full conversion of model
|
2019-01-16 12:16:20 +01:00 |
|
thomwolf
|
fea15cc9f5
|
update model conversion
|
2019-01-16 11:54:54 +01:00 |
|
thomwolf
|
c03c12687f
|
fix __main__ entry script
|
2019-01-16 10:55:22 +01:00 |
|
thomwolf
|
8831c68803
|
fixing various parts of model conversion, loading and weights sharing
|
2019-01-16 10:31:16 +01:00 |
|
thomwolf
|
a69ec2c722
|
improved corpus and tokenization conversion - added evaluation script
|
2019-01-15 23:17:46 +01:00 |
|
thomwolf
|
7d03c53718
|
conversion working
|
2019-01-15 16:07:25 +01:00 |
|
thomwolf
|
3a9c88377f
|
adding Transformer XL
|
2019-01-15 12:59:38 +01:00 |
|
thomwolf
|
e5c78c6684
|
update readme and few typos
|
2019-01-10 01:40:00 +01:00 |
|
thomwolf
|
ab90d4cddd
|
adding docs and example for OpenAI GPT
|
2019-01-09 00:12:43 +01:00 |
|
thomwolf
|
dc5df92fa8
|
added LM head for OpenAI
|
2019-01-08 17:18:47 +01:00 |
|
thomwolf
|
3cf12b235a
|
added tests + fixed losses
|
2019-01-08 16:24:23 +01:00 |
|
thomwolf
|
eed51c5bdf
|
add OpenAI GPT
|
2019-01-08 12:26:58 +01:00 |
|
thomwolf
|
793dcd236b
|
Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT into fifth-release
|
2019-01-07 13:37:55 +01:00 |
|
thomwolf
|
93f563b8a8
|
adding OpenAI GPT
|
2019-01-07 12:55:36 +01:00 |
|
Thomas Wolf
|
e048c7f1c8
|
Merge pull request #171 from donglixp/patch-1
LayerNorm initialization
|
2019-01-07 12:44:46 +01:00 |
|
Thomas Wolf
|
bcd607542c
|
Merge pull request #145 from wlhgtc/master
Correct the wrong note
|
2019-01-07 12:23:05 +01:00 |
|
Li Dong
|
d0d9b384f2
|
LayerNorm initialization
The LayerNorm gamma and beta should be initialized by .fill_(1.0) and .zero_().
reference links:
989e78c412/tensorflow/contrib/layers/python/layers/layers.py (L2298)
989e78c412/tensorflow/contrib/layers/python/layers/layers.py (L2308)
|
2019-01-07 15:51:33 +08:00 |
|
wlhgtc
|
e626eecc25
|
Update modeling.py
|
2018-12-22 20:26:05 +08:00 |
|
Grégory Châtel
|
7176674849
|
Fixing various class documentations.
|
2018-12-20 13:11:17 +01:00 |
|
Thomas Wolf
|
7fb94ab934
|
Merge pull request #127 from patrick-s-h-lewis/tokenizer-error-on-long-seqs
raises value error for bert tokenizer for long sequences
|
2018-12-19 10:29:17 +01:00 |
|
Patrick Sodré
|
87c1244c7d
|
Convert scripts into entry_points
The recommended approach to create launch scripts is to use entry_points
and console_scripts.
xref: https://packaging.python.org/guides/distributing-packages-using-setuptools/#scripts
|
2018-12-19 02:26:08 +00:00 |
|
Julien Chaumond
|
d57763f582
|
Fix typos
|
2018-12-18 19:23:22 -05:00 |
|
Patrick Lewis
|
78cf7b4ab4
|
added code to raise value error for bert tokenizer for covert_tokens_to_indices
|
2018-12-18 14:41:30 +00:00 |
|
thomwolf
|
4a4b0e5783
|
remove logging. basicConfig from library code
|
2018-12-14 14:46:25 +01:00 |
|
thomwolf
|
ae88eb88a4
|
set encoding to 'utf-8' in calls to open
|
2018-12-14 13:48:58 +01:00 |
|
thomwolf
|
52c53f39d0
|
clean up apex integration
|
2018-12-13 13:02:17 +01:00 |
|
thomwolf
|
d23eed85bb
|
model loading apex modification
|
2018-12-13 12:53:17 +01:00 |
|
thomwolf
|
1cbb32a542
|
include version number + comment in setup.py
|
2018-12-13 12:50:44 +01:00 |
|
thomwolf
|
ce52177638
|
added version in __init__.py
|
2018-12-13 12:50:44 +01:00 |
|
thomwolf
|
93f335ef86
|
add pretrained loading from state_dict
|
2018-12-13 12:48:13 +01:00 |
|
thomwolf
|
13bf0d4659
|
fixing Adam weights skip in TF convert script
|
2018-12-13 12:48:13 +01:00 |
|
Thomas Wolf
|
91aab2a6d3
|
Merge pull request #116 from FDecaYed/deyuf/fp16_with_apex
Change to use apex for better fp16 and multi-gpu support
|
2018-12-13 12:32:37 +01:00 |
|
Thomas Wolf
|
32a227f507
|
Merge pull request #113 from hzhwcmhf/master
fix compatibility with python 3.5.2
|
2018-12-13 12:15:15 +01:00 |
|
Thomas Wolf
|
ffe9075f48
|
Merge pull request #96 from rodgzilla/multiple-choice-code
BertForMultipleChoice and Swag dataset example.
|
2018-12-13 12:05:11 +01:00 |
|
Deyu Fu
|
3b0a14b761
|
add fallback path for apex used in modeling.py
|
2018-12-12 15:05:45 -08:00 |
|
Deyu Fu
|
c8ea286048
|
change to apex for better fp16 and multi-gpu support
|
2018-12-11 17:13:58 -08:00 |
|
hzhwcmhf
|
485adde742
|
add pathlib support for file_utils.py on python 3.5
|
2018-12-11 22:49:19 +08:00 |
|
hzhwcmhf
|
bc659f86ad
|
fix compatibility with python 3.5.2; convert path to str
|
2018-12-11 20:18:56 +08:00 |
|
thomwolf
|
1df6f26214
|
Merge branch 'fourth-release' of https://github.com/huggingface/pytorch-pretrained-BERT into fourth-release
|
2018-12-11 12:20:31 +01:00 |
|