Rémi Louf
81ee29ee8d
remove the staticmethod used to load the config
2019-10-10 14:13:37 +02:00
thomwolf
bb04edb45b
Add tests that TF 2.0 model can be integrated with other Keras modules
2019-10-10 13:08:24 +02:00
Rémi Louf
d7092d592c
rename the attributes in the Bert Layer
...
Since the preloading of weights relies on the name of the class's
attributes changing the namespace breaks loading pretrained weights on
Bert and all related models. I reverted `self_attention` to `attention`
and us `crossattention` for the decoder instead.
2019-10-10 12:51:14 +02:00
Rémi Louf
51261167b4
prune both attention and self-attention heads
2019-10-10 12:17:22 +02:00
Rémi Louf
17177e7379
add is_decoder as an attribute to Config class
2019-10-10 12:03:58 +02:00
Thomas Wolf
6596e3d566
Merge pull request #1454 from bkkaggle/pytorch-built-in-tensorboard
...
Change tensorboard imports to use built-in tensorboard if available
2019-10-10 11:56:55 +02:00
Thomas Wolf
4bc4601192
Merge pull request #1480 from huggingface/fix_ctrl_tokenizer
...
Fixing CTRL tokenizer - Update error messages - XLM-MLM in run_generation
2019-10-10 11:56:20 +02:00
thomwolf
177a721205
move back to simple space spliting
2019-10-10 11:45:47 +02:00
Rémi Louf
df85a0ff0b
replace double quotes with simple quotes
2019-10-10 11:38:26 +02:00
Rémi Louf
9ca788b2e8
merge the two Bert layers classes
2019-10-10 11:33:28 +02:00
thomwolf
a5997dd81a
better error messages
2019-10-10 11:31:01 +02:00
Rémi Louf
edfc8f8225
Remove and do the branching in
2019-10-10 10:17:27 +02:00
Rémi Louf
09cfd12235
remove and do the branching in
2019-10-10 10:15:27 +02:00
thomwolf
43a237f15e
switching to moses tokenizer
2019-10-10 10:11:16 +02:00
Rémi Louf
877ef2c6ca
override from_pretrained
in Bert2Rnd
...
In the seq2seq model we need to both load pretrained weights in the
encoder and initialize the decoder randomly. Because the
`from_pretrained` method defined in the base class relies on module
names to assign weights, it would also initialize the decoder with
pretrained weights. To avoid this we override the method to only
initialize the encoder with pretrained weights.
2019-10-10 10:02:18 +02:00
Rémi Louf
851ef592c5
add comment on recursive weights loading
2019-10-10 10:02:03 +02:00
LysandreJik
036483fae5
Temporary CTRL tokenizer fix
2019-10-09 16:33:15 -04:00
LysandreJik
9c2e0a4acf
Release: 2.1.0
2019-10-09 12:14:03 -04:00
LysandreJik
7fe98d8c18
Update CTRL documentation
2019-10-09 12:12:36 -04:00
LysandreJik
89f86f9661
CTRL added to the documentation
2019-10-09 12:04:06 -04:00
LysandreJik
e17ea08e24
Pycharm folder added to gitignore
2019-10-09 11:32:21 -04:00
Lysandre Debut
2431fea98a
Merge pull request #1383 from keskarnitish/master
...
Adding CTRL
2019-10-09 11:31:05 -04:00
thomwolf
d9e60f4f0d
Merge branch 'master' into pr/1383
2019-10-09 17:25:08 +02:00
Lysandre Debut
e84470ef81
Merge pull request #1384 from huggingface/encoding-qol
...
Quality of life enhancements in encoding + patch MLM masking
2019-10-09 11:18:24 -04:00
thomwolf
07d055f849
higher tolerance
2019-10-09 17:10:04 +02:00
thomwolf
48b438ff2a
doc and conversion
2019-10-09 17:06:30 +02:00
jinoobaek-qz
69629c4f0f
Improve naming and only do regex when necessary
2019-10-09 08:48:40 -04:00
jinoobaek-qz
bf34a252b8
Golden path
2019-10-09 08:48:40 -04:00
jinoobaek-qz
528d3f327b
Improve readability and improve make less assumptions about checkpoint format
2019-10-09 08:48:40 -04:00
jinoobaek-qz
56301bd9e8
Extract method
2019-10-09 08:48:40 -04:00
jinoobaek-qz
d6c5469712
Delete older checkpoint after saving new checkpoint
2019-10-09 08:48:40 -04:00
jinoobaek-qz
54a31f50fb
Add save_total_limit
2019-10-09 08:48:40 -04:00
thomwolf
c19b8e4ae0
fixing CTRL tests and OpenAI GPT tests
2019-10-09 13:51:05 +02:00
thomwolf
6dce6dda1b
fixing TF 2.0 model - adding more severe test on pt/tf equivalence
2019-10-09 11:57:55 +02:00
thomwolf
c56d921dda
adding TF 2.0 model
2019-10-09 11:07:43 +02:00
thomwolf
1c5079952f
simpler distilbert mask - fix tf tests
2019-10-09 04:26:20 +02:00
Thomas Wolf
58b302caf3
Merge pull request #1398 from dveselov/patch-1
...
Fixed typo in docs README
2019-10-09 03:52:42 +02:00
Thomas Wolf
439fac723a
Merge pull request #1409 from brian41005/master
...
Evaluation result.txt path changing #1286
2019-10-09 03:14:34 +02:00
thomwolf
23b7138ab4
fix #1378 and #1453
2019-10-09 01:54:44 +02:00
Bilal Khan
5ce8d29abe
Change tensorboard imports to use built-in tensorboard if available
2019-10-08 16:29:43 -05:00
Julien Chaumond
d688af19e5
Update link to swift-coreml-transformers
...
cc @lysandrejik
2019-10-08 16:37:52 -04:00
thomwolf
45dc04f33d
tf model [WIP]
2019-10-08 17:37:17 +02:00
Rémi Louf
770b15b58c
rename class in __init__
2019-10-08 17:32:28 +02:00
thomwolf
248314772f
fix tokenization
2019-10-08 17:19:28 +02:00
thomwolf
03c2c762a6
update tokenizer
2019-10-08 17:12:03 +02:00
thomwolf
3edfa1d6aa
update model to use past
2019-10-08 17:11:58 +02:00
Rémi Louf
f4d41fe33e
Merge pull request #1448 from huggingface/contributing
...
add contribution guidelines
2019-10-08 16:55:34 +02:00
Rémi Louf
61ed889005
remove old seq2seq file
2019-10-08 16:30:58 +02:00
Rémi Louf
8abfee9ec3
rename Bert2Bert -> Bert2Rnd
2019-10-08 16:30:58 +02:00
Rémi Louf
82628b0fc9
add a placeholder test
2019-10-08 16:30:58 +02:00