Thomas Wolf
3f05de6dde
Merge branch 'master' into reorder_arguments
2019-09-09 15:42:25 +03:00
thomwolf
2a667b1eb9
split configuration and modeling files
2019-09-05 00:27:11 +02:00
thomwolf
e25cba78cf
WIP reodering arguments for torchscript and TF
2019-09-04 22:39:23 +02:00
thomwolf
bdb4409ed8
updated pruning logic with sets - Bert and GPT-2
2019-08-31 00:33:50 -04:00
LysandreJik
0c8e823b03
Added patch to remaining models
2019-08-31 00:33:50 -04:00
LysandreJik
87747518e9
Blocks deletion from already deleted heads. Necessary integration test.
...
Now raises a warning when a head to be deleted already has been deleted. An integration test verifying the total pipeline (-> from config -> save model -> load model -> additional head pruning) has been added.
2019-08-31 00:33:50 -04:00
Lysandre
42e00cf9e1
Pruning saved to configuration first try
2019-08-31 00:33:50 -04:00
Thomas Wolf
41f35d0b3d
Merge pull request #1089 from dhpollack/dhp/use_pytorch_layernorm
...
change layernorm code to pytorch's native layer norm
2019-08-30 14:49:08 +02:00
thomwolf
c9bce1811c
fixing model to add torchscript, embedding resizing, head pruning and masking + tests
2019-08-28 13:22:45 +02:00
VictorSanh
7f5d85347e
fix small typo
2019-08-28 02:44:51 +00:00
David Pollack
e13465fb8b
change layernorm code to pytorch's native layer norm
2019-08-23 12:12:12 +02:00
thomwolf
53c8f700f4
fix #808
2019-08-20 11:29:26 +02:00
Lysandre
c589862b78
Doc: loading from config alone does not load the model weights
2019-08-19 10:17:47 -04:00
Kevin Trebing
7060766490
Corrected logger.error info
...
Signed-off-by: Kevin Trebing <Kevin.Trebing@gmx.net>
2019-08-09 19:36:44 -04:00
LysandreJik
f2b300df6b
fix #976
2019-08-08 10:38:57 -04:00
LysandreJik
7df303f5ad
fix #971
2019-08-08 10:36:26 -04:00
Thomas Wolf
7729ef7381
Merge pull request #955 from FeiWang96/master
...
Fix comment typo
2019-08-07 10:11:25 +02:00
Thomas Wolf
7e6102ce74
Merge pull request #963 from guotong1988/patch-2
...
Update modeling_bert.py
2019-08-07 10:09:04 +02:00
wangfei
6ec1ee9ec2
Fix examples in docstring
2019-08-06 11:32:54 +08:00
wangfei
beb03ec6c5
Fix examples of loading pretrained models in docstring
2019-08-06 11:24:46 +08:00
雷打不动!
d7fd10568c
Update modeling_bert.py
2019-08-05 08:58:19 +08:00
wangfei
a24f830604
Fix comment typo
2019-08-03 12:17:06 +08:00
thomwolf
bfbe52ec39
cleaning up example docstrings
2019-07-27 20:25:39 +02:00
thomwolf
0227b4a940
fix #827
2019-07-23 14:06:43 +02:00
thomwolf
5fe0b378d8
adding missing docstring fix #793
2019-07-16 21:35:53 +02:00
thomwolf
f289e6cfe4
fix docstrings
2019-07-16 15:31:21 +02:00
thomwolf
f7cd7392fd
fixed tests
2019-07-15 12:32:19 +02:00
thomwolf
44c985facd
update doc for XLM and XLNet
2019-07-15 11:36:50 +02:00
thomwolf
4cb489457f
added doc for openai GPT
2019-07-15 09:58:01 +02:00
thomwolf
62b8eb43c1
fix add_start_docstrings on python 2 (removed)
2019-07-15 09:49:02 +02:00
thomwolf
5bc3d0cc5b
added gpt2 doc
2019-07-15 09:40:05 +02:00
thomwolf
183fedfed5
fix doc on python2
2019-07-15 09:00:09 +02:00
thomwolf
2397f958f9
updating examples and doc
2019-07-14 23:20:10 +02:00
thomwolf
2918b7d2a0
updating tests
2019-07-12 10:57:58 +02:00
LysandreJik
3fbceed8d2
Fix layer reference loss + previous attempted fix
2019-07-11 22:29:55 -04:00
LysandreJik
6c2ee16c04
Test suite testing the tie_weights function as well as the resize_token_embeddings function.
...
Patched an issue relating to the tied weights I had introduced with the TorchScript addition.
Byte order mark management in TSV glue reading.
2019-07-11 22:09:16 -04:00
thomwolf
bd404735a7
embeddings resizing + tie_weights
2019-07-12 00:02:49 +02:00
Thomas Wolf
b87eb82b4f
Merge branch 'xlnet' into doc-sphinx
2019-07-11 15:46:27 +02:00
thomwolf
4fef5919a5
updating examples
2019-07-11 12:03:08 +02:00
LysandreJik
f773faa258
Fixed all links. Removed TPU. Changed CLI to Converting TF models. Many minor formatting adjustments. Added "TODO Lysandre filled" where necessary.
2019-07-10 14:45:56 -04:00
LysandreJik
83fb311ef7
Patched warnings + Refactored XLNet's Docstrings
2019-07-09 16:38:30 -04:00
LysandreJik
8fe2c9d98e
Refactored Docstrings of BERT, GPT2, GPT, TransfoXL, XLM and XLNet.
2019-07-09 15:55:31 -04:00
thomwolf
ed6c8d37f4
fix merge
2019-07-09 17:14:52 +02:00
thomwolf
e468192e2f
Merge branch 'pytorch-transformers' into xlnet
2019-07-09 17:05:37 +02:00
thomwolf
d5481cbe1b
adding tests to examples - updating summary module - coverage update
2019-07-09 15:29:42 +02:00
thomwolf
b19786985d
unified tokenizer api and serialization + tests
2019-07-09 10:25:18 +02:00
thomwolf
36bca545ff
tokenization abstract class - tests for examples
2019-07-05 15:02:59 +02:00
thomwolf
0bab55d5d5
[BIG] name change
2019-07-05 11:55:36 +02:00