Julien Chaumond
62760baf46
tiny fixes
2019-09-17 18:29:15 -04:00
mattolson93
f2cf6ce4a9
Fixing typo in gpt2 for doc site's class link
2019-09-10 09:12:01 -07:00
Thomas Wolf
3f05de6dde
Merge branch 'master' into reorder_arguments
2019-09-09 15:42:25 +03:00
thomwolf
2a667b1eb9
split configuration and modeling files
2019-09-05 00:27:11 +02:00
thomwolf
7fba47b7d9
WIP reordering
2019-09-04 22:39:23 +02:00
thomwolf
e25cba78cf
WIP reodering arguments for torchscript and TF
2019-09-04 22:39:23 +02:00
thomwolf
bdb4409ed8
updated pruning logic with sets - Bert and GPT-2
2019-08-31 00:33:50 -04:00
LysandreJik
0cd283522a
Attempt to fix head index
2019-08-31 00:33:50 -04:00
LysandreJik
87747518e9
Blocks deletion from already deleted heads. Necessary integration test.
...
Now raises a warning when a head to be deleted already has been deleted. An integration test verifying the total pipeline (-> from config -> save model -> load model -> additional head pruning) has been added.
2019-08-31 00:33:50 -04:00
LysandreJik
719cb3738d
Pruning for GPT and GPT-2
2019-08-31 00:33:50 -04:00
Thomas Wolf
50e615f43d
Merge branch 'master' into improved_testing
2019-08-30 13:40:35 +02:00
thomwolf
0f5a799456
fix GPT2DoubleHeadModel docstring
2019-08-30 11:49:23 +02:00
thomwolf
d51f72d5de
adding shortcut to the ids of all the special tokens
2019-08-30 11:41:11 +02:00
thomwolf
0517e7a1cb
Fix GPT2 and RoBERTa tokenizer to beging with a space - update Roberta tokenizer
2019-08-30 11:23:49 +02:00
Thomas Wolf
edc5222fc3
Merge pull request #1118 from CrafterKolyan/patch-2
...
Documentation fix #1117
2019-08-27 14:58:50 +02:00
Nikolay Korolev
26bda77225
Fix documentation #1117
...
Rename parameter in documentation + Delete its second occurrence.
2019-08-27 12:22:42 +03:00
Nikolay Korolev
c8933bb2d9
Delete nonexistent parameter from documentation
...
Changed documentation of GPT2Model, GPT2LMHeadModel and GPT2DoubleHeadsModel
2019-08-27 12:10:36 +03:00
Lysandre
034aa0c2d7
Fixed GPT2DoubleHeadsModel example and weight tying
2019-08-21 17:27:38 -04:00
Lysandre
814a3f4e01
Removed attention_mask
from GPT-2 and GPT documentation. Corrected multiple_choice_labels
to actual name mc_labels
2019-08-21 14:11:14 -04:00
thomwolf
aa05dc8935
adding gpt-2 large
2019-08-21 02:29:34 +02:00
Thomas Wolf
e4515faf54
Merge pull request #1057 from huggingface/fixes
...
Add a few of typos corrections, bugs fixes and small improvements
2019-08-21 01:54:05 +02:00
Julien Chaumond
b0b9b8091b
minor typo
2019-08-20 11:33:46 +02:00
thomwolf
53c8f700f4
fix #808
2019-08-20 11:29:26 +02:00
Lysandre
c589862b78
Doc: loading from config alone does not load the model weights
2019-08-19 10:17:47 -04:00
thomwolf
aaedfc35a8
Merge branch 'master' of https://github.com/huggingface/pytorch-transformers
2019-08-10 20:04:37 +02:00
thomwolf
c683c3d5a5
fix #993
2019-08-10 20:04:35 +02:00
Kevin Trebing
7060766490
Corrected logger.error info
...
Signed-off-by: Kevin Trebing <Kevin.Trebing@gmx.net>
2019-08-09 19:36:44 -04:00
LysandreJik
7df303f5ad
fix #971
2019-08-08 10:36:26 -04:00
wangfei
f889e77b9c
Fix examples of loading pretrained models in docstring
2019-08-06 11:30:35 +08:00
wangfei
beb03ec6c5
Fix examples of loading pretrained models in docstring
2019-08-06 11:24:46 +08:00
thomwolf
b90e29d52c
working on automodels
2019-08-05 16:06:34 +02:00
thomwolf
bfbe52ec39
cleaning up example docstrings
2019-07-27 20:25:39 +02:00
thomwolf
0227b4a940
fix #827
2019-07-23 14:06:43 +02:00
thomwolf
f289e6cfe4
fix docstrings
2019-07-16 15:31:21 +02:00
thomwolf
f7cd7392fd
fixed tests
2019-07-15 12:32:19 +02:00
thomwolf
44c985facd
update doc for XLM and XLNet
2019-07-15 11:36:50 +02:00
thomwolf
0201d86015
added doc for transformer-xl
2019-07-15 10:11:09 +02:00
thomwolf
62b8eb43c1
fix add_start_docstrings on python 2 (removed)
2019-07-15 09:49:02 +02:00
thomwolf
5bc3d0cc5b
added gpt2 doc
2019-07-15 09:40:05 +02:00
thomwolf
7d4b200e40
good quality generation example for GPT, GPT-2, Transfo-XL, XLNet
2019-07-13 15:25:03 +02:00
thomwolf
2918b7d2a0
updating tests
2019-07-12 10:57:58 +02:00
LysandreJik
3fbceed8d2
Fix layer reference loss + previous attempted fix
2019-07-11 22:29:55 -04:00
LysandreJik
6c2ee16c04
Test suite testing the tie_weights function as well as the resize_token_embeddings function.
...
Patched an issue relating to the tied weights I had introduced with the TorchScript addition.
Byte order mark management in TSV glue reading.
2019-07-11 22:09:16 -04:00
thomwolf
bd404735a7
embeddings resizing + tie_weights
2019-07-12 00:02:49 +02:00
LysandreJik
50e62a4cb4
fix gpt/gpt-2 from pretrained
2019-07-11 16:50:21 -04:00
thomwolf
273617b86d
update config - fix gpt/gpt-2 from pretrained
2019-07-11 22:45:03 +02:00
Thomas Wolf
b87eb82b4f
Merge branch 'xlnet' into doc-sphinx
2019-07-11 15:46:27 +02:00
thomwolf
4fef5919a5
updating examples
2019-07-11 12:03:08 +02:00
LysandreJik
5288913bdd
All TODOs to be checked by Thom have been added.
2019-07-10 15:16:40 -04:00
LysandreJik
8fe2c9d98e
Refactored Docstrings of BERT, GPT2, GPT, TransfoXL, XLM and XLNet.
2019-07-09 15:55:31 -04:00