Commit Graph

54 Commits

Author SHA1 Message Date
mattolson93
f2cf6ce4a9
Fixing typo in gpt2 for doc site's class link 2019-09-10 09:12:01 -07:00
Thomas Wolf
3f05de6dde
Merge branch 'master' into reorder_arguments 2019-09-09 15:42:25 +03:00
thomwolf
2a667b1eb9 split configuration and modeling files 2019-09-05 00:27:11 +02:00
thomwolf
7fba47b7d9 WIP reordering 2019-09-04 22:39:23 +02:00
thomwolf
e25cba78cf WIP reodering arguments for torchscript and TF 2019-09-04 22:39:23 +02:00
thomwolf
bdb4409ed8 updated pruning logic with sets - Bert and GPT-2 2019-08-31 00:33:50 -04:00
LysandreJik
0cd283522a Attempt to fix head index 2019-08-31 00:33:50 -04:00
LysandreJik
87747518e9 Blocks deletion from already deleted heads. Necessary integration test.
Now raises a warning when a head to be deleted already has been deleted. An integration test verifying the total pipeline (-> from config -> save model -> load model -> additional head pruning) has been added.
2019-08-31 00:33:50 -04:00
LysandreJik
719cb3738d Pruning for GPT and GPT-2 2019-08-31 00:33:50 -04:00
Thomas Wolf
50e615f43d
Merge branch 'master' into improved_testing 2019-08-30 13:40:35 +02:00
thomwolf
0f5a799456 fix GPT2DoubleHeadModel docstring 2019-08-30 11:49:23 +02:00
thomwolf
d51f72d5de adding shortcut to the ids of all the special tokens 2019-08-30 11:41:11 +02:00
thomwolf
0517e7a1cb Fix GPT2 and RoBERTa tokenizer to beging with a space - update Roberta tokenizer 2019-08-30 11:23:49 +02:00
Thomas Wolf
edc5222fc3
Merge pull request #1118 from CrafterKolyan/patch-2
Documentation fix #1117
2019-08-27 14:58:50 +02:00
Nikolay Korolev
26bda77225
Fix documentation #1117
Rename parameter in documentation + Delete its second occurrence.
2019-08-27 12:22:42 +03:00
Nikolay Korolev
c8933bb2d9
Delete nonexistent parameter from documentation
Changed documentation of GPT2Model, GPT2LMHeadModel and GPT2DoubleHeadsModel
2019-08-27 12:10:36 +03:00
Lysandre
034aa0c2d7 Fixed GPT2DoubleHeadsModel example and weight tying 2019-08-21 17:27:38 -04:00
Lysandre
814a3f4e01 Removed attention_mask from GPT-2 and GPT documentation. Corrected multiple_choice_labels to actual name mc_labels 2019-08-21 14:11:14 -04:00
thomwolf
aa05dc8935 adding gpt-2 large 2019-08-21 02:29:34 +02:00
Thomas Wolf
e4515faf54
Merge pull request #1057 from huggingface/fixes
Add a few of typos corrections, bugs fixes and small improvements
2019-08-21 01:54:05 +02:00
Julien Chaumond
b0b9b8091b minor typo 2019-08-20 11:33:46 +02:00
thomwolf
53c8f700f4 fix #808 2019-08-20 11:29:26 +02:00
Lysandre
c589862b78 Doc: loading from config alone does not load the model weights 2019-08-19 10:17:47 -04:00
thomwolf
aaedfc35a8 Merge branch 'master' of https://github.com/huggingface/pytorch-transformers 2019-08-10 20:04:37 +02:00
thomwolf
c683c3d5a5 fix #993 2019-08-10 20:04:35 +02:00
Kevin Trebing
7060766490 Corrected logger.error info
Signed-off-by: Kevin Trebing <Kevin.Trebing@gmx.net>
2019-08-09 19:36:44 -04:00
LysandreJik
7df303f5ad fix #971 2019-08-08 10:36:26 -04:00
wangfei
f889e77b9c Fix examples of loading pretrained models in docstring 2019-08-06 11:30:35 +08:00
wangfei
beb03ec6c5 Fix examples of loading pretrained models in docstring 2019-08-06 11:24:46 +08:00
thomwolf
b90e29d52c working on automodels 2019-08-05 16:06:34 +02:00
thomwolf
bfbe52ec39 cleaning up example docstrings 2019-07-27 20:25:39 +02:00
thomwolf
0227b4a940 fix #827 2019-07-23 14:06:43 +02:00
thomwolf
f289e6cfe4 fix docstrings 2019-07-16 15:31:21 +02:00
thomwolf
f7cd7392fd fixed tests 2019-07-15 12:32:19 +02:00
thomwolf
44c985facd update doc for XLM and XLNet 2019-07-15 11:36:50 +02:00
thomwolf
0201d86015 added doc for transformer-xl 2019-07-15 10:11:09 +02:00
thomwolf
62b8eb43c1 fix add_start_docstrings on python 2 (removed) 2019-07-15 09:49:02 +02:00
thomwolf
5bc3d0cc5b added gpt2 doc 2019-07-15 09:40:05 +02:00
thomwolf
7d4b200e40 good quality generation example for GPT, GPT-2, Transfo-XL, XLNet 2019-07-13 15:25:03 +02:00
thomwolf
2918b7d2a0 updating tests 2019-07-12 10:57:58 +02:00
LysandreJik
3fbceed8d2 Fix layer reference loss + previous attempted fix 2019-07-11 22:29:55 -04:00
LysandreJik
6c2ee16c04 Test suite testing the tie_weights function as well as the resize_token_embeddings function.
Patched an issue relating to the tied weights I had introduced with the TorchScript addition.
Byte order mark management in TSV glue reading.
2019-07-11 22:09:16 -04:00
thomwolf
bd404735a7 embeddings resizing + tie_weights 2019-07-12 00:02:49 +02:00
LysandreJik
50e62a4cb4 fix gpt/gpt-2 from pretrained 2019-07-11 16:50:21 -04:00
thomwolf
273617b86d update config - fix gpt/gpt-2 from pretrained 2019-07-11 22:45:03 +02:00
Thomas Wolf
b87eb82b4f
Merge branch 'xlnet' into doc-sphinx 2019-07-11 15:46:27 +02:00
thomwolf
4fef5919a5 updating examples 2019-07-11 12:03:08 +02:00
LysandreJik
5288913bdd All TODOs to be checked by Thom have been added. 2019-07-10 15:16:40 -04:00
LysandreJik
8fe2c9d98e Refactored Docstrings of BERT, GPT2, GPT, TransfoXL, XLM and XLNet. 2019-07-09 15:55:31 -04:00
thomwolf
d0efbd3cd1 update sequencesummary module 2019-07-09 15:46:43 +02:00