lukovnikov
90a41dbe14
BertAdam schedule objects
2019-03-09 02:23:20 +01:00
Junjie Qian
d648a02203
Correct line number in README for classes
2019-03-08 16:28:03 -08:00
lukovnikov
88874f6cf0
BertAdam schedule objects
2019-03-08 19:08:30 +01:00
Elon Musk
66d8206809
Update run_gpt2.py
2019-03-08 11:59:08 -05:00
Haozhe Ji
72fa8d03a7
add 'padding_idx=0' for BertEmbeddings
2019-03-07 20:02:55 +08:00
Philipp Glock
6190e8ce4c
Fix: use dropout layer
2019-03-07 10:12:45 +01:00
thomwolf
7cc35c3104
fix openai gpt example and updating readme
2019-03-06 11:43:21 +01:00
thomwolf
906b638efa
updating readme
2019-03-06 10:24:19 +01:00
thomwolf
994d86609b
fixing PYTORCH_PRETRAINED_BERT_CACHE use in examples
2019-03-06 10:21:24 +01:00
thomwolf
2dd8f524f5
removing test for long sequences error following #337
2019-03-06 10:10:41 +01:00
thomwolf
5c85fc3977
fix typo - logger info
2019-03-06 10:05:21 +01:00
Thomas Wolf
8e36da7acb
Merge pull request #347 from jplehmann/feature/sst2-processor
...
Processor for SST-2 task
2019-03-06 09:48:27 +01:00
Thomas Wolf
21c88a07b7
Merge pull request #341 from potatochip/patch-1
...
catch exception if pathlib not install
2019-03-06 09:48:01 +01:00
Thomas Wolf
3c01dfb775
Merge pull request #338 from CatalinVoss/patch-3
...
Fix top k generation for k != 0
2019-03-06 09:47:33 +01:00
Thomas Wolf
477ec4b6cc
Merge pull request #337 from CatalinVoss/patch-2
...
Allow tokenization of sequences > 512 for caching
2019-03-06 09:45:49 +01:00
Thomas Wolf
7b9e5a54b5
Merge pull request #327 from lukovnikov/master
...
Issue#324: warmup linear fixes
2019-03-06 09:44:56 +01:00
Thomas Wolf
4784b04f47
Merge pull request #325 from john-hewitt/master
...
add BertTokenizer flag to skip basic tokenization
2019-03-06 09:37:11 +01:00
Catalin Voss
4a49c22584
Warn instead of raising in BERT and GPT-2 tokenizers as well, to allow for pre-caching of tokens
2019-03-05 12:31:45 -08:00
Catalin Voss
e99bc87e4d
Merge branch 'patch-1' into patch-2
2019-03-05 12:24:18 -08:00
John Lehmann
0f96d4b1f7
Run classifier processor for SST-2.
2019-03-05 13:38:28 -06:00
Aaron Mangum
0c970caa4a
catch exception if pathlib not install
2019-03-04 14:30:19 -08:00
Catalin Voss
4b4b079272
Fix top k generation for k != 0
2019-03-02 21:54:44 -08:00
Catalin Voss
9775b2eb27
Allow tokenization of sequences > 512 for caching
...
For many applications requiring randomized data access, it's easier to cache the tokenized representations than the words. So why not turn this into a warning?
2019-03-02 16:30:21 -08:00
Catalin Voss
c0cf0a04d5
Fix typo
2019-02-27 18:01:06 -08:00
John Hewitt
4d1ad83236
update docstring of BERT tokenizer to reflect do_wordpiece_only
2019-02-27 14:50:41 -08:00
lukovnikov
35410da758
added warning
2019-02-27 17:11:42 +01:00
lukovnikov
4d79e0d386
added warning
2019-02-27 16:50:05 +01:00
lukovnikov
66a84b63b0
added warning
2019-02-27 16:38:00 +01:00
lukovnikov
070f3b21d8
added warning
2019-02-27 16:26:45 +01:00
lukovnikov
46ef646016
added warning
2019-02-27 16:22:27 +01:00
lukovnikov
9bc3773c84
added warning
2019-02-27 16:10:31 +01:00
lukovnikov
60a372387f
added warning
2019-02-27 15:54:09 +01:00
John Hewitt
e14c6b52e3
add BertTokenizer flag to skip basic tokenization
2019-02-26 20:11:24 -08:00
lukovnikov
da2d8ca265
fix for negative learning rate with warmup_linear in BertAdam (happens when t_total is specified incorrectly)
...
+ copied BERT optimization warmup functions to OpenAI optimization file + added comments
2019-02-26 17:16:06 +01:00
lukovnikov
e04bab59e1
fix for negative learning rate with warmup_linear in BertAdam (happens when t_total is specified incorrectly)
...
+ copied BERT optimization warmup functions to OpenAI optimization file + added comments
2019-02-26 16:22:52 +01:00
Thomas Wolf
2152bfeae8
Merge pull request #316 from joelgrus/gpt2docs
...
update documentation for gpt-2
2019-02-24 09:38:29 +01:00
Joel Grus
8722e9eb3b
finish updating docstrings
2019-02-23 06:31:59 -08:00
Joel Grus
33aa7a80ca
update documentation
2019-02-22 15:37:59 -08:00
Thomas Wolf
a5b3a89545
Merge pull request #310 from spolu/spolu-nits_gpt2
...
Few small nits in GPT-2's README code examples
2019-02-21 10:23:27 +01:00
Stanislas Polu
ff22b3acc0
Few small nits in GPT-2's code examples
2019-02-21 09:15:27 +00:00
Thomas Wolf
cbb7fad319
Merge pull request #307 from guotong1988/patch-1
...
Update README.md
2019-02-21 09:25:19 +01:00
Tong Guo
09efcece75
Update README.md
2019-02-21 11:25:33 +08:00
Thomas Wolf
97c815dae2
Merge pull request #305 from bkj/patch-1
...
Update run_openai_gpt.py
2019-02-20 21:24:06 +01:00
Ben Johnson
8607233679
Update run_openai_gpt.py
2019-02-20 13:58:54 -05:00
Thomas Wolf
f50b82af04
Merge pull request #302 from yongbowin/master
...
typo
2019-02-20 14:14:02 +01:00
Yongbo Wang
2fdab323d1
typo
2019-02-20 21:11:06 +08:00
Yongbo Wang
813e4d18ba
typo
2019-02-20 21:10:07 +08:00
Thomas Wolf
8337740754
Merge pull request #295 from tnlin/master
...
fix broken link in readme
2019-02-19 14:00:28 +01:00
Tony Lin
5b0e0b61f0
fix typo in readme
2019-02-19 20:34:18 +08:00
Thomas Wolf
3ca35b99ba
Merge pull request #293 from davidefiocco/patch-2
...
Minor README typos corrected
2019-02-19 09:00:01 +01:00