Matthew Carrigan
b8e2a9c584
Made --reduce_memory actually do something in finetune_on_pregenerated
2019-04-22 14:01:48 +01:00
Matt
af8a0384fc
Merge pull request #1 from huggingface/master
...
Pulling commits from main repo
2019-04-22 13:56:47 +01:00
Sangwhan Moon
14b1f719f4
Fix indentation weirdness in GPT-2 example.
2019-04-22 02:20:22 +09:00
lukovnikov
69850b4011
python 2 compat
2019-04-21 14:02:38 +02:00
lukovnikov
bb7557d3ab
- removed __all__ in optimization
...
- removed unused plotting code
- using ABC for LRSchedule
- added some schedule object init tests
2019-04-21 13:48:33 +02:00
lukovnikov
34ccc8ebf4
Merge remote-tracking branch 'upstream/master'
2019-04-21 13:16:15 +02:00
Ailing Zhang
bfd6f6b257
fix from_pretrained positional args
2019-04-17 16:31:40 -07:00
Ailing Zhang
ae4c9fee73
add hubconf
2019-04-17 13:34:34 -07:00
Thomas Wolf
68a889ee43
Merge pull request #500 from huggingface/network
...
Updating network handling
2019-04-17 15:22:14 +02:00
thomwolf
34ae5bf838
small clean up in tests
2019-04-17 14:52:12 +02:00
thomwolf
23d4554ec0
is python 2 happy now
2019-04-17 14:48:34 +02:00
thomwolf
265550ec34
relax network connection requirements
2019-04-17 14:22:35 +02:00
thomwolf
fa76520240
fix file_utils on python 2
2019-04-17 13:32:22 +02:00
thomwolf
bcde2c61cb
fix #497
2019-04-17 12:35:38 +02:00
thomwolf
929579f3b5
fix #497
2019-04-17 12:35:08 +02:00
thomwolf
31d387604c
adding s3 model tests with --runslow
2019-04-17 11:58:27 +02:00
Thomas Wolf
8407429d74
Merge pull request #494 from SudoSharma/patch-1
...
Fix indentation for unconditional generation
2019-04-17 11:11:36 +02:00
Thomas Wolf
2e153930cf
Merge pull request #495 from SudoSharma/patch-2
...
Fix gradient overflow issue during attention mask
2019-04-17 11:10:36 +02:00
Thomas Wolf
46078e1b46
Merge pull request #496 from 8enmann/patch-1
...
[run_gpt2.py] temperature should be a float, not int
2019-04-17 11:08:54 +02:00
Thomas Wolf
b8686130ca
Merge pull request #498 from huggingface/GPT2_tokenization
...
Gpt2 tokenization
2019-04-17 11:06:41 +02:00
thomwolf
5afa497cbf
fix GPT-2 tokenization to work also on python 3...
2019-04-17 11:04:41 +02:00
thomwolf
bc70779bf0
fixed GPT-2 tokenization on python 2
2019-04-17 10:56:15 +02:00
Ben Mann
87677fcc4d
[run_gpt2.py] temperature should be a float, not int
2019-04-16 15:23:21 -07:00
Abhi Sharma
9e666aaa29
Fix gradient overflow issue during attention mask
...
This fix is in reference to issue #382 . GPT2 can now be trained in mixed precision, which I've confirmed with testing. I also tested unconditional generation on multiple seeds before and after changing 1e10 to 1e4 and there was no difference. Please let me know if there is anything else I can do to make this pull request better. Thanks for all your work!
2019-04-16 11:42:34 -07:00
Abhi Sharma
07154dadb4
Fix indentation for unconditional generation
2019-04-16 11:11:49 -07:00
thomwolf
bdaba1897c
updating GPT tokenization
2019-04-16 17:44:06 +02:00
thomwolf
18a8a15f78
improving GPT2 tokenization and adding tests
2019-04-16 17:00:55 +02:00
Thomas Wolf
3d78e226e6
Merge pull request #489 from huggingface/tokenization_serialization
...
Better serialization for Tokenizers and Configuration classes - Also fix #466
2019-04-16 08:49:54 +02:00
thomwolf
3571187ef6
fix saving models in distributed setting examples
2019-04-15 16:43:56 +02:00
Thomas Wolf
64b6ef4db0
Merge pull request #490 from huggingface/better_finetuning_GPT_GPT-2
...
Clean up GPT and GPT-2 losses computation
2019-04-15 16:14:50 +02:00
thomwolf
d616022455
fix openai special tokens loading
2019-04-15 16:07:45 +02:00
thomwolf
df5d9c3551
load all models on cpu
2019-04-15 15:43:01 +02:00
thomwolf
2499b0a5fc
add ptvsd to run_squad
2019-04-15 15:33:04 +02:00
thomwolf
7816f7921f
clean up distributed training logging in run_squad example
2019-04-15 15:27:10 +02:00
thomwolf
1135f2384a
clean up logger in examples for distributed case
2019-04-15 15:22:40 +02:00
thomwolf
cc43307023
update readme
2019-04-15 15:06:10 +02:00
thomwolf
60ea6c59d2
added best practices for serialization in README and examples
2019-04-15 15:00:33 +02:00
thomwolf
179a2c2ff6
update example to work with new serialization semantic
2019-04-15 14:33:23 +02:00
thomwolf
b3c6ee0ac1
tokenization updates
2019-04-15 14:24:52 +02:00
thomwolf
20577d8a7c
add configuration serialization to readme
2019-04-15 14:21:41 +02:00
thomwolf
9761aa4845
add to_json_file method to configuration classes
2019-04-15 14:12:08 +02:00
thomwolf
b17963d82f
update readme
2019-04-15 13:44:30 +02:00
thomwolf
e8568a3b17
fixing tests
2019-04-15 12:55:38 +02:00
thomwolf
870b734bfd
added tokenizers serialization tests
2019-04-15 12:03:56 +02:00
thomwolf
3e65f255dc
add serialization semantics to tokenizers - fix transfo-xl tokenizer
2019-04-15 11:47:25 +02:00
Thomas Wolf
6b35cfd28f
Merge pull request #423 from dhanajitb/master
...
making unconditional generation work
2019-04-15 11:01:53 +02:00
Thomas Wolf
aff44f0c08
Merge branch 'master' into master
2019-04-15 10:58:34 +02:00
Thomas Wolf
7e7e4753c8
Merge pull request #480 from mboyanov/docs/cls_token_info
...
Extend the BertForSequenceClassification docs to mention the special CLS token.
2019-04-15 10:57:25 +02:00
Thomas Wolf
bb61b747df
Merge pull request #474 from jiesutd/master
...
Fix tsv read error in Windows
2019-04-15 10:56:48 +02:00
Thomas Wolf
7873d76464
Merge pull request #478 from Rocketknight1/master
...
Added a helpful error for users with single-document corpuses - fixes # 452
2019-04-15 10:55:57 +02:00