Commit Graph

102 Commits

Author SHA1 Message Date
thomwolf
b514a60c36 added tests for OpenAI GPT and Transformer-XL tokenizers 2019-02-11 10:17:16 +01:00
thomwolf
9f9909ea2f update readme 2019-02-09 16:59:21 +01:00
thomwolf
0c1a6f9b1d update readme 2019-02-08 22:32:25 +01:00
thomwolf
009b581316 updated readme 2019-02-07 23:15:05 +01:00
thomwolf
f99f2fb661 docstrings 2019-02-07 17:07:22 +01:00
Thomas Wolf
848aae49e1
Merge branch 'master' into python_2 2019-02-06 00:13:20 +01:00
thomwolf
ba37ddc5ce fix run_lm_modeling example command line 2019-02-06 00:07:08 +01:00
Girishkumar
0dd2b750ca
Minor update in README
Update links to classes in `modeling.py`
2019-01-30 23:49:15 +05:30
thomwolf
3a848111e6 update config, docstrings and readme to switch to seperated tokens and position embeddings 2019-01-29 11:00:11 +01:00
Davide Fiocco
35115eaf93
(very) minor update to README 2019-01-16 21:05:24 +01:00
nhatchan
8edc898f63 Fix documentation (missing backslashes)
This PR adds missing backslashes in LM Fine-tuning subsection in README.md.
2019-01-13 21:23:19 +09:00
thomwolf
e5c78c6684 update readme and few typos 2019-01-10 01:40:00 +01:00
thomwolf
fa5222c296 update readme 2019-01-10 01:25:28 +01:00
Thomas Wolf
c18bdb4433
Merge pull request #124 from deepset-ai/master
Add example for fine tuning BERT language model
2019-01-07 12:03:51 +01:00
Julien Chaumond
8da280ebbe Setup CI 2018-12-20 16:33:39 -05:00
tholor
e5fc98c542 add exemplary training data. update to nvidia apex. refactor 'item -> line in doc' mapping. add warning for unknown word. 2018-12-20 18:30:52 +01:00
tholor
67f4dd56a3 update readme for run_lm_finetuning 2018-12-19 09:22:37 +01:00
Julien Chaumond
d57763f582 Fix typos 2018-12-18 19:23:22 -05:00
Thomas Wolf
786cc41299
Typos in readme 2018-12-17 09:22:18 +01:00
Daniel Khashabi
8b1b93947f
Minor fix. 2018-12-14 14:10:36 -05:00
Thomas Wolf
8809eb6c93
update readme with information on NVIDIA's apex 2018-12-14 16:59:39 +01:00
thomwolf
d821358884 update readme 2018-12-14 15:15:17 +01:00
thomwolf
087798b7fa fix reloading model for evaluation in examples 2018-12-13 14:48:12 +01:00
thomwolf
0f544625f4 fix swag example for work with apex 2018-12-13 13:35:59 +01:00
thomwolf
4946c2c500 run_swag example in readme 2018-12-13 13:02:07 +01:00
Thomas Wolf
91aab2a6d3
Merge pull request #116 from FDecaYed/deyuf/fp16_with_apex
Change to use apex for better fp16 and multi-gpu support
2018-12-13 12:32:37 +01:00
Thomas Wolf
ffe9075f48
Merge pull request #96 from rodgzilla/multiple-choice-code
BertForMultipleChoice and Swag dataset example.
2018-12-13 12:05:11 +01:00
Grégory Châtel
dcb50eaa4b Swag example readme section update with gradient accumulation run. 2018-12-12 18:17:46 +01:00
Deyu Fu
c8ea286048 change to apex for better fp16 and multi-gpu support 2018-12-11 17:13:58 -08:00
Thomas Wolf
a3a3180c86
Bump up requirements to Python 3.6 2018-12-11 11:29:45 +01:00
Grégory Châtel
0876b77f7f Change to the README file to add SWAG results. 2018-12-10 15:34:19 +01:00
Davide Fiocco
c9f67e037c
Adding --do_lower_case for all uncased BERTs
I had missed those, it should make sense to use them
2018-12-07 20:40:56 +01:00
Grégory Châtel
150f3cd9fa Few typos in README.md 2018-12-06 19:22:07 +01:00
Grégory Châtel
4fa7892d64 Wrong line number link to modeling file. 2018-12-06 19:18:29 +01:00
Grégory Châtel
6a26e19ea3 Updating README.md with SWAG example informations. 2018-12-06 19:15:08 +01:00
Grégory Châtel
0a7c8bdcac Fixing badly formatted links. 2018-12-04 13:43:56 +01:00
Grégory Châtel
3113e967db Adding links to examples files. 2018-12-04 13:40:38 +01:00
Davide Fiocco
8a8aa59d8c
Update finetuning example adding --do_lower_case
Should be consistent with the fact that an uncased model is used
2018-12-01 01:00:05 +01:00
thomwolf
f9f3bdd60b update readme 2018-11-30 23:05:18 +01:00
thomwolf
52ff0590ff tup => tpu 2018-11-30 23:01:10 +01:00
thomwolf
296f006132 added BertForTokenClassification model 2018-11-30 13:56:53 +01:00
thomwolf
298107fed7 Added new bert models 2018-11-30 13:56:02 +01:00
Davide Fiocco
ec2c339b53
Updated quick-start example with BertForMaskedLM
As `convert_ids_to_tokens` returns a list, the code in the README currently throws an `AssertionError`, so I propose I quick fix.
2018-11-28 14:53:46 +01:00
thomwolf
05053d163c update cache_dir in readme and examples 2018-11-26 10:45:13 +01:00
thomwolf
029bdc0d50 fixing readme examples 2018-11-26 09:56:41 +01:00
Thomas Wolf
60e01ac427
fix link in readme 2018-11-21 12:08:30 +01:00
Thomas Wolf
fd32ebed81
Merge pull request #42 from weiyumou/master
Fixed UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2
2018-11-20 10:09:50 +01:00
thomwolf
eed255a58d fixing CLI typo in readme 2018-11-20 10:02:57 +01:00
weiyumou
9ff2b7d86d Fixed README typo 2018-11-19 23:13:10 -05:00
Thomas Wolf
da73925f6a
fix typos 2018-11-19 20:58:48 +01:00