thomwolf
|
93e9971c54
|
fix tests
|
2019-06-26 10:02:45 +02:00 |
|
thomwolf
|
e55d4c4ede
|
various updates to conversion, models and examples
|
2019-06-26 00:57:53 +02:00 |
|
thomwolf
|
603c513b35
|
update main conversion script and readme
|
2019-06-25 10:45:07 +02:00 |
|
thomwolf
|
62d78aa37e
|
updating GLUE utils for compatibility with XLNet
|
2019-06-24 14:36:11 +02:00 |
|
thomwolf
|
c304593d8f
|
BERTology details in readme
|
2019-06-20 10:05:06 +02:00 |
|
thomwolf
|
34d706a0e1
|
pruning in bertology
|
2019-06-19 15:25:49 +02:00 |
|
thomwolf
|
dc8e0019b7
|
updating examples
|
2019-06-19 13:23:20 +02:00 |
|
thomwolf
|
68ab9599ce
|
small fix and updates to readme
|
2019-06-19 09:38:38 +02:00 |
|
thomwolf
|
4d8c4337ae
|
test barrier in distrib training
|
2019-06-18 22:41:28 +02:00 |
|
thomwolf
|
15ebd67d4e
|
cache in run_classifier + various fixes to the examples
|
2019-06-18 15:58:22 +02:00 |
|
thomwolf
|
d82e5deeb1
|
set find_unused_parameters=True in DDP
|
2019-06-18 12:13:14 +02:00 |
|
thomwolf
|
f964753090
|
explanation on the current location of the caching folder
|
2019-06-18 11:36:28 +02:00 |
|
thomwolf
|
382e2d1e50
|
spliting config and weight files for bert also
|
2019-06-18 10:37:16 +02:00 |
|
thomwolf
|
4447f270b2
|
updating hub
|
2019-06-17 16:21:28 +02:00 |
|
thomwolf
|
33d3db5c43
|
updating head masking, readme and docstrings
|
2019-06-17 15:51:28 +02:00 |
|
thomwolf
|
34858ae1d9
|
adding bert whole words, bertgerman and gpt-2 medium models, head masking
|
2019-06-17 11:02:39 +02:00 |
|
timoeller
|
16af9ff7b0
|
Add German Bert model to code, update readme
|
2019-06-14 17:42:46 +02:00 |
|
Colanim
|
1eba8b9d96
|
Fix link in README
|
2019-05-30 14:01:46 +09:00 |
|
lukovnikov
|
331a46ff04
|
- replaced OpenAIGPTAdam with OpenAIAdam in docs
|
2019-04-25 16:04:37 +02:00 |
|
lukovnikov
|
704037ad51
|
- updated docs for new LR API
- added some images for illustration
- updated comments in optimization
|
2019-04-25 15:59:39 +02:00 |
|
thomwolf
|
18a8a15f78
|
improving GPT2 tokenization and adding tests
|
2019-04-16 17:00:55 +02:00 |
|
thomwolf
|
1135f2384a
|
clean up logger in examples for distributed case
|
2019-04-15 15:22:40 +02:00 |
|
thomwolf
|
cc43307023
|
update readme
|
2019-04-15 15:06:10 +02:00 |
|
thomwolf
|
60ea6c59d2
|
added best practices for serialization in README and examples
|
2019-04-15 15:00:33 +02:00 |
|
thomwolf
|
20577d8a7c
|
add configuration serialization to readme
|
2019-04-15 14:21:41 +02:00 |
|
thomwolf
|
b17963d82f
|
update readme
|
2019-04-15 13:44:30 +02:00 |
|
Weixin Wang
|
f26ce6992e
|
Fix links in README
|
2019-04-02 17:20:32 +08:00 |
|
Sepehr Sameni
|
b588ff362a
|
fix lm_finetuning's link
|
2019-03-29 12:39:24 +04:30 |
|
Thomas Wolf
|
694e2117f3
|
Merge pull request #388 from ananyahjha93/master
Added remaining GLUE tasks to 'run_classifier.py'
|
2019-03-28 09:06:53 +01:00 |
|
Thomas Wolf
|
bbff03fbfc
|
Merge pull request #394 from desireevl/master
Minor change in README
|
2019-03-27 12:03:00 +01:00 |
|
thomwolf
|
34561e61a5
|
update main readme also
|
2019-03-27 12:00:04 +01:00 |
|
Ananya Harsh Jha
|
f471979167
|
added GLUE dev set results and details on how to run GLUE tasks
|
2019-03-21 15:38:30 -04:00 |
|
Desiree Vogt-Lee
|
d52f914e24
|
weigths to weights
|
2019-03-21 15:02:59 +10:00 |
|
Junjie Qian
|
d648a02203
|
Correct line number in README for classes
|
2019-03-08 16:28:03 -08:00 |
|
thomwolf
|
7cc35c3104
|
fix openai gpt example and updating readme
|
2019-03-06 11:43:21 +01:00 |
|
thomwolf
|
906b638efa
|
updating readme
|
2019-03-06 10:24:19 +01:00 |
|
John Hewitt
|
e14c6b52e3
|
add BertTokenizer flag to skip basic tokenization
|
2019-02-26 20:11:24 -08:00 |
|
Joel Grus
|
8722e9eb3b
|
finish updating docstrings
|
2019-02-23 06:31:59 -08:00 |
|
Stanislas Polu
|
ff22b3acc0
|
Few small nits in GPT-2's code examples
|
2019-02-21 09:15:27 +00:00 |
|
Tong Guo
|
09efcece75
|
Update README.md
|
2019-02-21 11:25:33 +08:00 |
|
Tony Lin
|
5b0e0b61f0
|
fix typo in readme
|
2019-02-19 20:34:18 +08:00 |
|
Davide Fiocco
|
0ae8eece55
|
MInor README typos corrected
|
2019-02-18 21:28:28 +01:00 |
|
sam-qordoba
|
1cb9c76ec5
|
Fix typo in GPT2Model code sample
Typo prevented code from running
|
2019-02-18 09:27:26 -08:00 |
|
Thomas Wolf
|
a25d056b7a
|
update readme
|
2019-02-18 15:30:11 +01:00 |
|
Thomas Wolf
|
517d7c8624
|
update readme
|
2019-02-18 14:39:55 +01:00 |
|
Thomas Wolf
|
ada22a1c9e
|
more details in GPT-2 usage example
|
2019-02-18 14:37:41 +01:00 |
|
Thomas Wolf
|
522733f6cb
|
readme typo fixes
|
2019-02-18 14:32:10 +01:00 |
|
thomwolf
|
d44db1145c
|
update readme
|
2019-02-18 11:12:09 +01:00 |
|
Thomas Wolf
|
0e774e57a6
|
Update readme
Adding details on how to extract a full list of hidden states for the Transformer-XL
|
2019-02-14 08:39:58 +01:00 |
|
Thomas Wolf
|
4e56da38d9
|
Merge pull request #268 from wangxiaodiu/master
fixed a minor bug in README.md
|
2019-02-13 10:19:25 +01:00 |
|
thomwolf
|
67376c02e2
|
update readme for tokenizers
|
2019-02-13 10:11:11 +01:00 |
|
Liang Niu
|
e1b3cfb504
|
fixed a minor bug in README.md
|
2019-02-12 15:54:23 +04:00 |
|
Thomas Wolf
|
3c33499f87
|
fix typo in readme
|
2019-02-12 10:22:54 +01:00 |
|
thomwolf
|
1e71f11dec
|
Release: 0.5.0
|
2019-02-11 14:16:27 +01:00 |
|
thomwolf
|
eebc8abbe2
|
clarify and unify model saving logic in examples
|
2019-02-11 14:04:19 +01:00 |
|
thomwolf
|
81c7e3ec9f
|
fix typo in readme
|
2019-02-11 13:37:12 +01:00 |
|
thomwolf
|
884ca81d87
|
transposing the inputs of Transformer-XL to have a unified interface
|
2019-02-11 13:19:59 +01:00 |
|
thomwolf
|
32fea876bb
|
add distant debugging to run_transfo_xl
|
2019-02-11 12:53:32 +01:00 |
|
thomwolf
|
b31ba23913
|
cuda on in the examples by default
|
2019-02-11 12:15:43 +01:00 |
|
thomwolf
|
2071a9b86e
|
fix python 2.7 imports
|
2019-02-11 10:35:36 +01:00 |
|
thomwolf
|
b514a60c36
|
added tests for OpenAI GPT and Transformer-XL tokenizers
|
2019-02-11 10:17:16 +01:00 |
|
thomwolf
|
9f9909ea2f
|
update readme
|
2019-02-09 16:59:21 +01:00 |
|
thomwolf
|
0c1a6f9b1d
|
update readme
|
2019-02-08 22:32:25 +01:00 |
|
thomwolf
|
009b581316
|
updated readme
|
2019-02-07 23:15:05 +01:00 |
|
thomwolf
|
f99f2fb661
|
docstrings
|
2019-02-07 17:07:22 +01:00 |
|
Thomas Wolf
|
848aae49e1
|
Merge branch 'master' into python_2
|
2019-02-06 00:13:20 +01:00 |
|
thomwolf
|
ba37ddc5ce
|
fix run_lm_modeling example command line
|
2019-02-06 00:07:08 +01:00 |
|
Girishkumar
|
0dd2b750ca
|
Minor update in README
Update links to classes in `modeling.py`
|
2019-01-30 23:49:15 +05:30 |
|
thomwolf
|
3a848111e6
|
update config, docstrings and readme to switch to seperated tokens and position embeddings
|
2019-01-29 11:00:11 +01:00 |
|
Davide Fiocco
|
35115eaf93
|
(very) minor update to README
|
2019-01-16 21:05:24 +01:00 |
|
nhatchan
|
8edc898f63
|
Fix documentation (missing backslashes)
This PR adds missing backslashes in LM Fine-tuning subsection in README.md.
|
2019-01-13 21:23:19 +09:00 |
|
thomwolf
|
e5c78c6684
|
update readme and few typos
|
2019-01-10 01:40:00 +01:00 |
|
thomwolf
|
fa5222c296
|
update readme
|
2019-01-10 01:25:28 +01:00 |
|
Thomas Wolf
|
c18bdb4433
|
Merge pull request #124 from deepset-ai/master
Add example for fine tuning BERT language model
|
2019-01-07 12:03:51 +01:00 |
|
Julien Chaumond
|
8da280ebbe
|
Setup CI
|
2018-12-20 16:33:39 -05:00 |
|
tholor
|
e5fc98c542
|
add exemplary training data. update to nvidia apex. refactor 'item -> line in doc' mapping. add warning for unknown word.
|
2018-12-20 18:30:52 +01:00 |
|
tholor
|
67f4dd56a3
|
update readme for run_lm_finetuning
|
2018-12-19 09:22:37 +01:00 |
|
Julien Chaumond
|
d57763f582
|
Fix typos
|
2018-12-18 19:23:22 -05:00 |
|
Thomas Wolf
|
786cc41299
|
Typos in readme
|
2018-12-17 09:22:18 +01:00 |
|
Daniel Khashabi
|
8b1b93947f
|
Minor fix.
|
2018-12-14 14:10:36 -05:00 |
|
Thomas Wolf
|
8809eb6c93
|
update readme with information on NVIDIA's apex
|
2018-12-14 16:59:39 +01:00 |
|
thomwolf
|
d821358884
|
update readme
|
2018-12-14 15:15:17 +01:00 |
|
thomwolf
|
087798b7fa
|
fix reloading model for evaluation in examples
|
2018-12-13 14:48:12 +01:00 |
|
thomwolf
|
0f544625f4
|
fix swag example for work with apex
|
2018-12-13 13:35:59 +01:00 |
|
thomwolf
|
4946c2c500
|
run_swag example in readme
|
2018-12-13 13:02:07 +01:00 |
|
Thomas Wolf
|
91aab2a6d3
|
Merge pull request #116 from FDecaYed/deyuf/fp16_with_apex
Change to use apex for better fp16 and multi-gpu support
|
2018-12-13 12:32:37 +01:00 |
|
Thomas Wolf
|
ffe9075f48
|
Merge pull request #96 from rodgzilla/multiple-choice-code
BertForMultipleChoice and Swag dataset example.
|
2018-12-13 12:05:11 +01:00 |
|
Grégory Châtel
|
dcb50eaa4b
|
Swag example readme section update with gradient accumulation run.
|
2018-12-12 18:17:46 +01:00 |
|
Deyu Fu
|
c8ea286048
|
change to apex for better fp16 and multi-gpu support
|
2018-12-11 17:13:58 -08:00 |
|
Thomas Wolf
|
a3a3180c86
|
Bump up requirements to Python 3.6
|
2018-12-11 11:29:45 +01:00 |
|
Grégory Châtel
|
0876b77f7f
|
Change to the README file to add SWAG results.
|
2018-12-10 15:34:19 +01:00 |
|
Davide Fiocco
|
c9f67e037c
|
Adding --do_lower_case for all uncased BERTs
I had missed those, it should make sense to use them
|
2018-12-07 20:40:56 +01:00 |
|
Grégory Châtel
|
150f3cd9fa
|
Few typos in README.md
|
2018-12-06 19:22:07 +01:00 |
|
Grégory Châtel
|
4fa7892d64
|
Wrong line number link to modeling file.
|
2018-12-06 19:18:29 +01:00 |
|
Grégory Châtel
|
6a26e19ea3
|
Updating README.md with SWAG example informations.
|
2018-12-06 19:15:08 +01:00 |
|
Grégory Châtel
|
0a7c8bdcac
|
Fixing badly formatted links.
|
2018-12-04 13:43:56 +01:00 |
|
Grégory Châtel
|
3113e967db
|
Adding links to examples files.
|
2018-12-04 13:40:38 +01:00 |
|
Davide Fiocco
|
8a8aa59d8c
|
Update finetuning example adding --do_lower_case
Should be consistent with the fact that an uncased model is used
|
2018-12-01 01:00:05 +01:00 |
|
thomwolf
|
f9f3bdd60b
|
update readme
|
2018-11-30 23:05:18 +01:00 |
|
thomwolf
|
52ff0590ff
|
tup => tpu
|
2018-11-30 23:01:10 +01:00 |
|
thomwolf
|
296f006132
|
added BertForTokenClassification model
|
2018-11-30 13:56:53 +01:00 |
|
thomwolf
|
298107fed7
|
Added new bert models
|
2018-11-30 13:56:02 +01:00 |
|
Davide Fiocco
|
ec2c339b53
|
Updated quick-start example with BertForMaskedLM
As `convert_ids_to_tokens` returns a list, the code in the README currently throws an `AssertionError`, so I propose I quick fix.
|
2018-11-28 14:53:46 +01:00 |
|
thomwolf
|
05053d163c
|
update cache_dir in readme and examples
|
2018-11-26 10:45:13 +01:00 |
|
thomwolf
|
029bdc0d50
|
fixing readme examples
|
2018-11-26 09:56:41 +01:00 |
|
Thomas Wolf
|
60e01ac427
|
fix link in readme
|
2018-11-21 12:08:30 +01:00 |
|
Thomas Wolf
|
fd32ebed81
|
Merge pull request #42 from weiyumou/master
Fixed UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2
|
2018-11-20 10:09:50 +01:00 |
|
thomwolf
|
eed255a58d
|
fixing CLI typo in readme
|
2018-11-20 10:02:57 +01:00 |
|
weiyumou
|
9ff2b7d86d
|
Fixed README typo
|
2018-11-19 23:13:10 -05:00 |
|
Thomas Wolf
|
da73925f6a
|
fix typos
|
2018-11-19 20:58:48 +01:00 |
|
Joel Grus
|
dd56cfd89a
|
update pip package name
|
2018-11-19 09:50:34 -08:00 |
|
Thomas Wolf
|
956c917344
|
fix typos in readme
|
2018-11-17 23:25:23 +01:00 |
|
Thomas Wolf
|
7c91e51c26
|
update links in readme
|
2018-11-17 22:54:15 +01:00 |
|
Thomas Wolf
|
e113101702
|
fix typos in readme
|
2018-11-17 12:36:35 +01:00 |
|
thomwolf
|
47a7d4ec14
|
update examples from master
|
2018-11-17 12:21:35 +01:00 |
|
thomwolf
|
c8cba67742
|
clean up readme and examples
|
2018-11-17 12:19:16 +01:00 |
|
thomwolf
|
757750d6f6
|
fix tests
|
2018-11-17 11:58:14 +01:00 |
|
thomwolf
|
d0673c7dbd
|
fix links
|
2018-11-17 08:59:29 +01:00 |
|
thomwolf
|
68b937aa40
|
sub section overviews
|
2018-11-17 08:55:56 +01:00 |
|
thomwolf
|
c54d8b1847
|
fixing links in readme
|
2018-11-17 08:46:17 +01:00 |
|
thomwolf
|
f920eff8c3
|
update readme
|
2018-11-17 08:42:45 +01:00 |
|
thomwolf
|
886cb49792
|
updating readme and notebooks
|
2018-11-16 14:31:15 +01:00 |
|
thomwolf
|
1de35b624b
|
preparing for first release
|
2018-11-15 20:56:10 +01:00 |
|
Thomas Wolf
|
278fd28a32
|
added results for 16-bit fine-tuning in readme
|
2018-11-13 09:34:49 +01:00 |
|
thomwolf
|
d940eeda54
|
typo
|
2018-11-12 15:26:46 +01:00 |
|
thomwolf
|
1cf0a16c67
|
cleaning up readme
|
2018-11-12 15:24:47 +01:00 |
|
thomwolf
|
66b0090877
|
add fp16 training
|
2018-11-12 15:15:02 +01:00 |
|
Thomas Wolf
|
5dfd19060a
|
fix typo in readme
|
2018-11-12 12:39:57 +01:00 |
|
Thomas Wolf
|
fa1aa81f26
|
fix typo in readme bach examples
|
2018-11-12 08:37:43 +01:00 |
|
Thomas Wolf
|
6d6b916f48
|
update to BERT-large results
|
2018-11-11 17:00:49 +01:00 |
|
Thomas Wolf
|
c4bfc646f5
|
Add results of fine-tuning BERT-large on GPUs
|
2018-11-11 16:59:35 +01:00 |
|
thomwolf
|
ea85cca8ab
|
adding optimize_on_cpu explanation in readme
|
2018-11-09 11:42:37 +01:00 |
|
Thomas Wolf
|
0c24db9d5f
|
update results for SQuAD
|
2018-11-09 09:11:59 +01:00 |
|
thomwolf
|
2c5d993ba4
|
update readme - fix SQuAD model on multi-GPU
|
2018-11-08 21:22:22 +01:00 |
|
Gopal Krishna
|
4850ec5888
|
fixed small typos in the README.md (#8)
|
2018-11-08 15:00:02 -05:00 |
|
Thomas Wolf
|
3bfbc21376
|
updating pytest command
|
2018-11-08 00:44:17 +01:00 |
|
Thomas Wolf
|
0ed7696191
|
Updated MRPC results
|
2018-11-08 00:39:42 +01:00 |
|
Thomas Wolf
|
d92a7f7721
|
Removing note on run_squad.py example
|
2018-11-07 23:37:55 +01:00 |
|
Thomas Wolf
|
1a5bbd83dc
|
Updating run_squad information in readme
|
2018-11-06 08:53:01 +01:00 |
|
Thomas Wolf
|
79e1b95e75
|
fix link in readme
|
2018-11-06 08:38:02 +01:00 |
|
Knut Ole Sjøli
|
886f595c37
|
Fix typo in subheader (#4)
|
2018-11-05 18:34:18 -05:00 |
|
Thomas Wolf
|
59d4cc5f2b
|
typos
|
2018-11-05 22:47:24 +01:00 |
|
Thomas Wolf
|
d983eecdd3
|
more readme typo fixes
|
2018-11-05 21:29:04 +01:00 |
|
Thomas Wolf
|
8f91b4de91
|
more typo fixes
|
2018-11-05 21:24:14 +01:00 |
|
Thomas Wolf
|
7316b0d6d0
|
fix typo
|
2018-11-05 21:22:45 +01:00 |
|
Clement
|
d130cb5139
|
typos
|
2018-11-05 15:09:24 -05:00 |
|
Clement
|
2a8fee495b
|
typos
|
2018-11-05 15:04:06 -05:00 |
|
Clement
|
f968b11657
|
typo
|
2018-11-05 14:59:44 -05:00 |
|
thomwolf
|
88e793f31a
|
fix typos
|
2018-11-05 16:14:19 +01:00 |
|
thomwolf
|
3914eed505
|
update readme
|
2018-11-05 16:09:27 +01:00 |
|
thomwolf
|
7394eb47a5
|
update readme
|
2018-11-05 15:35:44 +01:00 |
|
thomwolf
|
6cc651778a
|
update readme
|
2018-11-04 21:26:03 +01:00 |
|
thomwolf
|
d6418c5ef3
|
tweaking the readme
|
2018-11-03 23:52:35 +01:00 |
|
thomwolf
|
3b70b270e0
|
update readme
|
2018-11-03 23:39:55 +01:00 |
|
thomwolf
|
f8276008df
|
update readme, file names, removing TF code, moving tests
|
2018-11-03 23:35:14 +01:00 |
|
VictorSanh
|
5889765a7c
|
Update README.md
|
2018-11-03 09:18:44 -04:00 |
|
VictorSanh
|
844b2f0e6f
|
Small update Readme
|
2018-11-02 08:57:15 -04:00 |
|
VictorSanh
|
72d69a4ef4
|
Update README
|
2018-11-02 03:37:39 -04:00 |
|
VictorSanh
|
bf65d4dbb7
|
Begin Updating the README.md
|
2018-11-02 02:51:07 -04:00 |
|
thomwolf
|
13ee61e4de
|
switch to full google code
|
2018-10-31 18:46:03 +01:00 |
|
thomwolf
|
12e013dbac
|
added wordpiece - updated readme
|
2018-10-30 23:09:09 +01:00 |
|
Thomas Wolf
|
43badf217d
|
Initial commit
|
2018-10-29 14:56:02 +01:00 |
|