Bilal Khan
5ce8d29abe
Change tensorboard imports to use built-in tensorboard if available
2019-10-08 16:29:43 -05:00
Julien Chaumond
d688af19e5
Update link to swift-coreml-transformers
...
cc @lysandrejik
2019-10-08 16:37:52 -04:00
thomwolf
45dc04f33d
tf model [WIP]
2019-10-08 17:37:17 +02:00
thomwolf
248314772f
fix tokenization
2019-10-08 17:19:28 +02:00
thomwolf
03c2c762a6
update tokenizer
2019-10-08 17:12:03 +02:00
thomwolf
3edfa1d6aa
update model to use past
2019-10-08 17:11:58 +02:00
Rémi Louf
f4d41fe33e
Merge pull request #1448 from huggingface/contributing
...
add contribution guidelines
2019-10-08 16:55:34 +02:00
Rémi Louf
45de313a9e
add bullet point on modifying an existing PR
2019-10-08 11:54:10 +02:00
Rémi Louf
ade05b6cef
add code contribution
2019-10-07 23:20:25 +02:00
Rémi Louf
e9c09052a4
add issues and requests guidelines
2019-10-07 22:30:55 +02:00
LysandreJik
8fcc6507ce
Multilingual
2019-10-07 15:02:42 -04:00
Rémi Louf
6e3e1c959e
Merge pull request #1447 from huggingface/dev-requirements
...
Provide requirements.txt for development dependencies
2019-10-07 18:49:26 +02:00
VictorSanh
7ce83b4931
update weights for distilgpt2
2019-10-07 12:30:27 -04:00
VictorSanh
9f81f1cba8
fix convert pt_to_tf2 for custom weights
2019-10-07 12:30:19 -04:00
Rémi Louf
7afd00a661
freeze dev requirements
2019-10-07 17:58:13 +02:00
thomwolf
bd5363cc83
update CTRL configuration
2019-10-07 15:37:30 +02:00
thomwolf
dc89441167
update CTRL pytorch model
2019-10-07 15:37:25 +02:00
thomwolf
320b7a7e01
fix #1416
2019-10-07 14:26:59 +02:00
Thomas Wolf
1615360c71
Merge pull request #1438 from SeanBE/master
...
fix pytorch-transformers migration description in README
2019-10-07 05:02:23 -04:00
seanBE
6dc6c716c5
fix pytorch-transformers migration description in README
2019-10-07 09:59:54 +01:00
Christopher Goh
904158ac4d
Rephrase forward method to reduce ambiguity
2019-10-06 23:40:52 -04:00
Christopher Goh
0f65d8cbbe
Fix some typos in README
2019-10-06 23:40:52 -04:00
LysandreJik
f3e0218fbb
Correct device assignment in run_generation
2019-10-05 21:05:16 -04:00
thomwolf
78ef1a9930
fixes
2019-10-04 17:59:44 -04:00
thomwolf
6c1d0bc066
update encode_plus - add truncation strategies
2019-10-04 17:38:38 -04:00
VictorSanh
0820bb0555
unecessary carriage return
2019-10-04 17:23:15 -04:00
VictorSanh
f5891c3821
run_squad --> run_squad_w_distillation
2019-10-04 17:23:15 -04:00
VictorSanh
764a7923ec
add distillation+finetuning option in run_squad
2019-10-04 17:23:15 -04:00
Lysandre Debut
bb464289ce
New model addition issue template
2019-10-04 16:41:26 -04:00
thomwolf
92c0f2fb90
Merge remote-tracking branch 'origin/julien_multiple-choice' into encoding-qol
2019-10-04 15:48:06 -04:00
Julien Chaumond
9e136ff57c
Honor args.overwrite_cache (h/t @erenup)
2019-10-04 15:00:56 -04:00
LysandreJik
7bddb45a6f
Decode documentaton
2019-10-04 14:27:38 -04:00
keskarnitish
dbed1c5d94
Adding CTRL (squashed commit)
...
adding conversion script
adding first draft of modeling & tokenization
adding placeholder for test files
bunch of changes
registering the tokenizer/model/etc
tests
change link; something is very VERY wrong here
weird end-of-word thingy going on
i think the tokenization works now ; wrote the unit tests
overall structure works;load w next
the monster is alive!
works after some cleanup as well
adding emacs autosave to gitignore
currently only supporting the 48 layer one; seems to infer fine on my macbook
cleanup
fixing some documentation
fixing some documentation
tests passing?
now works on CUDA also
adding greedy?
adding greedy sampling
works well
2019-10-03 22:29:03 -07:00
Thomas Wolf
b3cfd97946
Merge pull request #1373 from TimYagan/fix-css
...
Fixed critical css font-family issues
2019-10-03 19:04:02 -04:00
Lysandre Debut
81a1e12469
Merge pull request #1313 from enzoampil/master
...
Add option to use a 'stop token'
2019-10-03 22:43:57 +00:00
Lysandre Debut
d3f24dfad7
Merge branch 'master' into master
2019-10-03 22:43:09 +00:00
LysandreJik
ecc4f1bdfa
XLM use_lang_embedding flag in run_generation
2019-10-03 17:42:16 -04:00
LysandreJik
c2c2ca0fdb
Added XLM to run_generation, with prompt language selection.
2019-10-03 17:18:48 -04:00
Thomas Wolf
1569610f2d
Merge pull request #1296 from danai-antoniou/add-duplicate-tokens-error
...
Added ValueError for duplicates in list of added tokens
2019-10-03 17:06:17 -04:00
drc10723
e1b2949ae6
DistillBert Documentation Code Example fixes
2019-10-03 15:51:33 -04:00
Simon Layton
899883644f
Fix test fails and warnings
...
Attention output was in bnij ordering instead of ijbn which everything
else will expect. This was an oversight on my part, and keeps the
attention inputs/outputs identical to the original code.
Also moved back from tensor slicing to index_select in rel_shift_bnij to
make the tracer happy.
2019-10-03 12:05:15 -04:00
VictorSanh
e2ae9c0b73
fix links in doc index
2019-10-03 11:42:21 -04:00
LysandreJik
aebd83230f
Update naming + remove f string in run_lm_finetuning example
2019-10-03 11:31:36 -04:00
LysandreJik
651bfb7ad5
always_truncate by default
2019-10-03 11:31:36 -04:00
LysandreJik
5ed50a93fb
LM finetuning won't mask special tokens anymore
2019-10-03 11:31:36 -04:00
LysandreJik
cc412edd42
Supports already existing special tokens
2019-10-03 11:31:36 -04:00
LysandreJik
2f259b228e
Sequence IDS
2019-10-03 11:31:36 -04:00
LysandreJik
7c789c337d
Always truncate argument in the encode method
2019-10-03 11:31:36 -04:00
Brian Ma
7af0777910
Update run_glue.py
...
add DistilBert model shortcut into ALL_MODELS
2019-10-03 15:31:11 +00:00
VictorSanh
c1689ac301
fix name
2019-10-03 10:56:39 -04:00