Lysandre
|
ce9eade29c
|
Initializer range using BertPreTrainedModel
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
5680a11063
|
Activation function managed from the config file
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
1e5b31c388
|
Several fixes and improvements
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
ee20201d33
|
Tokenization tests + fixes + init
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
e3ea5d1d8d
|
Docstrings
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
fedac786d4
|
Tokenization + small fixes
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
67b422662c
|
Documentation + improved AlbertForMaskedLM
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
1b92564330
|
Reorganize and cleanup
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
12290c0d5c
|
Handles multi layer and multi groups
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
139affaa8d
|
Albert layer/layer groups
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
91ccbae788
|
Accepts multiple sizes
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
c0c2088333
|
ALBERT model
|
2019-11-26 13:08:12 -05:00 |
|
v_sboliu
|
8e5d84fcc1
|
Fixed typo
|
2019-11-26 09:01:32 -05:00 |
|
Lysandre
|
0669c1fcd1
|
SQuAD v2 BERT + XLNet
|
2019-11-25 19:22:21 -05:00 |
|
manansanghi
|
5d3b8daad2
|
Minor bug fixes on run_ner.py
|
2019-11-25 16:48:03 -05:00 |
|
İbrahim Ethem Demirci
|
aa92a184d2
|
resize model when special tokenizer present
|
2019-11-25 15:06:32 -05:00 |
|
Bilal Khan
|
07bf43074f
|
Fix GPT2 docstring
|
2019-11-25 11:32:00 -05:00 |
|
Evpok Padding
|
fa963ecc59
|
if→elif
|
2019-11-25 10:21:03 -05:00 |
|
Evpok Padding
|
c8eb8157b8
|
fix docstrings
|
2019-11-25 10:21:03 -05:00 |
|
Evpok Padding
|
99f750d64e
|
add Camembert models to modeling_auto
|
2019-11-25 10:21:03 -05:00 |
|
Lysandre
|
7485caefb0
|
fix #1894
|
2019-11-25 09:33:39 -05:00 |
|
Julien Chaumond
|
afaa335851
|
[doc] Fix assets urls
|
2019-11-23 11:34:45 -05:00 |
|
Julien Chaumond
|
176cd1ce1b
|
[doc] homogenize instructions slightly
|
2019-11-23 11:18:54 -05:00 |
|
Nikolay Korolev
|
041a901f32
|
Fix typo in documentation. toto -> to
|
2019-11-23 10:55:16 -05:00 |
|
Lysandre
|
e0e55bc550
|
Manage training example & refactor the refactor
|
2019-11-22 16:27:45 -05:00 |
|
Lysandre
|
c3ba645237
|
Works for XLNet
|
2019-11-22 16:27:37 -05:00 |
|
LysandreJik
|
a5a8a6175f
|
Works for BERT
|
2019-11-22 16:27:31 -05:00 |
|
LysandreJik
|
a7dafe2f41
|
Padding strategy (left and right) rather than boolean flag
|
2019-11-22 16:27:25 -05:00 |
|
LysandreJik
|
9f374c8252
|
encode and encode_plus handle attention masks and padding
|
2019-11-22 16:27:15 -05:00 |
|
Lysandre
|
72e506b22e
|
wip
|
2019-11-22 16:26:00 -05:00 |
|
Lysandre
|
ea52f82455
|
Moved some SQuAD logic to /data
|
2019-11-22 16:25:52 -05:00 |
|
Rémi Louf
|
26db31e0c0
|
update the documentation
|
2019-11-21 14:41:19 -05:00 |
|
Rémi Louf
|
6f70bb8c69
|
add instructions to run the examples
|
2019-11-21 14:41:19 -05:00 |
|
Juha Kiili
|
05d4232f63
|
Add valohai.yaml
|
2019-11-21 12:38:17 +02:00 |
|
Aarni Koskela
|
aac3551407
|
Add download_glue_data.py from kamalkraj/ALBERT-TF2.0
Original source: fa90194e5f/download_glue_data.py
Original license: fa90194e5f/LICENSE (Apache-2.0)
|
2019-11-21 12:37:41 +02:00 |
|
Juha Kiili
|
2cf3447e0a
|
Glue: log in Valohai-compatible JSON format too
|
2019-11-21 12:35:25 +02:00 |
|
Thomas Wolf
|
0cdfcca24b
|
Merge pull request #1860 from stefan-it/camembert-for-token-classification
[WIP] Add support for CamembertForTokenClassification
|
2019-11-21 10:56:07 +01:00 |
|
Jin Young Sohn
|
e70cdf083d
|
Cleanup TPU bits from run_glue.py
TPU runner is currently implemented in:
https://github.com/pytorch-tpu/transformers/blob/tpu/examples/run_glue_tpu.py.
We plan to upstream this directly into `huggingface/transformers`
(either `master` or `tpu`) branch once it's been more thoroughly tested.
|
2019-11-20 17:54:34 -05:00 |
|
Lysandre
|
454455c695
|
fix #1879
|
2019-11-20 09:42:48 -05:00 |
|
Lysandre
|
3de31f8d28
|
mean does not exist in TF2
|
2019-11-19 18:14:14 -05:00 |
|
Stefan Schweter
|
da06afafc8
|
tree-wide: add trailing comma in configuration maps
|
2019-11-19 21:57:00 +01:00 |
|
Stefan Schweter
|
2e2c0375c3
|
distilbert: add German distilbert model to positional embedding sizes map
|
2019-11-19 20:41:18 +01:00 |
|
Stefan Schweter
|
e7cf2ccd15
|
distillation: add German distilbert model
|
2019-11-19 19:55:19 +01:00 |
|
Stefan Schweter
|
e631383d4f
|
docs: add new German distilbert model to pretrained models
|
2019-11-19 19:52:40 +01:00 |
|
Stefan Schweter
|
f21dfe36ba
|
distilbert: add vocab for new German distilbert model
|
2019-11-19 19:51:31 +01:00 |
|
Stefan Schweter
|
22333945fb
|
distilbert: add pytorch model for new German distilbert model
|
2019-11-19 19:51:01 +01:00 |
|
Stefan Schweter
|
337802783f
|
distilbert: add configuration for new German distilbert model
|
2019-11-19 19:50:32 +01:00 |
|
alexzubiaga
|
4193aa9f81
|
add TFXLNetForTokenClassification implementation and unit test
add XLNetForTokenClassification implementation and unit tests
|
2019-11-19 12:47:54 +01:00 |
|
Kazutoshi Shinoda
|
f3386d9383
|
typo "deay" -> "decay"
|
2019-11-18 11:50:06 -05:00 |
|
Stefan Schweter
|
56c84863a1
|
camembert: add support for CamemBERT in run_ner example
|
2019-11-18 17:06:57 +01:00 |
|