Julien Chaumond
|
bdfe21ab24
|
Change param order for consistency
|
2019-11-26 13:08:12 -05:00 |
|
LysandreJik
|
c536c2a480
|
ALBERT Input Embeds
|
2019-11-26 13:08:12 -05:00 |
|
LysandreJik
|
f873b55e43
|
Warning for ALBERT-v2 models
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
c9cb7f8a0f
|
Torch 1.1.0 compatibility + FP16 O1 + TF checkpoints
Co-authored-by: wassname
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
b18509c208
|
Tests for ALBERT in TF2 + fixes
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
7bddbf5961
|
TFAlbertForSequenceClassification
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
f6f382532b
|
ALBERT in TF2
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
d9daad98c7
|
Re-ordering of group_idx/layer_idx + Python 2 tests
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
9d5c49546f
|
Tests for AlbertForQuestionAnswering AlbertForSequenceClassification
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
16263f9685
|
Headmasking
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
abb23a78ba
|
Head pruning for ALBERT
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
4374eaea78
|
ALBERT for SQuAD
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
70d99980de
|
ALBERT-V2
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
c110c41fdb
|
Run GLUE and remove LAMB
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
6637a77f80
|
AlbertForSequenceClassification
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
0d07a23c04
|
LAMB implementation
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
c987545592
|
Converting script
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
4f3a54bfc8
|
ALBERT can load pre-trained models. Doesn't inherit from BERT anymore.
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
c4403006b8
|
External MLM head
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
b21402fc86
|
Python 2 tests + licence
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
c14a22272f
|
ALBERT passes all tests
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
870320a24e
|
Early tests
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
25a31953e8
|
Output Attentions + output hidden states
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
ce9eade29c
|
Initializer range using BertPreTrainedModel
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
5680a11063
|
Activation function managed from the config file
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
1e5b31c388
|
Several fixes and improvements
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
ee20201d33
|
Tokenization tests + fixes + init
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
e3ea5d1d8d
|
Docstrings
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
fedac786d4
|
Tokenization + small fixes
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
67b422662c
|
Documentation + improved AlbertForMaskedLM
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
1b92564330
|
Reorganize and cleanup
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
12290c0d5c
|
Handles multi layer and multi groups
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
139affaa8d
|
Albert layer/layer groups
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
91ccbae788
|
Accepts multiple sizes
|
2019-11-26 13:08:12 -05:00 |
|
Lysandre
|
c0c2088333
|
ALBERT model
|
2019-11-26 13:08:12 -05:00 |
|
v_sboliu
|
8e5d84fcc1
|
Fixed typo
|
2019-11-26 09:01:32 -05:00 |
|
Lysandre
|
0669c1fcd1
|
SQuAD v2 BERT + XLNet
|
2019-11-25 19:22:21 -05:00 |
|
manansanghi
|
5d3b8daad2
|
Minor bug fixes on run_ner.py
|
2019-11-25 16:48:03 -05:00 |
|
İbrahim Ethem Demirci
|
aa92a184d2
|
resize model when special tokenizer present
|
2019-11-25 15:06:32 -05:00 |
|
Bilal Khan
|
07bf43074f
|
Fix GPT2 docstring
|
2019-11-25 11:32:00 -05:00 |
|
Evpok Padding
|
fa963ecc59
|
if→elif
|
2019-11-25 10:21:03 -05:00 |
|
Evpok Padding
|
c8eb8157b8
|
fix docstrings
|
2019-11-25 10:21:03 -05:00 |
|
Evpok Padding
|
99f750d64e
|
add Camembert models to modeling_auto
|
2019-11-25 10:21:03 -05:00 |
|
Lysandre
|
7485caefb0
|
fix #1894
|
2019-11-25 09:33:39 -05:00 |
|
Julien Chaumond
|
afaa335851
|
[doc] Fix assets urls
|
2019-11-23 11:34:45 -05:00 |
|
Julien Chaumond
|
176cd1ce1b
|
[doc] homogenize instructions slightly
|
2019-11-23 11:18:54 -05:00 |
|
Nikolay Korolev
|
041a901f32
|
Fix typo in documentation. toto -> to
|
2019-11-23 10:55:16 -05:00 |
|
Lysandre
|
e0e55bc550
|
Manage training example & refactor the refactor
|
2019-11-22 16:27:45 -05:00 |
|
Lysandre
|
c3ba645237
|
Works for XLNet
|
2019-11-22 16:27:37 -05:00 |
|
LysandreJik
|
a5a8a6175f
|
Works for BERT
|
2019-11-22 16:27:31 -05:00 |
|