transformers/docs/source/model_doc
Lysandre Debut d5d7d88612
ELECTRA (#3257)
* Electra wip

* helpers

* Electra wip

* Electra v1

* ELECTRA may be saved/loaded

* Generator & Discriminator

* Embedding size instead of halving the hidden size

* ELECTRA Tokenizer

* Revert BERT helpers

* ELECTRA Conversion script

* Archive maps

* PyTorch tests

* Start fixing tests

* Tests pass

* Same configuration for both models

* Compatible with base + large

* Simplification + weight tying

* Archives

* Auto + Renaming to standard names

* ELECTRA is uncased

* Tests

* Slight API changes

* Update tests

* wip

* ElectraForTokenClassification

* temp

* Simpler arch + tests

Removed ElectraForPreTraining which will be in a script

* Conversion script

* Auto model

* Update links to S3

* Split ElectraForPreTraining and ElectraForTokenClassification

* Actually test PreTraining model

* Remove num_labels from configuration

* wip

* wip

* From discriminator and generator to electra

* Slight API changes

* Better naming

* TensorFlow ELECTRA tests

* Accurate conversion script

* Added to conversion script

* Fast ELECTRA tokenizer

* Style

* Add ELECTRA to README

* Modeling Pytorch Doc + Real style

* TF Docs

* Docs

* Correct links

* Correct model intialized

* random fixes

* style

* Addressing Patrick's and Sam's comments

* Correct links in docs
2020-04-03 14:10:54 -04:00
..
albert.rst Documentation (#2989) 2020-02-25 18:43:36 -05:00
auto.rst adding in the doc 2020-01-27 14:27:07 -05:00
bart.rst Rename BartForMaskedLM -> BartForConditionalGeneration (#3114) 2020-03-05 17:41:18 -05:00
bert.rst Documentation (#2989) 2020-02-25 18:43:36 -05:00
camembert.rst Documentation (#2989) 2020-02-25 18:43:36 -05:00
ctrl.rst Documentation (#2989) 2020-02-25 18:43:36 -05:00
distilbert.rst Tips + whitespaces 2020-01-23 09:38:45 -05:00
electra.rst ELECTRA (#3257) 2020-04-03 14:10:54 -04:00
flaubert.rst FlauBERT documentation 2020-01-30 10:04:18 -05:00
gpt.rst Documentation (#2989) 2020-02-25 18:43:36 -05:00
gpt2.rst Documentation (#2989) 2020-02-25 18:43:36 -05:00
roberta.rst Documentation (#2989) 2020-02-25 18:43:36 -05:00
t5.rst [T5] Add training documenation (#3507) 2020-03-30 13:35:53 +02:00
transformerxl.rst Documentation (#2989) 2020-02-25 18:43:36 -05:00
xlm.rst Documentation (#2989) 2020-02-25 18:43:36 -05:00
xlmroberta.rst Documentation (#2989) 2020-02-25 18:43:36 -05:00
xlnet.rst Documentation (#2989) 2020-02-25 18:43:36 -05:00