transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-15 02:28:24 +06:00

History

Lysandre Debut d5d7d88612 ELECTRA (#3257 ) * Electra wip * helpers * Electra wip * Electra v1 * ELECTRA may be saved/loaded * Generator & Discriminator * Embedding size instead of halving the hidden size * ELECTRA Tokenizer * Revert BERT helpers * ELECTRA Conversion script * Archive maps * PyTorch tests * Start fixing tests * Tests pass * Same configuration for both models * Compatible with base + large * Simplification + weight tying * Archives * Auto + Renaming to standard names * ELECTRA is uncased * Tests * Slight API changes * Update tests * wip * ElectraForTokenClassification * temp * Simpler arch + tests Removed ElectraForPreTraining which will be in a script * Conversion script * Auto model * Update links to S3 * Split ElectraForPreTraining and ElectraForTokenClassification * Actually test PreTraining model * Remove num_labels from configuration * wip * wip * From discriminator and generator to electra * Slight API changes * Better naming * TensorFlow ELECTRA tests * Accurate conversion script * Added to conversion script * Fast ELECTRA tokenizer * Style * Add ELECTRA to README * Modeling Pytorch Doc + Real style * TF Docs * Docs * Correct links * Correct model intialized * random fixes * style * Addressing Patrick's and Sam's comments * Correct links in docs		2020-04-03 14:10:54 -04:00
..
albert.rst	Documentation (#2989 )	2020-02-25 18:43:36 -05:00
auto.rst	adding in the doc	2020-01-27 14:27:07 -05:00
bart.rst	Rename BartForMaskedLM -> BartForConditionalGeneration (#3114 )	2020-03-05 17:41:18 -05:00
bert.rst	Documentation (#2989 )	2020-02-25 18:43:36 -05:00
camembert.rst	Documentation (#2989 )	2020-02-25 18:43:36 -05:00
ctrl.rst	Documentation (#2989 )	2020-02-25 18:43:36 -05:00
distilbert.rst	Tips + whitespaces	2020-01-23 09:38:45 -05:00
electra.rst	ELECTRA (#3257 )	2020-04-03 14:10:54 -04:00
flaubert.rst	FlauBERT documentation	2020-01-30 10:04:18 -05:00
gpt.rst	Documentation (#2989 )	2020-02-25 18:43:36 -05:00
gpt2.rst	Documentation (#2989 )	2020-02-25 18:43:36 -05:00
roberta.rst	Documentation (#2989 )	2020-02-25 18:43:36 -05:00
t5.rst	[T5] Add training documenation (#3507 )	2020-03-30 13:35:53 +02:00
transformerxl.rst	Documentation (#2989 )	2020-02-25 18:43:36 -05:00
xlm.rst	Documentation (#2989 )	2020-02-25 18:43:36 -05:00
xlmroberta.rst	Documentation (#2989 )	2020-02-25 18:43:36 -05:00
xlnet.rst	Documentation (#2989 )	2020-02-25 18:43:36 -05:00