transformers/docs/source/model_doc
Anton Lozhkov d472bd7b18
Wav2Vec2 Pretraining (#11306)
* Working quantizer forward

* Working quantizer forward

* Clean up unused model parts, test reproducibility

* Working quantizer forward

* Clean up unused model parts, test reproducibility

* Remove custom outputs from the shared ones

* correct conversion

* correct bug

* add first pretrain script

* save intermediate

* static shapes

* save intermediate

* finish first pretrain script version

* more refactor

* remove wanddb

* refactor more

* improve test

* correct perplexity compute bug

* finish model implementation

* add to docs

* finish docs

* finish pretraining script

* finish pretraining script

* remove wandb

* finish PR for merge

* finish config

* finish

* make deepspeed work

* Apply suggestions from code review

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply suggestions

* fix flaky test

Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2021-06-09 18:40:56 +01:00
..
albert.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
auto.rst FlaxGPT2 (#11556) 2021-05-18 22:50:51 +01:00
bart.rst [docs] fix xref to PreTrainedModel.generate (#11049) 2021-06-02 09:21:05 -07:00
barthez.rst Examples reorg (#11350) 2021-04-21 11:11:20 -04:00
bert_japanese.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
bert.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
bertgeneration.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
bertweet.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
bigbird_pegasus.rst Add BigBirdPegasus (#10991) 2021-05-07 09:27:43 +02:00
bigbird.rst Big Bird Fast Tokenizer implementation (#11075) 2021-05-10 03:01:23 -04:00
blenderbot_small.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
blenderbot.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
bort.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
byt5.rst ByT5 model (#11971) 2021-06-01 19:07:37 +01:00
camembert.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
clip.rst Add FlaxCLIP (#11883) 2021-06-01 09:44:31 +05:30
convbert.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
cpm.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
ctrl.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
deberta_v2.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
deberta.rst Implement Fast Tokenization for Deberta (#11387) 2021-04-30 08:08:15 -04:00
deit.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
detr.rst Add DETR (#11653) 2021-06-09 11:51:13 -04:00
dialogpt.rst ADD BORT (#9813) 2021-01-27 21:25:11 +03:00
distilbert.rst Examples reorg (#11350) 2021-04-21 11:11:20 -04:00
dpr.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
electra.rst [Flax] Add Electra models (#11426) 2021-05-04 20:56:09 +02:00
encoderdecoder.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
flaubert.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
fsmt.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
funnel.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
gpt_neo.rst Added Sequence Classification class in GPTNeo (#11906) 2021-05-28 06:27:02 -04:00
gpt.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
gpt2.rst FlaxGPT2 (#11556) 2021-05-18 22:50:51 +01:00
herbert.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
ibert.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
layoutlm.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
led.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
longformer.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
luke.rst Add LUKE (#11223) 2021-05-03 09:07:29 -04:00
lxmert.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
m2m_100.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
marian.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
mbart.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
megatron_bert.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
megatron_gpt2.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
mobilebert.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
mpnet.rst MPNet copyright files (#9015) 2020-12-10 09:29:38 -05:00
mt5.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
pegasus.rst Typo in usage example, changed to device instead of torch_device (#11979) 2021-06-01 14:58:49 -04:00
phobert.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
prophetnet.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
rag.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
reformer.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
retribert.rst Examples reorg (#11350) 2021-04-21 11:11:20 -04:00
roberta.rst [FlaxRoberta] Add FlaxRobertaModels & adapt run_mlm_flax.py (#11470) 2021-05-04 19:57:59 +02:00
roformer.rst Add new model RoFormer (use rotary position embedding ) (#11684) 2021-05-20 08:00:34 -04:00
speech_to_text.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
squeezebert.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
t5.rst [docs] fix xref to PreTrainedModel.generate (#11049) 2021-06-02 09:21:05 -07:00
tapas.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
transformerxl.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
visual_bert.rst VisualBERT (#10534) 2021-06-02 18:13:08 +05:30
vit.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
wav2vec2.rst Wav2Vec2 Pretraining (#11306) 2021-06-09 18:40:56 +01:00
xlm.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
xlmprophetnet.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
xlmroberta.rst Honor contributors to models (#11329) 2021-04-21 09:47:27 -04:00
xlnet.rst Examples reorg (#11350) 2021-04-21 11:11:20 -04:00
xlsr_wav2vec2.rst [XLSR-Wav2Vec2] Add multi-lingual Wav2Vec2 models (#10648) 2021-03-11 17:44:18 +03:00