transformers/docs/source
Patrick von Platen 640e6fe190
[Flax] Align FlaxBertForMaskedLM with BertForMaskedLM, implement from_pretrained, init (#9054)
* save intermediate

* save intermediate

* save intermediate

* correct flax bert model file

* new module / model naming

* make style

* almost finish BERT

* finish roberta

* make fix-copies

* delete keys file

* last refactor

* fixes in run_mlm_flax.py

* remove pooled from run_mlm_flax.py`

* fix gelu | gelu_new

* remove Module from inits

* splits

* dirty print

* preventing warmup_steps == 0

* smaller splits

* make fix-copies

* dirty print

* dirty print

* initial_evaluation argument

* declaration order fix

* proper model initialization/loading

* proper initialization

* run_mlm_flax improvements: improper model inputs bugfix + automatic dataset splitting + tokenizers parallelism warning + avoiding warmup_steps=0 bug

* removed tokenizers warning hack, fixed model re-initialization

* reverted training_args.py changes

* fix flax from pretrained

* improve test in flax

* apply sylvains tips

* update init

* make 0.3.0 compatible

* revert tevens changes

* revert tevens changes 2

* finalize revert

* fix bug

* add docs

* add pretrained to init

* Update src/transformers/modeling_flax_utils.py

* fix copies

* final improvements

Co-authored-by: TevenLeScao <teven.lescao@gmail.com>
2020-12-16 13:03:32 +01:00
..
_static Fix link to stable version in the doc navbar (#9007) 2020-12-09 09:11:39 -05:00
imgs Guide to fixed-length model perplexity evaluation (#5449) 2020-07-07 16:04:15 -06:00
internal Fix T5 and BART for TF (#9063) 2020-12-14 18:47:00 +01:00
main_classes [Flax] Align FlaxBertForMaskedLM with BertForMaskedLM, implement from_pretrained, init (#9054) 2020-12-16 13:03:32 +01:00
model_doc [WIP] Tapas v4 (tres) (#9117) 2020-12-15 17:08:49 -05:00
benchmarks.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
bertology.rst Fixed a broken link in documentation (#9101) 2020-12-14 09:12:27 -05:00
conf.py Copyright (#8970) 2020-12-07 18:36:34 -05:00
contributing.md Update installation page and add contributing to the doc (#5084) 2020-06-17 14:01:10 -04:00
converting_tensorflow_models.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
custom_datasets.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
examples.md per_device instead of per_gpu/error thrown when argument unknown (#4618) 2020-05-27 11:36:55 -04:00
favicon.ico Adding usage examples for common tasks (#2850) 2020-02-25 13:48:24 -05:00
glossary.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
index.rst [WIP] Tapas v4 (tres) (#9117) 2020-12-15 17:08:49 -05:00
installation.md Copyright (#8970) 2020-12-07 18:36:34 -05:00
migration.md Copyright (#8970) 2020-12-07 18:36:34 -05:00
model_sharing.rst [model_cards] Migrate cards from this repo to model repos on huggingface.co (#9013) 2020-12-11 18:24:42 -05:00
model_summary.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
multilingual.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
notebooks.md Update notebooks (#3620) 2020-04-06 14:32:39 -04:00
perplexity.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
philosophy.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
preprocessing.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
pretrained_models.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
quicktour.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
serialization.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
task_summary.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
testing.rst Testing Experimental CI Features (#9070) 2020-12-14 10:34:59 -05:00
tokenizer_summary.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
training.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00