transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-08 07:10:06 +06:00

History

Patrick von Platen 640e6fe190 [Flax] Align FlaxBertForMaskedLM with BertForMaskedLM, implement from_pretrained, init (#9054 ) * save intermediate * save intermediate * save intermediate * correct flax bert model file * new module / model naming * make style * almost finish BERT * finish roberta * make fix-copies * delete keys file * last refactor * fixes in run_mlm_flax.py * remove pooled from run_mlm_flax.py` * fix gelu \| gelu_new * remove Module from inits * splits * dirty print * preventing warmup_steps == 0 * smaller splits * make fix-copies * dirty print * dirty print * initial_evaluation argument * declaration order fix * proper model initialization/loading * proper initialization * run_mlm_flax improvements: improper model inputs bugfix + automatic dataset splitting + tokenizers parallelism warning + avoiding warmup_steps=0 bug * removed tokenizers warning hack, fixed model re-initialization * reverted training_args.py changes * fix flax from pretrained * improve test in flax * apply sylvains tips * update init * make 0.3.0 compatible * revert tevens changes * revert tevens changes 2 * finalize revert * fix bug * add docs * add pretrained to init * Update src/transformers/modeling_flax_utils.py * fix copies * final improvements Co-authored-by: TevenLeScao <teven.lescao@gmail.com>		2020-12-16 13:03:32 +01:00
..
_static	Fix link to stable version in the doc navbar (#9007 )	2020-12-09 09:11:39 -05:00
imgs	Guide to fixed-length model perplexity evaluation (#5449 )	2020-07-07 16:04:15 -06:00
internal	Fix T5 and BART for TF (#9063 )	2020-12-14 18:47:00 +01:00
main_classes	[Flax] Align FlaxBertForMaskedLM with BertForMaskedLM, implement from_pretrained, init (#9054 )	2020-12-16 13:03:32 +01:00
model_doc	[WIP] Tapas v4 (tres) (#9117 )	2020-12-15 17:08:49 -05:00
benchmarks.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
bertology.rst	Fixed a broken link in documentation (#9101 )	2020-12-14 09:12:27 -05:00
conf.py	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
contributing.md	Update installation page and add contributing to the doc (#5084 )	2020-06-17 14:01:10 -04:00
converting_tensorflow_models.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
custom_datasets.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
examples.md	per_device instead of per_gpu/error thrown when argument unknown (#4618 )	2020-05-27 11:36:55 -04:00
favicon.ico	Adding usage examples for common tasks (#2850 )	2020-02-25 13:48:24 -05:00
glossary.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
index.rst	[WIP] Tapas v4 (tres) (#9117 )	2020-12-15 17:08:49 -05:00
installation.md	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
migration.md	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
model_sharing.rst	[model_cards] Migrate cards from this repo to model repos on huggingface.co (#9013 )	2020-12-11 18:24:42 -05:00
model_summary.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
multilingual.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
notebooks.md	Update notebooks (#3620 )	2020-04-06 14:32:39 -04:00
perplexity.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
philosophy.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
preprocessing.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
pretrained_models.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
quicktour.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
serialization.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
task_summary.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
testing.rst	Testing Experimental CI Features (#9070 )	2020-12-14 10:34:59 -05:00
tokenizer_summary.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
training.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00