transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-05 22:00:09 +06:00

Author	SHA1	Message	Date
Suraj Patil	85a4bda4f4	bump flax version (#14343 )	2021-11-09 22:15:22 +05:30
Suraj Patil	7db2a79b38	[examples/flax] use Repository API for push_to_hub (#13672 ) * use Repository for push_to_hub * update readme * update other flax scripts * update readme * update qa example * fix push_to_hub call * fix typo * fix more typos * update readme * use abosolute path to get repo name * fix glue script	2021-09-30 16:38:07 +05:30
Chungman Lee	75b8990d90	fix typo in example/text-classification README (#12974 ) * fix typo in example/text-classification README * add space to align the table	2021-08-02 12:58:43 +02:00
Patrick von Platen	2d70c91206	[Flax] Adapt flax examples to include `push_to_hub` (#12391 ) * fix_torch_device_generate_test * remove @ * finish * correct summary writer * correct push to hub * fix indent * finish * finish * finish * finish * finish Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-06-28 19:23:35 +01:00
Stas Bekman	4a872caef4	remove extra white space from log format (#12360 )	2021-06-25 13:20:14 -07:00
Avital Oliver	9b393240a2	Use a released version of optax rather than installing from Git. (#12173 ) Use a released version of optax rather than installing from Git	2021-06-15 16:42:51 +05:30
Patrick von Platen	16c0efca2c	Add mlm pretraining xla torch readme (#12011 ) * fix_torch_device_generate_test * remove @ * upload * Apply suggestions from code review * Apply suggestions from code review * Apply suggestions from code review * Update examples/flax/language-modeling/README.md * add more info * finish * fix Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-06-14 10:31:21 +01:00
Nicholas Vadivelu	4674061b2a	Fix weight decay masking in `run_flax_glue.py` (#11964 ) * Fix weight decay masking in `run_flax_glue.py` Issues with the previous implementation: - The `dict` from `traverse_util.flatten_dict` has keys which are tuples of strings, not one long string with the path separated by periods. - `optax.masked` applies the transformation wherever the mask is True, so the masks are flipped. - Flax's LayerNorm calls the scale parameter `scale` not `weight` * Fix formatting with black * adapt results Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-06-03 11:35:26 +01:00
Nicholas Vadivelu	1ab147d648	Remove redundant `nn.log_softmax` in `run_flax_glue.py` (#11920 ) * Remove redundant `nn.log_softmax` in `run_flax_glue.py` `optax.softmax_cross_entropy` expects unnormalized logits, and so it already calls `nn.log_softmax`, so I believe it is not needed here. `nn.log_softmax` is idempotent so mathematically it shouldn't have made a difference. * Remove unused 'flax.linen' import	2021-05-31 15:29:04 +01:00
Patrick von Platen	82335185fe	[Flax] Small fixes in `run_flax_glue.py` (#11820 ) * fix_torch_device_generate_test * remove @ * correct best seed for flax fine-tuning Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-05-21 16:52:23 +01:00
Patrick von Platen	bd9871657b	[Flax] Align GLUE training script with mlm training script (#11778 ) * speed up flax glue * remove unnecessary line * remove folder * remove run in loop Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-05-21 09:36:56 +01:00
Patrick von Platen	00440e350f	[Flax MLM] Refactor run mlm with optax (#11745 ) * refactor * update * update * update * refactor run mlm * finalize * refactor more * fix typo * update * finish refactor * modify run mlm * Apply suggestions from code review * Apply suggestions from code review * Apply suggestions from code review * small fixes * upload * upload * finish run mlm script Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-05-19 12:00:58 +01:00
Marc van Zee	726e953d44	Improvements to Flax finetuning script (#11727 ) * Add Cloud details to README * Flax script and readme updates * Some simplifications of Flax script	2021-05-17 09:26:33 +01:00
Marc van Zee	94a2348706	Add Cloud details to README (#11706 ) * Add Cloud details to README * Flax script and readme updates	2021-05-14 14:51:25 +01:00
Patrick von Platen	113eaa7575	correct example script (#11726 )	2021-05-14 12:02:57 +01:00
Marc van Zee	6797cdc077	Updates README and fixes bug (#11701 )	2021-05-12 13:52:52 +01:00
Marc van Zee	4ce6bcc310	Adds Flax BERT finetuning example on GLUE (#11564 ) * Adds Flax BERT finetuning example * fix traced jax tensor type * Use Optax losses and learning schedulers * Add 1GPU training results * merge into master & make style * fix input * del file * Fix bug in loss and add torch runs * finish bert flax fine-tune * Update examples/flax/text-classification/README.md * Update examples/flax/text-classification/run_flax_glue.py * add requirements * finalize * finalize Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-05-11 19:02:59 +01:00

1 2

67 Commits