transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-04 13:20:12 +06:00

Author	SHA1	Message	Date
Kamal Raj	d1f5ca1afd	[FLAX] glue training example refactor (#13815 ) * refactor run_flax_glue.py * updated readme * rm unused import and args typo fix * refactor * make consistent arg name across task * has_tensorboard check * argparse -> argument dataclasses * refactor according to review * fix	2022-01-19 12:04:51 +01:00
Suraj Patil	75ae287aec	fix flax examples tests (#14646 ) * make tensorboard optional * update test_fetcher for flax examples * make the tests slow	2021-12-07 00:34:27 +05:30
Suraj Patil	c5bd732ac6	Add Flax example tests (#14599 ) * add test for glue * add tests for clm * fix clm test * add summrization tests * more tests * fix few tests * add test for t5 mlm * fix t5 mlm test * fix tests for multi device * cleanup * ci job * fix metric file name * make t5 more robust	2021-12-06 10:48:58 +05:30
Suraj Patil	7db2a79b38	[examples/flax] use Repository API for push_to_hub (#13672 ) * use Repository for push_to_hub * update readme * update other flax scripts * update readme * update qa example * fix push_to_hub call * fix typo * fix more typos * update readme * use abosolute path to get repo name * fix glue script	2021-09-30 16:38:07 +05:30
Patrick von Platen	2d70c91206	[Flax] Adapt flax examples to include `push_to_hub` (#12391 ) * fix_torch_device_generate_test * remove @ * finish * correct summary writer * correct push to hub * fix indent * finish * finish * finish * finish * finish Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-06-28 19:23:35 +01:00
Stas Bekman	4a872caef4	remove extra white space from log format (#12360 )	2021-06-25 13:20:14 -07:00
Nicholas Vadivelu	4674061b2a	Fix weight decay masking in `run_flax_glue.py` (#11964 ) * Fix weight decay masking in `run_flax_glue.py` Issues with the previous implementation: - The `dict` from `traverse_util.flatten_dict` has keys which are tuples of strings, not one long string with the path separated by periods. - `optax.masked` applies the transformation wherever the mask is True, so the masks are flipped. - Flax's LayerNorm calls the scale parameter `scale` not `weight` * Fix formatting with black * adapt results Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-06-03 11:35:26 +01:00
Nicholas Vadivelu	1ab147d648	Remove redundant `nn.log_softmax` in `run_flax_glue.py` (#11920 ) * Remove redundant `nn.log_softmax` in `run_flax_glue.py` `optax.softmax_cross_entropy` expects unnormalized logits, and so it already calls `nn.log_softmax`, so I believe it is not needed here. `nn.log_softmax` is idempotent so mathematically it shouldn't have made a difference. * Remove unused 'flax.linen' import	2021-05-31 15:29:04 +01:00
Patrick von Platen	82335185fe	[Flax] Small fixes in `run_flax_glue.py` (#11820 ) * fix_torch_device_generate_test * remove @ * correct best seed for flax fine-tuning Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-05-21 16:52:23 +01:00
Patrick von Platen	bd9871657b	[Flax] Align GLUE training script with mlm training script (#11778 ) * speed up flax glue * remove unnecessary line * remove folder * remove run in loop Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-05-21 09:36:56 +01:00
Marc van Zee	726e953d44	Improvements to Flax finetuning script (#11727 ) * Add Cloud details to README * Flax script and readme updates * Some simplifications of Flax script	2021-05-17 09:26:33 +01:00
Patrick von Platen	113eaa7575	correct example script (#11726 )	2021-05-14 12:02:57 +01:00
Marc van Zee	6797cdc077	Updates README and fixes bug (#11701 )	2021-05-12 13:52:52 +01:00
Marc van Zee	4ce6bcc310	Adds Flax BERT finetuning example on GLUE (#11564 ) * Adds Flax BERT finetuning example * fix traced jax tensor type * Use Optax losses and learning schedulers * Add 1GPU training results * merge into master & make style * fix input * del file * Fix bug in loss and add torch runs * finish bert flax fine-tune * Update examples/flax/text-classification/README.md * Update examples/flax/text-classification/run_flax_glue.py * add requirements * finalize * finalize Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-05-11 19:02:59 +01:00

14 Commits