transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-04 21:30:07 +06:00

Author	SHA1	Message	Date
Karim Foda	24a85cca61	Add use_auth to load_datasets for private datasets to PT and TF examples (#16521 ) * fix formatting and remove use_auth * Add use_auth_token to Flax examples	2022-04-04 10:27:45 -04:00
Yongrae Jo	8049dfa427	Update run_t5_mlm_flax.py (#16421 ) Fix typo in comment: proprocessed -> preprocessed	2022-03-28 06:00:53 -04:00
Sylvain Gugger	4975002df5	Reorganize file utils (#16264 ) * Split file_utils in several submodules * Fixes * Add back more objects * More fixes * Who exactly decided to import that from there? * Second suggestion to code with code review * Revert wront move * Fix imports * Adapt all imports * Adapt all imports everywhere * Revert this import, will fix in a separate commit	2022-03-23 10:26:33 -04:00
Yeb Havinga	91fb62d01c	Speedup training by using numpy instead of jnp for batch shuffling (#15963 ) Speedup training by using numpy instead of jnp for batch shuffling Co-authored-by: Yeb Havinga <y.t.havinga@mgrid.net>	2022-03-08 12:18:38 +01:00
Patrick von Platen	10b76987fc	[FlaxT5 Example] fix flax t5 example pretraining (#15835 )	2022-03-04 17:04:43 +01:00
Stas Bekman	762416ffa8	[examples/flax/language-modeling] set loglevel (#15129 )	2022-01-13 15:17:28 +01:00
Suraj Patil	6a025487a6	[Flax examples] remove dependancy on pytorch training args (#14636 ) * use custom training arguments * update tests	2021-12-12 09:19:12 +05:30
Suraj Patil	c5bd732ac6	Add Flax example tests (#14599 ) * add test for glue * add tests for clm * fix clm test * add summrization tests * more tests * fix few tests * add test for t5 mlm * fix t5 mlm test * fix tests for multi device * cleanup * ci job * fix metric file name * make t5 more robust	2021-12-06 10:48:58 +05:30
Rahul Nadkarni	8332327dca	Fix sentinel token IDs in data collator for Flax T5 pretraining script (#14477 )	2021-11-29 17:30:17 +01:00
Nicholas Broad	69e16abf98	Switch from using sum for flattening lists of lists in group_texts (#14472 ) * remove sum for list flattening * change to chain() make chain object a list * delete empty lines per sgugger's suggestions Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Nicholas Broad <nicholas@nmbroad.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-11-22 16:17:26 -05:00
Suraj Patil	7db2a79b38	[examples/flax] use Repository API for push_to_hub (#13672 ) * use Repository for push_to_hub * update readme * update other flax scripts * update readme * update qa example * fix push_to_hub call * fix typo * fix more typos * update readme * use abosolute path to get repo name * fix glue script	2021-09-30 16:38:07 +05:30
Patrick von Platen	2e4082364e	[Flax T5] Speed up t5 training (#13012 ) * fix_torch_device_generate_test * remove @ * update * up * fix * remove f-stings * correct readme * up Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-08-06 11:21:37 +02:00
fgaim	66197adc98	Flax MLM: Allow validation split when loading dataset from local file (#12689 ) * Allow validation split when loading dataset from local file * Flax clm & t5, enable validation split for datasets loaded from local file	2021-07-20 13:38:25 +02:00
Nick Doiron	5803a2a7ac	Add ByT5 option to example run_t5_mlm_flax.py (#12634 ) * Allow ByT5 type in Flax T5 script * use T5TokenizerFast * change up tokenizer config * model_args * reorder imports * Update run_t5_mlm_flax.py	2021-07-13 13:39:57 +01:00
Patrick von Platen	deecdd4939	[Flax] Fix cur step flax examples (#12608 ) * fix_torch_device_generate_test * remove @ * fix save problem	2021-07-09 13:51:28 +01:00
Sylvain Gugger	6f1adc4334	Fix group_lengths for short datasets (#12558 )	2021-07-08 07:23:41 -04:00
Ibraheem Moosa	122d7dc34f	Remove logging of GPU count etc logging. (#12569 ) Successfully logging this requires Pytorch. For the purposes of this script we are not using Pytorch.	2021-07-07 23:05:47 +01:00
Patrick von Platen	7d321b7689	[Flax] Allow retraining from save checkpoint (#12559 ) * fix_torch_device_generate_test * remove @ * finish	2021-07-07 19:13:43 +05:30
Suraj Patil	2d42915abe	[examples/flax] add adafactor optimizer (#12544 ) * add adafactor * Update examples/flax/language-modeling/run_mlm_flax.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-07-07 11:50:30 +05:30
Patrick von Platen	208df208bf	[Flax] Adapt examples to be able to use eval_steps and save_steps (#12543 ) * fix_torch_device_generate_test * remove @ * up * up * correct * upload Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-07-06 19:41:51 +01:00
Patrick von Platen	4605b2b8ec	[Flax] Fix another bug in logging steps (#12516 ) * fix_torch_device_generate_test * remove @ * up	2021-07-05 18:35:22 +01:00
Patrick von Platen	d0f7508abe	[Flax] Correct logging steps flax (#12515 ) * fix_torch_device_generate_test * remove @ * push	2021-07-05 18:21:00 +01:00
Patrick von Platen	bb4ac2b5a8	[Flax] Correct flax training scripts (#12514 ) * fix_torch_device_generate_test * remove @ * add logging steps * correct training scripts * correct readme * correct	2021-07-05 18:14:50 +01:00
Patrick von Platen	813328682e	[Flax] Example scripts - correct weight decay (#12409 ) * fix_torch_device_generate_test * remove @ * finish * finish * correct style	2021-06-29 12:01:08 +01:00
Patrick von Platen	31c3e7e75b	[Flax] Add T5 pretraining script (#12355 ) * fix_torch_device_generate_test * remove @ * add length computatan * finish masking * finish * upload * fix some bugs * finish * fix dependency table * correct tensorboard * Apply suggestions from code review * correct processing * slight change init * correct some more mistakes * apply suggestions * improve readme * fix indent * Apply suggestions from code review Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com> * correct tokenizer * finish * finish * finish * finish Co-authored-by: Patrick von Platen <patrick@huggingface.co> Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>	2021-06-28 20:11:29 +01:00

25 Commits