transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-02 11:11:05 +06:00

Author	SHA1	Message	Date
Patrick von Platen	a76dd7ee82	Update README.md	2021-07-16 00:16:30 +01:00
Patrick von Platen	2e9fb13fb1	[Wav2Vec2] Correctly pad mask indices for PreTraining (#12748 ) * fix_torch_device_generate_test * remove @ * start adding tests * correct wav2vec2 pretraining * up * up Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-07-15 21:40:25 +01:00
Suraj Patil	44f5b260fe	flax model parallel training (#12590 ) * update scripts * add copyright * add logging * cleanup * add z loss * add readme * shard description * update readme	2021-07-14 22:55:44 +05:30
Matt	f9ac677eba	Update TF examples README (#12703 ) * Update Transformers README, rename token_classification example to token-classification to be consistent with the others * Update examples/tensorflow/README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Add README for TF token classification * Update examples/tensorflow/token-classification/README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update examples/tensorflow/token-classification/README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-07-14 15:15:25 +01:00
Patrick von Platen	f4399ec570	Update README.md	2021-07-14 12:54:31 +01:00
Matt	65bf05cd18	Adding TF translation example (#12667 ) * Adding TF translation example * Fixes and style pass for TF translation example * Remove unused postprocess_text copied from run_summarization * Adding README * Review fixes * Move changes to model.config to after we've initialized the model	2021-07-13 19:08:25 +01:00
Nick Doiron	5803a2a7ac	Add ByT5 option to example run_t5_mlm_flax.py (#12634 ) * Allow ByT5 type in Flax T5 script * use T5TokenizerFast * change up tokenizer config * model_args * reorder imports * Update run_t5_mlm_flax.py	2021-07-13 13:39:57 +01:00
Omar Sanseviero	c523b241c2	Update timeline for Flax event evaluation	2021-07-12 21:24:58 +02:00
Matt	379f649434	TF summarization example (#12617 ) * Adding a TF summarization example * Style pass * Style fixes * Updates for review comments * Adding README * Style pass * Remove unused import	2021-07-12 15:58:38 +01:00
Eduardo Gonzalez Ponferrada	2dd9440d08	Point to the right file for hybrid CLIP (#12599 )	2021-07-12 12:16:22 +05:30
Bhadresh Savani	de23ecea36	added test file (#12630 )	2021-07-12 12:15:14 +05:30
Patrick von Platen	deecdd4939	[Flax] Fix cur step flax examples (#12608 ) * fix_torch_device_generate_test * remove @ * fix save problem	2021-07-09 13:51:28 +01:00
Omar Sanseviero	8fe836af5a	Add Flax sprint project evaluation section (#12592 )	2021-07-09 08:52:30 +02:00
Sylvain Gugger	6f1adc4334	Fix group_lengths for short datasets (#12558 )	2021-07-08 07:23:41 -04:00
Ibraheem Moosa	122d7dc34f	Remove logging of GPU count etc logging. (#12569 ) Successfully logging this requires Pytorch. For the purposes of this script we are not using Pytorch.	2021-07-07 23:05:47 +01:00
Suraj Patil	d7e156bd1a	fix loading clip vision model (#12566 )	2021-07-07 22:50:27 +05:30
Patrick von Platen	7d321b7689	[Flax] Allow retraining from save checkpoint (#12559 ) * fix_torch_device_generate_test * remove @ * finish	2021-07-07 19:13:43 +05:30
Souvic Chakraborty	1d6623c6a2	MLM training fails with no validation file(same as #12406 for pytorch now) (#12517 ) * Validation split percentage to be used for custom data files also Issue same as https://github.com/huggingface/transformers/issues/12406 fixed for pytorch branch run_mlm.py * Validation split added in the right place * Update run_clm.py * validation split added for custom files * Validation split added for custom files * Update run_plm.py * fixed validation split for custom files as input for pytorch examples in lm * Update run_clm_no_trainer.py * args modified	2021-07-07 09:05:44 -04:00
Suraj Patil	2d42915abe	[examples/flax] add adafactor optimizer (#12544 ) * add adafactor * Update examples/flax/language-modeling/run_mlm_flax.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-07-07 11:50:30 +05:30
Patrick von Platen	208df208bf	[Flax] Adapt examples to be able to use eval_steps and save_steps (#12543 ) * fix_torch_device_generate_test * remove @ * up * up * correct * upload Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-07-06 19:41:51 +01:00
SaulLu	09af5bdea3	Replace `nn.Moudle` by `nn.Module` (#12541 )	2021-07-06 11:31:45 -04:00
Patrick von Platen	f42a0abf4b	Update README.md	2021-07-06 15:14:48 +01:00
Suzana Ilić	029b9d3f40	Update README (#12540 )	2021-07-06 16:12:16 +02:00
Suraj Patil	f5b0c1ecf0	[Flax] Fix hybrid clip (#12519 ) * fix saving and loading * update readme	2021-07-06 11:12:47 +05:30
Patrick von Platen	7d6285a921	[Wav2Vec2] Flax - Adapt wav2vec2 script (#12520 ) * fix_torch_device_generate_test * remove @ * adapt flax pretrain script	2021-07-05 23:49:47 +01:00
Patrick von Platen	4605b2b8ec	[Flax] Fix another bug in logging steps (#12516 ) * fix_torch_device_generate_test * remove @ * up	2021-07-05 18:35:22 +01:00
Patrick von Platen	d0f7508abe	[Flax] Correct logging steps flax (#12515 ) * fix_torch_device_generate_test * remove @ * push	2021-07-05 18:21:00 +01:00
Patrick von Platen	bb4ac2b5a8	[Flax] Correct flax training scripts (#12514 ) * fix_torch_device_generate_test * remove @ * add logging steps * correct training scripts * correct readme * correct	2021-07-05 18:14:50 +01:00
Matt	ea55675024	NER example for Tensorflow (#12469 ) * NER example for Tensorflow * Style pass * Style pass * Added metric computation on the evaluation set * Style pass * Fixed label masking * Style pass * Style pass	2021-07-05 15:42:18 +01:00
Patrick von Platen	9b90810558	[Flax] Dataset streaming example (#12470 ) * fix_torch_device_generate_test * remove @ * upload * finish dataset streaming * adapt readme * finish * up * up * up * up * Apply suggestions from code review * finish * make style * make style2 * finish Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-07-05 15:13:10 +01:00
Navjot	eceb1042c1	flax.linen.apply takes state as the first param, followed by the input (#12510 )	2021-07-05 19:33:14 +05:30
Suraj Patil	f1c81d6b92	[Flax] ViT training example (#12300 ) * begin script * clean example, add readme * update readme * remove decay mask * remove masking * update readme & make flake happy	2021-07-05 18:23:03 +05:30
Akmal	e799e0f1ed	[Flax] Fix wav2vec2 pretrain arguments (#12498 )	2021-07-05 13:35:20 +01:00
Suraj Patil	23ab0b6980	[examples/flax] clip style image-text training example (#12491 ) * clip style example * fix post init * add requirements * update readme, few small fixes	2021-07-05 13:26:44 +05:30
Lysandre Debut	89a8739f0c	Add `Repository` import to the FLAX example script (#12501 )	2021-07-05 03:51:11 -04:00
Patrick von Platen	2df63282e0	Update README.md	2021-07-04 13:16:29 +01:00
Omar Sanseviero	a76eebfc80	Add guide on how to build demos for the Flax sprint (#12468 )	2021-07-02 20:35:17 +02:00
Patrick von Platen	b21905e03d	Update README.md	2021-07-02 14:12:47 +01:00
Patrick von Platen	d24a523130	Update README.md	2021-07-02 13:41:14 +01:00
Patrick von Platen	e3fce2f868	Update README.md Thanks a lot @BirgerMoell	2021-07-02 12:12:54 +01:00
Matthew LeMay	b4ecc6bef2	fixed typo in flax-projects readme (#12466 )	2021-07-02 12:27:39 +05:30
Souvic Chakraborty	d5b8fe3b90	Validation split added: custom data files @sgugger, @patil-suraj (#12407 ) * Validation split added: custom data files Validation split added in case of no validation file and loading custom data * Updated documentation with custom file usage Updated documentation with custom file usage * Update README.md * Update README.md * Update README.md * Made some suggested stylistic changes * Used logger instead of print. Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Made similar changes to add validation split In case of a missing validation file, a validation split will be used now. * max_train_samples to be used for training only max_train_samples got misplaced, now corrected so that it is applied on training data only, not whole data. * styled * changed ordering * Improved language of documentation Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Improved language of documentation Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Fixed styling issue * Update run_mlm.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-07-01 13:22:42 -04:00
Patrick von Platen	7f87bfc910	Add TPU README (#12463 ) * Add TPU README * Apply suggestions from code review * Update examples/research_projects/jax-projects/README.md * Update examples/research_projects/jax-projects/README.md Co-authored-by: Stefan Schweter <stefan@schweter.it> Co-authored-by: Stefan Schweter <stefan@schweter.it>	2021-07-01 17:11:54 +01:00
Patrick von Platen	1457839fc5	Update README.md	2021-07-01 15:52:11 +01:00
Suzana Ilić	c18af5d40c	Added talk details (#12465 )	2021-07-01 16:19:23 +02:00
Patrick von Platen	b655f16d4e	[Flax community event] How to use hub during training (#12447 ) * fix_torch_device_generate_test * remove @ * upload * finish doc * Apply suggestions from code review Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Julien Chaumond <chaumond@gmail.com> * finish Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2021-07-01 11:41:22 +01:00
Patrick von Platen	0d1f67e651	[Flax] Add wav2vec2 (#12271 ) * fix_torch_device_generate_test * remove @ * start flax wav2vec2 * save intermediate * forward pass has correct shape * add weight norm * add files * finish ctc * make style * finish gumbel quantizer * correct docstrings * correct some more files * fix vit * finish quality * correct tests * correct docstring * correct tests * start wav2vec2 pretraining script * save intermediate * start pretraining script * finalize pretraining script * finish * finish * small typo * finish * correct * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Suraj Patil <surajp815@gmail.com> * make style * push Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Suraj Patil <surajp815@gmail.com>	2021-06-30 18:44:23 +01:00
Suraj Patil	3f36a2c064	[JAX/Flax readme] add philosophy doc (#12419 ) * add philosophy doc * fix typos * update doc * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * address Patricks suggestions * add a training example and fix typos * jit the training step * jit train step * fix example code * typo * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-06-30 21:40:12 +05:30
Suzana Ilić	1ad1c4a864	Add to talks section (#12442 )	2021-06-30 16:58:03 +02:00
Suzana Ilić	90d69456eb	Added to talks section (#12433 ) Added one more confirmed speaker, zoom links and gcal event links	2021-06-30 13:14:11 +02:00

1 2 3 4 5 ...

1718 Commits