transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-02 11:11:05 +06:00

Author	SHA1	Message	Date
Bhavitvya Malik	d2753dcbec	add relevant description to tqdm in examples (#11927 ) * add relevant `desc` in examples * require_version datasets>=1.8.0	2021-06-10 15:59:55 -04:00
Matt	bebbdd0fc9	Appending label2id and id2label to models to ensure inference works properly (#12102 )	2021-06-10 15:25:04 +01:00
Matt	4cda08decb	Minor style edits	2021-06-10 15:10:57 +01:00
Matt	7f08dbd10a	Update README.md to cover the TF GLUE example.	2021-06-10 14:33:42 +01:00
Sylvain Gugger	d72e5a3a6d	Fix quality	2021-06-10 09:27:11 -04:00
Matt	73a532651a	New TF GLUE example (#12028 ) * Pushing partially-complete new GLUE example * First draft of the new TF GLUE example! Needs a little more testing to be sure but it's almost ready. * Fix to the fit() call * Bugfixes, making sure TPU and multi-GPU support is ready * Remove logger line that depends on Pytorch * Style pass * Deleting old TF GLUE example * Include label2id and id2label in the saved model config * Don't clobber the existing model.config.label2id * Style fixes * Update examples/tensorflow/text-classification/run_glue.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-06-10 14:14:37 +01:00
kumapo	472a867626	Add text_column_name and label_column_name to run_ner and run_ner_no_trainer args (#12083 ) * Add text_column_name and label_column_name to run_ner args * Minor fix: grouping for text and label column name	2021-06-10 08:03:20 -04:00
Stas Bekman	61e191987d	rm require_version_examples (#12088 )	2021-06-09 11:02:52 -07:00
Suraj Patil	d1500d9151	pass decay_mask fn to optimizer (#12087 )	2021-06-09 18:49:27 +01:00
Anton Lozhkov	d472bd7b18	Wav2Vec2 Pretraining (#11306 ) * Working quantizer forward * Working quantizer forward * Clean up unused model parts, test reproducibility * Working quantizer forward * Clean up unused model parts, test reproducibility * Remove custom outputs from the shared ones * correct conversion * correct bug * add first pretrain script * save intermediate * static shapes * save intermediate * finish first pretrain script version * more refactor * remove wanddb * refactor more * improve test * correct perplexity compute bug * finish model implementation * add to docs * finish docs * finish pretraining script * finish pretraining script * remove wandb * finish PR for merge * finish config * finish * make deepspeed work * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * apply suggestions * fix flaky test Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-06-09 18:40:56 +01:00
Stas Bekman	d14e0af274	sync LayerDrop for Wav2Vec2Encoder + tests (#12076 )	2021-06-09 13:21:03 +01:00
Koichi Yasuoka	82a2b76c95	Update run_ner.py with id2label config (#12001 )	2021-06-09 07:27:05 -04:00
Stas Bekman	11d86d3de4	[Deepspeed Wav2vec2] integration (#11638 ) * wip * wip - but working with https://github.com/microsoft/DeepSpeed/pull/1044 * cleanup * workaround * working 5/8 modes * solve fp32 distributed zero3 * style * sync * sync * rework * deprecation * cleanup * https://github.com/microsoft/DeepSpeed/pull/1044 pr was merged * clean up * add a guide * more prose * more prose * fix * more prose * sub_group_size was too big * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * refactor * bug fix * make the true check explicit * new deepspeed release Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-06-08 12:32:03 -07:00
Sylvain Gugger	fd6902838a	Properly indent block_size (#12070 )	2021-06-08 10:27:02 -04:00
cdleong	49bee0aea4	Add torch to requirements.txt in language-modeling (#12040 ) * Add torch to requirements.txt in language-modeling * Update examples/pytorch/language-modeling/requirements.txt Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-06-08 09:02:35 -04:00
Mario Šaško	f5eec0d8e9	Replace legacy tensor.Tensor with torch.tensor/torch.empty (#12027 ) * Replace legacy torch.Tensor constructor with torch.{tensor, empty} * Remove torch.Tensor in examples	2021-06-08 13:58:38 +01:00
Shamane Siri	e33085d648	updated the original RAG implementation to be compatible with latest Pytorch-Lightning (#11806 ) * updated the original RAG implementation to be compatible with the latest PL version * updated the requirements.txt file * execute make style * code quality test * code quality * conflix resolved in requirement.txt * code quality * changed the MyDDP class name to CustomDDP	2021-06-08 13:42:49 +01:00
Russell Klopfer	e363e1d936	adds metric prefix. (#12057 ) * adds metric prefix. * update tests to include prefix	2021-06-07 22:34:10 -04:00
Patrick von Platen	242ec31aa5	[Flax] Refactor MLM (#12013 ) * fix_torch_device_generate_test * remove @ * finish refactor Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-06-03 16:31:32 +01:00
Nicholas Vadivelu	4674061b2a	Fix weight decay masking in `run_flax_glue.py` (#11964 ) * Fix weight decay masking in `run_flax_glue.py` Issues with the previous implementation: - The `dict` from `traverse_util.flatten_dict` has keys which are tuples of strings, not one long string with the path separated by periods. - `optax.masked` applies the transformation wherever the mask is True, so the masks are flipped. - Flax's LayerNorm calls the scale parameter `scale` not `weight` * Fix formatting with black * adapt results Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-06-03 11:35:26 +01:00
dependabot[bot]	6db3a87de2	Bump urllib3 from 1.25.8 to 1.26.5 in /examples/research_projects/lxmert (#11983 ) Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.25.8 to 1.26.5. - [Release notes](https://github.com/urllib3/urllib3/releases) - [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst) - [Commits](https://github.com/urllib3/urllib3/compare/1.25.8...1.26.5) --- updated-dependencies: - dependency-name: urllib3 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2021-06-02 03:40:20 -04:00
Fan Zhang	7e73601f32	modify qa-trainer (#11872 ) * modify qa-trainer * fix flax model	2021-06-01 08:28:41 -04:00
Shamane Siri	9ec0f01b6c	RAG-2nd2end-revamp (#11893 ) * initial * code quality test * code quality * added test functions in test_modeling_rag.py and test_retrieval_rag.py to test end2end retreiver * minor change in test_modeling_rag * fixed tests * Update examples/research_projects/rag-end2end-retriever/README.md typo corrected as suggested by lhoestq Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com> * Update examples/research_projects/rag-end2end-retriever/finetune_rag.py type change suggested by lhoestq Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com> * Update src/transformers/models/rag/retrieval_rag.py Adding this change as mentioned by lhoestq. Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com> * completed the minor changes suggested by the reviewers Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>	2021-06-01 07:32:26 +01:00
Philip May	cfca638acb	Add MT5ForConditionalGeneration as supported arch. to summarization README (#11961 ) * Add MT5ForConditionalGeneration as supported arch. * Update README.md	2021-05-31 21:24:33 +05:30
Nicholas Vadivelu	1ab147d648	Remove redundant `nn.log_softmax` in `run_flax_glue.py` (#11920 ) * Remove redundant `nn.log_softmax` in `run_flax_glue.py` `optax.softmax_cross_entropy` expects unnormalized logits, and so it already calls `nn.log_softmax`, so I believe it is not needed here. `nn.log_softmax` is idempotent so mathematically it shouldn't have made a difference. * Remove unused 'flax.linen' import	2021-05-31 15:29:04 +01:00
Avital Oliver	2df546918e	Link official Cloud TPU JAX docs (#11892 )	2021-05-26 15:44:40 -04:00
Stas Bekman	1b6530104d	[Examples] create model with custom config on the fly (#11798 ) * create custom model on the flight * better wording * add update_from_string * cleanup * cleanup * Update src/transformers/configuration_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * more bool options * style * fix logger * add test * add the doc * assert on conflict of options Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-05-25 10:40:49 -07:00
Stas Bekman	6287c929c1	[lm examples] fix overflow in perplexity calc (#11855 ) * fix overflow in perplexity calc * use inf * fix	2021-05-25 08:11:26 -07:00
Sylvain Gugger	f086652b16	Add option to log only once in multinode training (#11819 ) * Add option to long only once in multinode training * Use an alternate property	2021-05-25 08:03:43 -04:00
Wang Ran (汪然)	b8344a274f	typo (#11858 )	2021-05-25 04:23:46 -04:00
Patrick von Platen	f580604157	[Flax] Fix PyTorch import error (#11839 ) * fix_torch_device_generate_test * remove @ * change pytorch import to flax import	2021-05-24 10:41:10 +01:00
Patrick von Platen	da22245ed9	Add flax text class colab (#11824 ) * fix_torch_device_generate_test * remove @ * add flax glue link	2021-05-21 23:11:58 +01:00
Patrick von Platen	82335185fe	[Flax] Small fixes in `run_flax_glue.py` (#11820 ) * fix_torch_device_generate_test * remove @ * correct best seed for flax fine-tuning Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-05-21 16:52:23 +01:00
Patrick von Platen	bd9871657b	[Flax] Align GLUE training script with mlm training script (#11778 ) * speed up flax glue * remove unnecessary line * remove folder * remove run in loop Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-05-21 09:36:56 +01:00
Keren Fuentes	223943872e	Fix failing test on Windows Platform (#11589 ) * add separator for windows * fixes test_is_copy_consistent on Windows * fixing writing encoding issue on extended test (for Windows) * resolving comments	2021-05-20 19:54:23 -04:00
Patrick von Platen	00440e350f	[Flax MLM] Refactor run mlm with optax (#11745 ) * refactor * update * update * update * refactor run mlm * finalize * refactor more * fix typo * update * finish refactor * modify run mlm * Apply suggestions from code review * Apply suggestions from code review * Apply suggestions from code review * small fixes * upload * upload * finish run mlm script Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-05-19 12:00:58 +01:00
Tomy Hsieh	eb3e072a3b	Fix a small error in summarization example (#11762 )	2021-05-18 14:38:36 -04:00
Avital Oliver	77f9bd18af	Add Flax Examples and Cloud TPU README (#11753 ) * Add Flax Examples README * Apply suggestions from code review * Update examples/flax/README.md * add nice table * fix * fix * apply suggestions * upload * finish flax readme.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-05-18 17:45:16 +01:00
Philipp Schmid	04e25c6286	add `dataset_name` to data_args and added accuracy metric (#11760 ) * add `dataset_name` to data_args and added accuracy metric * added documentation for dataset_name * spelling correction	2021-05-18 16:27:29 +02:00
Patrick von Platen	cebb96f53a	Add more subsections to main doc (#11758 ) * add headers to main doc * Apply suggestions from code review * update * upload	2021-05-18 14:38:56 +01:00
Tommy Chiang	da7e73b721	Fix incorrect newline in #11650 (#11757 )	2021-05-18 15:28:13 +02:00
Sylvain Gugger	936b57158a	Use new evaluation loop in TrainerQA (#11746 )	2021-05-17 10:10:13 -04:00
Marc van Zee	726e953d44	Improvements to Flax finetuning script (#11727 ) * Add Cloud details to README * Flax script and readme updates * Some simplifications of Flax script	2021-05-17 09:26:33 +01:00
Marc van Zee	94a2348706	Add Cloud details to README (#11706 ) * Add Cloud details to README * Flax script and readme updates	2021-05-14 14:51:25 +01:00
Patrick von Platen	113eaa7575	correct example script (#11726 )	2021-05-14 12:02:57 +01:00
Lysandre	d77eb0cf92	Docs for v4.7.0.dev0	2021-05-12 17:08:35 +02:00
Lysandre	64e78564a5	Release: v4.6.0	2021-05-12 17:03:03 +02:00
Philip May	77f4c46b50	remove defaults to None if optional (#11703 )	2021-05-12 09:11:10 -04:00
Marc van Zee	6797cdc077	Updates README and fixes bug (#11701 )	2021-05-12 13:52:52 +01:00
Marc van Zee	4ce6bcc310	Adds Flax BERT finetuning example on GLUE (#11564 ) * Adds Flax BERT finetuning example * fix traced jax tensor type * Use Optax losses and learning schedulers * Add 1GPU training results * merge into master & make style * fix input * del file * Fix bug in loss and add torch runs * finish bert flax fine-tune * Update examples/flax/text-classification/README.md * Update examples/flax/text-classification/run_flax_glue.py * add requirements * finalize * finalize Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-05-11 19:02:59 +01:00

1 2 3 4 5 ...

1623 Commits