transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-23 06:20:22 +06:00

Author	SHA1	Message	Date
Sylvain Gugger	a135f59536	Auto modelcard (#11599 ) * Autogenerate model cards from the Trainer * ModelCard deprecated * Fix test * Style * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Address review comments * Quality * With all metadata * Metadata * Post-merge conflict mess * Data args and all examples * Default license and languages when possible Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-05-11 11:30:34 -04:00
Jonathan Chang	64232bc0df	Add --text_column to run_summarization_no_trainer (#11673 )	2021-05-11 07:58:38 -04:00
Matt	ef8d32c5ea	Fix suggested by @bhadreshpsavani (#11660 )	2021-05-10 14:28:04 +01:00
Quentin Lhoest	1a0b41781d	Update requirements.txt (#11634 )	2021-05-10 11:19:52 +05:30
Tommy Chiang	7e406f4a65	[Examples] Fix invalid links after reorg (#11650 )	2021-05-10 11:16:48 +05:30
Tommy Chiang	f2ffcaf49f	[Examples] Check key exists in datasets first (#11503 )	2021-05-09 15:42:38 -04:00
Stas Bekman	ba0d50f214	[examples] fix sys.path in conftest.py (#11636 ) * restore conftest.py * fix conftest and make copies * remove unneeded parts * remove unwanted files	2021-05-07 14:44:22 -07:00
Jonathan Chang	6f40e31766	Fix comment in run_clm_no_trainer.py (#11624 )	2021-05-07 12:32:30 +05:30
Vipul Raheja	f594090a93	fix typo in command (#11605 )	2021-05-06 12:32:54 +05:30
Patrick von Platen	3e3e41ae20	Pytorch - Lazy initialization of models (#11471 ) * lazy_init_weights * remove ipdb * save int * add necessary code * remove unnecessary utils * Update src/transformers/models/t5/modeling_t5.py * clean * add tests * correct * finish tests * finish tests * fix some more tests * fix xlnet & transfo-xl * fix more tests * make sure tests are independent * fix tests more * finist tests * final touches * Update src/transformers/modeling_utils.py * Apply suggestions from code review * Update src/transformers/modeling_utils.py Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Update src/transformers/modeling_utils.py Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * clean tests * give arg positive name * add more mock weights to xlnet Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>	2021-05-05 17:22:20 +02:00
Sylvain Gugger	6b241e0e3b	Reproducible checkpoint (#11582 ) * Set generator in dataloader * Use generator in all random samplers * Checkpoint all RNG states * Final version * Quality * Test * Address review comments * Quality * Remove debug util * Add python and numpy RNGs * Split states in different files in distributed * Quality * local_rank for TPUs * Only use generator when accepted * Add test * Set seed to avoid flakiness * Make test less flaky * Quality	2021-05-04 16:20:56 -04:00
Patrick von Platen	084a187da3	[FlaxRoberta] Add FlaxRobertaModels & adapt run_mlm_flax.py (#11470 ) * add flax roberta * make style * correct initialiazation * modify model to save weights * fix copied from * fix copied from * correct some more code * add more roberta models * Apply suggestions from code review * merge from master * finish * finish docs Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-05-04 19:57:59 +02:00
Sylvain Gugger	87dd1a00ef	Fix metric computation in `run_glue_no_trainer` (#11569 )	2021-05-03 11:42:55 -04:00
Bhadresh Savani	84326a28f8	[Examples] Added support for test-file in QA examples with no trainer (#11510 ) * added support for test-file * fixed typo * added suggested changes * reformatted code * modifed files * fix post processing error * Trigger CI * removed extra lines	2021-04-30 09:02:50 -04:00
Suraj Patil	57c8e822f7	reszie token embeds (#11524 )	2021-04-30 08:47:01 -04:00
Matt	20d6931e32	Update TF text classification example (#11496 ) Big refactor, fixes and multi-GPU/TPU support	2021-04-30 13:45:33 +01:00
Manuel Romero	58c789e3d2	Update README.md (#11489 ) Add link to code	2021-04-30 04:29:59 -04:00
Sylvain Gugger	b29eb247d3	Split checkpoint from model_name_or_path in examples (#11492 ) * Split checkpoint from model_name_or_path in examples * Address review comments * Address review comments	2021-04-29 18:33:47 -04:00
Jaimeen Ahn	0661abc545	Variable Correction for Consistency in Distillation Example (#11444 ) As the error comes from the inconsistency of variable meaning number of gpus in parser and its actual usage in the train.py script, 'gpus' and 'n_gpu' respectively, the correction makes the example work	2021-04-26 13:30:48 -04:00
Bhadresh Savani	1d30ec95c7	[Examples] Fixes inconsistency around eval vs val and predict vs test (#11380 ) * added changes for uniformity * modified files * corrected typo * fixed qa scripts * fix typos * fixed predict typo in qa no trainer * fixed test file * reverted trainer changes * reverted trainer changes in custom exmaples * updated readme * added changes in deepspeed test * added changes for predict and eval	2021-04-26 09:24:31 -07:00
Amine Abdaoui	e3e70f9551	docs(examples): fix link to TPU launcher script (#11427 )	2021-04-26 09:08:43 -04:00
Patrick von Platen	32dbb2d954	make style (#11442 )	2021-04-26 13:50:34 +02:00
Sylvain Gugger	1ef152eb48	Default to accuracy metric (#11405 )	2021-04-23 14:49:59 -04:00
Sylvain Gugger	bf2e0cf70b	Trainer push to hub (#11328 ) * Initial support for upload to hub * push -> upload * Fixes + examples * Fix torchhub test * Torchhub test I hate you * push_model_to_hub -> push_to_hub * Apply mixin to other pretrained models * Remove ABC inheritance * Add tests * Typo * Run tests * Install git-lfs * Change approach * Add push_to_hub to all * Staging test suite * Typo * Maybe like this? * More deps * Cache * Adapt name * Quality * MOAR tests * Put it in testing_utils * Docs + torchhub last hope * Styling * Wrong method * Typos * Update src/transformers/file_utils.py Co-authored-by: Julien Chaumond <julien@huggingface.co> * Address review comments * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Julien Chaumond <julien@huggingface.co> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-04-23 09:17:37 -04:00
Yoshitomo Matsubara	c3d6f33918	fixed typos (#11391 )	2021-04-23 07:48:42 -04:00
Max Del	a90d3f1862	Fix typo in text (#11396 )	2021-04-23 07:37:19 -04:00
Patrick von Platen	b48cf7124c	correct typo (#11393 )	2021-04-23 11:34:59 +02:00
Matt	2617396094	Correctly cast num_train_epochs to int (#11379 )	2021-04-22 13:49:59 +01:00
johnson7788	5b5e4ca366	[run_translation.py] fix typo (#11372 ) fix typo Co-authored-by: johnson <johnson@github.com>	2021-04-22 17:47:11 +05:30
Matt	6fe79e57d7	Move old TF text classification script to legacy (#11361 ) And update README to explain the work-in-progress!	2021-04-21 17:36:18 +01:00
Matt	ac588594e2	Merge new TF example script (#11360 ) First of the new and more idiomatic TF examples!	2021-04-21 17:04:55 +01:00
Sylvain Gugger	dabeb15292	Examples reorg (#11350 ) * Base move * Examples reorganization * Update references * Put back test data * Move conftest * More fixes * Move test data to test fixtures * Update path * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Address review comments and clean Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-04-21 11:11:20 -04:00
Sylvain Gugger	f1b938fda8	Update to use datasets remove_cloumns method (#11343 ) * Update to use datasets remove_cloumns method * Quality	2021-04-20 14:12:01 -04:00
rajvi-k	bfd83c17a7	Added translation example script (#11196 ) * initial changes * modified evaluation * updated evaluation * updated evaluation on text translation example script * added translation example script * Formatted translation example script * Reformatted translation example * Fixed evaluation bug and added support for other tokenisers * Fixed evaluation bug and added support for other tokenisers * Added translation example script * Formatted summarization example script * Removed typos from summarization example script	2021-04-20 07:18:47 -04:00
Sudharsan S T	f25444cb22	Close open files to suppress ResourceWarning (#11240 ) Co-authored-by: Sudharsan Thirumalai <sudharsan.t@sprinklr.com>	2021-04-14 10:31:04 -04:00
Nithin Holla	653076ca30	Save the Wav2Vec2 processor before training starts (#10910 ) Co-authored-by: nithin19 <nithin@amberscript.com>	2021-04-14 14:52:06 +03:00
Philipp Schmid	9fa2995993	added cache_dir=model_args.cache_dir to all example with cache_dir arg (#11220 )	2021-04-13 18:35:18 +02:00
Takuya Makino	cb251ba619	Fix typo (#11188 )	2021-04-12 17:35:32 -04:00
Masatoshi TSUCHIYA	ef102c4886	model_path should be ignored as the checkpoint path (#11157 ) * model_path is refered as the path of the trainer, and should be ignored as the checkpoint path. * Improved according to Sgugger's comment.	2021-04-12 09:06:41 -04:00
Stas Bekman	07f0bb691d	[examples run_clm] fix _LazyModule hasher error (#11168 ) * fix _LazyModule hasher error * reword	2021-04-09 11:39:12 -07:00
Suraj Patil	c161dd56df	[examples/translation] support mBART-50 and M2M100 fine-tuning (#11170 ) * keep a list of multilingual tokenizers * add forced_bos_token argument	2021-04-09 23:58:42 +05:30
Saviour Owolabi	6060746570	Update README.md (#11161 ) Corrected a typo ('Downlowd' to 'Download')	2021-04-09 11:52:21 -04:00
Stas Bekman	66446909b2	[tests] relocate core integration tests (#11146 ) * relocate core integration tests * add sys.path context manager * cleanup * try * try2 * fix path * doc * style * add dep * add 2 more deps	2021-04-08 13:13:17 -07:00
Andrea Cappelli	6c40e49712	Run mlm pad to multiple for fp16 (#11128 ) * Add mlm collator pad to multiple option (#10627) * Use padding to 8x in run mlm (#10627)	2021-04-08 16:12:49 -04:00
Stas Bekman	c6d664849b	[DeepSpeed] ZeRO Stage 3 (#10753 ) * synced gpus * fix * fix * need to use t5-small for quality tests * notes * complete merge * fix a disappearing std stream problem * start zero3 tests * wip * tune params * sorting out the pre-trained model loading * reworking generate loop wip * wip * style * fix tests * split the tests * refactor tests * wip * parameterized * fix * workout the resume from non-ds checkpoint pass + test * cleanup * remove no longer needed code * split getter/setter functions * complete the docs * suggestions * gpus and their compute capabilities link * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * style * remove invalid paramgd * automatically configure zero3 params that rely on hidden size * make _get_resized_embeddings zero3-aware * add test exercising resize_token_embeddings() * add docstring Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-04-08 09:53:01 -07:00
Stas Bekman	acc851e1ff	[run_clm] clarify why we get the tokenizer warning on long input (#11145 ) * clarify why we get the warning here * Update examples/language-modeling/run_clm.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * wording * style Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-04-08 09:46:28 -07:00
Stas Bekman	424419f549	[examples] fix white space (#11099 ) these get concatenated without whitespace, so fix it	2021-04-07 09:20:58 -04:00
Stas Bekman	c9035e4537	fix: The 'warn' method is deprecated (#11105 ) * The 'warn' method is deprecated * fix test	2021-04-07 09:20:06 -04:00
Sylvain Gugger	fd338abdeb	Style	2021-04-06 19:54:13 -04:00
SHYAM SUNDER KUMAR	aef4cf8c52	accelerate question answering examples with no trainer (#11091 ) * accelerate question answering examples with no trainer * removed train and eval flags also fixed fill np array function * Update examples/question-answering/run_qa_beam_search_no_trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update examples/question-answering/run_qa_no_trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-04-06 19:35:21 -04:00

1 2 3 4 5 ...

1573 Commits