transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-06 22:30:09 +06:00

Author	SHA1	Message	Date
Lysandre	0daadc1919	Docs for v4.8.0	2021-06-17 18:17:42 +02:00
Lysandre	7a6c9fab8e	Release: v4.7.0	2021-06-17 17:57:42 +02:00
Sylvain Gugger	7d7ceca396	Model card defaults (#12122 ) * [WIP] Model card defaults * finetuned_from default value * Add all mappings to the mapping file * Be more defensive on finetuned_from arg * Add default task tag * Separate tags from tasks * Edge case for dataset * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-06-15 16:01:37 -04:00
kumapo	955b2b97a6	Enable add_prefix_space if model_type is roberta or gpt2 (#12116 )	2021-06-15 09:33:21 -04:00
Kumar Abhishek	9de62cfbce	[lm examples] Replicate --config_overrides addition to other LM examples (#12135 ) * [lm examples] Replicate --config_overrides addition to other LM examples * Removing no trainer files changes * Update README Co-authored-by: Kumar Abhishek <kabhishek@expedia.com>	2021-06-14 08:12:22 -04:00
Nicholas Broad	cd7961b632	Use text_column_name variable instead of "text" (#12132 ) * Use text_column_name variable instead of "text" `text_column_name` was already defined above where I made the changes and it was also used below where I made changes. This is a very minor change. If a dataset does not use "text" as the column name, then the `tokenize_function` will now use whatever column is assigned to `text_column_name`. `text_column_name` is just the first column name if "text" is not a column name. It makes the function a little more robust, though I would assume that 90% + of datasets use "text" anyway. * black formatting * make style Co-authored-by: Nicholas Broad <nicholas@nmbroad.com>	2021-06-14 08:11:13 -04:00
Sylvain Gugger	b8ab541340	Don't log anything before logging is setup in examples (#12121 ) * Don't log anything before logging is setup in examples * Last example	2021-06-14 08:03:33 -04:00
Bhavitvya Malik	d2753dcbec	add relevant description to tqdm in examples (#11927 ) * add relevant `desc` in examples * require_version datasets>=1.8.0	2021-06-10 15:59:55 -04:00
Matt	bebbdd0fc9	Appending label2id and id2label to models to ensure inference works properly (#12102 )	2021-06-10 15:25:04 +01:00
Sylvain Gugger	d72e5a3a6d	Fix quality	2021-06-10 09:27:11 -04:00
kumapo	472a867626	Add text_column_name and label_column_name to run_ner and run_ner_no_trainer args (#12083 ) * Add text_column_name and label_column_name to run_ner args * Minor fix: grouping for text and label column name	2021-06-10 08:03:20 -04:00
Koichi Yasuoka	82a2b76c95	Update run_ner.py with id2label config (#12001 )	2021-06-09 07:27:05 -04:00
Sylvain Gugger	fd6902838a	Properly indent block_size (#12070 )	2021-06-08 10:27:02 -04:00
cdleong	49bee0aea4	Add torch to requirements.txt in language-modeling (#12040 ) * Add torch to requirements.txt in language-modeling * Update examples/pytorch/language-modeling/requirements.txt Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-06-08 09:02:35 -04:00
Russell Klopfer	e363e1d936	adds metric prefix. (#12057 ) * adds metric prefix. * update tests to include prefix	2021-06-07 22:34:10 -04:00
Fan Zhang	7e73601f32	modify qa-trainer (#11872 ) * modify qa-trainer * fix flax model	2021-06-01 08:28:41 -04:00
Philip May	cfca638acb	Add MT5ForConditionalGeneration as supported arch. to summarization README (#11961 ) * Add MT5ForConditionalGeneration as supported arch. * Update README.md	2021-05-31 21:24:33 +05:30
Stas Bekman	1b6530104d	[Examples] create model with custom config on the fly (#11798 ) * create custom model on the flight * better wording * add update_from_string * cleanup * cleanup * Update src/transformers/configuration_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * more bool options * style * fix logger * add test * add the doc * assert on conflict of options Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-05-25 10:40:49 -07:00
Stas Bekman	6287c929c1	[lm examples] fix overflow in perplexity calc (#11855 ) * fix overflow in perplexity calc * use inf * fix	2021-05-25 08:11:26 -07:00
Sylvain Gugger	f086652b16	Add option to log only once in multinode training (#11819 ) * Add option to long only once in multinode training * Use an alternate property	2021-05-25 08:03:43 -04:00
Wang Ran (汪然)	b8344a274f	typo (#11858 )	2021-05-25 04:23:46 -04:00
Keren Fuentes	223943872e	Fix failing test on Windows Platform (#11589 ) * add separator for windows * fixes test_is_copy_consistent on Windows * fixing writing encoding issue on extended test (for Windows) * resolving comments	2021-05-20 19:54:23 -04:00
Tomy Hsieh	eb3e072a3b	Fix a small error in summarization example (#11762 )	2021-05-18 14:38:36 -04:00
Philipp Schmid	04e25c6286	add `dataset_name` to data_args and added accuracy metric (#11760 ) * add `dataset_name` to data_args and added accuracy metric * added documentation for dataset_name * spelling correction	2021-05-18 16:27:29 +02:00
Patrick von Platen	cebb96f53a	Add more subsections to main doc (#11758 ) * add headers to main doc * Apply suggestions from code review * update * upload	2021-05-18 14:38:56 +01:00
Tommy Chiang	da7e73b721	Fix incorrect newline in #11650 (#11757 )	2021-05-18 15:28:13 +02:00
Sylvain Gugger	936b57158a	Use new evaluation loop in TrainerQA (#11746 )	2021-05-17 10:10:13 -04:00
Lysandre	d77eb0cf92	Docs for v4.7.0.dev0	2021-05-12 17:08:35 +02:00
Lysandre	64e78564a5	Release: v4.6.0	2021-05-12 17:03:03 +02:00
Sylvain Gugger	a135f59536	Auto modelcard (#11599 ) * Autogenerate model cards from the Trainer * ModelCard deprecated * Fix test * Style * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Address review comments * Quality * With all metadata * Metadata * Post-merge conflict mess * Data args and all examples * Default license and languages when possible Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-05-11 11:30:34 -04:00
Jonathan Chang	64232bc0df	Add --text_column to run_summarization_no_trainer (#11673 )	2021-05-11 07:58:38 -04:00
Tommy Chiang	7e406f4a65	[Examples] Fix invalid links after reorg (#11650 )	2021-05-10 11:16:48 +05:30
Tommy Chiang	f2ffcaf49f	[Examples] Check key exists in datasets first (#11503 )	2021-05-09 15:42:38 -04:00
Stas Bekman	ba0d50f214	[examples] fix sys.path in conftest.py (#11636 ) * restore conftest.py * fix conftest and make copies * remove unneeded parts * remove unwanted files	2021-05-07 14:44:22 -07:00
Jonathan Chang	6f40e31766	Fix comment in run_clm_no_trainer.py (#11624 )	2021-05-07 12:32:30 +05:30
Vipul Raheja	f594090a93	fix typo in command (#11605 )	2021-05-06 12:32:54 +05:30
Patrick von Platen	3e3e41ae20	Pytorch - Lazy initialization of models (#11471 ) * lazy_init_weights * remove ipdb * save int * add necessary code * remove unnecessary utils * Update src/transformers/models/t5/modeling_t5.py * clean * add tests * correct * finish tests * finish tests * fix some more tests * fix xlnet & transfo-xl * fix more tests * make sure tests are independent * fix tests more * finist tests * final touches * Update src/transformers/modeling_utils.py * Apply suggestions from code review * Update src/transformers/modeling_utils.py Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Update src/transformers/modeling_utils.py Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * clean tests * give arg positive name * add more mock weights to xlnet Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>	2021-05-05 17:22:20 +02:00
Sylvain Gugger	6b241e0e3b	Reproducible checkpoint (#11582 ) * Set generator in dataloader * Use generator in all random samplers * Checkpoint all RNG states * Final version * Quality * Test * Address review comments * Quality * Remove debug util * Add python and numpy RNGs * Split states in different files in distributed * Quality * local_rank for TPUs * Only use generator when accepted * Add test * Set seed to avoid flakiness * Make test less flaky * Quality	2021-05-04 16:20:56 -04:00
Sylvain Gugger	87dd1a00ef	Fix metric computation in `run_glue_no_trainer` (#11569 )	2021-05-03 11:42:55 -04:00
Bhadresh Savani	84326a28f8	[Examples] Added support for test-file in QA examples with no trainer (#11510 ) * added support for test-file * fixed typo * added suggested changes * reformatted code * modifed files * fix post processing error * Trigger CI * removed extra lines	2021-04-30 09:02:50 -04:00
Suraj Patil	57c8e822f7	reszie token embeds (#11524 )	2021-04-30 08:47:01 -04:00
Sylvain Gugger	b29eb247d3	Split checkpoint from model_name_or_path in examples (#11492 ) * Split checkpoint from model_name_or_path in examples * Address review comments * Address review comments	2021-04-29 18:33:47 -04:00
Bhadresh Savani	1d30ec95c7	[Examples] Fixes inconsistency around eval vs val and predict vs test (#11380 ) * added changes for uniformity * modified files * corrected typo * fixed qa scripts * fix typos * fixed predict typo in qa no trainer * fixed test file * reverted trainer changes * reverted trainer changes in custom exmaples * updated readme * added changes in deepspeed test * added changes for predict and eval	2021-04-26 09:24:31 -07:00
Amine Abdaoui	e3e70f9551	docs(examples): fix link to TPU launcher script (#11427 )	2021-04-26 09:08:43 -04:00
Sylvain Gugger	1ef152eb48	Default to accuracy metric (#11405 )	2021-04-23 14:49:59 -04:00
Sylvain Gugger	bf2e0cf70b	Trainer push to hub (#11328 ) * Initial support for upload to hub * push -> upload * Fixes + examples * Fix torchhub test * Torchhub test I hate you * push_model_to_hub -> push_to_hub * Apply mixin to other pretrained models * Remove ABC inheritance * Add tests * Typo * Run tests * Install git-lfs * Change approach * Add push_to_hub to all * Staging test suite * Typo * Maybe like this? * More deps * Cache * Adapt name * Quality * MOAR tests * Put it in testing_utils * Docs + torchhub last hope * Styling * Wrong method * Typos * Update src/transformers/file_utils.py Co-authored-by: Julien Chaumond <julien@huggingface.co> * Address review comments * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Julien Chaumond <julien@huggingface.co> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-04-23 09:17:37 -04:00
Yoshitomo Matsubara	c3d6f33918	fixed typos (#11391 )	2021-04-23 07:48:42 -04:00
Max Del	a90d3f1862	Fix typo in text (#11396 )	2021-04-23 07:37:19 -04:00
johnson7788	5b5e4ca366	[run_translation.py] fix typo (#11372 ) fix typo Co-authored-by: johnson <johnson@github.com>	2021-04-22 17:47:11 +05:30
Sylvain Gugger	dabeb15292	Examples reorg (#11350 ) * Base move * Examples reorganization * Update references * Put back test data * Move conftest * More fixes * Move test data to test fixtures * Update path * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Address review comments and clean Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-04-21 11:11:20 -04:00

... 6 7 8 9 10

500 Commits