transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 10:12:23 +06:00

Author	SHA1	Message	Date
Manuel Romero	58c789e3d2	Update README.md (#11489 ) Add link to code	2021-04-30 04:29:59 -04:00
Sylvain Gugger	b29eb247d3	Split checkpoint from model_name_or_path in examples (#11492 ) * Split checkpoint from model_name_or_path in examples * Address review comments * Address review comments	2021-04-29 18:33:47 -04:00
Jaimeen Ahn	0661abc545	Variable Correction for Consistency in Distillation Example (#11444 ) As the error comes from the inconsistency of variable meaning number of gpus in parser and its actual usage in the train.py script, 'gpus' and 'n_gpu' respectively, the correction makes the example work	2021-04-26 13:30:48 -04:00
Bhadresh Savani	1d30ec95c7	[Examples] Fixes inconsistency around eval vs val and predict vs test (#11380 ) * added changes for uniformity * modified files * corrected typo * fixed qa scripts * fix typos * fixed predict typo in qa no trainer * fixed test file * reverted trainer changes * reverted trainer changes in custom exmaples * updated readme * added changes in deepspeed test * added changes for predict and eval	2021-04-26 09:24:31 -07:00
Amine Abdaoui	e3e70f9551	docs(examples): fix link to TPU launcher script (#11427 )	2021-04-26 09:08:43 -04:00
Patrick von Platen	32dbb2d954	make style (#11442 )	2021-04-26 13:50:34 +02:00
Sylvain Gugger	1ef152eb48	Default to accuracy metric (#11405 )	2021-04-23 14:49:59 -04:00
Sylvain Gugger	bf2e0cf70b	Trainer push to hub (#11328 ) * Initial support for upload to hub * push -> upload * Fixes + examples * Fix torchhub test * Torchhub test I hate you * push_model_to_hub -> push_to_hub * Apply mixin to other pretrained models * Remove ABC inheritance * Add tests * Typo * Run tests * Install git-lfs * Change approach * Add push_to_hub to all * Staging test suite * Typo * Maybe like this? * More deps * Cache * Adapt name * Quality * MOAR tests * Put it in testing_utils * Docs + torchhub last hope * Styling * Wrong method * Typos * Update src/transformers/file_utils.py Co-authored-by: Julien Chaumond <julien@huggingface.co> * Address review comments * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Julien Chaumond <julien@huggingface.co> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-04-23 09:17:37 -04:00
Yoshitomo Matsubara	c3d6f33918	fixed typos (#11391 )	2021-04-23 07:48:42 -04:00
Max Del	a90d3f1862	Fix typo in text (#11396 )	2021-04-23 07:37:19 -04:00
Patrick von Platen	b48cf7124c	correct typo (#11393 )	2021-04-23 11:34:59 +02:00
Matt	2617396094	Correctly cast num_train_epochs to int (#11379 )	2021-04-22 13:49:59 +01:00
johnson7788	5b5e4ca366	[run_translation.py] fix typo (#11372 ) fix typo Co-authored-by: johnson <johnson@github.com>	2021-04-22 17:47:11 +05:30
Matt	6fe79e57d7	Move old TF text classification script to legacy (#11361 ) And update README to explain the work-in-progress!	2021-04-21 17:36:18 +01:00
Matt	ac588594e2	Merge new TF example script (#11360 ) First of the new and more idiomatic TF examples!	2021-04-21 17:04:55 +01:00
Sylvain Gugger	dabeb15292	Examples reorg (#11350 ) * Base move * Examples reorganization * Update references * Put back test data * Move conftest * More fixes * Move test data to test fixtures * Update path * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Address review comments and clean Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-04-21 11:11:20 -04:00
Sylvain Gugger	f1b938fda8	Update to use datasets remove_cloumns method (#11343 ) * Update to use datasets remove_cloumns method * Quality	2021-04-20 14:12:01 -04:00
rajvi-k	bfd83c17a7	Added translation example script (#11196 ) * initial changes * modified evaluation * updated evaluation * updated evaluation on text translation example script * added translation example script * Formatted translation example script * Reformatted translation example * Fixed evaluation bug and added support for other tokenisers * Fixed evaluation bug and added support for other tokenisers * Added translation example script * Formatted summarization example script * Removed typos from summarization example script	2021-04-20 07:18:47 -04:00
Sudharsan S T	f25444cb22	Close open files to suppress ResourceWarning (#11240 ) Co-authored-by: Sudharsan Thirumalai <sudharsan.t@sprinklr.com>	2021-04-14 10:31:04 -04:00
Nithin Holla	653076ca30	Save the Wav2Vec2 processor before training starts (#10910 ) Co-authored-by: nithin19 <nithin@amberscript.com>	2021-04-14 14:52:06 +03:00
Philipp Schmid	9fa2995993	added cache_dir=model_args.cache_dir to all example with cache_dir arg (#11220 )	2021-04-13 18:35:18 +02:00
Takuya Makino	cb251ba619	Fix typo (#11188 )	2021-04-12 17:35:32 -04:00
Masatoshi TSUCHIYA	ef102c4886	model_path should be ignored as the checkpoint path (#11157 ) * model_path is refered as the path of the trainer, and should be ignored as the checkpoint path. * Improved according to Sgugger's comment.	2021-04-12 09:06:41 -04:00
Stas Bekman	07f0bb691d	[examples run_clm] fix _LazyModule hasher error (#11168 ) * fix _LazyModule hasher error * reword	2021-04-09 11:39:12 -07:00
Suraj Patil	c161dd56df	[examples/translation] support mBART-50 and M2M100 fine-tuning (#11170 ) * keep a list of multilingual tokenizers * add forced_bos_token argument	2021-04-09 23:58:42 +05:30
Saviour Owolabi	6060746570	Update README.md (#11161 ) Corrected a typo ('Downlowd' to 'Download')	2021-04-09 11:52:21 -04:00
Stas Bekman	66446909b2	[tests] relocate core integration tests (#11146 ) * relocate core integration tests * add sys.path context manager * cleanup * try * try2 * fix path * doc * style * add dep * add 2 more deps	2021-04-08 13:13:17 -07:00
Andrea Cappelli	6c40e49712	Run mlm pad to multiple for fp16 (#11128 ) * Add mlm collator pad to multiple option (#10627) * Use padding to 8x in run mlm (#10627)	2021-04-08 16:12:49 -04:00
Stas Bekman	c6d664849b	[DeepSpeed] ZeRO Stage 3 (#10753 ) * synced gpus * fix * fix * need to use t5-small for quality tests * notes * complete merge * fix a disappearing std stream problem * start zero3 tests * wip * tune params * sorting out the pre-trained model loading * reworking generate loop wip * wip * style * fix tests * split the tests * refactor tests * wip * parameterized * fix * workout the resume from non-ds checkpoint pass + test * cleanup * remove no longer needed code * split getter/setter functions * complete the docs * suggestions * gpus and their compute capabilities link * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * style * remove invalid paramgd * automatically configure zero3 params that rely on hidden size * make _get_resized_embeddings zero3-aware * add test exercising resize_token_embeddings() * add docstring Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-04-08 09:53:01 -07:00
Stas Bekman	acc851e1ff	[run_clm] clarify why we get the tokenizer warning on long input (#11145 ) * clarify why we get the warning here * Update examples/language-modeling/run_clm.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * wording * style Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-04-08 09:46:28 -07:00
Stas Bekman	424419f549	[examples] fix white space (#11099 ) these get concatenated without whitespace, so fix it	2021-04-07 09:20:58 -04:00
Stas Bekman	c9035e4537	fix: The 'warn' method is deprecated (#11105 ) * The 'warn' method is deprecated * fix test	2021-04-07 09:20:06 -04:00
Sylvain Gugger	fd338abdeb	Style	2021-04-06 19:54:13 -04:00
SHYAM SUNDER KUMAR	aef4cf8c52	accelerate question answering examples with no trainer (#11091 ) * accelerate question answering examples with no trainer * removed train and eval flags also fixed fill np array function * Update examples/question-answering/run_qa_beam_search_no_trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update examples/question-answering/run_qa_no_trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-04-06 19:35:21 -04:00
Lysandre	9853c5dd58	Development on v4.6.0dev0	2021-04-06 12:53:25 -04:00
Lysandre	4906a29f7f	Release v4.5.0	2021-04-06 12:37:47 -04:00
Hemil Desai	6ab7d1a429	Add Readme for language modeling scripts with accelerate (#11073 )	2021-04-05 20:56:12 -04:00
Hemil Desai	b51b87c41d	Add `examples/language_modeling/run_clm_no_trainer.py` (#11026 ) * Initial draft for clm no trainer * Remove unwanted args * Fix bug * Update examples/language-modeling/run_clm_no_trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-04-05 12:27:52 -04:00
Stas Bekman	3d39226a51	s\|Pretrained\|PreTrained\| (#11048 )	2021-04-04 18:08:42 -07:00
versis	335c0ca35c	fixed typo: logging instead of logger (#11025 )	2021-04-02 09:22:22 -04:00
Hemil Desai	838f83d84c	Add `examples/language_modeling/run_mlm_no_trainer.py` (#11001 ) * Add initial script for finetuning MLM models with accelerate * Add evaluation metric calculation * Fix bugs * Use no_grad on evaluation * update script docstring * Update examples/language-modeling/run_mlm_no_trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * PR feedback * Fix CI failure * Update examples/language-modeling/run_mlm_no_trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-03-31 18:49:45 -04:00
Sylvain Gugger	acc3bd9d2a	Enforce string-formatting with f-strings (#10980 ) * First third * Styling and fix mistake * Quality * All the rest * Treat %s and %d * typo * Missing ) * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-03-31 10:00:27 -04:00
WybeKoper	645f45c462	Fixed some typos and removed legacy url (#10989 ) * Fixed typos * Removed legacy colab notebook from readme Co-authored-by: WybeKoper <WybeKoper@users.noreply.github.com>	2021-03-31 16:53:15 +05:30
Yih-Dar	e031162a6b	fix md file to avoid evaluation crash (#10962 )	2021-03-30 21:26:22 +03:00
Philipp Schmid	3e09d813aa	[examples/s2s] added py7zr dep (#10971 ) * added py7zr * comment out check_min for sagemaker test * added min version again	2021-03-30 23:17:12 +05:30
Stas Bekman	05c966f24b	[vulnerability] dep fix (#10954 ) Fixes https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/Pygments/open @LysandreJik	2021-03-29 17:25:47 -04:00
Daniel Stancl	5057213bcc	Add `examples/multiple-choice/run_swag_no_trainer.py` (#10934 ) * Initial commit * Another bunch of updates * make style quliaty + delete debug arg from bash script * Use compue_metrics func * Do a few fixes * Add copyright * Fix typos	2021-03-29 16:41:09 -04:00
Sylvain Gugger	4002f95eb6	Remove duplicate code	2021-03-29 15:27:12 -04:00
Daniel Stancl	d7b50ce469	Add `examples/run_ner_no_trainer.py` (#10902 ) * Add NER example with accelerate library * This commit contains the first (yet really unfinished) version of a script for showing how to train HuggingFace model with their new accelerate library. * Fix metric calculation * make style quality * mv ner_no_trainer to token-classification dir * Delete --debug flag from running script * hf_datasets -> raw_datasets * Make a few slight adjustments * Add an informative comment + rewrite a help comment * Change header * Fix a few things * Enforce to use fast tokenizers only * DataCollatorWithPadding -> DataCollatorForTokenClassification * Change bash script: python3 -> accelerate launch * make style * Add a few missing things (see below) * Add a max-lenghth padding to predictions and labels to enable accelerate gather functionality * Add PyTorch no trainer example to the example README.md * Remove --do-train from args as being redundant for now * DataCollatorWithPadding -> DataCollatorForTokenClassification * Remove some obsolete args.do_train conditions from the script * Delete --do_train from bash running script * Delete use_slow_tokenizer from args * Add unintentionally removed flag --label_all_tokens * Delete --debug flag from running script	2021-03-29 15:11:23 -04:00
WybeKoper	ddea8771c6	Updated colab links in readme of examples (#10932 ) Co-authored-by: WybeKoper <WybeKoper@users.noreply.github.com>	2021-03-29 08:47:09 -04:00

1 2 3 4 5 ...

1557 Commits