transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-07 06:40:04 +06:00

Author	SHA1	Message	Date
Qbiwan	8dcfaea08d	Update run_xnli.py to use Datasets library (#9829 ) * remove xnli_compute_metrics, add load_dataset, load_metric, set_seed,metric.compute,load_metric * fix * fix * fix * push * fix * everything works * fix init * fix * special treatment for sepconv1d * style * 🙏🏽 * add doc and cleanup * fix doc * fix doc again * fix doc again * Apply suggestions from code review * make style * Proposal that should work * Remove needless code * Fix test * Apply suggestions from code review * remove xnli_compute_metrics, add load_dataset, load_metric, set_seed,metric.compute,load_metric * amend README * removed data_args.task_name and replaced with task_name = "xnli"; use split function to load train and validation dataset separately; remove __post_init__; remove flag --task_name from README. * removed dict task_to_keys, use str "xnli" instead of variable task_name, change preprocess_function to use examples["premise"], examples["hypothesis"] directly, remove sentence1_key and sentence2_key, change compute_metrics function to cater only to accuracy metric, add condition for train_langauge is None when using dataset.load_dataset() * removed `torch.distributed.barrier()` and `import torch` as `from_pretrained` is able to do the work; amend README	2021-02-11 10:27:23 +05:30
Sylvain Gugger	b01483faa0	Truncate max length if needed in all examples (#10034 )	2021-02-08 05:03:55 -05:00
Stas Bekman	8ea412a86f	[examples] make run scripts executable (#10037 ) * make executable * make executable * same for the template * cleanup	2021-02-05 15:51:18 -08:00
Patrick von Platen	538b3b4607	[Tokenizer Utils Base] Make pad function more flexible (#9928 ) * change tokenizer requirement * split line * Correct typo from list to str * improve style * make other function pretty as well * add comment * correct typo * add new test * pass tests for tok without padding token * Apply suggestions from code review	2021-02-02 10:35:27 +03:00
Sylvain Gugger	b4e559cfa1	Deprecate model_path in Trainer.train (#9854 )	2021-01-28 08:32:46 -05:00
Sylvain Gugger	f2fabedbab	Setup logging with a stdout handler (#9816 )	2021-01-27 03:39:11 -05:00
Yusuke Mori	059bb25817	Fix a bug in run_glue.py (#9812 ) (#9815 )	2021-01-26 14:32:19 -05:00
Andrea Cappelli	10e5f28212	Improve pytorch examples for fp16 (#9796 ) * Pad to 8x for fp16 multiple choice example (#9752) * Pad to 8x for fp16 squad trainer example (#9752) * Pad to 8x for fp16 ner example (#9752) * Pad to 8x for fp16 swag example (#9752) * Pad to 8x for fp16 qa beam search example (#9752) * Pad to 8x for fp16 qa example (#9752) * Pad to 8x for fp16 seq2seq example (#9752) * Pad to 8x for fp16 glue example (#9752) * Pad to 8x for fp16 new ner example (#9752) * update script template #9752 * Update examples/multiple-choice/run_swag.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update examples/question-answering/run_qa.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update examples/question-answering/run_qa_beam_search.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * improve code quality #9752 Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-01-26 04:47:07 -05:00
Sylvain Gugger	caf4abf768	Auto-resume training from checkpoint (#9776 ) * Auto-resume training from checkpoint * Update examples/text-classification/run_glue.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Roll out to other examples Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-01-25 12:03:51 -05:00
Stefan Schweter	08b22722c7	examples: fix XNLI url (#9741 )	2021-01-22 18:13:52 +05:30
Yusuke Mori	eabad8fd9c	Update run_glue for do_predict with local test data (#9442 ) (#9486 ) * Update run_glue for do_predict with local test data (#9442) * Update run_glue (#9442): fix comments ('files' to 'a file') * Update run_glue (#9442): reflect the code review * Update run_glue (#9442): auto format * Update run_glue (#9442): reflect the code review	2021-01-13 07:48:35 -05:00
Pavel Tarashkevich	27d0e01d75	Fix classification script: enable dynamic padding with truncation (#9554 ) Co-authored-by: Pavel Tarashkevich <Pavel.Tarashkievich@orange.com>	2021-01-13 07:46:48 -05:00
Sylvain Gugger	453a70d4cb	Allow example to use a revision and work with private models (#9407 ) * Allow example to use a revision and work with private models * Copy to other examples and template * Styling	2021-01-06 06:49:23 -05:00
Yusuke Mori	57a6626929	[examples/text-classification] Fix a bug for using one's own dataset of a regression task (#9411 )	2021-01-05 08:15:06 -05:00
Sylvain Gugger	ec07da65e2	Update the README of the text classification example (#9237 ) * Update the README of the text classification example * Update examples/README.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Adapt comment from review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-12-21 15:23:40 -05:00
Stas Bekman	6b850b671d	[run_glue] add speed metrics (#9198 ) * add speed metrics * suggestions	2020-12-18 17:09:30 -08:00
Sylvain Gugger	783d7d2629	Reorganize examples (#9010 ) * Reorganize example folder * Continue reorganization * Change requirements for tests * Final cleanup * Finish regroup with tests all passing * Copyright * Requirements and readme * Make a full link for the documentation * Address review comments * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Add symlink * Reorg again * Apply suggestions from code review Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Adapt title * Update to new strucutre * Remove test * Update READMEs Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-12-11 10:07:02 -05:00
Ethan Perez	8dfc8c7221	Don't pass in token_type_ids to BART for GLUE (#8929 ) Without this fix, training a `BARTForSequenceClassification` model with `run_pl_glue.py` gives `TypeError: forward() got an unexpected keyword argument 'token_type_ids'`, because BART does not have token_type_ids. I've solved this issue in the same way as it's solved for the "distilbert" model, and I can train BART models on SNLI without errors now.	2020-12-05 09:52:16 -05:00
Julien Chaumond	042a6aa777	Tokenizers: ability to load from model subfolder (#8586 ) * <small>tiny typo</small> * Tokenizers: ability to load from model subfolder * use subfolder for local files as well * Uniformize model shortcut name => model id * from s3 => from huggingface.co Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>	2020-11-17 08:58:45 -05:00
Julien Plu	27b3ff316a	Try to understand and apply Sylvain's comments (#8458 )	2020-11-12 13:43:00 -05:00
Sylvain Gugger	cdc48ce92d	Finalize lm examples (#8188 ) * Finish the cleanup of the language-modeling examples * Update main README * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Apply suggestions from code review Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Propagate changes Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-10-30 14:20:18 -04:00
Sean Naren	5e24982e58	Upgrade PyTorch Lightning to 1.0.2 (#7852 ) Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-10-28 14:59:14 -04:00
Sylvain Gugger	47dfa65b0c	New run_clm script (#8105 ) * New run_clm script * Formatting * More comments * Remove unused imports * Apply suggestions from code review Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Address review comments * Change link to the hub Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-10-28 10:38:58 -04:00
Sylvain Gugger	2e5052d4f1	New run glue script (#7917 ) * Start simplification * More progress * Finished script * Address comments and update tests instructions * Wrong test * Accept files as inputs and fix test * Update src/transformers/trainer_utils.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> * Fix labels and add combined score * Add special labels * Update TPU command * Revert to old label strategy * Use model labels * Fix for STT-B * Styling * Apply suggestions from code review Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Code styling * Fix review comments Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-10-22 11:42:22 -04:00
Sylvain Gugger	bb9559a7f9	Don't use `store_xxx` on optional bools (#7786 ) * Don't use `store_xxx` on optional bools * Refine test * Refine test	2020-10-14 12:05:02 -04:00
Julien Plu	d9ffb87efb	Fix tf text class (#7724 ) * Fix test * fix generic text classification * fix test * Fix tests	2020-10-12 08:45:15 -04:00
Julien Plu	9ad830596d	Fix dataset cardinality (#7678 ) * Fix test * Fix cardinality issue * Fix test	2020-10-09 10:38:25 -04:00
Julien Plu	585217c87f	Add generic text classification example in TF (#5716 ) * Add new example with nlp * Update README * replace nlp by datasets * Update examples/text-classification/README.md Add Lysandre's suggestion. Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-09-22 12:05:05 -04:00
Lysandre	aae4edb5f0	Addressing review comment	2020-09-21 11:37:00 +02:00
Suraj Patil	43b9d93875	[example/glue] fix compute_metrics_fn for bart like models (#7248 ) * fix compute_metrics_fn * p.predictions -> preds * apply suggestions	2020-09-21 05:34:20 -04:00
Stas Bekman	b0cbcdb05b	[logging] remove no longer needed verbosity override (#7100 )	2020-09-15 04:01:14 -04:00
Lysandre	1650130b0f	Remove misleading docstring	2020-09-07 14:16:59 +02:00
Lysandre	a75c64d80c	Black 20 release	2020-08-26 17:20:22 +02:00
Suraj Patil	6f972e1423	update xnli-mt url (#6580 )	2020-08-18 13:10:47 -04:00
Sam Shleifer	84c265ffcc	[lightning_base] fix s2s logging, only make train_loader once (#6404 )	2020-08-16 22:49:41 -04:00
Stas Bekman	0203d6517f	[pl] restore lr logging behavior for glue, ner examples (#6314 )	2020-08-11 16:27:11 -04:00
Stas Bekman	7c6a085ebf	pl version: examples/requirements.txt is single source of truth (#6309 )	2020-08-11 10:58:54 -04:00
Stas Bekman	f6c0680d36	add pl_glue example test (#6034 ) * add pl_glue example test * for now just test that it runs, next validate results of eval or predict? * complete the run_pl_glue test to validate the actual outcome * worked on my machine, CI gets less accuracy - trying higher epochs * match run_pl.sh hparms * more epochs? * trying higher lr * for now just test that the script runs to a completion * correct the comment * if cuda is available, add --fp16 --gpus=1 to cover more bases * style	2020-08-11 03:16:52 -04:00
Sam Shleifer	9a5ef83748	[s2s] fix --gpus clarg collision (#6358 )	2020-08-08 21:51:37 -04:00
Stas Bekman	6695450a23	[examples] consistently use --gpus, instead of --n_gpu (#6315 )	2020-08-07 10:36:32 -04:00
Stas Bekman	175cd45e13	fix the shuffle agrument usage and the default (#6307 )	2020-08-06 20:32:28 -04:00
Bhashithe Abeysinghe	ffceef2042	[Fix] text-classification PL example (#6027 ) Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-08-06 15:46:43 -04:00
xujiaze13	eb2bd8d6eb	Remove redundant line in run_pl_glue.py (#6305 )	2020-08-06 15:43:45 -04:00
Julien Plu	54f9fbeff8	Rework TF trainer (#6038 ) * Fully rework training/prediction loops * fix method name * Fix variable name * Fix property name * Fix scope * Fix method name * Fix tuple index * Fix tuple index * Fix indentation * Fix variable name * fix eval before log * Add drop remainder for test dataset * Fix step number + fix logging datetime * fix eval loss value * use global step instead of step + fix logging at step 0 * Fix logging datetime * Fix global_step usage * Fix breaking loop + logging datetime * Fix step in prediction loop * Fix step breaking * Fix train/test loops * Force TF at least 2.2 for the trainer * Use assert_cardinality to facilitate the dataset size computation * Log steps per epoch * Make tfds compliant with TPU * Make tfds compliant with TPU * Use TF dataset enumerate instead of the Python one * revert previous commit * Fix data_dir * Apply style * rebase on master * Address Sylvain's comments * Address Sylvain's and Lysandre comments * Trigger CI * Remove unused import	2020-07-29 14:32:01 -04:00
Sylvain Gugger	734a28a767	Clean up diffs in Trainer/TFTrainer (#5417 ) * Cleanup and unify Trainer/TFTrainer * Forgot to adapt TFTrainingArgs * In tf scripts n_gpu -> n_replicas * Update src/transformers/training_args.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Address review comments * Formatting * Fix typo Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-07-01 11:00:20 -04:00
Sam Shleifer	45e26125de	save_pretrained: mkdir(exist_ok=True) (#5258 ) * all save_pretrained methods mkdir if not os.path.exists	2020-06-28 14:53:47 -04:00
Sylvain Gugger	5e85b324ec	Use the script in utils (#5224 )	2020-06-24 07:55:58 -04:00
Sam Shleifer	f5c2a122e3	Upgrade examples to pl=0.8.1(#5146 )	2020-06-22 20:40:10 -04:00
Jason Phang	492b352ab6	Remove unnecessary model_type arg in example (#4771 )	2020-06-04 13:41:24 -04:00
Jin Young Sohn	b231a413f5	Add cache_dir to save features in GLUE + Differentiate match/mismatch for MNLI metrics (#4621 ) * Glue task cleaup * Enable writing cache to cache_dir in case dataset lives in readOnly filesystem. * Differentiate match vs mismatch for MNLI metrics. * Style * Fix pytype * Fix type * Use cache_dir in mnli mismatch eval dataset * Small Tweaks Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-06-02 13:40:14 -04:00

1 2

57 Commits