transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-16 19:18:24 +06:00

Author	SHA1	Message	Date
Sylvain Gugger	4c19f3baab	Clean Trainer tests and datasets dep (#8268 )	2020-11-03 15:50:55 -05:00
François Lagunas	e174bfeb34	TensorBoard/Wandb/optuna/raytune integration improvements. (#7935 ) Improved TensorBoard and Wandb integration, as well as optuna and ray/tune support, with minor modifications to trainer core code.	2020-10-21 17:18:52 +02:00
Julien Rossi	a09fe140c1	Trainer with Iterable Dataset (#7858 ) * fix 5990 * accomodate iterable dataset without predefined length * set it as 1 use case: provide max_steps, and NO num_epochs * Is a merge of master and PR 5995 * fix trainer test under TF * fix only for torch * TF trainer untouched * trainer tests are skipped when no torch * address comments * fix quality checks * remove torch.dataset from test_trainer * unnecessary inheritance * RegressionDataset implements all needed methods __len__ and __getitem__ * fix quality checks * restore RegressionDataset * was wrongly under is_torch_available()	2020-10-19 11:57:39 -04:00
Thomas Wolf	ba8c4d0ac0	[Dependencies\|tokenizers] Make both SentencePiece and Tokenizers optional dependencies (#7659 ) * splitting fast and slow tokenizers [WIP] * [WIP] splitting sentencepiece and tokenizers dependencies * update dummy objects * add name_or_path to models and tokenizers * prefix added to file names * prefix * styling + quality * spliting all the tokenizer files - sorting sentencepiece based ones * update tokenizer version up to 0.9.0 * remove hard dependency on sentencepiece 🎉 * and removed hard dependency on tokenizers 🎉 * update conversion script * update missing models * fixing tests * move test_tokenization_fast to main tokenization tests - fix bugs * bump up tokenizers * fix bert_generation * update ad fix several tokenizers * keep sentencepiece in deps for now * fix funnel and deberta tests * fix fsmt * fix marian tests * fix layoutlm * fix squeezebert and gpt2 * fix T5 tokenization * fix xlnet tests * style * fix mbart * bump up tokenizers to 0.9.2 * fix model tests * fix tf models * fix seq2seq examples * fix tests without sentencepiece * fix slow => fast conversion without sentencepiece * update auto and bert generation tests * fix mbart tests * fix auto and common test without tokenizers * fix tests without tokenizers * clean up tests lighten up when tokenizers + sentencepiece are both off * style quality and tests fixing * add sentencepiece to doc/examples reqs * leave sentencepiece on for now * style quality split hebert and fix pegasus * WIP Herbert fast * add sample_text_no_unicode and fix hebert tokenization * skip FSMT example test for now * fix style * fix fsmt in example tests * update following Lysandre and Sylvain's comments * Update src/transformers/testing_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/testing_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/tokenization_utils_base.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/tokenization_utils_base.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-10-18 20:51:24 +02:00
Sylvain Gugger	a1d1b332d0	Add predict step accumulation (#7767 ) * Add eval_accumulation_step and clean distributed eval * Add TPU test * Add TPU stuff * Fix arg name * Fix Seq2SeqTrainer * Fix total_size * Update src/transformers/trainer_pt_utils.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Doc and add test to TPU * Add unit test * Adapt name Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-10-14 11:41:45 -04:00
Sylvain Gugger	7968051aba	Fix typo	2020-10-13 17:30:46 -04:00
Sylvain Gugger	c6e18de9f8	Fix flaky test in test_trainer (#7689 )	2020-10-09 20:01:15 -04:00
Sylvain Gugger	d3adb985d1	Expand test to locate flakiness (#7580 )	2020-10-05 09:45:47 -04:00
Sylvain Gugger	29baa8fabe	Clean the Trainer state (#7490 ) * Trainer should not modify its TrainingArguments * Trainer should not modify its TrainingArguments * Trainer should not modify its TrainingArguments * Add test of resumed training * Fixes * Non multiGPU test * Clean Trainer state * Add more to the state * Documentation * One last test * Make resume training test more complete * Unwanted changes	2020-10-01 13:07:04 -04:00
Sylvain Gugger	8546dc55c2	Fix Trainer tests in a multiGPU env (#7458 )	2020-09-29 14:06:41 -04:00
Sylvain Gugger	52e8392b7e	Add automatic best model loading to Trainer (#7431 ) * Add automatic best model loading to Trainer * Some small fixes * Formatting	2020-09-29 10:41:18 -04:00
Marcin Zabłocki	4083a55ab0	Flos fix (#7384 )	2020-09-28 04:09:26 -04:00
Sylvain Gugger	1ee2194fb6	Mark big downloads slow (#7325 ) * Make big downloads as slow * Add import * Right order for slow decorator * More slow tests	2020-09-22 12:21:52 -04:00
Sylvain Gugger	492bb6aa48	Trainer multi label (#7191 ) * Trainer accep multiple labels * Missing import * Fix dosctrings	2020-09-17 08:15:37 -04:00
Yih-Dar	4c62c6021a	fix ZeroDivisionError and epoch counting (#7125 ) * fix ZeroDivisionError and epoch counting * Add test for num_train_epochs calculation in trainer.py * Remove @require_non_multigpu for test_num_train_epochs_in_training	2020-09-15 11:51:50 -04:00
Sylvain Gugger	7186ca6240	Multi predictions trainer (#7126 ) * Allow multiple outputs * Formatting * Move the unwrapping before metrics * Fix typo * Add test for non-supported config options	2020-09-15 10:27:24 -04:00
Sylvain Gugger	2bf70e2150	Fix reproducible tests in Trainer (#7119 ) * Fix reproducible tests in Trainer * Deal with multiple GPUs	2020-09-15 03:32:44 -04:00
Lysandre Debut	bb3106f741	Temporarily skip failing tests due to dependency change (#7118 ) * Temporarily skip failing tests due to dependency change * Remove trace	2020-09-14 07:42:13 -04:00
Stas Bekman	8fcbe486e1	these tests require non-multigpu env (#7059 ) * these tests require non-multigpu env * cleanup * clarify	2020-09-10 18:52:55 -04:00
Sylvain Gugger	514486739c	Fix CI with change of name of nlp (#7054 ) * nlp -> datasets * More nlp -> datasets * Woopsie * More nlp -> datasets * One last	2020-09-10 14:51:08 -04:00
Stas Bekman	e6b811f0a7	[testing] replace hardcoded paths to allow running tests from anywhere (#6523 ) * [testing] replace hardcoded paths to allow running tests from anywhere * fix the merge conflict	2020-08-27 12:22:18 -04:00
Lysandre	a75c64d80c	Black 20 release	2020-08-26 17:20:22 +02:00
Sylvain Gugger	abc0202194	More tests to Trainer (#6699 ) * More tests to Trainer * Add warning in the doc	2020-08-25 07:07:36 -04:00
Sylvain Gugger	573bdb0a5d	Add tests to Trainer (#6605 ) * Add tests to Trainer * Test if removing long breaks everything * Remove ugly hack * Fix distributed test * Use float for number of epochs	2020-08-20 11:13:50 -04:00
Pradhy729	290b6e18ac	Trainer support for iterabledataset (#5834 ) * Don't pass sampler for iterable dataset * Added check for test and eval dataloaders. * Formatting * Don't pass sampler for iterable dataset * Added check for test and eval dataloaders. * Formatting * Cleaner if nesting. * Added test for trainer and iterable dataset * Formatting for test * Fixed import when torch is available only. * Added require torch decorator to helper class * Moved dataset class inside unittest * Removed nested if and changed model in test * Checking torch availability for IterableDataset	2020-07-20 09:07:37 -04:00
Shashank Gupta	3dcb748e31	Added data collator for permutation (XLNet) language modeling and related calls (#5522 ) * Added data collator for XLNet language modeling and related calls Added DataCollatorForXLNetLanguageModeling in data/data_collator.py to generate necessary inputs for language modeling training with XLNetLMHeadModel. Also added related arguments, logic and calls in examples/language-modeling/run_language_modeling.py. Resolves: #4739, #2008 (partially) * Changed name to `DataCollatorForPermutationLanguageModeling` Changed the name of `DataCollatorForXLNetLanguageModeling` to the more general `DataCollatorForPermutationLanguageModelling`. Removed the `--mlm` flag requirement for the new collator and defined a separate `--plm_probability` flag for its use. CTRL uses a CLM loss just like GPT and GPT-2, so should work out of the box with this script (provided `past` is taken care of similar to `mems` for XLNet). Changed calls and imports appropriately. * Added detailed comments, changed variable names Added more detailed comments to `DataCollatorForPermutationLanguageModeling` in `data/data_collator.py` to explain working. Also cleaned up variable names and made them more informative. * Added tests for new data collator Added tests in `tests/test_trainer.py` for DataCollatorForPermutationLanguageModeling based on those in DataCollatorForLanguageModeling. A specific test has been added to check for odd-length sequences. * Fixed styling issues	2020-07-07 10:17:37 +02:00
Joe Davison	35befd9ce3	Fix tensor label type inference in default collator (#5250 ) * allow tensor label inputs to default collator * replace try/except with type check	2020-07-01 10:40:14 -06:00
Sam Shleifer	13deb95a40	Move tests/utils.py -> transformers/testing_utils.py (#5350 )	2020-07-01 10:31:17 -04:00
Sylvain Gugger	5f721ad6e4	Fix #5114 (#5122 )	2020-06-18 19:20:04 -04:00
Sylvain Gugger	20fa828984	Make default_data_collator more flexible and deprecate old behavior (#5060 ) * Make default_data_collator more flexible * Accept tensors for all features * Document code * Refactor * Formatting	2020-06-17 15:24:51 -04:00
Sylvain Gugger	1affde2f10	Make DataCollator a callable (#5015 ) * Make DataCollator a callable * Update src/transformers/data/data_collator.py Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-06-15 11:58:33 -04:00
Sylvain Gugger	4dd5cf2207	Fix argument label (#4792 ) * Fix argument label * Fix test	2020-06-05 15:20:29 -04:00
Zhangyx	49296533ca	Adds predict stage for glue tasks, and generate result files which can be submitted to gluebenchmark.com (#4463 ) * Adds predict stage for glue tasks, and generate result files which could be submitted to gluebenchmark.com website. * Use Split enum + always output the label name Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-21 09:17:44 -04:00
Julien Chaumond	241759101e	(v2) Improvements to the wandb integration (#4324 ) * Improvements to the wandb integration * small reorg + no global necessary * feat(trainer): log epoch and final metrics * Simplify logging a bit * Fixup * Fix crash when just running eval Co-authored-by: Chris Van Pelt <vanpelt@gmail.com> Co-authored-by: Boris Dayma <boris.dayma@gmail.com>	2020-05-12 21:52:01 -04:00
Julien Chaumond	0ae96ff8a7	BIG Reorganize examples (#4213 ) * Created using Colaboratory * [examples] reorganize files * remove run_tpu_glue.py as superseded by TPU support in Trainer * Bugfix: int, not tuple * move files around	2020-05-07 13:48:44 -04:00
Julien Chaumond	dd9d483d03	Trainer (#3800 ) * doc * [tests] Add sample files for a regression task * [HUGE] Trainer * Feedback from @sshleifer * Feedback from @thomwolf + logging tweak * [file_utils] when downloading concurrently, get_from_cache will use the cached file for subsequent processes * [glue] Use default max_seq_length of 128 like before * [glue] move DataTrainingArguments around * [ner] Change interface of InputExample, and align run_{tf,pl} * Re-align the pl scripts a little bit * ner * [ner] Add integration test * Fix language_modeling with API tweak * [ci] Tweak loss target * Don't break console output * amp.initialize: model must be on right device before * [multiple-choice] update for Trainer * Re-align to `827d6d6ef0`	2020-04-21 20:11:56 -04:00

36 Commits