transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-29 01:02:25 +06:00

Author	SHA1	Message	Date
Julien Chaumond	dd9d483d03	Trainer (#3800 ) * doc * [tests] Add sample files for a regression task * [HUGE] Trainer * Feedback from @sshleifer * Feedback from @thomwolf + logging tweak * [file_utils] when downloading concurrently, get_from_cache will use the cached file for subsequent processes * [glue] Use default max_seq_length of 128 like before * [glue] move DataTrainingArguments around * [ner] Change interface of InputExample, and align run_{tf,pl} * Re-align the pl scripts a little bit * ner * [ner] Add integration test * Fix language_modeling with API tweak * [ci] Tweak loss target * Don't break console output * amp.initialize: model must be on right device before * [multiple-choice] update for Trainer * Re-align to `827d6d6ef0`	2020-04-21 20:11:56 -04:00
Andrey Kulagin	b1ff0b2ae7	Fix bug in examples: double wrap into DataParallel during eval	2020-04-20 19:37:44 -04:00
Ethan Perez	e52d1258e0	Fix RoBERTa/XLNet Pad Token in run_multiple_choice.py (#3631 ) * Fix RoBERTa/XLNet Pad Token in run_multiple_choice.py `convert_examples_to_fes atures` sets `pad_token=0` by default, which is correct for BERT but incorrect for RoBERTa (`pad_token=1`) and XLNet (`pad_token=5`). I think the other arguments to `convert_examples_to_features` are correct, but it might be helpful if someone checked who is more familiar with this part of the codebase. * Simplifying change to match recent commits	2020-04-06 16:52:22 -04:00
Julien Chaumond	50e15c825c	Tokenizers: Start cleaning examples a little (#3455 ) * Start cleaning examples * Fixup	2020-04-01 07:13:40 -04:00
Victor SANH	6b1ff25084	fix n_gpu count when no_cuda flag is activated (#3077 ) * fix n_gpu count when no_cuda flag is activated * someone was left behind	2020-03-02 10:20:21 -05:00
Lysandre	335dd5e68a	Default save steps 50 to 500 in all scripts	2020-01-28 09:42:11 -05:00
alberduris	81d6841b4b	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
alberduris	dd4df80f0b	Moved the encoded_prompts to correct device	2020-01-06 15:11:12 +01:00
Aymeric Augustin	81422c4e6d	Remove unused variables in examples.	2019-12-23 22:29:02 +01:00
Aymeric Augustin	d6eaf4e6d2	Update comments mentioning Python 2.	2019-12-22 18:38:56 +01:00
Aymeric Augustin	c824d15aa1	Remove __future__ imports.	2019-12-22 17:47:54 +01:00
Aymeric Augustin	fa2ccbc081	Fix E266 flake8 warning (x90).	2019-12-22 10:59:08 +01:00
Aymeric Augustin	631be27078	Fix E722 flake8 warnings (x26).	2019-12-22 10:59:07 +01:00
Aymeric Augustin	357db7098c	Fix E712 flake8 warning (x1).	2019-12-22 10:59:07 +01:00
Aymeric Augustin	158e82e061	Sort imports with isort. This is the result of: $ isort --recursive examples templates transformers utils hubconf.py setup.py	2019-12-22 10:57:46 +01:00
Aymeric Augustin	fa84ae26d6	Reformat source code with black. This is the result of: $ black --line-length 119 examples templates transformers utils hubconf.py setup.py There's a lot of fairly long lines in the project. As a consequence, I'm picking the longest widely accepted line length, 119 characters. This is also Thomas' preference, because it allows for explicit variable names, to make the code easier to understand.	2019-12-21 17:52:29 +01:00
VictorSanh	48cbf267c9	Use full dataset for eval (SequentialSampler in Distributed setting)	2019-12-03 11:01:37 -05:00
Thomas Wolf	9629e2c676	Merge pull request #1804 from ronakice/master fix multi-gpu eval in torch examples	2019-11-14 22:24:05 +01:00
Rémi Louf	2276bf69b7	update the examples, docs and template	2019-11-14 20:38:02 +01:00
ronakice	2e31176557	fix multi-gpu eval	2019-11-12 05:55:11 -05:00
thomwolf	89d6272898	Fix #1623	2019-11-04 16:21:12 +01:00
Thomas Wolf	6596e3d566	Merge pull request #1454 from bkkaggle/pytorch-built-in-tensorboard Change tensorboard imports to use built-in tensorboard if available	2019-10-10 11:56:55 +02:00
Lysandre Debut	e84470ef81	Merge pull request #1384 from huggingface/encoding-qol Quality of life enhancements in encoding + patch MLM masking	2019-10-09 11:18:24 -04:00
Bilal Khan	5ce8d29abe	Change tensorboard imports to use built-in tensorboard if available	2019-10-08 16:29:43 -05:00
Julien Chaumond	9e136ff57c	Honor args.overwrite_cache (h/t @erenup)	2019-10-04 15:00:56 -04:00
Brian Ma	2195c0d5f9	Evaluation result.txt path changing #1286	2019-10-03 12:49:12 +08:00
Julien Chaumond	f5bcde0b2f	[multiple-choice] Simplify and use tokenizer.encode_plus	2019-09-30 16:04:55 -04:00
thomwolf	31c23bd5ee	[BIG] pytorch-transformers => transformers	2019-09-26 10:15:53 +02:00
erenup	8960988f35	fixed to find best dev acc	2019-09-19 01:10:05 +08:00
erenup	46ffc28329	Merge branch 'master' into run_multiple_choice_merge # Please enter a commit message to explain why this merge is necessary, # especially if it merges an updated upstream into a topic branch. # # Lines starting with '#' will be ignored, and an empty message aborts # the commit.	2019-09-18 21:43:46 +08:00
erenup	15143fbad6	move run_multiple_choice.py and utils_multiple_choice.py to examples	2019-09-18 21:18:46 +08:00

31 Commits