transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-27 08:18:58 +06:00

Author	SHA1	Message	Date
Patrick von Platen	fc38d4c86f	Improve special_token_id logic in run_generation.py and add tests (#2885 ) * improving generation * finalized special token behaviour for no_beam_search generation * solved modeling_utils merge conflict * solve merge conflicts in modeling_utils.py * add run_generation improvements from PR #2749 * adapted language generation to not use hardcoded -1 if no padding token is available * remove the -1 removal as hard coded -1`s are not necessary anymore * add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown * add slow language generation tests for pretrained models using hardcoded output with pytorch seed * delete ipdb * check that all generated tokens are valid * renaming * renaming Generation -> Generate * make style * updated so that generate_beam_search has same token behavior than generate_no_beam_search * consistent return format for run_generation.py * deleted pretrain lm generate tests -> will be added in another PR * cleaning of unused if statements and renaming * run_generate will always return an iterable * make style * consistent renaming * improve naming, make sure generate function always returns the same tensor, add docstring * add slow tests for all lmhead models * make style and improve example comments modeling_utils * better naming and refactoring in modeling_utils * improving generation * finalized special token behaviour for no_beam_search generation * solved modeling_utils merge conflict * solve merge conflicts in modeling_utils.py * add run_generation improvements from PR #2749 * adapted language generation to not use hardcoded -1 if no padding token is available * remove the -1 removal as hard coded -1`s are not necessary anymore * add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown * add slow language generation tests for pretrained models using hardcoded output with pytorch seed * delete ipdb * check that all generated tokens are valid * renaming * renaming Generation -> Generate * make style * updated so that generate_beam_search has same token behavior than generate_no_beam_search * consistent return format for run_generation.py * deleted pretrain lm generate tests -> will be added in another PR * cleaning of unused if statements and renaming * run_generate will always return an iterable * make style * consistent renaming * improve naming, make sure generate function always returns the same tensor, add docstring * add slow tests for all lmhead models * make style and improve example comments modeling_utils * better naming and refactoring in modeling_utils * changed fast random lm generation testing design to more general one * delete in old testing design in gpt2 * correct old variable name * temporary fix for encoder_decoder lm generation tests - has to be updated when t5 is fixed * adapted all fast random generate tests to new design * better warning description in modeling_utils * better comment * better comment and error message Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-02-21 12:09:59 -05:00
Lysandre	d18d47be67	run_generation style	2020-01-31 12:05:48 -05:00
Lysandre	7365f01d43	do_sample should be set to True in run_generation.py	2020-01-31 11:49:32 -05:00
alberduris	81d6841b4b	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
alberduris	dd4df80f0b	Moved the encoded_prompts to correct device	2020-01-06 15:11:12 +01:00
Aymeric Augustin	c824d15aa1	Remove __future__ imports.	2019-12-22 17:47:54 +01:00
Aymeric Augustin	2ab78325f0	Fix F821 flake8 warning (x47). Ignore warnings related to Python 2, because it's going away soon.	2019-12-22 10:59:07 +01:00
Aymeric Augustin	158e82e061	Sort imports with isort. This is the result of: $ isort --recursive examples templates transformers utils hubconf.py setup.py	2019-12-22 10:57:46 +01:00
Aymeric Augustin	fa84ae26d6	Reformat source code with black. This is the result of: $ black --line-length 119 examples templates transformers utils hubconf.py setup.py There's a lot of fairly long lines in the project. As a consequence, I'm picking the longest widely accepted line length, 119 characters. This is also Thomas' preference, because it allows for explicit variable names, to make the code easier to understand.	2019-12-21 17:52:29 +01:00
thomwolf	cfa0380515	Merge branch 'master' into generation_sampler	2019-12-21 14:12:52 +01:00
thomwolf	300ec3003c	fixing run_generation example - using torch.no_grad	2019-12-21 14:02:19 +01:00
thomwolf	1c37746892	fixing run_generation	2019-12-21 13:52:49 +01:00
thomwolf	3d2096f516	further cleanup	2019-12-18 11:50:54 +01:00
Lysandre	18a879f475	fix #2180	2019-12-16 16:44:29 -05:00
Rémi Louf	07bc8efbc3	add greedy decoding and sampling	2019-12-10 17:27:50 +01:00
Julien Chaumond	f96ce1c241	[run_generation] Fix generation with batch_size>1	2019-10-31 18:27:11 +00:00
Julien Chaumond	3c1b6f594e	Merge branch 'master' into fix_top_k_top_p_filtering	2019-10-31 13:53:51 -04:00
Thomas Wolf	36174696cc	Merge branch 'master' into clean-roberta	2019-10-30 16:51:06 +01:00
Julien Chaumond	ef1b8b2ae5	[CTRL] warn if generation prompt does not start with a control code see also https://github.com/salesforce/ctrl/pull/50	2019-10-22 21:30:32 +00:00
Lysandre	7d709e55ed	Remove	2019-10-22 14:12:33 -04:00
leo-du	ecd15667f3	fix repetition penalty	2019-10-17 14:47:14 -04:00
thomwolf	177a721205	move back to simple space spliting	2019-10-10 11:45:47 +02:00
thomwolf	a5997dd81a	better error messages	2019-10-10 11:31:01 +02:00
thomwolf	d9e60f4f0d	Merge branch 'master' into pr/1383	2019-10-09 17:25:08 +02:00
LysandreJik	f3e0218fbb	Correct device assignment in run_generation	2019-10-05 21:05:16 -04:00
keskarnitish	dbed1c5d94	Adding CTRL (squashed commit) adding conversion script adding first draft of modeling & tokenization adding placeholder for test files bunch of changes registering the tokenizer/model/etc tests change link; something is very VERY wrong here weird end-of-word thingy going on i think the tokenization works now ; wrote the unit tests overall structure works;load w next the monster is alive! works after some cleanup as well adding emacs autosave to gitignore currently only supporting the 48 layer one; seems to infer fine on my macbook cleanup fixing some documentation fixing some documentation tests passing? now works on CUDA also adding greedy? adding greedy sampling works well	2019-10-03 22:29:03 -07:00
Lysandre Debut	d3f24dfad7	Merge branch 'master' into master	2019-10-03 22:43:09 +00:00
LysandreJik	ecc4f1bdfa	XLM use_lang_embedding flag in run_generation	2019-10-03 17:42:16 -04:00
LysandreJik	c2c2ca0fdb	Added XLM to run_generation, with prompt language selection.	2019-10-03 17:18:48 -04:00
thomwolf	31c23bd5ee	[BIG] pytorch-transformers => transformers	2019-09-26 10:15:53 +02:00
mataney	a9f24a16bc	[FIX] fix run_generation.py to work with batch_size > 1	2019-09-25 15:53:29 +03:00
Lorenzo Ampil	4b543c3007	Add option to use a 'stop token' which will be used to truncate the output text to everything till right before the 'stop token'	2019-09-22 21:38:38 +08:00
thomwolf	e691fc0963	update QA models tests + run_generation	2019-07-15 17:45:24 +02:00
thomwolf	2397f958f9	updating examples and doc	2019-07-14 23:20:10 +02:00
thomwolf	7d4b200e40	good quality generation example for GPT, GPT-2, Transfo-XL, XLNet	2019-07-13 15:25:03 +02:00

35 Commits