transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-02 19:21:31 +06:00

Author	SHA1	Message	Date
Thomas Wolf	3e5da38dae	Merge pull request #3132 from huggingface/hf_api_model_list [hf_api] Get the public list of all the models on huggingface	2020-03-06 13:05:52 +01:00
Thomas Wolf	9499a3778e	Merge pull request #3103 from gthb/keras-serialization Support keras JSON/HDF5 serialization of main layers	2020-03-06 12:59:13 +01:00
patrickvonplaten	58fc8f97a3	fix renaming problem	2020-03-06 00:35:47 +01:00
Sam Shleifer	857e0a0d3b	Rename BartForMaskedLM -> BartForConditionalGeneration (#3114 ) * improved documentation	2020-03-05 17:41:18 -05:00
Lysandre Debut	146c521235	Merge branch 'master' into add_models_special_tokens_to_specific_configs	2020-03-05 17:24:42 -05:00
Lysandre Debut	b623ddc000	Pass kwargs to configuration (#3147 ) * Pass kwargs to configuration * Setter * test	2020-03-05 17:16:57 -05:00
Lysandre Debut	0001d05686	Correct missing keys + test (#3143 )	2020-03-05 17:01:54 -05:00
sshleifer	1360dacaa3	cleanup deltas	2020-03-05 12:57:42 -05:00
sshleifer	c36fdc88d4	tests pass	2020-03-05 12:33:08 -05:00
Julien Chaumond	f564f93c84	[hf_api] Get the public list of all the models on huggingface	2020-03-04 23:33:09 -05:00
Julien Chaumond	ff9e79ba3a	make style	2020-03-04 20:18:07 -05:00
Lysandre	07a79db505	Fix failing doc samples	2020-03-04 19:11:31 -05:00
Thomas Wolf	bdd3d0c76d	Merge pull request #3118 from patrickvonplaten/add_beam_search_to_generation_tf_2_0 Add beam search to generation tf 2 0	2020-03-04 23:28:00 +01:00
Patrick von Platen	932eab943d	include tf gpt2 tests for attn mask and past variable (#3122 )	2020-03-04 12:03:46 -05:00
patrickvonplaten	61fef6e957	added beam_search generation for tf 2.0	2020-03-04 17:27:47 +01:00
Gunnlaugur Thor Briem	96c4990165	fix unused imports and style	2020-03-03 22:57:05 +00:00
Gunnlaugur Thor Briem	470753bcf5	Put @keras_serializable only on layers it works on And only run the test on TF*MainLayer classes so marked.	2020-03-03 22:44:45 +00:00
Gunnlaugur Thor Briem	0c716ede8c	Use class decorator instead of superclass When supplied by Keras deserialization, the config parameter to initializers will be a dict. So intercept it and convert to PretrainedConfig object (and store in instance attribute for get_config to get at it) before passing to the actual initializer. To accomplish this, and repeat as little code as possible, use a class decorator on TF*MainLayer classes.	2020-03-03 22:31:42 +00:00
Sam Shleifer	e9e6efdc45	BartForSequenceClassification: fix num_labels, add test (#3110 )	2020-03-03 15:54:29 -05:00
Julien Chaumond	f631e01d2c	[ci] Re-run integration ground truth from fairseq Adopted best practice set by @patrickvonplaten of commenting lines run on fairseq, for easy comparison also see #3020	2020-03-03 15:31:40 -05:00
Gunnlaugur Thor Briem	b8da16f390	Add (failing) tests for Keras save/load	2020-03-03 15:22:34 +00:00
Patrick von Platen	4134100363	Add generate() functionality to TF 2.0 (#3063 ) * add first copy past test to tf 2 generate * add tf top_k_top_p_filter fn * add generate function for TF * add generate function for TF * implemented generate for all models expect transfoXL * implemented generate for all models expect transfoXL * implemented generate for all models expect transfoXL * make style * change permission of test file to correct ones * delete ipdb * delete ipdb * fix bug and finish simple gpt2 integration test * clean test file * clean test file * make style * make style * make style * make style * change import style * change import style * make style * make style * add decorators * add decorators * fix tf ctrl bug dim => axis in TF * make style * make style * refactored test file * refactored test file * take out test_torch_tf_conversion if nothing is defined * take out test_torch_tf_conversion if nothing is defined * remove useless files * remove useless files * fix conflicts * fix conflicts * fix conflicts * fix conflicts * fix conflicts * solve conflicts * solve conflicts * fix conflicts * fix conflicts * merge conflicts * delete ipdb * exposed top_k_top_p_filtering fns * delete weirdly created w! file * add comment to test tf common modeling * fix conflicts * fix conflicts * make style * merge conflicts * make style * change tf.tensor.shape to shape_list(tensor)	2020-03-03 09:42:15 -05:00
Julien Chaumond	f169957d0c	TF GPU CI (#3085 ) * debug env * Restrict TF GPU memory * Fixup * One more test * rm debug logs * Fixup	2020-03-02 15:45:25 -05:00
Lysandre Debut	d3eb7d23a4	Pipeline doc (#3055 ) * Pipeline doc initial commit * pipeline abstraction * Remove modelcard argument from pipeline * Task-specific pipelines can be instantiated with no model or tokenizer * All pipelines doc	2020-03-02 14:07:10 -05:00
Julien Chaumond	0e56b37e80	rm bogus file cc @patrickvonplaten	2020-03-02 12:27:12 -05:00
Patrick von Platen	2fdc7f6ce8	correct greedy generation when doing beam search (#3078 ) * correct greedy generation when doing beam search * improve comment	2020-03-02 12:00:09 -05:00
Patrick von Platen	c0135194eb	Force pad_token_id to be set before padding for standard tokenizer (#3035 ) * force pad_token_id to be set before padding * fix tests and forbid padding without having a padding_token_id set	2020-03-02 10:53:55 -05:00
Sam Shleifer	b54ef78d0c	Bart-CNN (#3059 ) `generate` code that produces 99% identical summarizations to fairseq on CNN test data, with caching.	2020-03-02 10:35:53 -05:00
Lysandre Debut	8bcb37bfb8	NER support for Albert in run_ner.py and NerPipeline (#2983 ) * * Added support for Albert when fine-tuning for NER * Added support for Albert in NER pipeline * Added command-line options to examples/ner/run_ner.py to better control tokenization * Added class AlbertForTokenClassification * Changed output for NerPipeline to use .convert_ids_to_tokens(...) instead of .decode(...) to better reflect tokens * Added , * Now passes style guide enforcement * Changes from reviews. * Code now passes style enforcement * Added test for AlbertForTokenClassification * Added test for AlbertForTokenClassification	2020-02-27 10:22:55 -05:00
Martin Malmsten	f71157529e	Added test for AlbertForTokenClassification	2020-02-27 12:24:20 +01:00
Martin Malmsten	aceb6a0907	Added test for AlbertForTokenClassification	2020-02-27 11:52:46 +01:00
Julien Chaumond	b370cc7e99	[gpu] Fixup `fdd61b1992`	2020-02-26 21:48:49 +00:00
Julien Chaumond	f5516805c2	Fix bart slow test	2020-02-26 20:47:49 +00:00
Patrick von Platen	fdd61b1992	Fix attn mask gpt2 when using past (#3033 ) * fix issue and add some tests * fix issue and add some tests * updated doc string gpt2	2020-02-26 12:04:37 -05:00
Julien Chaumond	9cda3620b6	Fix (non-slow) tests on GPU (torch) (#3024 ) * Fix tests on GPU (torch) * Fix bart slow tests Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-02-26 11:59:25 -05:00
Sam Shleifer	9df74b8bc4	Delete all mentions of Model2Model (#3019 )	2020-02-26 11:36:27 -05:00
Patrick von Platen	c913eb9c38	Add integration tests for xlm roberta modelling and xlm roberta tokenzier (#3014 ) * add first files * add xlm roberta integration tests * make style * flake 8 issues solved	2020-02-25 16:51:25 -05:00
Patrick von Platen	f5b50c6b8e	make style	2020-02-25 16:41:54 +01:00
Patrick von Platen	e645dcbb70	add special tokens to pretrain configs of respective lm head models	2020-02-25 16:37:56 +01:00
Lysandre Debut	b90745c590	Test correct tokenizers after default switch (#3003 )	2020-02-24 18:45:53 -05:00
Funtowicz Morgan	4cd9c0971c	Fix for fast tokenizers save_pretrained compatibility with Python. (#2933 ) * Renamed file generate by tokenizers when calling save_pretrained to match python. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Added save_vocabulary tests. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Remove python quick and dirty fix for clean Rust impl. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Bump tokenizers dependency to 0.5.1 Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * TransfoXLTokenizerFast uses a json vocabulary file + warning about incompatibility between Python and Rust Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Added some save_pretrained / from_pretrained unittests. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Update tokenizers to 0.5.2 Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Quality and format. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * flake8 Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Making sure there is really a bug in unittest * Fix TransfoXL constructor vocab_file / pretrained_vocab_file mixin. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>	2020-02-24 18:20:42 -05:00
Lysandre Debut	21d8b6a33e	Testing that batch_encode_plus is the same as encode_plus (#2973 ) * Testing that encode_plus and batch_encode_plus behave the same way Spoiler alert: they don't * Testing rest of arguments in batch_encode_plus * Test tensor return in batch_encode_plus * Addressing Sam's comments * flake8 * Simplified with `num_added_tokens`	2020-02-24 12:09:46 -05:00
Patrick von Platen	17c45c39ed	Add slow generate tests for pretrained lm models (#2909 ) * add slow generate lm_model tests * fix conflicts * merge conflicts * fix conflicts * add slow generate lm_model tests * make style * delete unused variable * fix conflicts * fix conflicts * fix conflicts * delete unused variable * fix conflicts * finished hard coded tests	2020-02-24 11:51:57 -05:00
Sam Shleifer	92487a1dc0	Bart: fix layerdrop and cached decoder_input_ids for generation (#2969 )	2020-02-22 16:25:04 -05:00
Joe Davison	c36416e53c	Add standardized get_vocab method to tokenizers	2020-02-22 12:09:01 -05:00
Funtowicz Morgan	cc6775cdf5	Fix max_length not taken into account when using pad_to_max_length on fast tokenizers (#2961 ) * enable_padding should pad up to max_length if set. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Added more testing on padding. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>	2020-02-22 09:27:47 -05:00
Patrick von Platen	fc38d4c86f	Improve special_token_id logic in run_generation.py and add tests (#2885 ) * improving generation * finalized special token behaviour for no_beam_search generation * solved modeling_utils merge conflict * solve merge conflicts in modeling_utils.py * add run_generation improvements from PR #2749 * adapted language generation to not use hardcoded -1 if no padding token is available * remove the -1 removal as hard coded -1`s are not necessary anymore * add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown * add slow language generation tests for pretrained models using hardcoded output with pytorch seed * delete ipdb * check that all generated tokens are valid * renaming * renaming Generation -> Generate * make style * updated so that generate_beam_search has same token behavior than generate_no_beam_search * consistent return format for run_generation.py * deleted pretrain lm generate tests -> will be added in another PR * cleaning of unused if statements and renaming * run_generate will always return an iterable * make style * consistent renaming * improve naming, make sure generate function always returns the same tensor, add docstring * add slow tests for all lmhead models * make style and improve example comments modeling_utils * better naming and refactoring in modeling_utils * improving generation * finalized special token behaviour for no_beam_search generation * solved modeling_utils merge conflict * solve merge conflicts in modeling_utils.py * add run_generation improvements from PR #2749 * adapted language generation to not use hardcoded -1 if no padding token is available * remove the -1 removal as hard coded -1`s are not necessary anymore * add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown * add slow language generation tests for pretrained models using hardcoded output with pytorch seed * delete ipdb * check that all generated tokens are valid * renaming * renaming Generation -> Generate * make style * updated so that generate_beam_search has same token behavior than generate_no_beam_search * consistent return format for run_generation.py * deleted pretrain lm generate tests -> will be added in another PR * cleaning of unused if statements and renaming * run_generate will always return an iterable * make style * consistent renaming * improve naming, make sure generate function always returns the same tensor, add docstring * add slow tests for all lmhead models * make style and improve example comments modeling_utils * better naming and refactoring in modeling_utils * changed fast random lm generation testing design to more general one * delete in old testing design in gpt2 * correct old variable name * temporary fix for encoder_decoder lm generation tests - has to be updated when t5 is fixed * adapted all fast random generate tests to new design * better warning description in modeling_utils * better comment * better comment and error message Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-02-21 12:09:59 -05:00
Sam Shleifer	53ce3854a1	New BartModel (#2745 ) * Results same as fairseq * Wrote a ton of tests * Struggled with api signatures * added some docs	2020-02-20 18:11:13 -05:00
Joe Davison	197d74f988	Add get_vocab method to PretrainedTokenizer	2020-02-20 15:26:49 -05:00
Funtowicz Morgan	d490b5d500	Fast Tokenizers save pretrained should return the list of generated file paths. (#2918 ) * Correctly return the tuple of generated file(s) when calling save_pretrained Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Quality and format. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>	2020-02-20 00:58:04 +01:00

1 2 3 4

191 Commits