transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-02 11:11:05 +06:00

Author	SHA1	Message	Date
Lysandre	b5d330d118	Fix #1784	2019-11-11 10:15:14 -05:00
eukaryote	90f6e73a35	Add DialoGPT support for Pytorch->TF	2019-11-09 16:46:19 +00:00
eukaryote	ef99852961	from_pretrained: convert DialoGPT format DialoGPT checkpoints have "lm_head.decoder.weight" instead of "lm_head.weight". (see: https://www.reddit.com/r/MachineLearning/comments/dt5woy/p_dialogpt_state_of_the_art_conversational_model/f6vmwuy?utm_source=share&utm_medium=web2x)	2019-11-09 16:32:40 +00:00
Adrian Bauer	7a9aae1044	Fix run_bertology.py Make imports and args.overwrite_cache match run_glue.py	2019-11-08 16:28:40 -05:00
Rémi Louf	cd286c2145	add condition around mask transformation	2019-11-08 11:31:16 +01:00
Rémi Louf	28d0ba35d7	only init encoder_attention_mask if stack is decoder We currently initialize `encoder_attention_mask` when it is `None`, whether the stack is that of an encoder or a decoder. Since this may lead to bugs that are difficult to tracks down, I added a condition that assesses whether the current stack is a decoder.	2019-11-08 11:22:19 +01:00
Diganta Misra	070dcf1c02	Added Mish Activation Function Mish is a new activation function proposed here - https://arxiv.org/abs/1908.08681 It has seen some recent success and has been adopted in SpaCy, Thic, TensorFlow Addons and FastAI-dev. All benchmarks recorded till now (including against ReLU, Swish and GELU) is present in the repository - https://github.com/digantamisra98/Mish Might be a good addition to experiment with especially in the Bert Model.	2019-11-07 03:45:43 +05:30
Julien Chaumond	1c542df7e5	Add RoBERTa-based GPT-2 Output Detector from OpenAI converted from https://github.com/openai/gpt-2-output-dataset/tree/master/detector Co-Authored-By: Lysandre Debut <lysandre.debut@reseau.eseo.fr> Co-Authored-By: Jong Wook Kim <jongwook@nyu.edu> Co-Authored-By: Jeff Wu <wuthefwasthat@gmail.com>	2019-11-06 16:26:31 -05:00
Julien Chaumond	2f3a421018	Fix other PyTorch models	2019-11-06 14:03:47 -05:00
Julien Chaumond	d5319793c4	Fix BERT	2019-11-06 14:03:47 -05:00
Julien Chaumond	27e015bd54	[tests] Flag to test on cuda	2019-11-06 14:03:47 -05:00
Julien Chaumond	13d9135fa5	[tests] get rid of warning cf. https://docs.pytest.org/en/latest/example/simple.html	2019-11-06 14:03:47 -05:00
Julien Chaumond	f88c104d8f	[run_tf_glue] Add comment for context	2019-11-05 19:56:43 -05:00
Julien Chaumond	30968d70af	misc doc	2019-11-05 19:06:12 -05:00
Dom Hudson	de890ae67d	Updating docblocks in optimizers.py	2019-11-05 17:31:29 -05:00
Lysandre	d7d36181fd	GPT-2 XL	2019-11-05 13:31:58 -05:00
LysandreJik	151e4ab4e7	Fix CTRL past	2019-11-05 16:26:51 +00:00
Julien Chaumond	7daacf00df	Merge pull request #1695 from huggingface/models_inputs_embeds model forwards can take an inputs_embeds param	2019-11-05 09:55:28 -05:00
Clement	a44f112fb9	add authors for models	2019-11-05 08:48:26 -05:00
Thomas Wolf	e99071f105	Merge pull request #1734 from orena1/patch-1 add progress bar to convert_examples_to_features	2019-11-05 11:34:20 +01:00
Thomas Wolf	ba973342e3	Merge pull request #1553 from WilliamTambellini/timeSquadInference Add speed log to examples/run_squad.py	2019-11-05 11:13:12 +01:00
Thomas Wolf	237fad339c	Merge pull request #1709 from oneraghavan/master Fixing mode in evaluate during training	2019-11-05 10:55:33 +01:00
thomwolf	f1e4db2aa8	Fix #1686	2019-11-05 09:38:00 +01:00
Oren Amsalem	d7906165a3	add progress bar for convert_examples_to_features It takes considerate amount of time (~10 min) to parse the examples to features, it is good to have a progress-bar to track this	2019-11-05 10:34:27 +02:00
Thomas Wolf	d2e2577dd3	Merge pull request #1723 from huggingface/fix-1623 Fix #1623	2019-11-05 08:36:30 +01:00
Julien Chaumond	00337e9687	[inputs_embeds] All PyTorch models	2019-11-05 00:39:18 +00:00
Julien Chaumond	9eddf44b7a	docstring + check	2019-11-04 17:19:15 +00:00
Julien Chaumond	8e11de0e86	model forwards can take an inputs_embeds param	2019-11-04 16:56:26 +00:00
Lysandre	68f7064a3e	Add `model.train()` line to ReadMe training example Co-Authored-By: Santosh-Gupta <San.Gupta.ML@gmail.com>	2019-11-04 11:52:35 -05:00
thomwolf	8d6b9d717c	fix #1532 and encode_plus	2019-11-04 17:07:51 +01:00
Thomas Wolf	c8f2712199	Merge pull request #1721 from huggingface/common_attributes Add common getter and setter for input_embeddings & output_embeddings	2019-11-04 16:21:52 +01:00
thomwolf	89d6272898	Fix #1623	2019-11-04 16:21:12 +01:00
thomwolf	b340a910ed	fix tests - flagged as slow all the tests downloading from AWS	2019-11-04 16:03:36 +01:00
thomwolf	f02805da6f	fix tests	2019-11-04 15:42:23 +01:00
Thomas Wolf	1d4d070256	Merge pull request #1549 from hlums/master Fix token order in xlnet preprocessing for SQuAD	2019-11-04 15:37:15 +01:00
thomwolf	1724cee8c4	switch from properties to methods	2019-11-04 15:34:10 +01:00
thomwolf	9b45d0f878	Add common properties input_embeddings and output_embeddings	2019-11-04 12:28:56 +01:00
Thomas Wolf	9a3b173cd3	Merge branch 'master' into master	2019-11-04 11:41:26 +01:00
thomwolf	ad90868627	Update example readme	2019-11-04 11:27:22 +01:00
Raghavan	e5b1048bae	Fixing mode in evaluate during training	2019-11-03 16:14:46 +05:30
Thomas Wolf	8a62835577	Merge pull request #1679 from cregouby/master Fix https://github.com/huggingface/transformers/issues/1673	2019-11-01 22:02:24 +01:00
Julien Chaumond	93d2fff071	Close #1654	2019-11-01 09:47:38 -04:00
Lysandre	1a2b40cb53	run_tf_glue MRPC evaluation only for MRPC	2019-10-31 18:00:51 -04:00
Timothy Liu	be36cf92fb	Added mixed precision support to benchmarks.py	2019-10-31 17:24:37 -04:00
Julien Chaumond	2a5663c280	Merge branch 'mataney-fix_top_k_top_p_filtering'	2019-10-31 18:28:34 +00:00
Julien Chaumond	f96ce1c241	[run_generation] Fix generation with batch_size>1	2019-10-31 18:27:11 +00:00
Julien Chaumond	3c1b6f594e	Merge branch 'master' into fix_top_k_top_p_filtering	2019-10-31 13:53:51 -04:00
Sergey Mironov	0e4cc050d6	Add support for resumable downloads for HTTP protocol.	2019-10-31 18:25:34 +03:00
cregouby	ac29353abe	Fix https://github.com/huggingface/transformers/issues/1673	2019-10-31 10:04:40 +01:00
Victor SANH	fa735208c9	update readme - fix example command distil*	2019-10-30 14:27:28 -04:00

... 3 4 5 6 7 ...

2329 Commits