transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-04 21:30:07 +06:00

Author	SHA1	Message	Date
thomwolf	a3274ac40b	adding attention outputs in bert	2019-06-03 16:11:45 -05:00
VictorSanh	826496580b	Revert "add output_attentions for BertModel" This reverts commit `de5e5682a1`.	2019-06-03 17:10:25 -04:00
VictorSanh	de5e5682a1	add output_attentions for BertModel	2019-06-03 17:05:24 -04:00
Thomas Wolf	e6cf62d499	Merge pull request #488 from dhpollack/fix_multichoice fixed BertForMultipleChoice model init and forward pass	2019-04-25 21:04:16 +02:00
Thomas Wolf	d76a57b0ba	Merge pull request #506 from ailzhang/hubconf Hubconf	2019-04-24 20:59:21 +02:00
thomwolf	80f995a141	revert BertForMultipleChoice linear classifier	2019-04-24 16:51:54 +02:00
Ailing Zhang	bfd6f6b257	fix from_pretrained positional args	2019-04-17 16:31:40 -07:00
Thomas Wolf	3d78e226e6	Merge pull request #489 from huggingface/tokenization_serialization Better serialization for Tokenizers and Configuration classes - Also fix #466	2019-04-16 08:49:54 +02:00
thomwolf	df5d9c3551	load all models on cpu	2019-04-15 15:43:01 +02:00
thomwolf	60ea6c59d2	added best practices for serialization in README and examples	2019-04-15 15:00:33 +02:00
thomwolf	9761aa4845	add to_json_file method to configuration classes	2019-04-15 14:12:08 +02:00
David Pollack	38ba7b439b	fixed BertForMultipleChoice model init and forward pass	2019-04-15 10:38:01 +02:00
Martin Boyanov	34cf67fd6c	Extend the BertForSequenceClassification docs to mention the special CLS token.	2019-04-12 21:30:28 +03:00
thomwolf	19666dcb3b	Should fix #438	2019-04-03 11:01:01 +02:00
Mike Arpaia	8b5c63e4de	Fixes to the TensorFlow conversion tool	2019-04-01 13:17:54 -06:00
Ikuya Yamada	0401317b23	Remove padding_idx from position_embeddings and token_type_embeddings	2019-03-26 21:56:35 +09:00
Thomas Wolf	9b03d67b83	Merge pull request #362 from Bharat123rox/patch-1 Make the hyperlink of NVIDIA Apex clickable	2019-03-11 09:08:51 +01:00
Bharat Raghunathan	f91ce0b803	Make the hyperlink of NVIDIA Apex clickable	2019-03-09 20:05:39 +05:30
Haozhe Ji	72fa8d03a7	add 'padding_idx=0' for BertEmbeddings	2019-03-07 20:02:55 +08:00
thomwolf	5c85fc3977	fix typo - logger info	2019-03-06 10:05:21 +01:00
Dan Hendrycks	434d15da8e	Update activation function docstring	2019-02-16 12:17:52 -08:00
thomwolf	5ee4f17234	adding option to load on cpu	2019-02-08 10:37:40 +01:00
thomwolf	edcb56fd96	more explicit variable name	2019-02-08 09:54:49 +01:00
Thomas Wolf	848aae49e1	Merge branch 'master' into python_2	2019-02-06 00:13:20 +01:00
thomwolf	448937c00d	python 2 compatibility	2019-02-06 00:07:46 +01:00
thomwolf	822915142b	fix docstring	2019-02-05 16:34:32 +01:00
Thibault Fevry	f3bda2352a	Only keep the active part mof the loss for token classification	2019-02-04 11:46:36 -05:00
thomwolf	98c96fb1a7	splitting position and tokens embeddings in OpenAI GPT - updating tf imports - tests	2019-01-29 10:31:42 +01:00
thomwolf	5456d82311	more versatile model loading	2019-01-29 09:54:18 +01:00
thomwolf	bd3b3aee9c	update	2019-01-28 17:47:29 +01:00
thomwolf	b12616fd8e	updating code organization to fix imports	2019-01-28 17:03:39 +01:00
thomwolf	d77dd62ff8	directly load from TF checkpoints + code cleanup	2019-01-28 16:50:23 +01:00
thomwolf	e5c78c6684	update readme and few typos	2019-01-10 01:40:00 +01:00
thomwolf	ab90d4cddd	adding docs and example for OpenAI GPT	2019-01-09 00:12:43 +01:00
thomwolf	3cf12b235a	added tests + fixed losses	2019-01-08 16:24:23 +01:00
thomwolf	eed51c5bdf	add OpenAI GPT	2019-01-08 12:26:58 +01:00
thomwolf	793dcd236b	Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT into fifth-release	2019-01-07 13:37:55 +01:00
thomwolf	93f563b8a8	adding OpenAI GPT	2019-01-07 12:55:36 +01:00
Thomas Wolf	e048c7f1c8	Merge pull request #171 from donglixp/patch-1 LayerNorm initialization	2019-01-07 12:44:46 +01:00
Thomas Wolf	bcd607542c	Merge pull request #145 from wlhgtc/master Correct the wrong note	2019-01-07 12:23:05 +01:00
Li Dong	d0d9b384f2	LayerNorm initialization The LayerNorm gamma and beta should be initialized by .fill_(1.0) and .zero_(). reference links: `989e78c412/tensorflow/contrib/layers/python/layers/layers.py (L2298)` `989e78c412/tensorflow/contrib/layers/python/layers/layers.py (L2308)`	2019-01-07 15:51:33 +08:00
wlhgtc	e626eecc25	Update modeling.py	2018-12-22 20:26:05 +08:00
Grégory Châtel	7176674849	Fixing various class documentations.	2018-12-20 13:11:17 +01:00
thomwolf	4a4b0e5783	remove logging. basicConfig from library code	2018-12-14 14:46:25 +01:00
thomwolf	ae88eb88a4	set encoding to 'utf-8' in calls to open	2018-12-14 13:48:58 +01:00
thomwolf	52c53f39d0	clean up apex integration	2018-12-13 13:02:17 +01:00
thomwolf	d23eed85bb	model loading apex modification	2018-12-13 12:53:17 +01:00
thomwolf	93f335ef86	add pretrained loading from state_dict	2018-12-13 12:48:13 +01:00
Thomas Wolf	91aab2a6d3	Merge pull request #116 from FDecaYed/deyuf/fp16_with_apex Change to use apex for better fp16 and multi-gpu support	2018-12-13 12:32:37 +01:00
Deyu Fu	3b0a14b761	add fallback path for apex used in modeling.py	2018-12-12 15:05:45 -08:00

1 2

63 Commits