transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-04 21:30:07 +06:00

Author	SHA1	Message	Date
thomwolf	99ae5ab883	update config tests and circle-ci	2019-07-02 12:40:39 +02:00
thomwolf	1484d67de9	[LARGE] updating all tests and API	2019-07-02 12:13:17 +02:00
thomwolf	4d47f4985d	slight refactoring, add abstract class for model loading	2019-06-26 12:52:44 +02:00
thomwolf	7e3070ae4f	add from_pretrained method to all configuration classes	2019-06-26 11:12:00 +02:00
thomwolf	45709d7532	model running with simple inputs	2019-06-21 00:28:42 +02:00
thomwolf	34d706a0e1	pruning in bertology	2019-06-19 15:25:49 +02:00
thomwolf	64e0adda81	better error message	2019-06-18 10:51:31 +02:00
thomwolf	382e2d1e50	spliting config and weight files for bert also	2019-06-18 10:37:16 +02:00
thomwolf	33d3db5c43	updating head masking, readme and docstrings	2019-06-17 15:51:28 +02:00
thomwolf	965f172de6	output all hidden layers states in GPT/GPT-2	2019-06-17 14:34:12 +02:00
thomwolf	f12007e421	add head masking and pruning to openai GPT	2019-06-17 14:19:40 +02:00
thomwolf	8415a38b23	better error messages	2019-06-17 13:03:48 +02:00
Thomas Wolf	ff276fc00c	Merge branch 'master' into finish_torchhub_interfaces	2019-06-14 16:59:07 +02:00
Thomas Wolf	35e6baab37	Merge branch 'master' into attention	2019-06-14 16:41:56 +02:00
VictorSanh	8f97f6c57f	fix typo cc @thomwolf	2019-06-01 17:29:07 -04:00
VictorSanh	0c5a4fe9c9	modify from_pretrained for OpenAIGPT	2019-05-31 00:27:18 -04:00
thomwolf	0efc4ab632	adding dropout to GPT-2 and embedding dropout to GPT	2019-05-08 10:41:35 +02:00
thomwolf	ce86336545	add predict_special_tokens option to GPT also	2019-05-07 16:47:22 +02:00
thomwolf	e211785ada	extract attention weights from GPT	2019-05-02 18:31:26 +02:00
thomwolf	c30139a013	add special tokens to gpt-2	2019-04-30 10:45:26 +02:00
Thomas Wolf	3d78e226e6	Merge pull request #489 from huggingface/tokenization_serialization Better serialization for Tokenizers and Configuration classes - Also fix #466	2019-04-16 08:49:54 +02:00
thomwolf	df5d9c3551	load all models on cpu	2019-04-15 15:43:01 +02:00
thomwolf	60ea6c59d2	added best practices for serialization in README and examples	2019-04-15 15:00:33 +02:00
thomwolf	9761aa4845	add to_json_file method to configuration classes	2019-04-15 14:12:08 +02:00
thomwolf	fe2756ff41	update double head model	2019-04-15 10:04:05 +02:00
thomwolf	b509bf7655	updating loss computation	2019-04-12 12:12:33 +02:00
thomwolf	1d203a34c0	back to simple indexing	2019-04-11 23:51:03 +02:00
thomwolf	074c869bbe	fix OpenAIGPTMultipleChoiceHead	2019-04-11 20:53:50 +02:00
thomwolf	a05fad8dce	fix typo	2019-04-11 13:16:17 +02:00
thomwolf	4a82f4f856	update special token addition	2019-04-11 13:11:22 +02:00
thomwolf	991b8e65f4	Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT	2019-04-11 11:43:15 +02:00
thomwolf	e99b2014cc	fixes #471	2019-04-11 11:43:13 +02:00
Catalin Voss	01520d5412	Remove my unhelpful comments :)	2019-03-27 10:45:28 -07:00
Catalin Voss	fda2f62395	Fix test failures due to old torch issue with non-contiguous view	2019-03-24 14:37:13 -07:00
Catalin Voss	0dd796e359	Also fix loss function issue with the double head models	2019-03-24 14:35:55 -07:00
Catalin Voss	472857c47f	Fix typo syntax err (sorry, c/p from my repo)	2019-03-24 14:14:49 -07:00
Catalin Voss	2e6f5ffb96	Fix GPT language model loss here as well	2019-03-24 14:14:44 -07:00
thomwolf	e5f2d9122c	adding absolute imports to gpt2, openai and transfo-xl	2019-03-14 09:55:01 +01:00
Philipp Glock	6190e8ce4c	Fix: use dropout layer	2019-03-07 10:12:45 +01:00
thomwolf	5c85fc3977	fix typo - logger info	2019-03-06 10:05:21 +01:00
thomwolf	009ee86a19	fix tests - bump up version	2019-02-17 23:57:23 +01:00
thomwolf	1320e4ec0c	mc_token_mask => mc_token_ids	2019-02-09 16:58:53 +01:00
thomwolf	80607874c1	fix layer norm epsilon in OpenAI GPT	2019-02-08 21:49:05 +01:00
thomwolf	777459b471	run openai example running	2019-02-08 10:33:14 +01:00
thomwolf	edcb56fd96	more explicit variable name	2019-02-08 09:54:49 +01:00
thomwolf	9c3c24800b	split saved model in config & weights	2019-02-07 17:06:17 +01:00
thomwolf	448937c00d	python 2 compatibility	2019-02-06 00:07:46 +01:00
thomwolf	3a848111e6	update config, docstrings and readme to switch to seperated tokens and position embeddings	2019-01-29 11:00:11 +01:00
thomwolf	98c96fb1a7	splitting position and tokens embeddings in OpenAI GPT - updating tf imports - tests	2019-01-29 10:31:42 +01:00
thomwolf	5456d82311	more versatile model loading	2019-01-29 09:54:18 +01:00

1 2

59 Commits