transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-18 03:58:25 +06:00

Author	SHA1	Message	Date
VictorSanh	0c5a4fe9c9	modify from_pretrained for OpenAIGPT	2019-05-31 00:27:18 -04:00
Victor SANH	96592b544b	default in __init__s for classification BERT models (#650 )	2019-05-30 15:53:13 -04:00
Thomas Wolf	701bd59b8b	Merge pull request #585 from huntzhan/master Make the epsilon of LayerNorm configurable.	2019-05-08 16:56:38 +02:00
Thomas Wolf	303b5e2b92	Merge pull request #545 from ailzhang/cache_dir move pytroch_pretrained_bert cache folder under same path as torch	2019-05-08 16:55:27 +02:00
huntzhan	101ab4dd8e	Make the epsilon of LayerNorm configurable.	2019-05-06 00:26:21 +08:00
Ben Mann	74f7906db4	Fix #537	2019-04-30 19:48:22 -07:00
Ailing Zhang	3963d57c89	move pytroch_pretrained_bert cache folder under same path as torch	2019-04-27 11:09:11 -07:00
thomwolf	b832d5bb8a	Release: 0.6.2	2019-04-25 21:37:47 +02:00
Thomas Wolf	e6cf62d499	Merge pull request #488 from dhpollack/fix_multichoice fixed BertForMultipleChoice model init and forward pass	2019-04-25 21:04:16 +02:00
lukovnikov	704037ad51	- updated docs for new LR API - added some images for illustration - updated comments in optimization	2019-04-25 15:59:39 +02:00
Thomas Wolf	d76a57b0ba	Merge pull request #506 from ailzhang/hubconf Hubconf	2019-04-24 20:59:21 +02:00
thomwolf	80f995a141	revert BertForMultipleChoice linear classifier	2019-04-24 16:51:54 +02:00
lukovnikov	69850b4011	python 2 compat	2019-04-21 14:02:38 +02:00
lukovnikov	bb7557d3ab	- removed __all__ in optimization - removed unused plotting code - using ABC for LRSchedule - added some schedule object init tests	2019-04-21 13:48:33 +02:00
lukovnikov	34ccc8ebf4	Merge remote-tracking branch 'upstream/master'	2019-04-21 13:16:15 +02:00
Ailing Zhang	bfd6f6b257	fix from_pretrained positional args	2019-04-17 16:31:40 -07:00
thomwolf	23d4554ec0	is python 2 happy now	2019-04-17 14:48:34 +02:00
thomwolf	265550ec34	relax network connection requirements	2019-04-17 14:22:35 +02:00
thomwolf	fa76520240	fix file_utils on python 2	2019-04-17 13:32:22 +02:00
thomwolf	bcde2c61cb	fix #497	2019-04-17 12:35:38 +02:00
Thomas Wolf	2e153930cf	Merge pull request #495 from SudoSharma/patch-2 Fix gradient overflow issue during attention mask	2019-04-17 11:10:36 +02:00
thomwolf	5afa497cbf	fix GPT-2 tokenization to work also on python 3...	2019-04-17 11:04:41 +02:00
thomwolf	bc70779bf0	fixed GPT-2 tokenization on python 2	2019-04-17 10:56:15 +02:00
Abhi Sharma	9e666aaa29	Fix gradient overflow issue during attention mask This fix is in reference to issue #382. GPT2 can now be trained in mixed precision, which I've confirmed with testing. I also tested unconditional generation on multiple seeds before and after changing 1e10 to 1e4 and there was no difference. Please let me know if there is anything else I can do to make this pull request better. Thanks for all your work!	2019-04-16 11:42:34 -07:00
thomwolf	bdaba1897c	updating GPT tokenization	2019-04-16 17:44:06 +02:00
thomwolf	18a8a15f78	improving GPT2 tokenization and adding tests	2019-04-16 17:00:55 +02:00
Thomas Wolf	3d78e226e6	Merge pull request #489 from huggingface/tokenization_serialization Better serialization for Tokenizers and Configuration classes - Also fix #466	2019-04-16 08:49:54 +02:00
Thomas Wolf	64b6ef4db0	Merge pull request #490 from huggingface/better_finetuning_GPT_GPT-2 Clean up GPT and GPT-2 losses computation	2019-04-15 16:14:50 +02:00
thomwolf	d616022455	fix openai special tokens loading	2019-04-15 16:07:45 +02:00
thomwolf	df5d9c3551	load all models on cpu	2019-04-15 15:43:01 +02:00
thomwolf	60ea6c59d2	added best practices for serialization in README and examples	2019-04-15 15:00:33 +02:00
thomwolf	b3c6ee0ac1	tokenization updates	2019-04-15 14:24:52 +02:00
thomwolf	9761aa4845	add to_json_file method to configuration classes	2019-04-15 14:12:08 +02:00
thomwolf	e8568a3b17	fixing tests	2019-04-15 12:55:38 +02:00
thomwolf	870b734bfd	added tokenizers serialization tests	2019-04-15 12:03:56 +02:00
thomwolf	3e65f255dc	add serialization semantics to tokenizers - fix transfo-xl tokenizer	2019-04-15 11:47:25 +02:00
David Pollack	38ba7b439b	fixed BertForMultipleChoice model init and forward pass	2019-04-15 10:38:01 +02:00
thomwolf	fe2756ff41	update double head model	2019-04-15 10:04:05 +02:00
Martin Boyanov	34cf67fd6c	Extend the BertForSequenceClassification docs to mention the special CLS token.	2019-04-12 21:30:28 +03:00
thomwolf	b509bf7655	updating loss computation	2019-04-12 12:12:33 +02:00
thomwolf	1d203a34c0	back to simple indexing	2019-04-11 23:51:03 +02:00
thomwolf	074c869bbe	fix OpenAIGPTMultipleChoiceHead	2019-04-11 20:53:50 +02:00
thomwolf	a05fad8dce	fix typo	2019-04-11 13:16:17 +02:00
thomwolf	4a82f4f856	update special token addition	2019-04-11 13:11:22 +02:00
thomwolf	991b8e65f4	Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT	2019-04-11 11:43:15 +02:00
thomwolf	e99b2014cc	fixes #471	2019-04-11 11:43:13 +02:00
lukovnikov	fc7693adc3	schedule fix	2019-04-03 18:16:47 +02:00
lukovnikov	20686b78fc	schedule fix	2019-04-03 18:13:52 +02:00
lukovnikov	5fed5bb3d6	schedule fix	2019-04-03 17:20:29 +02:00
lukovnikov	91a073f804	schedule fix	2019-04-03 17:10:08 +02:00

1 2 3 4 5

228 Commits