transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-16 11:08:23 +06:00

Author	SHA1	Message	Date
thomwolf	d77dd62ff8	directly load from TF checkpoints + code cleanup	2019-01-28 16:50:23 +01:00
thomwolf	9c35c132fa	apex LayerNorm	2019-01-17 09:19:19 +01:00
thomwolf	b9c77b98d5	fix transposition in model conversion and memory initialization	2019-01-17 00:33:21 +01:00
thomwolf	009101de12	fix loading bug and check full conversion of model	2019-01-16 12:16:20 +01:00
thomwolf	fea15cc9f5	update model conversion	2019-01-16 11:54:54 +01:00
thomwolf	c03c12687f	fix __main__ entry script	2019-01-16 10:55:22 +01:00
thomwolf	8831c68803	fixing various parts of model conversion, loading and weights sharing	2019-01-16 10:31:16 +01:00
thomwolf	a69ec2c722	improved corpus and tokenization conversion - added evaluation script	2019-01-15 23:17:46 +01:00
thomwolf	7d03c53718	conversion working	2019-01-15 16:07:25 +01:00
thomwolf	3a9c88377f	adding Transformer XL	2019-01-15 12:59:38 +01:00
nhatchan	cd30565aed	Fix importing unofficial TF models Importing unofficial TF models seems to be working well, at least for me. This PR resolves #50.	2019-01-14 13:35:40 +09:00
thomwolf	e5c78c6684	update readme and few typos	2019-01-10 01:40:00 +01:00
thomwolf	ab90d4cddd	adding docs and example for OpenAI GPT	2019-01-09 00:12:43 +01:00
thomwolf	dc5df92fa8	added LM head for OpenAI	2019-01-08 17:18:47 +01:00
thomwolf	3cf12b235a	added tests + fixed losses	2019-01-08 16:24:23 +01:00
thomwolf	eed51c5bdf	add OpenAI GPT	2019-01-08 12:26:58 +01:00
WrRan	3f60a60eed	text in never_split should not lowercase	2019-01-08 13:33:57 +08:00
WrRan	751beb9e73	never split some text	2019-01-08 10:54:51 +08:00
thomwolf	793dcd236b	Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT into fifth-release	2019-01-07 13:37:55 +01:00
thomwolf	93f563b8a8	adding OpenAI GPT	2019-01-07 12:55:36 +01:00
Thomas Wolf	e048c7f1c8	Merge pull request #171 from donglixp/patch-1 LayerNorm initialization	2019-01-07 12:44:46 +01:00
Thomas Wolf	bcd607542c	Merge pull request #145 from wlhgtc/master Correct the wrong note	2019-01-07 12:23:05 +01:00
Li Dong	d0d9b384f2	LayerNorm initialization The LayerNorm gamma and beta should be initialized by .fill_(1.0) and .zero_(). reference links: `989e78c412/tensorflow/contrib/layers/python/layers/layers.py (L2298)` `989e78c412/tensorflow/contrib/layers/python/layers/layers.py (L2308)`	2019-01-07 15:51:33 +08:00
wlhgtc	e626eecc25	Update modeling.py	2018-12-22 20:26:05 +08:00
Grégory Châtel	7176674849	Fixing various class documentations.	2018-12-20 13:11:17 +01:00
Thomas Wolf	7fb94ab934	Merge pull request #127 from patrick-s-h-lewis/tokenizer-error-on-long-seqs raises value error for bert tokenizer for long sequences	2018-12-19 10:29:17 +01:00
Patrick Sodré	87c1244c7d	Convert scripts into entry_points The recommended approach to create launch scripts is to use entry_points and console_scripts. xref: https://packaging.python.org/guides/distributing-packages-using-setuptools/#scripts	2018-12-19 02:26:08 +00:00
Julien Chaumond	d57763f582	Fix typos	2018-12-18 19:23:22 -05:00
Patrick Lewis	78cf7b4ab4	added code to raise value error for bert tokenizer for covert_tokens_to_indices	2018-12-18 14:41:30 +00:00
thomwolf	4a4b0e5783	remove logging. basicConfig from library code	2018-12-14 14:46:25 +01:00
thomwolf	ae88eb88a4	set encoding to 'utf-8' in calls to open	2018-12-14 13:48:58 +01:00
thomwolf	52c53f39d0	clean up apex integration	2018-12-13 13:02:17 +01:00
thomwolf	d23eed85bb	model loading apex modification	2018-12-13 12:53:17 +01:00
thomwolf	1cbb32a542	include version number + comment in setup.py	2018-12-13 12:50:44 +01:00
thomwolf	ce52177638	added version in __init__.py	2018-12-13 12:50:44 +01:00
thomwolf	93f335ef86	add pretrained loading from state_dict	2018-12-13 12:48:13 +01:00
thomwolf	13bf0d4659	fixing Adam weights skip in TF convert script	2018-12-13 12:48:13 +01:00
Thomas Wolf	91aab2a6d3	Merge pull request #116 from FDecaYed/deyuf/fp16_with_apex Change to use apex for better fp16 and multi-gpu support	2018-12-13 12:32:37 +01:00
Thomas Wolf	32a227f507	Merge pull request #113 from hzhwcmhf/master fix compatibility with python 3.5.2	2018-12-13 12:15:15 +01:00
Thomas Wolf	ffe9075f48	Merge pull request #96 from rodgzilla/multiple-choice-code BertForMultipleChoice and Swag dataset example.	2018-12-13 12:05:11 +01:00
Deyu Fu	3b0a14b761	add fallback path for apex used in modeling.py	2018-12-12 15:05:45 -08:00
Deyu Fu	c8ea286048	change to apex for better fp16 and multi-gpu support	2018-12-11 17:13:58 -08:00
hzhwcmhf	485adde742	add pathlib support for file_utils.py on python 3.5	2018-12-11 22:49:19 +08:00
hzhwcmhf	bc659f86ad	fix compatibility with python 3.5.2; convert path to str	2018-12-11 20:18:56 +08:00
thomwolf	1df6f26214	Merge branch 'fourth-release' of https://github.com/huggingface/pytorch-pretrained-BERT into fourth-release	2018-12-11 12:20:31 +01:00
thomwolf	770f805ae5	include version number + comment in setup.py	2018-12-11 12:20:22 +01:00
thomwolf	ed3b62cd3b	added version in __init__.py	2018-12-11 12:12:08 +01:00
Thomas Wolf	632f2d2df9	Merge branch 'master' into fourth-release	2018-12-11 06:00:53 -05:00
thomwolf	270fa2f20b	add pretrained loading from state_dict	2018-12-11 11:50:38 +01:00
Li Li	81e1e2489f	Fix optimizer to work with horovod	2018-12-10 02:08:38 -08:00
thomwolf	68f77303b2	fixing Adam weights skip in TF convert script	2018-12-09 16:17:11 -05:00
Grégory Châtel	fc5a38ac92	Adding the BertForMultipleChoiceClass.	2018-12-06 18:42:23 +01:00
thomwolf	511bce58bd	update new token classification model	2018-11-30 22:56:02 +01:00
thomwolf	d787c6be8c	improve docstrings and fix new token classification model	2018-11-30 22:55:26 +01:00
thomwolf	ed302a73f4	add new token classification model	2018-11-30 22:55:03 +01:00
thomwolf	d6f06c03f4	fixed loading pre-trained tokenizer from directory	2018-11-30 14:09:06 +01:00
thomwolf	532a81d3d6	fixed doc_strings	2018-11-30 13:57:01 +01:00
thomwolf	296f006132	added BertForTokenClassification model	2018-11-30 13:56:53 +01:00
thomwolf	298107fed7	Added new bert models	2018-11-30 13:56:02 +01:00
thomwolf	32167cdf4b	remove convert_to_unicode and printable_text from examples	2018-11-26 23:33:22 +01:00
thomwolf	05053d163c	update cache_dir in readme and examples	2018-11-26 10:45:13 +01:00
thomwolf	63ae5d2134	added cache_dir option in from_pretrained	2018-11-26 10:21:56 +01:00
thomwolf	ebaacba38b	fixing typo in docstring	2018-11-26 09:55:15 +01:00
thomwolf	870d71636e	fixing target size in crossentropy losses	2018-11-26 09:51:34 +01:00
thomwolf	982339d829	fixing unicode error	2018-11-23 12:22:12 +01:00
weiyumou	37b6c9b21b	Fixed UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 in position 3793: ordinal not in range(128)	2018-11-19 23:01:28 -05:00
thomwolf	757750d6f6	fix tests	2018-11-17 11:58:14 +01:00
thomwolf	886cb49792	updating readme and notebooks	2018-11-16 14:31:15 +01:00
thomwolf	1de35b624b	preparing for first release	2018-11-15 20:56:10 +01:00

1 2 3

119 Commits