transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-14 01:58:22 +06:00

Author	SHA1	Message	Date
thomwolf	1484d67de9	[LARGE] updating all tests and API	2019-07-02 12:13:17 +02:00
Lei Mao	64b2a828c0	fix evaluation bug	2019-07-01 14:56:24 -07:00
thomwolf	4f8b5f687c	add fix for serialization of tokenizer	2019-06-29 23:35:21 +02:00
thomwolf	d9184620f9	fix tests and new API	2019-06-29 23:10:40 +02:00
Thomas Wolf	dad3c7a485	Merge pull request #723 from tonianelope/master Update Adam optimizer to follow pytorch convention for betas parameter (#510)	2019-06-28 17:28:25 +02:00
Thomas Wolf	e296d5bef1	Merge pull request #704 from deepset-ai/master Adjust s3 german Bert file storage	2019-06-28 17:10:58 +02:00
Thomas Wolf	c68b4eceed	Merge pull request #718 from Rocketknight1/master Incorrect docstring for BertForMaskedLM	2019-06-28 17:08:51 +02:00
thomwolf	213981d8cb	updating bert API	2019-06-28 16:45:24 +02:00
thomwolf	2b56e98892	standardizing API across models - XLNetForSeqClass working	2019-06-28 16:35:09 +02:00
thomwolf	3a00674cbf	fix imports	2019-06-27 17:18:46 +02:00
thomwolf	d939d6fd02	fix hidden-state extraction	2019-06-27 09:39:44 +02:00
thomwolf	0c2ff34815	extracting double hidden-state from xlnet	2019-06-27 09:27:50 +02:00
Mayhul Arora	08ff056c43	Added option to use multiple workers to create training data for lm fine tuning	2019-06-26 16:16:12 -07:00
thomwolf	3deea56c07	fixing loading fucntion	2019-06-26 13:41:12 +02:00
thomwolf	f56b8033f0	more versatile loading	2019-06-26 13:13:15 +02:00
thomwolf	4d47f4985d	slight refactoring, add abstract class for model loading	2019-06-26 12:52:44 +02:00
thomwolf	59cefd4f98	fix #726 - get_lr in examples	2019-06-26 11:28:27 +02:00
thomwolf	ddc2cc61a6	fix python2 tests	2019-06-26 11:17:42 +02:00
thomwolf	7e3070ae4f	add from_pretrained method to all configuration classes	2019-06-26 11:12:00 +02:00
thomwolf	93e9971c54	fix tests	2019-06-26 10:02:45 +02:00
thomwolf	092dacfd62	changing is_regression to unified API	2019-06-26 09:54:05 +02:00
thomwolf	e55d4c4ede	various updates to conversion, models and examples	2019-06-26 00:57:53 +02:00
thomwolf	603c513b35	update main conversion script and readme	2019-06-25 10:45:07 +02:00
thomwolf	7de1740490	add ability to restore fine-tuned TF mdoel	2019-06-25 10:27:58 +02:00
tonianelope	c9885903a1	update betas to follow pytorch convention	2019-06-25 09:23:12 +01:00
thomwolf	7334bf6c21	pad on left for xlnet	2019-06-24 15:05:11 +02:00
thomwolf	c888663f18	overwrite output directories if needed	2019-06-24 14:38:24 +02:00
thomwolf	62d78aa37e	updating GLUE utils for compatibility with XLNet	2019-06-24 14:36:11 +02:00
thomwolf	24ed0b9346	updating run_xlnet_classifier	2019-06-24 12:00:09 +02:00
thomwolf	f6081f2255	add xlnetforsequence classif and run_classifier example for xlnet	2019-06-24 10:01:07 +02:00
Rocketknight1	8d6a118aee	Incorrect docstring for the head_mask argument to BertForMaskedLM	2019-06-23 18:47:05 +01:00
Matt	06716d7536	Merge pull request #3 from huggingface/master Catch up with main repo	2019-06-23 18:46:03 +01:00
thomwolf	c946bb51a6	fix xlnet tokenizer and python2	2019-06-22 22:28:49 +02:00
Thomas Wolf	98dc30b21e	Merge pull request #714 from papower1/master Correct a broken link on README	2019-06-22 21:29:41 +02:00
Thomas Wolf	eae5d3819d	Merge pull request #715 from Rocketknight1/master Include a reference for LM finetuning	2019-06-22 21:29:19 +02:00
Rocketknight1	c7b2808ed7	Update LM finetuning README to include a literature reference	2019-06-22 15:04:01 +01:00
Matt	7c59e32d47	Merge pull request #2 from huggingface/master Updating my fork to the latest version	2019-06-22 14:59:47 +01:00
Chang-Uk Shin	ada0d8fec7	Merge pull request #1 from papower1/papower1-patch-1 Correct a broken link and its context.	2019-06-22 20:34:45 +09:00
Chang-Uk Shin	fcc706343f	Correct a broken link and its context. Correct a broken link(run_lm_finetuning.py) and its context.	2019-06-22 20:33:48 +09:00
thomwolf	181075635d	updating model loading and adding special tokens ids	2019-06-21 23:23:37 +02:00
thomwolf	ebd2cb8d74	update from_pretrained to load XLNetModel as well	2019-06-21 21:08:44 +02:00
thomwolf	483cbc36a9	test deviation with tf model: max ~1e-3 should be ok	2019-06-21 16:38:01 +02:00
thomwolf	24d8068982	weights loading script ok	2019-06-21 12:33:44 +02:00
thomwolf	32da75486b	add tokenizer and tests	2019-06-21 11:09:51 +02:00
thomwolf	45709d7532	model running with simple inputs	2019-06-21 00:28:42 +02:00
thomwolf	b407972e27	update gitignore	2019-06-20 13:52:56 +02:00
thomwolf	c2ea5aef77	work in progress on xlnet	2019-06-20 13:52:21 +02:00
thomwolf	de713fa9b4	starting	2019-06-20 10:54:19 +02:00
thomwolf	c304593d8f	BERTology details in readme	2019-06-20 10:05:06 +02:00
Thomas Wolf	12e892e174	Merge pull request #697 from huggingface/updating_examples Updating examples	2019-06-20 09:58:24 +02:00

... 367 368 369 370 371 ...

19383 Commits