transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-18 03:58:25 +06:00

Author	SHA1	Message	Date
piero	821de121e8	Minor changes	2019-12-03 10:14:02 -05:00
w4nderlust	7469d03b1c	Fixed minor bug when running training on cuda	2019-12-03 10:14:02 -05:00
piero	0b51fba20b	Added script for training a discriminator for pplm to use	2019-12-03 10:14:02 -05:00
Piero Molino	34a83faabe	Let's make PPLM great again	2019-12-03 10:14:02 -05:00
Julien Chaumond	d5faa74cd6	tokenizer white space: revert to previous behavior	2019-12-03 10:14:02 -05:00
Julien Chaumond	0b77d66a6d	rm extraneous import	2019-12-03 10:14:02 -05:00
Rosanne Liu	83b1e6ac9e	fix the loss backward issue (cherry picked from commit 566468cc984c6ec7e10dfc62b5b4191781a99cd2)	2019-12-03 10:14:02 -05:00
Julien Chaumond	572c24cfa2	PPLM (squashed) Co-authored-by: piero <piero@uber.com> Co-authored-by: Rosanne Liu <mimosavvy@gmail.com>	2019-12-03 10:14:02 -05:00
Thomas Wolf	f19a78a634	Merge pull request #1903 from valohai/master Valohai integration	2019-12-03 16:13:01 +01:00
Thomas Wolf	d100ad99c0	Merge pull request #2014 from aaugustin/mark-tf-auto-model-test-as-slow Mark tests in TFAutoModelTest as slow.	2019-12-03 16:03:48 +01:00
Juha Kiili	66fc8d25a5	Change ref to original GLUE downloader script	2019-12-03 10:49:50 +02:00
LysandreJik	fbaf05bd92	Remove annoying tokenization message	2019-12-02 18:23:00 -05:00
Lysandre	e85855f2c4	Fix ALBERT exports with pretraining + sp classifier; Fix naming for ALBERT TF models	2019-12-02 18:00:19 -05:00
Lysandre	b3d834ae11	Reorganize ALBERT conversion script	2019-12-02 15:01:52 -05:00
thomwolf	f3776df0f3	WIP debugging	2019-12-02 15:47:00 +01:00
Aymeric Augustin	5ab93083e4	Mark tests in TFAutoModelTest as slow. Each test forces downloading the same 536MB file, which is slow even with a decent internet connection.	2019-12-01 18:25:15 +01:00
Aditya Soni	c356290c8d	typo fix as per Pytorch v1.1+	2019-12-01 14:08:14 +05:30
Rostislav Nedelchev	76c0bc06d5	[XLNet] Changed post-processing of attention w.r.t to target_mapping Whenever target_mapping is provided to the input, XLNet outputs two different attention streams. Based on that the attention output would be on of the two: - a list of tensors (usual case for most transformers) - a list of 2-tuples of tensors, one tesor for each of attention streams Docs and unit-tests have been updated	2019-11-30 21:01:04 +01:00
Rostislav Nedelchev	b90791e950	fixed XLNet attenttion output for both attention streams	2019-11-30 15:57:51 +01:00
maxvidal	b0ee7c7df3	Added Camembert to available models	2019-11-29 14:17:02 -05:00
Elad Segal	ecf15ebf3b	Add ALBERT to AutoClasses	2019-11-29 11:25:37 -05:00
thomwolf	4a666885b5	reducing my level of enthousiasm	2019-11-29 09:40:50 -05:00
thomwolf	adb5c79ff2	update all tf.shape and tensor.shape to shape_list	2019-11-29 09:40:50 -05:00
Juha Kiili	2421e54f8c	Add link to original source and license to download_glue.data.py	2019-11-29 15:39:28 +02:00
Juha Kiili	41aa0e8003	Refactor logs and fix loss bug	2019-11-29 15:33:25 +02:00
Thomas Wolf	1ab8dc44b3	Merge pull request #1876 from huggingface/mean-fix Mean does not exist in TF2	2019-11-29 09:26:33 +01:00
Thomas Wolf	f0d22b6363	Merge pull request #1873 from stefan-it/distilbert-german German DistilBERT	2019-11-29 09:25:47 +01:00
Lysandre	1e9ac5a7cf	New -> normal	2019-11-28 17:43:47 -05:00
Lysandre	0b84b9fd8a	Add processors to __init__	2019-11-28 17:38:52 -05:00
Lysandre	f671997ef7	Interface with TFDS	2019-11-28 17:17:20 -05:00
Lysandre	bd41e8292a	Cleanup & Evaluation now works	2019-11-28 16:03:56 -05:00
Thomas Wolf	d49c43ff78	Merge pull request #1778 from eukaryote31/patch-2 from_pretrained: convert DialoGPT format	2019-11-28 16:08:37 +01:00
Thomas Wolf	91caf2462c	Merge pull request #1770 from huggingface/initi-encoder-mask Only init encoder_attention_mask if stack is decoder	2019-11-28 16:06:55 +01:00
Thomas Wolf	49a69d5b78	Merge pull request #1753 from digantamisra98/patch-1 Added Mish Activation Function	2019-11-28 15:24:08 +01:00
Thomas Wolf	96e7ee7238	Merge pull request #1740 from huggingface/fix-ctrl-past Fix CTRL past	2019-11-27 23:28:30 +01:00
thomwolf	8da47b078d	fix merge tests	2019-11-27 23:11:37 +01:00
Stefan Schweter	8c276b9c92	Merge branch 'master' into distilbert-german	2019-11-27 18:11:49 +01:00
Yao Lu	3c28a2daac	add add_special_tokens=True for input examples	2019-11-27 12:05:23 -05:00
Thomas Wolf	a36f981d1b	Merge branch 'master' into fix-ctrl-past	2019-11-27 17:25:46 +01:00
Thomas Wolf	5afca00b47	Merge pull request #1724 from huggingface/fix_encode_plus Fix encode_plus	2019-11-27 17:14:49 +01:00
Thomas Wolf	49108288ba	Merge pull request #1624 from Huawei-MRC-OSI/resumable_http Add support for resumable downloads for HTTP protocol.	2019-11-27 17:11:07 +01:00
Thomas Wolf	5340d1f21f	Merge branch 'master' into resumable_http	2019-11-27 17:10:36 +01:00
VictorSanh	10bd1ddb39	soft launch distilbert multilingual	2019-11-27 11:07:22 -05:00
VictorSanh	d5478b939d	add distilbert + update run_xnli wrt run_glue	2019-11-27 11:07:22 -05:00
VictorSanh	07ab8d7af6	fix bug	2019-11-27 11:07:22 -05:00
VictorSanh	d474022639	cleaning simple_accuracy since not used anymore	2019-11-27 11:07:22 -05:00
VictorSanh	bcd8dc6b48	move xnli_compute_metrics to data/metrics	2019-11-27 11:07:22 -05:00
VictorSanh	73fe2e7385	remove fstrings	2019-11-27 11:07:22 -05:00
VictorSanh	3e7656f7ac	update readme	2019-11-27 11:07:22 -05:00
VictorSanh	abd397e954	uniformize w/ the cache_dir update	2019-11-27 11:07:22 -05:00

... 253 254 255 256 257 ...

15053 Commits