transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-23 22:38:58 +06:00

Author	SHA1	Message	Date
Matthew Carrigan	b8e2a9c584	Made --reduce_memory actually do something in finetune_on_pregenerated	2019-04-22 14:01:48 +01:00
Matt	af8a0384fc	Merge pull request #1 from huggingface/master Pulling commits from main repo	2019-04-22 13:56:47 +01:00
Sangwhan Moon	14b1f719f4	Fix indentation weirdness in GPT-2 example.	2019-04-22 02:20:22 +09:00
lukovnikov	69850b4011	python 2 compat	2019-04-21 14:02:38 +02:00
lukovnikov	bb7557d3ab	- removed __all__ in optimization - removed unused plotting code - using ABC for LRSchedule - added some schedule object init tests	2019-04-21 13:48:33 +02:00
lukovnikov	34ccc8ebf4	Merge remote-tracking branch 'upstream/master'	2019-04-21 13:16:15 +02:00
Ailing Zhang	bfd6f6b257	fix from_pretrained positional args	2019-04-17 16:31:40 -07:00
Ailing Zhang	ae4c9fee73	add hubconf	2019-04-17 13:34:34 -07:00
Thomas Wolf	68a889ee43	Merge pull request #500 from huggingface/network Updating network handling	2019-04-17 15:22:14 +02:00
thomwolf	34ae5bf838	small clean up in tests	2019-04-17 14:52:12 +02:00
thomwolf	23d4554ec0	is python 2 happy now	2019-04-17 14:48:34 +02:00
thomwolf	265550ec34	relax network connection requirements	2019-04-17 14:22:35 +02:00
thomwolf	fa76520240	fix file_utils on python 2	2019-04-17 13:32:22 +02:00
thomwolf	bcde2c61cb	fix #497	2019-04-17 12:35:38 +02:00
thomwolf	929579f3b5	fix #497	2019-04-17 12:35:08 +02:00
thomwolf	31d387604c	adding s3 model tests with --runslow	2019-04-17 11:58:27 +02:00
Thomas Wolf	8407429d74	Merge pull request #494 from SudoSharma/patch-1 Fix indentation for unconditional generation	2019-04-17 11:11:36 +02:00
Thomas Wolf	2e153930cf	Merge pull request #495 from SudoSharma/patch-2 Fix gradient overflow issue during attention mask	2019-04-17 11:10:36 +02:00
Thomas Wolf	46078e1b46	Merge pull request #496 from 8enmann/patch-1 [run_gpt2.py] temperature should be a float, not int	2019-04-17 11:08:54 +02:00
Thomas Wolf	b8686130ca	Merge pull request #498 from huggingface/GPT2_tokenization Gpt2 tokenization	2019-04-17 11:06:41 +02:00
thomwolf	5afa497cbf	fix GPT-2 tokenization to work also on python 3...	2019-04-17 11:04:41 +02:00
thomwolf	bc70779bf0	fixed GPT-2 tokenization on python 2	2019-04-17 10:56:15 +02:00
Ben Mann	87677fcc4d	[run_gpt2.py] temperature should be a float, not int	2019-04-16 15:23:21 -07:00
Abhi Sharma	9e666aaa29	Fix gradient overflow issue during attention mask This fix is in reference to issue #382. GPT2 can now be trained in mixed precision, which I've confirmed with testing. I also tested unconditional generation on multiple seeds before and after changing 1e10 to 1e4 and there was no difference. Please let me know if there is anything else I can do to make this pull request better. Thanks for all your work!	2019-04-16 11:42:34 -07:00
Abhi Sharma	07154dadb4	Fix indentation for unconditional generation	2019-04-16 11:11:49 -07:00
thomwolf	bdaba1897c	updating GPT tokenization	2019-04-16 17:44:06 +02:00
thomwolf	18a8a15f78	improving GPT2 tokenization and adding tests	2019-04-16 17:00:55 +02:00
Thomas Wolf	3d78e226e6	Merge pull request #489 from huggingface/tokenization_serialization Better serialization for Tokenizers and Configuration classes - Also fix #466	2019-04-16 08:49:54 +02:00
thomwolf	3571187ef6	fix saving models in distributed setting examples	2019-04-15 16:43:56 +02:00
Thomas Wolf	64b6ef4db0	Merge pull request #490 from huggingface/better_finetuning_GPT_GPT-2 Clean up GPT and GPT-2 losses computation	2019-04-15 16:14:50 +02:00
thomwolf	d616022455	fix openai special tokens loading	2019-04-15 16:07:45 +02:00
thomwolf	df5d9c3551	load all models on cpu	2019-04-15 15:43:01 +02:00
thomwolf	2499b0a5fc	add ptvsd to run_squad	2019-04-15 15:33:04 +02:00
thomwolf	7816f7921f	clean up distributed training logging in run_squad example	2019-04-15 15:27:10 +02:00
thomwolf	1135f2384a	clean up logger in examples for distributed case	2019-04-15 15:22:40 +02:00
thomwolf	cc43307023	update readme	2019-04-15 15:06:10 +02:00
thomwolf	60ea6c59d2	added best practices for serialization in README and examples	2019-04-15 15:00:33 +02:00
thomwolf	179a2c2ff6	update example to work with new serialization semantic	2019-04-15 14:33:23 +02:00
thomwolf	b3c6ee0ac1	tokenization updates	2019-04-15 14:24:52 +02:00
thomwolf	20577d8a7c	add configuration serialization to readme	2019-04-15 14:21:41 +02:00
thomwolf	9761aa4845	add to_json_file method to configuration classes	2019-04-15 14:12:08 +02:00
thomwolf	b17963d82f	update readme	2019-04-15 13:44:30 +02:00
thomwolf	e8568a3b17	fixing tests	2019-04-15 12:55:38 +02:00
thomwolf	870b734bfd	added tokenizers serialization tests	2019-04-15 12:03:56 +02:00
thomwolf	3e65f255dc	add serialization semantics to tokenizers - fix transfo-xl tokenizer	2019-04-15 11:47:25 +02:00
Thomas Wolf	6b35cfd28f	Merge pull request #423 from dhanajitb/master making unconditional generation work	2019-04-15 11:01:53 +02:00
Thomas Wolf	aff44f0c08	Merge branch 'master' into master	2019-04-15 10:58:34 +02:00
Thomas Wolf	7e7e4753c8	Merge pull request #480 from mboyanov/docs/cls_token_info Extend the BertForSequenceClassification docs to mention the special CLS token.	2019-04-15 10:57:25 +02:00
Thomas Wolf	bb61b747df	Merge pull request #474 from jiesutd/master Fix tsv read error in Windows	2019-04-15 10:56:48 +02:00
Thomas Wolf	7873d76464	Merge pull request #478 from Rocketknight1/master Added a helpful error for users with single-document corpuses - fixes # 452	2019-04-15 10:55:57 +02:00

1 2 3 4 5 ...

807 Commits