transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Catalin Voss	01520d5412	Remove my unhelpful comments :)	2019-03-27 10:45:28 -07:00
Catalin Voss	fda2f62395	Fix test failures due to old torch issue with non-contiguous view	2019-03-24 14:37:13 -07:00
Catalin Voss	0dd796e359	Also fix loss function issue with the double head models	2019-03-24 14:35:55 -07:00
Catalin Voss	472857c47f	Fix typo syntax err (sorry, c/p from my repo)	2019-03-24 14:14:49 -07:00
Catalin Voss	2e6f5ffb96	Fix GPT language model loss here as well	2019-03-24 14:14:44 -07:00
Catalin Voss	5938f31fa7	Fix c/p typo from my experiment code	2019-03-24 14:14:40 -07:00
Catalin Voss	7797d21b8d	Fix GPT2 language modeling loss computation	2019-03-24 14:14:35 -07:00
Thomas Wolf	f3e5404880	Merge pull request #381 from tseretelitornike/master Added missing imports.	2019-03-15 12:54:40 +01:00
tseretelitornike	83857ffeaa	Added missing imports.	2019-03-15 12:45:48 +01:00
Thomas Wolf	d5c037c3ed	Merge pull request #380 from yongbowin/patch-3 typo in annotation	2019-03-14 15:56:40 +01:00
Yongbo Wang	d1e4fa98a9	typo in annotation modify `heruistic` to `heuristic` in line 660, `charcter` to `character` in line 661.	2019-03-14 17:32:15 +08:00
Thomas Wolf	59e2bdd086	Merge pull request #379 from yongbowin/patch-2 typo	2019-03-14 10:17:18 +01:00
Yongbo Wang	3d6452163d	typo modify `mull` to `null` in line 474 annotation.	2019-03-14 17:03:38 +08:00
Thomas Wolf	76906372b0	Merge pull request #378 from huggingface/absolute_imports Add absolute imports to GPT, GPT-2, Transfo-XL and and fix empty nbest_predictions.json	2019-03-14 10:00:47 +01:00
thomwolf	a98dfe4ced	fixing #377 (empty nbest_predictions.json)	2019-03-14 09:57:06 +01:00
thomwolf	e5f2d9122c	adding absolute imports to gpt2, openai and transfo-xl	2019-03-14 09:55:01 +01:00
Thomas Wolf	eecaaa734a	Merge pull request #371 from yongbowin/patch-1 Simplify code, delete redundancy line	2019-03-14 09:03:32 +01:00
Yongbo Wang	22a465a91f	Simplify code, delete redundancy line delete redundancy line `if args.train`, simplify code.	2019-03-13 09:42:06 +08:00
Thomas Wolf	9b03d67b83	Merge pull request #362 from Bharat123rox/patch-1 Make the hyperlink of NVIDIA Apex clickable	2019-03-11 09:08:51 +01:00
Thomas Wolf	8435d78f0c	Merge pull request #361 from junjieqian/jqian/updateReadme Correct line number in README for classes	2019-03-11 09:08:27 +01:00
Thomas Wolf	80790705e0	Merge pull request #359 from elonmuskceo/fix-typo Update run_gpt2.py	2019-03-11 09:07:56 +01:00
Thomas Wolf	13aa13dbc0	Merge pull request #358 from cdjhz/patch-1 add 'padding_idx=0' for BertEmbeddings	2019-03-11 09:06:55 +01:00
Thomas Wolf	c0660df5dd	Merge pull request #357 from pglock/feature/354-use-dropout-layer-gpt Use Dropout Layer in OpenAIGPTMultipleChoiceHead	2019-03-11 09:06:27 +01:00
Bharat Raghunathan	f91ce0b803	Make the hyperlink of NVIDIA Apex clickable	2019-03-09 20:05:39 +05:30
Junjie Qian	d648a02203	Correct line number in README for classes	2019-03-08 16:28:03 -08:00
Elon Musk	66d8206809	Update run_gpt2.py	2019-03-08 11:59:08 -05:00
Haozhe Ji	72fa8d03a7	add 'padding_idx=0' for BertEmbeddings	2019-03-07 20:02:55 +08:00
Philipp Glock	6190e8ce4c	Fix: use dropout layer	2019-03-07 10:12:45 +01:00
thomwolf	7cc35c3104	fix openai gpt example and updating readme	2019-03-06 11:43:21 +01:00
thomwolf	906b638efa	updating readme	2019-03-06 10:24:19 +01:00
thomwolf	994d86609b	fixing PYTORCH_PRETRAINED_BERT_CACHE use in examples	2019-03-06 10:21:24 +01:00
thomwolf	2dd8f524f5	removing test for long sequences error following #337	2019-03-06 10:10:41 +01:00
thomwolf	5c85fc3977	fix typo - logger info	2019-03-06 10:05:21 +01:00
Thomas Wolf	8e36da7acb	Merge pull request #347 from jplehmann/feature/sst2-processor Processor for SST-2 task	2019-03-06 09:48:27 +01:00
Thomas Wolf	21c88a07b7	Merge pull request #341 from potatochip/patch-1 catch exception if pathlib not install	2019-03-06 09:48:01 +01:00
Thomas Wolf	3c01dfb775	Merge pull request #338 from CatalinVoss/patch-3 Fix top k generation for k != 0	2019-03-06 09:47:33 +01:00
Thomas Wolf	477ec4b6cc	Merge pull request #337 from CatalinVoss/patch-2 Allow tokenization of sequences > 512 for caching	2019-03-06 09:45:49 +01:00
Thomas Wolf	7b9e5a54b5	Merge pull request #327 from lukovnikov/master Issue#324: warmup linear fixes	2019-03-06 09:44:56 +01:00
Thomas Wolf	4784b04f47	Merge pull request #325 from john-hewitt/master add BertTokenizer flag to skip basic tokenization	2019-03-06 09:37:11 +01:00
Catalin Voss	4a49c22584	Warn instead of raising in BERT and GPT-2 tokenizers as well, to allow for pre-caching of tokens	2019-03-05 12:31:45 -08:00
Catalin Voss	e99bc87e4d	Merge branch 'patch-1' into patch-2	2019-03-05 12:24:18 -08:00
John Lehmann	0f96d4b1f7	Run classifier processor for SST-2.	2019-03-05 13:38:28 -06:00
Aaron Mangum	0c970caa4a	catch exception if pathlib not install	2019-03-04 14:30:19 -08:00
Catalin Voss	4b4b079272	Fix top k generation for k != 0	2019-03-02 21:54:44 -08:00
Catalin Voss	9775b2eb27	Allow tokenization of sequences > 512 for caching For many applications requiring randomized data access, it's easier to cache the tokenized representations than the words. So why not turn this into a warning?	2019-03-02 16:30:21 -08:00
Catalin Voss	c0cf0a04d5	Fix typo	2019-02-27 18:01:06 -08:00
John Hewitt	4d1ad83236	update docstring of BERT tokenizer to reflect do_wordpiece_only	2019-02-27 14:50:41 -08:00
lukovnikov	35410da758	added warning	2019-02-27 17:11:42 +01:00
lukovnikov	4d79e0d386	added warning	2019-02-27 16:50:05 +01:00
lukovnikov	66a84b63b0	added warning	2019-02-27 16:38:00 +01:00

1 2 3 4 5 ...

605 Commits