transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-30 17:52:35 +06:00

Author	SHA1	Message	Date
thomwolf	1135f2384a	clean up logger in examples for distributed case	2019-04-15 15:22:40 +02:00
thomwolf	60ea6c59d2	added best practices for serialization in README and examples	2019-04-15 15:00:33 +02:00
thomwolf	179a2c2ff6	update example to work with new serialization semantic	2019-04-15 14:33:23 +02:00
thomwolf	3e65f255dc	add serialization semantics to tokenizers - fix transfo-xl tokenizer	2019-04-15 11:47:25 +02:00
Thomas Wolf	616743330e	Merge pull request #462 from 8enmann/master fix run_gpt2.py	2019-04-11 21:54:46 +02:00
Thomas Wolf	2cdfb8b254	Merge pull request #467 from yaroslavvb/patch-2 Update README.md	2019-04-11 21:53:23 +02:00
thomwolf	4bc4c69af9	finetuning any BERT model - fixes #455	2019-04-11 16:57:59 +02:00
Yaroslav Bulatov	8fffba5f47	Update README.md Fix for ```> > > > 04/09/2019 21:39:38 - INFO - __main__ - device: cuda n_gpu: 1, distributed training: False, 16-bits training: False Traceback (most recent call last): File "/home/ubuntu/pytorch-pretrained-BERT/examples/lm_finetuning/simple_lm_finetuning.py", line 642, in <module> main() File "/home/ubuntu/pytorch-pretrained-BERT/examples/lm_finetuning/simple_lm_finetuning.py", line 502, in main raise ValueError("Training is currently the only implemented execution option. Please set `do_train`.") ValueError: Training is currently the only implemented execution option. Please set `do_train`. ```	2019-04-09 14:45:47 -07:00
Benjamin Mann	fd8a3556f0	fix run_gpt2.py	2019-04-08 17:20:35 -07:00
Thomas Wolf	9ca25ce828	Merge pull request #427 from jeonsworld/patch-1 fix sample_doc	2019-04-03 11:26:58 +02:00
thomwolf	846b1fd6f8	Fix #419	2019-04-03 10:50:38 +02:00
Thomas Wolf	2f80dbbc0d	Merge pull request #430 from MottoX/master Fix typo in example code	2019-04-02 10:41:56 +02:00
Mike Arpaia	8b5c63e4de	Fixes to the TensorFlow conversion tool	2019-04-01 13:17:54 -06:00
Weixin Wang	d07db28f52	Fix typo in example code Modify 'unambigiously' to 'unambiguously'	2019-03-31 01:20:18 +08:00
jeonsworld	60005f464d	Update pregenerate_training_data.py If the value of rand_end is returned from the randint function, the value of sampled_doc_index that matches current_idx is returned from searchsorted. example: cumsum_max = {int64} 30 doc_cumsum = {ndarray} [ 5 7 11 19 30] doc_lengths = {list} <class 'list'>: [5, 2, 4, 8, 11] if current_idx = 1, rand_start = 7 rand_end = 35 sentence_index = randint(7, 35) % cumsum_max if randint return 35, sentence_index becomes 5. if sentence_index is 5, np.searchsorted returns 1 equal to current_index.	2019-03-30 14:50:17 +09:00
Thomas Wolf	694e2117f3	Merge pull request #388 from ananyahjha93/master Added remaining GLUE tasks to 'run_classifier.py'	2019-03-28 09:06:53 +01:00
Thomas Wolf	cc8c2d2332	Merge pull request #396 from IndexFziQ/IndexFziQ add tqdm to the process of eval in examples/run_swag.py	2019-03-27 12:03:26 +01:00
thomwolf	361aff6de5	typos	2019-03-27 11:54:59 +01:00
thomwolf	cea8ba1d59	adjusted formating and some wording in the readme	2019-03-27 11:53:44 +01:00
Matthew Carrigan	24e67fbf75	Minor README update	2019-03-25 12:33:30 +00:00
Matthew Carrigan	8d1d1ffde2	Corrected the displayed loss when gradient_accumulation_steps > 1	2019-03-25 12:15:19 +00:00
Matthew Carrigan	abb7d1ff6d	Added proper context management to ensure cleanup happens in the right order.	2019-03-21 17:50:03 +00:00
Matthew Carrigan	06a30cfdf3	Added a --reduce_memory option to the training script to keep training data on disc as a memmap rather than in memory	2019-03-21 17:04:12 +00:00
Matthew Carrigan	7d1ae644ef	Added a --reduce_memory option to the training script to keep training data on disc as a memmap rather than in memory	2019-03-21 17:02:18 +00:00
Matthew Carrigan	2bba7f810e	Added a --reduce_memory option to shelve docs to disc instead of keeping them in memory.	2019-03-21 16:50:16 +00:00
Matthew Carrigan	8733ffcb5e	Removing a couple of other old unnecessary comments	2019-03-21 14:09:57 +00:00
Matthew Carrigan	8a861048dd	Fixed up the notes on a possible future low-memory path	2019-03-21 14:08:39 +00:00
Matthew Carrigan	a8a577ba93	Reduced memory usage for pregenerating the data a lot by writing it out on the fly without shuffling - the Sampler in the finetuning script will shuffle for us.	2019-03-21 14:05:52 +00:00
Matthew Carrigan	0ae59e662d	Reduced memory usage for pregenerating the data a lot by writing it out on the fly without shuffling - the Sampler in the finetuning script will shuffle for us.	2019-03-21 14:04:17 +00:00
Matthew Carrigan	6a9038ba53	Removed an old irrelevant comment	2019-03-21 13:36:41 +00:00
Yuqiang Xie	77944d1b31	add tqdm to the process of eval Maybe better.	2019-03-21 20:59:33 +08:00
Matthew Carrigan	29a392fbcf	Small README changes	2019-03-20 17:35:17 +00:00
Matthew Carrigan	832b2b0058	Adding README	2019-03-20 17:31:49 +00:00
Matthew Carrigan	934d3f4d2f	Syncing up argument names between the scripts	2019-03-20 17:23:23 +00:00
Matthew Carrigan	f19ba35b2b	Move old finetuning script into the new folder	2019-03-20 16:47:06 +00:00
Matthew Carrigan	7de5c6aa5e	PEP8 and formatting cleanups	2019-03-20 16:44:04 +00:00
Matthew Carrigan	1798e98e5a	Added final TODOs	2019-03-20 16:42:37 +00:00
Matthew Carrigan	c64c2fc4c2	Fixed embarrassing indentation problem	2019-03-20 15:42:57 +00:00
Matthew Carrigan	0540d360f2	Fixed logging	2019-03-20 15:36:51 +00:00
Matthew Carrigan	976554a472	First commit of the new LM finetuning	2019-03-20 14:23:51 +00:00
Ananya Harsh Jha	e5b63fb542	Merge branch 'master' of https://github.com/ananyahjha93/pytorch-pretrained-BERT pull current master to local	2019-03-17 08:30:13 -04:00
Ananya Harsh Jha	8a4e90ff40	corrected folder creation error for MNLI-MM, verified GLUE results	2019-03-17 08:16:50 -04:00
Ananya Harsh Jha	e0bf01d9a9	added hack for mismatched MNLI	2019-03-16 14:10:48 -04:00
Ananya Harsh Jha	4c721c6b6a	added eval time metrics for GLUE tasks	2019-03-15 23:21:24 -04:00
tseretelitornike	83857ffeaa	Added missing imports.	2019-03-15 12:45:48 +01:00
Yongbo Wang	d1e4fa98a9	typo in annotation modify `heruistic` to `heuristic` in line 660, `charcter` to `character` in line 661.	2019-03-14 17:32:15 +08:00
Yongbo Wang	3d6452163d	typo modify `mull` to `null` in line 474 annotation.	2019-03-14 17:03:38 +08:00
thomwolf	a98dfe4ced	fixing #377 (empty nbest_predictions.json)	2019-03-14 09:57:06 +01:00
Ananya Harsh Jha	043c8781ef	added code for all glue task processors	2019-03-14 04:24:04 -04:00
Yongbo Wang	22a465a91f	Simplify code, delete redundancy line delete redundancy line `if args.train`, simplify code.	2019-03-13 09:42:06 +08:00

1 2 3 4

177 Commits