transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-30 17:52:35 +06:00

Author	SHA1	Message	Date
thomwolf	edfe91c36e	first version bertology ok	2019-06-19 23:43:04 +02:00
thomwolf	7766ce66dd	update bertology	2019-06-19 22:29:51 +02:00
thomwolf	e4b46d86ce	update head pruning	2019-06-19 22:16:30 +02:00
thomwolf	0f40e8d6a6	debugger	2019-06-19 15:38:46 +02:00
thomwolf	0e1e8128bf	more logging	2019-06-19 15:35:49 +02:00
thomwolf	909d4f1af2	cuda again	2019-06-19 15:32:10 +02:00
thomwolf	14f0e8e557	fix cuda	2019-06-19 15:29:28 +02:00
thomwolf	34d706a0e1	pruning in bertology	2019-06-19 15:25:49 +02:00
thomwolf	dc8e0019b7	updating examples	2019-06-19 13:23:20 +02:00
thomwolf	68ab9599ce	small fix and updates to readme	2019-06-19 09:38:38 +02:00
thomwolf	f7e2ac01ea	update barrier	2019-06-18 22:43:35 +02:00
thomwolf	4d8c4337ae	test barrier in distrib training	2019-06-18 22:41:28 +02:00
thomwolf	3359955622	updating run_classif	2019-06-18 22:23:10 +02:00
thomwolf	29b7b30eaa	updating evaluation on a single gpu	2019-06-18 22:20:21 +02:00
thomwolf	7d2001aa44	overwrite_output_dir	2019-06-18 22:13:30 +02:00
thomwolf	16a1f338c4	fixing	2019-06-18 17:06:31 +02:00
thomwolf	92e0ad5aba	no numpy	2019-06-18 17:00:52 +02:00
thomwolf	4e6edc3274	hop	2019-06-18 16:57:15 +02:00
thomwolf	f55b60b9ee	fixing again	2019-06-18 16:56:52 +02:00
thomwolf	8bd9118294	quick fix	2019-06-18 16:54:41 +02:00
thomwolf	3e847449ad	fix out_label_ids	2019-06-18 16:53:31 +02:00
thomwolf	aad3a54e9c	fix paths	2019-06-18 16:48:04 +02:00
thomwolf	40dbda6871	updating classification example	2019-06-18 16:45:52 +02:00
thomwolf	7388c83b60	update run_classifier for distributed eval	2019-06-18 16:32:49 +02:00
thomwolf	9727723243	fix pickle	2019-06-18 16:02:42 +02:00
thomwolf	9710b68dbc	fix pickles	2019-06-18 16:01:15 +02:00
thomwolf	15ebd67d4e	cache in run_classifier + various fixes to the examples	2019-06-18 15:58:22 +02:00
thomwolf	e6e5f19257	fix	2019-06-18 14:45:14 +02:00
thomwolf	a432b3d466	distributed traing t_total	2019-06-18 14:39:09 +02:00
thomwolf	c5407f343f	split squad example in two	2019-06-18 14:29:03 +02:00
thomwolf	335f57baf8	only on main process	2019-06-18 14:03:46 +02:00
thomwolf	326944d627	add tensorboard to run_squad	2019-06-18 14:02:42 +02:00
thomwolf	d82e5deeb1	set find_unused_parameters=True in DDP	2019-06-18 12:13:14 +02:00
thomwolf	a59abedfb5	DDP update	2019-06-18 12:06:26 +02:00
thomwolf	2ef5e0de87	switch to pytorch DistributedDataParallel	2019-06-18 12:03:13 +02:00
thomwolf	9ce37af99b	oups	2019-06-18 11:47:54 +02:00
thomwolf	a40955f071	no need to duplicate models anymore	2019-06-18 11:46:14 +02:00
thomwolf	382e2d1e50	spliting config and weight files for bert also	2019-06-18 10:37:16 +02:00
Thomas Wolf	cad88e19de	Merge pull request #672 from oliverguhr/master Add vocabulary and model config to the finetune output	2019-06-14 17:02:47 +02:00
Thomas Wolf	460d9afd45	Merge pull request #640 from Barqawiz/master Support latest multi language bert fine tune	2019-06-14 16:57:02 +02:00
Thomas Wolf	277c77f1c5	Merge pull request #630 from tguens/master Update run_squad.py	2019-06-14 16:56:26 +02:00
Thomas Wolf	659af2cbd0	Merge pull request #604 from samuelbroscheit/master Fixing issue "Training beyond specified 't_total' steps with schedule 'warmup_linear'" reported in #556	2019-06-14 16:49:24 +02:00
Meet Pragnesh Shah	e02ce4dc79	[hotfix] Fix frozen pooler parameters in SWAG example.	2019-06-11 15:13:53 -07:00
Oliver Guhr	5c08c8c273	adds the tokenizer + model config to the output	2019-06-11 13:46:33 +02:00
jeonsworld	a3a604cefb	Update pregenerate_training_data.py apply Whole Word Masking technique. referred to [create_pretraining_data.py](https://github.com/google-research/bert/blob/master/create_pretraining_data.py)	2019-06-10 12:17:23 +09:00
Ahmad Barqawi	c4fe56dcc0	support latest multi language bert fine tune fix issue of bert-base-multilingual and add support for uncased multilingual	2019-05-27 11:27:41 +02:00
tguens	9e7bc51b95	Update run_squad.py Indentation change so that the output "nbest_predictions.json" is not empty.	2019-05-22 17:27:59 +08:00
samuelbroscheit	94247ad6cb	Make num_train_optimization_steps int	2019-05-13 12:38:22 +02:00
samuel.broscheit	49a77ac16f	Clean up a little bit	2019-05-12 00:31:10 +02:00
samuel.broscheit	3bf3f9596f	Fixing the issues reported in https://github.com/huggingface/pytorch-pretrained-BERT/issues/556 Reason for issue was that optimzation steps where computed from example size, which is different from actual size of dataloader when an example is chunked into multiple instances. Solution in this pull request is to compute num_optimization_steps directly from len(data_loader).	2019-05-12 00:13:45 +02:00

1 2 3 4 5 ...

255 Commits