transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
focox@qq.com	bd847ce7d7	fixed the bug raised by "tmp_eval_loss += tmp_eval_loss.item()" when parallelly using multi-gpu.	2019-10-23 20:27:13 +08:00
Julien Chaumond	ef1b8b2ae5	[CTRL] warn if generation prompt does not start with a control code see also https://github.com/salesforce/ctrl/pull/50	2019-10-22 21:30:32 +00:00
Julián Peller (dataista)	e16d46843a	Fix architectures count	2019-10-22 15:13:47 -04:00
Lysandre	1cfd974868	Option to benchmark only one of the two libraries	2019-10-22 13:32:23 -04:00
Lysandre	777faa8ae7	Fix #1597	2019-10-22 11:26:42 -04:00
Thomas Wolf	b8c9ea0010	Merge pull request #1580 from pminervini/master Gradient norm clipping should be done right before calling the optimiser	2019-10-22 13:59:20 +02:00
Pasquale Minervini	abd7110e21	gradient norm clipping should be done right before calling the optimiser - fixing run_glue and run_ner as well	2019-10-21 19:56:52 +01:00
thomwolf	4d456542e9	Fix citation	2019-10-21 16:34:14 +02:00
Thomas Wolf	0e64fec1ab	Merge pull request #1568 from daemon/patch-1 Fix hanging when loading pretrained models	2019-10-21 14:31:57 +02:00
Pasquale Minervini	3775550c4b	gradient norm clipping should be done right before calling the optimiser	2019-10-20 22:33:56 +01:00
Pasquale Minervini	bf2c36a920	Merge pull request #1 from huggingface/master update	2019-10-20 23:30:45 +02:00
Ralph Tang	a2c8c8ef00	Fix hanging when loading pretrained models - Fix hanging when loading pretrained models from the cache without having internet access. This is a widespread issue on supercomputers whose internal compute nodes are firewalled.	2019-10-19 16:19:20 -04:00
LysandreJik	82f6abd98a	Benchmark section added to the documentation	2019-10-18 17:27:10 -04:00
LysandreJik	7dd29ed2f1	Benchmarks example script	2019-10-18 10:53:04 -04:00
Lysandre Debut	8efc0ec91a	Add Benchmarks to issue templates	2019-10-18 10:45:44 -04:00
VictorSanh	fd97761c5a	soft launch distilroberta	2019-10-17 15:28:58 -04:00
leo-du	ecd15667f3	fix repetition penalty	2019-10-17 14:47:14 -04:00
Lysandre Debut	c544194611	Remove `special_tokens_mask` from inputs in README Co-authored-by: Thomas Wolf @thomwolf	2019-10-16 11:05:13 -04:00
thomwolf	be916cb3fb	Merge branch 'master' of https://github.com/huggingface/transformers	2019-10-15 10:37:13 +02:00
thomwolf	5875aaf762	install tensorboard	2019-10-15 10:36:46 +02:00
Thomas Wolf	40f14ff545	Merge pull request #1513 from slayton58/amp_fp16_einsum Force einsum to run in fp16	2019-10-15 10:25:00 +02:00
Thomas Wolf	e703e4dfe1	Merge pull request #1509 from julian-pani/patch-3 remove leftover usage of DUMMY_INPUTS	2019-10-15 10:24:13 +02:00
thomwolf	898ce064f8	add tests on TF2.0 & PT checkpoint => model convertion functions	2019-10-15 10:04:19 +02:00
Thomas Wolf	d147671c6c	Merge pull request #1508 from tlkh/master Added performance enhancements (XLA, AMP) to examples	2019-10-15 09:57:18 +02:00
thomwolf	2c1d5564ad	add readme information	2019-10-15 09:56:52 +02:00
Thomas Wolf	08bd8f9f39	Merge pull request #1505 from e-budur/master Fixed the sample code in the title 'Quick tour'.	2019-10-15 09:50:36 +02:00
Thomas Wolf	8aa3b753bd	Merge pull request #1434 from bryant1410/patch-1 Remove unnecessary use of FusedLayerNorm in XLNet	2019-10-15 09:44:19 +02:00
Thomas Wolf	621e7a2529	Merge pull request #1275 from stecklin/ner-fine-tuning Implement fine-tuning BERT on CoNLL-2003 named entity recognition task	2019-10-15 09:35:24 +02:00
thomwolf	c55badcee0	Add NER finetuning details by @stefan-it in example readme	2019-10-15 09:33:52 +02:00
Julien Chaumond	788e632622	[ner] Honor args.overwrite_cache	2019-10-15 09:17:31 +02:00
thomwolf	0f9ebb0b43	add seqeval as requirement for examples	2019-10-15 09:17:31 +02:00
thomwolf	66adb71734	update to transformers	2019-10-15 09:17:31 +02:00
Marianne Stecklina	5ff9cd158a	Add option to predict on test set	2019-10-15 09:17:31 +02:00
Marianne Stecklina	7f5367e0b1	Add cli argument for configuring labels	2019-10-15 09:17:31 +02:00
Marianne Stecklina	e1d4179b64	Make file reading more robust	2019-10-15 09:17:31 +02:00
Marianne Stecklina	383ef96747	Implement fine-tuning BERT on CoNLL-2003 named entity recognition task	2019-10-15 09:17:31 +02:00
Marianne Stecklina	5adb39e757	Add option to predict on test set	2019-10-15 09:14:53 +02:00
Marianne Stecklina	99b189df6d	Add cli argument for configuring labels	2019-10-15 09:14:53 +02:00
Marianne Stecklina	3e9420add1	Make file reading more robust	2019-10-15 09:14:53 +02:00
Marianne Stecklina	cde42c4354	Implement fine-tuning BERT on CoNLL-2003 named entity recognition task	2019-10-15 09:14:53 +02:00
Thomas Wolf	80889a0226	Merge pull request #1512 from louismartin/fix-roberta-convert Fix import error in script to convert faisreq roberta checkpoints	2019-10-14 17:40:32 +02:00
Simon Layton	4e6a55751a	Force einsum to fp16	2019-10-14 11:12:41 -04:00
Thomas Wolf	f62f992cf7	Merge pull request #1502 from jeffxtang/master the working example code to use BertForQuestionAnswering	2019-10-14 16:14:52 +02:00
Louis MARTIN	49cba6e543	Fix import error in script to convert faisreq roberta checkpoints	2019-10-14 01:38:57 -07:00
JulianPani	0993586758	remove usage of DUMMY_INPUTS Hey @thomwolf This change `da26bae61b (diff-8ddce309e88e8eb5b4d02228fd8881daL28-L29)` removed the constant, but one usage of that constant remains in the code.	2019-10-14 02:09:53 +03:00
Timothy Liu	376e65a674	Added automatic mixed precision and XLA options to run_tf_glue.py	2019-10-13 13:19:06 +00:00
Timothy Liu	86f23a1944	Minor enhancements to run_tf_glue.py	2019-10-13 10:21:35 +00:00
Emrah Budur	5a8c6e771a	Fixed the sample code in the title 'Quick tour'.	2019-10-12 14:17:17 +03:00
jeffxtang	e76d71521c	the working example code to use BertForQuestionAnswering and get an answer from a text and a question	2019-10-11 17:04:02 -07:00
VictorSanh	d844db4005	Add citation bibtex	2019-10-11 16:55:42 -04:00

1 2 3 4 5 ...

1966 Commits