transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-03 03:31:05 +06:00

Author	SHA1	Message	Date
Rémi Louf	a88a0e4413	add tests to encoder-decoder model	2019-10-30 16:06:29 +01:00
Rémi Louf	3f07cd419c	update test on Bert to include decoder mode	2019-10-30 15:09:53 +01:00
Thomas Wolf	55fbfea369	Update CONTRIBUTING.md Co-Authored-By: Stefan Schweter <stefan.schweter@bsb-muenchen.de>	2019-10-30 12:25:40 +01:00
Thomas Wolf	cef2a8f900	Update CONTRIBUTING.md Co-Authored-By: Stefan Schweter <stefan.schweter@bsb-muenchen.de>	2019-10-30 12:25:31 +01:00
thomwolf	328a86d2af	adding links to the templates in readme and contributing	2019-10-30 11:37:55 +01:00
thomwolf	7f4226f9e6	adding templates	2019-10-30 11:31:56 +01:00
Rémi Louf	070507df1f	format utils for summarization	2019-10-30 11:24:12 +01:00
Rémi Louf	da10de8466	fix bug with padding mask + add corresponding test	2019-10-30 11:19:58 +01:00
Rémi Louf	3b0d2fa30e	rename seq2seq to encoder_decoder	2019-10-30 10:54:46 +01:00
Rémi Louf	9c1bdb5b61	revert renaming of lm_labels to ltr_lm_labels	2019-10-30 10:43:13 +01:00
Timothy Liu	842f3bf049	Fixed training for TF XLM	2019-10-30 01:32:15 +00:00
Rémi Louf	098a89f312	update docstrings; rename lm_labels to more explicit ltr_lm_labels	2019-10-29 20:08:03 +01:00
Rémi Louf	dfce409691	resolve PR comments	2019-10-29 17:10:20 +01:00
altsoph	079bfb32fb	Evaluation fixed.	2019-10-28 10:18:58 -04:00
altsoph	438f2730a0	Evaluation code fixed.	2019-10-28 10:18:58 -04:00
Rémi Louf	4c3ac4a7d8	here's one big commit	2019-10-28 10:49:50 +01:00
Rémi Louf	932543f77e	fix test of truncation function	2019-10-28 10:49:49 +01:00
Rémi Louf	a67413ccc8	extend works in-place	2019-10-28 10:49:49 +01:00
Rémi Louf	cb26b035c6	remove potential UndefinedError	2019-10-28 10:49:49 +01:00
Rémi Louf	b915ba9dfe	pad sequence with 0, mask with -1	2019-10-28 10:49:49 +01:00
Rémi Louf	dc580dd4c7	add lm_labels for the LM cross-entropy	2019-10-28 10:49:49 +01:00
Rémi Louf	f873a3edb2	the decoder attends to the output of the encoder stack (last layer)	2019-10-28 10:49:00 +01:00
Lysandre	beaf66b1f3	Remove break	2019-10-24 21:43:28 +00:00
Lysandre	bab6ad01aa	run_tf_glue works with all tasks	2019-10-24 21:41:45 +00:00
Matt Maybeno	ae1d03fc51	Add roberta to doc	2019-10-24 14:32:48 -04:00
Matt Maybeno	4e5f88b74f	Add Roberta to run_ner.py	2019-10-24 14:32:48 -04:00
Matt Maybeno	b92d68421d	Use roberta model and update doc strings	2019-10-24 14:32:48 -04:00
Matt Maybeno	66085a1321	RoBERTa token classification [WIP] copy paste bert token classification for roberta	2019-10-24 14:32:48 -04:00
Lysandre	b82bfbd0c3	Updated README to show all available documentation	2019-10-24 15:55:31 +00:00
VictorSanh	5b6cafb11b	[release] fix table weirdness	2019-10-23 10:35:16 -04:00
VictorSanh	8ad5c591cd	[RELEASE] DistilRoBERTa	2019-10-23 10:29:47 -04:00
focox@qq.com	bd847ce7d7	fixed the bug raised by "tmp_eval_loss += tmp_eval_loss.item()" when parallelly using multi-gpu.	2019-10-23 20:27:13 +08:00
Lysandre Debut	6e85bccafc	Fixed typo	2019-10-22 18:07:01 -04:00
Lysandre	fbcc5ff9fb	Change branch to master	2019-10-22 18:01:10 -04:00
Lysandre	69eba0ab19	Edit script path	2019-10-22 17:53:52 -04:00
Lysandre	bc3e57d551	Multi version doc deployment	2019-10-22 17:51:30 -04:00
Julien Chaumond	ef1b8b2ae5	[CTRL] warn if generation prompt does not start with a control code see also https://github.com/salesforce/ctrl/pull/50	2019-10-22 21:30:32 +00:00
Julián Peller (dataista)	e16d46843a	Fix architectures count	2019-10-22 15:13:47 -04:00
Lysandre	7d709e55ed	Remove	2019-10-22 14:12:33 -04:00
Lysandre	44286b94d3	RoBERTa doesn't print a warning when no special tokens are passed.	2019-10-22 13:46:48 -04:00
Lysandre	1cfd974868	Option to benchmark only one of the two libraries	2019-10-22 13:32:23 -04:00
Lysandre	777faa8ae7	Fix #1597	2019-10-22 11:26:42 -04:00
Thomas Wolf	b8c9ea0010	Merge pull request #1580 from pminervini/master Gradient norm clipping should be done right before calling the optimiser	2019-10-22 13:59:20 +02:00
Pasquale Minervini	abd7110e21	gradient norm clipping should be done right before calling the optimiser - fixing run_glue and run_ner as well	2019-10-21 19:56:52 +01:00
thomwolf	4d456542e9	Fix citation	2019-10-21 16:34:14 +02:00
Thomas Wolf	0e64fec1ab	Merge pull request #1568 from daemon/patch-1 Fix hanging when loading pretrained models	2019-10-21 14:31:57 +02:00
Pasquale Minervini	3775550c4b	gradient norm clipping should be done right before calling the optimiser	2019-10-20 22:33:56 +01:00
Pasquale Minervini	bf2c36a920	Merge pull request #1 from huggingface/master update	2019-10-20 23:30:45 +02:00
Ralph Tang	a2c8c8ef00	Fix hanging when loading pretrained models - Fix hanging when loading pretrained models from the cache without having internet access. This is a widespread issue on supercomputers whose internal compute nodes are firewalled.	2019-10-19 16:19:20 -04:00
LysandreJik	82f6abd98a	Benchmark section added to the documentation	2019-10-18 17:27:10 -04:00

1 2 3 4 5 ...

2119 Commits