transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-29 17:22:25 +06:00

Author	SHA1	Message	Date
LysandreJik	7e7fc53da5	Fixing run_glue example with RoBERTa	2019-08-16 11:53:10 -04:00
LysandreJik	715534800a	BERT + RoBERTa masking tokens handling + GPU device update.	2019-08-16 10:10:21 -04:00
LysandreJik	339e556feb	CLM for BERT, beginning of CLM fot RoBERTa; still needs a better masking token mechanism.	2019-08-16 10:10:20 -04:00
LysandreJik	5c18825a18	Removed dataset limit	2019-08-16 10:10:20 -04:00
LysandreJik	3e3e145497	Added GPT to the generative fine-tuning.	2019-08-16 10:10:20 -04:00
LysandreJik	47975ed53e	Language Modeling fine-tuning using GPT-2.	2019-08-16 10:10:20 -04:00
LysandreJik	ab05280666	Order of strings in AutoModel/AutoTokenizer updated.	2019-08-16 09:53:26 -04:00
wangfei	b8ff56896c	Fix bug of multi-gpu training in lm finetuning	2019-08-16 12:11:05 +08:00
LysandreJik	9d0029e215	Added RoBERTa example to README	2019-08-15 17:17:35 -04:00
LysandreJik	83dba0b67b	Added RoBERTa tokenizer to AutoTokenizer	2019-08-15 17:07:07 -04:00
LysandreJik	e24e19ce3b	Added RoBERTa to AutoModel/AutoConfig	2019-08-15 14:02:11 -04:00
LysandreJik	fe02e45e48	Release: 1.1.0	2019-08-15 11:15:08 -04:00
Lysandre Debut	88efc65bac	Merge pull request #964 from huggingface/RoBERTa RoBERTa: model conversion, inference, tests 🔥	2019-08-15 11:11:10 -04:00
LysandreJik	8308170156	Warning for RoBERTa sequences encoded without special tokens.	2019-08-15 10:29:04 -04:00
LysandreJik	572dcfd1db	Doc	2019-08-14 14:56:14 -04:00
Julien Chaumond	c4ef103447	[RoBERTa] First 4 authors cf. https://github.com/huggingface/pytorch-transformers/pull/964#discussion_r313574354 Co-Authored-By: Myle Ott <myleott@fb.com>	2019-08-14 12:31:09 -04:00
Rabeeh KARIMI	3d47a7f8ab	loads the tokenizer for each checkpoint, to solve the reproducability issue	2019-08-14 10:58:26 +02:00
samvelyan	9ce36e3e4b	Re-implemented tokenize() iteratively in PreTrainedTokenizer.	2019-08-14 08:57:09 +00:00
LysandreJik	39f426be65	Added special tokens <pad> and <mask> to RoBERTa.	2019-08-13 15:19:50 -04:00
Julien Chaumond	baf08ca1d4	[RoBERTa] run_glue: correct pad_token + reorder labels	2019-08-13 12:51:15 -04:00
LysandreJik	3d87991f60	Fixed error with encoding	2019-08-13 12:00:24 -04:00
tuvuumass	ba4bce2581	fix issue #824	2019-08-13 11:26:27 -04:00
LysandreJik	634a3172d8	Added integration tests for sequence builders.	2019-08-12 15:14:15 -04:00
LysandreJik	22ac004a7c	Added documentation and changed parameters for special_tokens_sentences_pair.	2019-08-12 15:13:53 -04:00
Julien Chaumond	912fdff899	[RoBERTa] Update `run_glue` for RoBERTa	2019-08-12 13:49:50 -04:00
Julien Chaumond	b3d83d68db	Fixup `9d0603148b`	2019-08-12 12:28:55 -04:00
carefree0910	a7b4cfe919	Update README.md I assume that it should test the `re-load` functionality after testing the `save` functionality, however I'm also surprised that nobody points this out after such a long time, so maybe I've misunderstood the purpose. This PR is just in case :)	2019-08-12 09:53:05 -04:00
erenup	b219029c45	refactoring old run_swag. This script is mainly refatored from run_squad in pytorch_transformers	2019-08-11 15:20:37 +08:00
thomwolf	aaedfc35a8	Merge branch 'master' of https://github.com/huggingface/pytorch-transformers	2019-08-10 20:04:37 +02:00
thomwolf	c683c3d5a5	fix #993	2019-08-10 20:04:35 +02:00
Kevin Trebing	7060766490	Corrected logger.error info Signed-off-by: Kevin Trebing <Kevin.Trebing@gmx.net>	2019-08-09 19:36:44 -04:00
LysandreJik	75d5f98fd2	Roberta tokenization + fixed tests (py3 + py2).	2019-08-09 15:02:13 -04:00
LysandreJik	14e970c271	Tokenization encode/decode class-based sequence handling	2019-08-09 15:01:38 -04:00
LysandreJik	3566d27919	Clarified PreTrainedModel.from_pretrained warning messages in documentation.	2019-08-08 19:04:34 -04:00
LysandreJik	fbd746bd06	Updated test architecture	2019-08-08 18:21:34 -04:00
LysandreJik	6c41a8f5dc	Encode and Decode are back in the superclass. They now handle sentence pairs special tokens.	2019-08-08 18:20:32 -04:00
Julien Chaumond	e367ac469c	[RoBERTa] Re-apply `39d72bcc7b` cc @lysandrejik	2019-08-08 11:26:11 -04:00
Julien Chaumond	9d0603148b	[RoBERTa] RobertaForSequenceClassification + conversion	2019-08-08 11:24:54 -04:00
LysandreJik	f2b300df6b	fix #976	2019-08-08 10:38:57 -04:00
LysandreJik	7df303f5ad	fix #971	2019-08-08 10:36:26 -04:00
LysandreJik	d2cc6b101e	Merge branch 'master' into RoBERTa	2019-08-08 09:42:05 -04:00
LysandreJik	39d72bcc7b	Fixed the RoBERTa checkpoint conversion script according to the LM head refactoring.	2019-08-07 14:21:57 -04:00
LysandreJik	770043eea2	Sentence-pair tasks handling. Using common tests on RoBERTa. Forced push to fix indentation.	2019-08-07 12:53:19 -04:00
Thomas Wolf	7729ef7381	Merge pull request #955 from FeiWang96/master Fix comment typo	2019-08-07 10:11:25 +02:00
Thomas Wolf	5c6ecf37e7	Merge pull request #958 from saket404/typo-fix Fixed small typo	2019-08-07 10:10:20 +02:00
Thomas Wolf	b4f9464f90	Merge pull request #960 from ethanjperez/patch-1 Fixing unused weight_decay argument	2019-08-07 10:09:55 +02:00
Thomas Wolf	822d6768eb	Merge pull request #962 from guotong1988/patch-1 Update modeling_xlnet.py	2019-08-07 10:09:20 +02:00
Thomas Wolf	7e6102ce74	Merge pull request #963 from guotong1988/patch-2 Update modeling_bert.py	2019-08-07 10:09:04 +02:00
Thomas Wolf	3773ba44f0	Merge pull request #977 from chrisgzf/master Fixed typo in migration guide	2019-08-07 10:08:45 +02:00
Thomas Wolf	a80aa03bda	Merge pull request #973 from FeiWang96/bert_config Fix examples of loading pretrained models in docstring	2019-08-07 10:08:22 +02:00

... 150 151 152 153 154 ...

8821 Commits