transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-29 01:02:25 +06:00

Author	SHA1	Message	Date
Stefan Schweter	3e89fca543	readme: add XLM-RoBERTa to model architecture list	2019-12-18 19:44:23 +01:00
Gunnlaugur Thor Briem	d303f84e7b	fix: wrong architecture count in README Just say “the following” so that this intro doesn't so easily fall out of date :) )	2019-12-17 16:18:00 +00:00
Julien Chaumond	3f5ccb183e	[doc] Clarify uploads cf `855ff0e91d (commitcomment-36452545)`	2019-12-16 18:20:29 -05:00
Julien Chaumond	855ff0e91d	[doc] Model upload and sharing ping @lysandrejik @thomwolf Is this clear enough? Anything we should add?	2019-12-16 12:42:22 -05:00
Thomas Wolf	e92bcb7eb6	Merge pull request #1739 from huggingface/t5 [WIP] Adding Google T5 model	2019-12-14 09:40:43 +01:00
Lysandre	7bd11dda6f	Release: v2.2.2	2019-12-13 16:45:30 -05:00
thomwolf	0558c9cb9b	Merge branch 'master' into t5	2019-12-10 12:58:48 +01:00
Suvrat Bhooshan	df3961121f	Add MMBT Model to Transformers Repo	2019-12-09 18:36:48 -08:00
Pierric Cistac	5c877fe94a	fix albert links	2019-12-09 18:53:00 -05:00
Aymeric Augustin	35401fe50f	Remove dependency on pytest for running tests (#2055 ) * Switch to plain unittest for skipping slow tests. Add a RUN_SLOW environment variable for running them. * Switch to plain unittest for PyTorch dependency. * Switch to plain unittest for TensorFlow dependency. * Avoid leaking open files in the test suite. This prevents spurious warnings when running tests. * Fix unicode warning on Python 2 when running tests. The warning was: UnicodeWarning: Unicode equal comparison failed to convert both arguments to Unicode - interpreting them as being unequal * Support running PyTorch tests on a GPU. Reverts `27e015bd`. * Tests no longer require pytest. * Make tests pass on cuda	2019-12-06 13:57:38 -05:00
VictorSanh	552c44a9b1	release distilm-bert	2019-12-05 10:14:58 -05:00
LysandreJik	8101924a68	Patch: v2.2.1	2019-12-03 11:20:26 -05:00
Julien Chaumond	b5d884d25c	Uniformize #1952	2019-11-27 11:05:55 -05:00
Lysandre	cf26a0c85e	Fix pretrained models table	2019-11-26 15:40:03 -05:00
Lysandre Debut	b632145273	Update master documentation link in README	2019-11-26 14:27:15 -05:00
Lysandre	ae98d45991	Release: v2.2.0	2019-11-26 14:12:44 -05:00
Julien Chaumond	176cd1ce1b	[doc] homogenize instructions slightly	2019-11-23 11:18:54 -05:00
Rémi Louf	6f70bb8c69	add instructions to run the examples	2019-11-21 14:41:19 -05:00
Julien Chaumond	3916b334a8	[camembert] Acknowledge the full author list	2019-11-18 09:29:11 -05:00
Sebastian Stabinger	44455eb5b6	Adds CamemBERT to Model architectures list	2019-11-18 09:23:14 -05:00
Thomas Wolf	df99f8c5a1	Merge pull request #1832 from huggingface/memory-leak-schedulers replace LambdaLR scheduler wrappers by function	2019-11-14 22:10:31 +01:00
Rémi Louf	2276bf69b7	update the examples, docs and template	2019-11-14 20:38:02 +01:00
thomwolf	8aba81a0b6	fix #1789	2019-11-12 08:52:43 +01:00
thomwolf	f03c0c1423	adding models in readme and auto classes	2019-11-08 11:49:46 +01:00
Lysandre	68f7064a3e	Add `model.train()` line to ReadMe training example Co-Authored-By: Santosh-Gupta <San.Gupta.ML@gmail.com>	2019-11-04 11:52:35 -05:00
Thomas Wolf	7f84fc571a	Merge pull request #1670 from huggingface/templates Templates and explanation for adding a new model and example script	2019-10-30 17:05:58 +01:00
Thomas Wolf	5c6a19a94a	Merge pull request #1604 from huggingface/deploy_doc Versioning in documentation	2019-10-30 17:03:14 +01:00
thomwolf	328a86d2af	adding links to the templates in readme and contributing	2019-10-30 11:37:55 +01:00
Lysandre	b82bfbd0c3	Updated README to show all available documentation	2019-10-24 15:55:31 +00:00
Julien Chaumond	ef1b8b2ae5	[CTRL] warn if generation prompt does not start with a control code see also https://github.com/salesforce/ctrl/pull/50	2019-10-22 21:30:32 +00:00
Julián Peller (dataista)	e16d46843a	Fix architectures count	2019-10-22 15:13:47 -04:00
thomwolf	4d456542e9	Fix citation	2019-10-21 16:34:14 +02:00
Lysandre Debut	c544194611	Remove `special_tokens_mask` from inputs in README Co-authored-by: Thomas Wolf @thomwolf	2019-10-16 11:05:13 -04:00
Emrah Budur	5a8c6e771a	Fixed the sample code in the title 'Quick tour'.	2019-10-12 14:17:17 +03:00
thomwolf	4b8f3e8f32	adding citation	2019-10-11 16:18:16 +02:00
thomwolf	d9e60f4f0d	Merge branch 'master' into pr/1383	2019-10-09 17:25:08 +02:00
Julien Chaumond	d688af19e5	Update link to swift-coreml-transformers cc @lysandrejik	2019-10-08 16:37:52 -04:00
seanBE	6dc6c716c5	fix pytorch-transformers migration description in README	2019-10-07 09:59:54 +01:00
Christopher Goh	904158ac4d	Rephrase forward method to reduce ambiguity	2019-10-06 23:40:52 -04:00
Christopher Goh	0f65d8cbbe	Fix some typos in README	2019-10-06 23:40:52 -04:00
keskarnitish	dbed1c5d94	Adding CTRL (squashed commit) adding conversion script adding first draft of modeling & tokenization adding placeholder for test files bunch of changes registering the tokenizer/model/etc tests change link; something is very VERY wrong here weird end-of-word thingy going on i think the tokenization works now ; wrote the unit tests overall structure works;load w next the monster is alive! works after some cleanup as well adding emacs autosave to gitignore currently only supporting the 48 layer one; seems to infer fine on my macbook cleanup fixing some documentation fixing some documentation tests passing? now works on CUDA also adding greedy? adding greedy sampling works well	2019-10-03 22:29:03 -07:00
VictorSanh	35071007cb	incoming release 🔥 update links to arxiv preprint	2019-10-03 10:27:11 -04:00
DenysNahurnyi	6971556ab8	Fix syntax typo in README.md	2019-10-01 14:59:31 -04:00
Santosh Gupta	5c3b32d44d	Update README.md Lines 183 - 200, fixed indentation. Line 198, replaced `tokenizer_class` with `BertTokenizer`, since `tokenizer_class` is not defined in the loop it belongs to.	2019-09-30 18:48:01 +00:00
wangfei	60f791631b	Fix link in readme	2019-09-28 16:20:17 +08:00
BramVanroy	15749bfc10	Add small note about the output of hidden states	2019-09-27 10:01:36 +02:00
thomwolf	6c3b131516	typo in readme/doc	2019-09-26 16:23:28 +02:00
thomwolf	4e63c90720	update installation instructions in readme	2019-09-26 16:14:21 +02:00
Lysandre Debut	0f92f76ca3	CircleCI reference in README	2019-09-26 08:59:52 -04:00
thomwolf	9676d1a2a8	update readme and setup.py	2019-09-26 13:47:58 +02:00
thomwolf	4dde31cb76	update readme	2019-09-26 12:18:26 +02:00
thomwolf	4ddc31ff40	update readme with migration change	2019-09-26 12:00:38 +02:00
thomwolf	f47f7f4611	add logo	2019-09-26 11:28:44 +02:00
thomwolf	9fabc0b6a9	wip readme	2019-09-26 11:21:34 +02:00
thomwolf	31c23bd5ee	[BIG] pytorch-transformers => transformers	2019-09-26 10:15:53 +02:00
Julien Chaumond	62760baf46	tiny fixes	2019-09-17 18:29:15 -04:00
Julien Chaumond	f9453d15e5	Fix broken link	2019-09-05 12:35:22 -04:00
Julien Chaumond	f7ee2e5d20	[README] link to Write With Transformer	2019-09-05 12:33:46 -04:00
Thomas Wolf	50e615f43d	Merge branch 'master' into improved_testing	2019-08-30 13:40:35 +02:00
thomwolf	306af132d7	update readme to mention add_special_tokens more clearly in example	2019-08-30 11:30:51 +02:00
LysandreJik	75bc2a03cc	Updated article link	2019-08-28 10:05:15 -04:00
thomwolf	912a377e90	dilbert -> distilbert	2019-08-28 13:59:42 +02:00
thomwolf	4ce5f36f78	update readmes	2019-08-28 12:14:31 +02:00
VictorSanh	497f73c964	add DilBERT to master REAME	2019-08-28 07:16:30 +00:00
thomwolf	e00b4ff1de	fix #1017	2019-08-21 22:22:17 +02:00
Nikolay Korolev	ad6e62cd82	Fix typo. configuratoin -> configuration	2019-08-20 15:43:06 +03:00
Christophe Bourguignat	189ff9b664	Update README after RoBERTa addition	2019-08-17 13:18:37 -04:00
LysandreJik	9d0029e215	Added RoBERTa example to README	2019-08-15 17:17:35 -04:00
Lysandre Debut	88efc65bac	Merge pull request #964 from huggingface/RoBERTa RoBERTa: model conversion, inference, tests 🔥	2019-08-15 11:11:10 -04:00
Julien Chaumond	c4ef103447	[RoBERTa] First 4 authors cf. https://github.com/huggingface/pytorch-transformers/pull/964#discussion_r313574354 Co-Authored-By: Myle Ott <myleott@fb.com>	2019-08-14 12:31:09 -04:00
carefree0910	a7b4cfe919	Update README.md I assume that it should test the `re-load` functionality after testing the `save` functionality, however I'm also surprised that nobody points this out after such a long time, so maybe I've misunderstood the purpose. This PR is just in case :)	2019-08-12 09:53:05 -04:00
LysandreJik	d2cc6b101e	Merge branch 'master' into RoBERTa	2019-08-08 09:42:05 -04:00
Christopher Goh	a6f412da01	Fixed typo in migration guide	2019-08-07 02:19:14 +08:00
Thomas Wolf	d43dc48b34	Merge branch 'master' into auto_models	2019-08-05 19:17:35 +02:00
thomwolf	7223886dc9	fix #944	2019-08-05 17:16:56 +02:00
thomwolf	58830807d1	inidicate we only support pytorch 1.0.0+ now	2019-08-05 14:38:59 +02:00
thomwolf	328afb7097	cleaning up tokenizer tests structure (at last) - last remaining ppb refs	2019-08-05 14:08:56 +02:00
Julien Chaumond	05c083520a	[RoBERTa] model conversion, inference, tests 🔥	2019-08-04 21:39:21 -04:00
thomwolf	009273dbdd	big doc update [WIP]	2019-08-04 12:14:57 +02:00
Julien Chaumond	44dd941efb	link to `swift-coreml-transformers`	2019-08-01 09:50:30 -04:00
Anthony MOI	f2a3eb987e	Fix small typos	2019-07-31 11:05:06 -04:00
Pierric Cistac	97091acb8c	Small spelling fix	2019-07-31 10:37:56 -04:00
Grégory Châtel	769bb643ce	Fixing a broken link.	2019-07-31 10:22:41 -04:00
Thomas Wolf	fec76a481d	Update readme	2019-07-23 16:05:29 +02:00
thomwolf	ba52fe69d5	update breaking change section regarding from_pretrained keyword arguments	2019-07-23 15:10:02 +02:00
rish-16	2f869dc665	Fixed typo	2019-07-21 11:05:36 -04:00
Thomas Wolf	dbecfcf321	Merge pull request #815 from praateekmahajan/update-readme-link Update Readme link for Fine Tune/Usage section	2019-07-18 18:30:32 +02:00
Peiqin Lin	acc48a0cc9	typos	2019-07-18 09:54:04 -04:00
Praateek Mahajan	0d46b17553	Update Readme Incorrect link for `Quick tour: Fine-tuning/usage scripts`	2019-07-17 22:50:10 -07:00
thomwolf	c5b3d86a91	Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT	2019-07-16 21:21:05 +02:00
thomwolf	6b70760204	typos	2019-07-16 21:21:03 +02:00
Thomas Wolf	b33a385091	update readme	2019-07-16 16:18:37 +02:00
thomwolf	6a72d9aa52	updated examples in readme	2019-07-16 16:09:29 +02:00
thomwolf	b59043bf8f	update readme	2019-07-16 16:03:48 +02:00
thomwolf	edc79acb3b	simpler quick tour	2019-07-16 16:02:32 +02:00
thomwolf	5c82d3488f	indicate default evaluation in breaking changes	2019-07-16 15:45:58 +02:00
thomwolf	4acaa65068	model in evaluation mode by default after from_pretrained	2019-07-16 15:41:57 +02:00
thomwolf	1849aa7d39	update readme and pretrained model weight files	2019-07-16 15:11:29 +02:00
thomwolf	43e0e8fa04	updates to readme and doc	2019-07-16 13:56:47 +02:00
thomwolf	352e3ff998	added migration guide to readme	2019-07-16 09:03:49 +02:00
thomwolf	8ad7e5b4f2	indeed	2019-07-16 00:29:15 +02:00
thomwolf	064d0a0b76	update readme	2019-07-16 00:21:33 +02:00
thomwolf	3b8b0e01bb	update readme	2019-07-16 00:12:55 +02:00
thomwolf	2397f958f9	updating examples and doc	2019-07-14 23:20:10 +02:00
thomwolf	6135de2fa3	readme update	2019-07-11 15:39:49 +02:00
thomwolf	e468192e2f	Merge branch 'pytorch-transformers' into xlnet	2019-07-09 17:05:37 +02:00
LysandreJik	ab30651802	Hugging Face theme.	2019-07-08 16:05:26 -04:00
thomwolf	eb91f6437e	update readme and setup	2019-07-05 12:30:15 +02:00
thomwolf	0231ba291e	circle-ci	2019-07-05 11:59:04 +02:00
thomwolf	0bab55d5d5	[BIG] name change	2019-07-05 11:55:36 +02:00
thomwolf	93e9971c54	fix tests	2019-06-26 10:02:45 +02:00
thomwolf	e55d4c4ede	various updates to conversion, models and examples	2019-06-26 00:57:53 +02:00
thomwolf	603c513b35	update main conversion script and readme	2019-06-25 10:45:07 +02:00
thomwolf	62d78aa37e	updating GLUE utils for compatibility with XLNet	2019-06-24 14:36:11 +02:00
thomwolf	c304593d8f	BERTology details in readme	2019-06-20 10:05:06 +02:00
thomwolf	34d706a0e1	pruning in bertology	2019-06-19 15:25:49 +02:00
thomwolf	dc8e0019b7	updating examples	2019-06-19 13:23:20 +02:00
thomwolf	68ab9599ce	small fix and updates to readme	2019-06-19 09:38:38 +02:00
thomwolf	4d8c4337ae	test barrier in distrib training	2019-06-18 22:41:28 +02:00
thomwolf	15ebd67d4e	cache in run_classifier + various fixes to the examples	2019-06-18 15:58:22 +02:00
thomwolf	d82e5deeb1	set find_unused_parameters=True in DDP	2019-06-18 12:13:14 +02:00
thomwolf	f964753090	explanation on the current location of the caching folder	2019-06-18 11:36:28 +02:00
thomwolf	382e2d1e50	spliting config and weight files for bert also	2019-06-18 10:37:16 +02:00
thomwolf	4447f270b2	updating hub	2019-06-17 16:21:28 +02:00
thomwolf	33d3db5c43	updating head masking, readme and docstrings	2019-06-17 15:51:28 +02:00
thomwolf	34858ae1d9	adding bert whole words, bertgerman and gpt-2 medium models, head masking	2019-06-17 11:02:39 +02:00
timoeller	16af9ff7b0	Add German Bert model to code, update readme	2019-06-14 17:42:46 +02:00
Colanim	1eba8b9d96	Fix link in README	2019-05-30 14:01:46 +09:00
lukovnikov	331a46ff04	- replaced OpenAIGPTAdam with OpenAIAdam in docs	2019-04-25 16:04:37 +02:00
lukovnikov	704037ad51	- updated docs for new LR API - added some images for illustration - updated comments in optimization	2019-04-25 15:59:39 +02:00
thomwolf	18a8a15f78	improving GPT2 tokenization and adding tests	2019-04-16 17:00:55 +02:00
thomwolf	1135f2384a	clean up logger in examples for distributed case	2019-04-15 15:22:40 +02:00
thomwolf	cc43307023	update readme	2019-04-15 15:06:10 +02:00
thomwolf	60ea6c59d2	added best practices for serialization in README and examples	2019-04-15 15:00:33 +02:00
thomwolf	20577d8a7c	add configuration serialization to readme	2019-04-15 14:21:41 +02:00
thomwolf	b17963d82f	update readme	2019-04-15 13:44:30 +02:00
Weixin Wang	f26ce6992e	Fix links in README	2019-04-02 17:20:32 +08:00
Sepehr Sameni	b588ff362a	fix lm_finetuning's link	2019-03-29 12:39:24 +04:30
Thomas Wolf	694e2117f3	Merge pull request #388 from ananyahjha93/master Added remaining GLUE tasks to 'run_classifier.py'	2019-03-28 09:06:53 +01:00
Thomas Wolf	bbff03fbfc	Merge pull request #394 from desireevl/master Minor change in README	2019-03-27 12:03:00 +01:00
thomwolf	34561e61a5	update main readme also	2019-03-27 12:00:04 +01:00
Ananya Harsh Jha	f471979167	added GLUE dev set results and details on how to run GLUE tasks	2019-03-21 15:38:30 -04:00
Desiree Vogt-Lee	d52f914e24	weigths to weights	2019-03-21 15:02:59 +10:00
Junjie Qian	d648a02203	Correct line number in README for classes	2019-03-08 16:28:03 -08:00
thomwolf	7cc35c3104	fix openai gpt example and updating readme	2019-03-06 11:43:21 +01:00
thomwolf	906b638efa	updating readme	2019-03-06 10:24:19 +01:00
John Hewitt	e14c6b52e3	add BertTokenizer flag to skip basic tokenization	2019-02-26 20:11:24 -08:00
Joel Grus	8722e9eb3b	finish updating docstrings	2019-02-23 06:31:59 -08:00
Stanislas Polu	ff22b3acc0	Few small nits in GPT-2's code examples	2019-02-21 09:15:27 +00:00
Tong Guo	09efcece75	Update README.md	2019-02-21 11:25:33 +08:00

1 2 3 4 5 ...

372 Commits