transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-07 14:50:07 +06:00

Author	SHA1	Message	Date
Jared T Nielsen	64070cbb88	Fix TF input docstrings to refer to tf.Tensor rather than torch.FloatTensor. (#4051 )	2020-04-30 14:28:56 +02:00
Clement	6ba254ee54	quick fix wording readme for community models (#3900 )	2020-04-23 14:19:45 -04:00
Julien Chaumond	dd9d483d03	Trainer (#3800 ) * doc * [tests] Add sample files for a regression task * [HUGE] Trainer * Feedback from @sshleifer * Feedback from @thomwolf + logging tweak * [file_utils] when downloading concurrently, get_from_cache will use the cached file for subsequent processes * [glue] Use default max_seq_length of 128 like before * [glue] move DataTrainingArguments around * [ner] Change interface of InputExample, and align run_{tf,pl} * Re-align the pl scripts a little bit * ner * [ner] Add integration test * Fix language_modeling with API tweak * [ci] Tweak loss target * Don't break console output * amp.initialize: model must be on right device before * [multiple-choice] update for Trainer * Re-align to `827d6d6ef0`	2020-04-21 20:11:56 -04:00
Patrick von Platen	a21d4fa410	add "by" to ReadMe	2020-04-18 18:07:17 +02:00
Patrick von Platen	d22894dfd4	[Docs] Add DialoGPT (#3755 ) * add dialoGPT * update README.md * fix conflict * update readme * add code links to docs * Update README.md * Update dialo_gpt2.rst * Update pretrained_models.rst * Update docs/source/model_doc/dialo_gpt2.rst Co-Authored-By: Julien Chaumond <chaumond@gmail.com> * change filename of dialogpt Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-04-16 09:04:32 +02:00
Julien Chaumond	cbad305ce6	[docs] The use of `do_lower_case` in scripts is on its way to deprecation (#3738 )	2020-04-10 12:34:04 -04:00
Julien Chaumond	83703cd077	Update doc for {Summarization,Translation}Pipeline and other tweaks	2020-04-08 09:45:00 -04:00
Lysandre Debut	d5d7d88612	ELECTRA (#3257 ) * Electra wip * helpers * Electra wip * Electra v1 * ELECTRA may be saved/loaded * Generator & Discriminator * Embedding size instead of halving the hidden size * ELECTRA Tokenizer * Revert BERT helpers * ELECTRA Conversion script * Archive maps * PyTorch tests * Start fixing tests * Tests pass * Same configuration for both models * Compatible with base + large * Simplification + weight tying * Archives * Auto + Renaming to standard names * ELECTRA is uncased * Tests * Slight API changes * Update tests * wip * ElectraForTokenClassification * temp * Simpler arch + tests Removed ElectraForPreTraining which will be in a script * Conversion script * Auto model * Update links to S3 * Split ElectraForPreTraining and ElectraForTokenClassification * Actually test PreTraining model * Remove num_labels from configuration * wip * wip * From discriminator and generator to electra * Slight API changes * Better naming * TensorFlow ELECTRA tests * Accurate conversion script * Added to conversion script * Fast ELECTRA tokenizer * Style * Add ELECTRA to README * Modeling Pytorch Doc + Real style * TF Docs * Docs * Correct links * Correct model intialized * random fixes * style * Addressing Patrick's and Sam's comments * Correct links in docs	2020-04-03 14:10:54 -04:00
Thomas Wolf	2187c49f5c	CPU/GPU memory benchmarking utilities - Remove support for python 3.5 (now only 3.6+) (#3186 ) * memory benchmark rss * have both forward pass and line-by-line mem tracing * cleaned up tracing * refactored and cleaning up API * no f-strings yet... * add GPU mem logging * fix GPU memory monitoring * style and quality * clean up and doc * update with comments * Switching to python 3.6+ * fix quality	2020-03-17 10:17:11 -04:00
Sam Shleifer	087465b943	add BART to README (#3255 )	2020-03-12 19:38:05 -04:00
Julien Chaumond	d6de6423ba	[doc] --organization tweak Co-Authored-By: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-03-10 16:52:44 -04:00
Julien Chaumond	0e56dc3078	[doc] Document the new --organization flag of CLI	2020-03-10 16:42:01 -04:00
Santiago Castro	976e9afece	Add syntax highlighting to the BibTeX in README	2020-02-20 10:06:15 -05:00
Lysandre	59c23ad9c9	README link + better instructions for release	2020-02-19 11:57:17 -05:00
VictorSanh	ee5a6856ca	distilbert-base-cased weights + Readmes + omissions	2020-02-07 15:28:13 -05:00
Clement	c069932f5d	Add contributors snapshot powered by https://github.com/sourcerer-io/hall-of-fame	2020-02-06 15:25:47 -05:00
Julien Chaumond	eae8ee0389	[doc] model sharing: mention README.md + tweaks cc @lysandrejik @thomwolf	2020-02-05 14:20:03 -05:00
Arnaud	3a21d6da6b	Typo on markdown link in README.md	2020-01-31 10:58:49 -05:00
Lysandre	0aa40e9569	v2.4.0 documentation	2020-01-31 09:55:34 -05:00
Julien Chaumond	9fa836a73f	fill_mask helper (#2576 ) * fill_mask helper * [poc] FillMaskPipeline * Revert "[poc] FillMaskPipeline" This reverts commit `67eeea55b0`. * Revert "fill_mask helper" This reverts commit `cacc17b884`. * README: clarify that Pipelines can also do text-classification cf. question at the AI&ML meetup last week, @mfuntowicz * Fix test: test feature-extraction pipeline * Test tweaks * Slight refactor of existing pipeline (in preparation of new FillMaskPipeline) * Extraneous doc * More robust way of doing this @mfuntowicz as we don't rely on the model name anymore (see AutoConfig) * Also add RobertaConfig as a quickfix for wrong token_type_ids * cs * [BIG] FillMaskPipeline	2020-01-30 18:15:42 -05:00
Hang Le	f0a4fc6cd6	Add Flaubert	2020-01-30 10:04:18 -05:00
Julien Chaumond	119dc50e2a	Doc tweak on model sharing	2020-01-22 22:40:38 -05:00
alberduris	81d6841b4b	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
alberduris	dd4df80f0b	Moved the encoded_prompts to correct device	2020-01-06 15:11:12 +01:00
Julien Chaumond	78528742f1	Fix syntax + link to community page	2020-01-05 12:43:39 -05:00
Clement	12e0aa4368	Proposition to include community models in readme	2020-01-05 12:37:11 -05:00
Julien Chaumond	9b2badf3c9	[cli] Update doc	2019-12-27 22:54:29 -05:00
Aymeric Augustin	3233b58ad4	Quote square brackets in shell commands. This ensures compatibility with zsh. Fix #2316.	2019-12-27 08:50:25 +01:00
Aymeric Augustin	a8d34e534e	Remove [--editable] in install instructions. Use -e only in docs targeted at contributors. If a user copy-pastes command line with [--editable], they will hit an error. If they don't know the --editable option, we're giving them a choice to make before they can move forwards, but this isn't a choice they need to make right now.	2019-12-24 08:46:08 +01:00
Aymeric Augustin	70373a5f7c	Update contribution instructions. Also provide shortcuts in a Makefile.	2019-12-23 21:05:30 +01:00
Aymeric Augustin	45841eaf7b	Remove references to Python 2 in documentation.	2019-12-22 18:38:56 +01:00
Aymeric Augustin	b6ea0f43ae	Remove duplicate -v flag.	2019-12-22 17:47:27 +01:00
Aymeric Augustin	ced0a94204	Switch test files to the standard test_*.py scheme.	2019-12-22 14:15:13 +01:00
Aymeric Augustin	067395d5c5	Move tests outside of library.	2019-12-22 13:47:17 +01:00
Aymeric Augustin	698f9e3d7a	Remove trailing whitespace in README.	2019-12-22 13:29:58 +01:00
thomwolf	1ab25c49d3	Merge branch 'master' into pr/2115	2019-12-21 14:54:30 +01:00
Thomas Wolf	6e7102cfb3	Merge pull request #2203 from gthb/patch-1 fix: wrong architecture count in README	2019-12-21 14:31:44 +01:00
Lysandre	a436574bfd	Release: v2.3.0	2019-12-20 16:22:20 -05:00
thomwolf	71883b6ddc	update link in readme	2019-12-20 19:40:23 +01:00
Morgan Funtowicz	b98ff88544	Added pipelines quick tour in README	2019-12-20 15:52:50 +01:00
Stefan Schweter	3e89fca543	readme: add XLM-RoBERTa to model architecture list	2019-12-18 19:44:23 +01:00
Gunnlaugur Thor Briem	d303f84e7b	fix: wrong architecture count in README Just say “the following” so that this intro doesn't so easily fall out of date :) )	2019-12-17 16:18:00 +00:00
Julien Chaumond	3f5ccb183e	[doc] Clarify uploads cf `855ff0e91d (commitcomment-36452545)`	2019-12-16 18:20:29 -05:00
Julien Chaumond	855ff0e91d	[doc] Model upload and sharing ping @lysandrejik @thomwolf Is this clear enough? Anything we should add?	2019-12-16 12:42:22 -05:00
Thomas Wolf	e92bcb7eb6	Merge pull request #1739 from huggingface/t5 [WIP] Adding Google T5 model	2019-12-14 09:40:43 +01:00
Lysandre	7bd11dda6f	Release: v2.2.2	2019-12-13 16:45:30 -05:00
thomwolf	0558c9cb9b	Merge branch 'master' into t5	2019-12-10 12:58:48 +01:00
Suvrat Bhooshan	df3961121f	Add MMBT Model to Transformers Repo	2019-12-09 18:36:48 -08:00
Pierric Cistac	5c877fe94a	fix albert links	2019-12-09 18:53:00 -05:00
Aymeric Augustin	35401fe50f	Remove dependency on pytest for running tests (#2055 ) * Switch to plain unittest for skipping slow tests. Add a RUN_SLOW environment variable for running them. * Switch to plain unittest for PyTorch dependency. * Switch to plain unittest for TensorFlow dependency. * Avoid leaking open files in the test suite. This prevents spurious warnings when running tests. * Fix unicode warning on Python 2 when running tests. The warning was: UnicodeWarning: Unicode equal comparison failed to convert both arguments to Unicode - interpreting them as being unequal * Support running PyTorch tests on a GPU. Reverts `27e015bd`. * Tests no longer require pytest. * Make tests pass on cuda	2019-12-06 13:57:38 -05:00
VictorSanh	552c44a9b1	release distilm-bert	2019-12-05 10:14:58 -05:00
LysandreJik	8101924a68	Patch: v2.2.1	2019-12-03 11:20:26 -05:00
Julien Chaumond	b5d884d25c	Uniformize #1952	2019-11-27 11:05:55 -05:00
Lysandre	cf26a0c85e	Fix pretrained models table	2019-11-26 15:40:03 -05:00
Lysandre Debut	b632145273	Update master documentation link in README	2019-11-26 14:27:15 -05:00
Lysandre	ae98d45991	Release: v2.2.0	2019-11-26 14:12:44 -05:00
Julien Chaumond	176cd1ce1b	[doc] homogenize instructions slightly	2019-11-23 11:18:54 -05:00
Rémi Louf	6f70bb8c69	add instructions to run the examples	2019-11-21 14:41:19 -05:00
Julien Chaumond	3916b334a8	[camembert] Acknowledge the full author list	2019-11-18 09:29:11 -05:00
Sebastian Stabinger	44455eb5b6	Adds CamemBERT to Model architectures list	2019-11-18 09:23:14 -05:00
Thomas Wolf	df99f8c5a1	Merge pull request #1832 from huggingface/memory-leak-schedulers replace LambdaLR scheduler wrappers by function	2019-11-14 22:10:31 +01:00
Rémi Louf	2276bf69b7	update the examples, docs and template	2019-11-14 20:38:02 +01:00
thomwolf	8aba81a0b6	fix #1789	2019-11-12 08:52:43 +01:00
thomwolf	f03c0c1423	adding models in readme and auto classes	2019-11-08 11:49:46 +01:00
Lysandre	68f7064a3e	Add `model.train()` line to ReadMe training example Co-Authored-By: Santosh-Gupta <San.Gupta.ML@gmail.com>	2019-11-04 11:52:35 -05:00
Thomas Wolf	7f84fc571a	Merge pull request #1670 from huggingface/templates Templates and explanation for adding a new model and example script	2019-10-30 17:05:58 +01:00
Thomas Wolf	5c6a19a94a	Merge pull request #1604 from huggingface/deploy_doc Versioning in documentation	2019-10-30 17:03:14 +01:00
thomwolf	328a86d2af	adding links to the templates in readme and contributing	2019-10-30 11:37:55 +01:00
Lysandre	b82bfbd0c3	Updated README to show all available documentation	2019-10-24 15:55:31 +00:00
Julien Chaumond	ef1b8b2ae5	[CTRL] warn if generation prompt does not start with a control code see also https://github.com/salesforce/ctrl/pull/50	2019-10-22 21:30:32 +00:00
Julián Peller (dataista)	e16d46843a	Fix architectures count	2019-10-22 15:13:47 -04:00
thomwolf	4d456542e9	Fix citation	2019-10-21 16:34:14 +02:00
Lysandre Debut	c544194611	Remove `special_tokens_mask` from inputs in README Co-authored-by: Thomas Wolf @thomwolf	2019-10-16 11:05:13 -04:00
Emrah Budur	5a8c6e771a	Fixed the sample code in the title 'Quick tour'.	2019-10-12 14:17:17 +03:00
thomwolf	4b8f3e8f32	adding citation	2019-10-11 16:18:16 +02:00
thomwolf	d9e60f4f0d	Merge branch 'master' into pr/1383	2019-10-09 17:25:08 +02:00
Julien Chaumond	d688af19e5	Update link to swift-coreml-transformers cc @lysandrejik	2019-10-08 16:37:52 -04:00
seanBE	6dc6c716c5	fix pytorch-transformers migration description in README	2019-10-07 09:59:54 +01:00
Christopher Goh	904158ac4d	Rephrase forward method to reduce ambiguity	2019-10-06 23:40:52 -04:00
Christopher Goh	0f65d8cbbe	Fix some typos in README	2019-10-06 23:40:52 -04:00
keskarnitish	dbed1c5d94	Adding CTRL (squashed commit) adding conversion script adding first draft of modeling & tokenization adding placeholder for test files bunch of changes registering the tokenizer/model/etc tests change link; something is very VERY wrong here weird end-of-word thingy going on i think the tokenization works now ; wrote the unit tests overall structure works;load w next the monster is alive! works after some cleanup as well adding emacs autosave to gitignore currently only supporting the 48 layer one; seems to infer fine on my macbook cleanup fixing some documentation fixing some documentation tests passing? now works on CUDA also adding greedy? adding greedy sampling works well	2019-10-03 22:29:03 -07:00
VictorSanh	35071007cb	incoming release 🔥 update links to arxiv preprint	2019-10-03 10:27:11 -04:00
DenysNahurnyi	6971556ab8	Fix syntax typo in README.md	2019-10-01 14:59:31 -04:00
Santosh Gupta	5c3b32d44d	Update README.md Lines 183 - 200, fixed indentation. Line 198, replaced `tokenizer_class` with `BertTokenizer`, since `tokenizer_class` is not defined in the loop it belongs to.	2019-09-30 18:48:01 +00:00
wangfei	60f791631b	Fix link in readme	2019-09-28 16:20:17 +08:00
BramVanroy	15749bfc10	Add small note about the output of hidden states	2019-09-27 10:01:36 +02:00
thomwolf	6c3b131516	typo in readme/doc	2019-09-26 16:23:28 +02:00
thomwolf	4e63c90720	update installation instructions in readme	2019-09-26 16:14:21 +02:00
Lysandre Debut	0f92f76ca3	CircleCI reference in README	2019-09-26 08:59:52 -04:00
thomwolf	9676d1a2a8	update readme and setup.py	2019-09-26 13:47:58 +02:00
thomwolf	4dde31cb76	update readme	2019-09-26 12:18:26 +02:00
thomwolf	4ddc31ff40	update readme with migration change	2019-09-26 12:00:38 +02:00
thomwolf	f47f7f4611	add logo	2019-09-26 11:28:44 +02:00
thomwolf	9fabc0b6a9	wip readme	2019-09-26 11:21:34 +02:00
thomwolf	31c23bd5ee	[BIG] pytorch-transformers => transformers	2019-09-26 10:15:53 +02:00
Julien Chaumond	62760baf46	tiny fixes	2019-09-17 18:29:15 -04:00
Julien Chaumond	f9453d15e5	Fix broken link	2019-09-05 12:35:22 -04:00
Julien Chaumond	f7ee2e5d20	[README] link to Write With Transformer	2019-09-05 12:33:46 -04:00
Thomas Wolf	50e615f43d	Merge branch 'master' into improved_testing	2019-08-30 13:40:35 +02:00
thomwolf	306af132d7	update readme to mention add_special_tokens more clearly in example	2019-08-30 11:30:51 +02:00
LysandreJik	75bc2a03cc	Updated article link	2019-08-28 10:05:15 -04:00
thomwolf	912a377e90	dilbert -> distilbert	2019-08-28 13:59:42 +02:00
thomwolf	4ce5f36f78	update readmes	2019-08-28 12:14:31 +02:00
VictorSanh	497f73c964	add DilBERT to master REAME	2019-08-28 07:16:30 +00:00
thomwolf	e00b4ff1de	fix #1017	2019-08-21 22:22:17 +02:00
Nikolay Korolev	ad6e62cd82	Fix typo. configuratoin -> configuration	2019-08-20 15:43:06 +03:00
Christophe Bourguignat	189ff9b664	Update README after RoBERTa addition	2019-08-17 13:18:37 -04:00
LysandreJik	9d0029e215	Added RoBERTa example to README	2019-08-15 17:17:35 -04:00
Lysandre Debut	88efc65bac	Merge pull request #964 from huggingface/RoBERTa RoBERTa: model conversion, inference, tests 🔥	2019-08-15 11:11:10 -04:00
Julien Chaumond	c4ef103447	[RoBERTa] First 4 authors cf. https://github.com/huggingface/pytorch-transformers/pull/964#discussion_r313574354 Co-Authored-By: Myle Ott <myleott@fb.com>	2019-08-14 12:31:09 -04:00
carefree0910	a7b4cfe919	Update README.md I assume that it should test the `re-load` functionality after testing the `save` functionality, however I'm also surprised that nobody points this out after such a long time, so maybe I've misunderstood the purpose. This PR is just in case :)	2019-08-12 09:53:05 -04:00
LysandreJik	d2cc6b101e	Merge branch 'master' into RoBERTa	2019-08-08 09:42:05 -04:00
Christopher Goh	a6f412da01	Fixed typo in migration guide	2019-08-07 02:19:14 +08:00
Thomas Wolf	d43dc48b34	Merge branch 'master' into auto_models	2019-08-05 19:17:35 +02:00
thomwolf	7223886dc9	fix #944	2019-08-05 17:16:56 +02:00
thomwolf	58830807d1	inidicate we only support pytorch 1.0.0+ now	2019-08-05 14:38:59 +02:00
thomwolf	328afb7097	cleaning up tokenizer tests structure (at last) - last remaining ppb refs	2019-08-05 14:08:56 +02:00
Julien Chaumond	05c083520a	[RoBERTa] model conversion, inference, tests 🔥	2019-08-04 21:39:21 -04:00
thomwolf	009273dbdd	big doc update [WIP]	2019-08-04 12:14:57 +02:00
Julien Chaumond	44dd941efb	link to `swift-coreml-transformers`	2019-08-01 09:50:30 -04:00
Anthony MOI	f2a3eb987e	Fix small typos	2019-07-31 11:05:06 -04:00
Pierric Cistac	97091acb8c	Small spelling fix	2019-07-31 10:37:56 -04:00
Grégory Châtel	769bb643ce	Fixing a broken link.	2019-07-31 10:22:41 -04:00
Thomas Wolf	fec76a481d	Update readme	2019-07-23 16:05:29 +02:00
thomwolf	ba52fe69d5	update breaking change section regarding from_pretrained keyword arguments	2019-07-23 15:10:02 +02:00
rish-16	2f869dc665	Fixed typo	2019-07-21 11:05:36 -04:00
Thomas Wolf	dbecfcf321	Merge pull request #815 from praateekmahajan/update-readme-link Update Readme link for Fine Tune/Usage section	2019-07-18 18:30:32 +02:00
Peiqin Lin	acc48a0cc9	typos	2019-07-18 09:54:04 -04:00
Praateek Mahajan	0d46b17553	Update Readme Incorrect link for `Quick tour: Fine-tuning/usage scripts`	2019-07-17 22:50:10 -07:00
thomwolf	c5b3d86a91	Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT	2019-07-16 21:21:05 +02:00
thomwolf	6b70760204	typos	2019-07-16 21:21:03 +02:00
Thomas Wolf	b33a385091	update readme	2019-07-16 16:18:37 +02:00
thomwolf	6a72d9aa52	updated examples in readme	2019-07-16 16:09:29 +02:00
thomwolf	b59043bf8f	update readme	2019-07-16 16:03:48 +02:00
thomwolf	edc79acb3b	simpler quick tour	2019-07-16 16:02:32 +02:00
thomwolf	5c82d3488f	indicate default evaluation in breaking changes	2019-07-16 15:45:58 +02:00
thomwolf	4acaa65068	model in evaluation mode by default after from_pretrained	2019-07-16 15:41:57 +02:00
thomwolf	1849aa7d39	update readme and pretrained model weight files	2019-07-16 15:11:29 +02:00
thomwolf	43e0e8fa04	updates to readme and doc	2019-07-16 13:56:47 +02:00
thomwolf	352e3ff998	added migration guide to readme	2019-07-16 09:03:49 +02:00
thomwolf	8ad7e5b4f2	indeed	2019-07-16 00:29:15 +02:00
thomwolf	064d0a0b76	update readme	2019-07-16 00:21:33 +02:00
thomwolf	3b8b0e01bb	update readme	2019-07-16 00:12:55 +02:00
thomwolf	2397f958f9	updating examples and doc	2019-07-14 23:20:10 +02:00
thomwolf	6135de2fa3	readme update	2019-07-11 15:39:49 +02:00
thomwolf	e468192e2f	Merge branch 'pytorch-transformers' into xlnet	2019-07-09 17:05:37 +02:00
LysandreJik	ab30651802	Hugging Face theme.	2019-07-08 16:05:26 -04:00
thomwolf	eb91f6437e	update readme and setup	2019-07-05 12:30:15 +02:00
thomwolf	0231ba291e	circle-ci	2019-07-05 11:59:04 +02:00
thomwolf	0bab55d5d5	[BIG] name change	2019-07-05 11:55:36 +02:00
thomwolf	93e9971c54	fix tests	2019-06-26 10:02:45 +02:00
thomwolf	e55d4c4ede	various updates to conversion, models and examples	2019-06-26 00:57:53 +02:00
thomwolf	603c513b35	update main conversion script and readme	2019-06-25 10:45:07 +02:00
thomwolf	62d78aa37e	updating GLUE utils for compatibility with XLNet	2019-06-24 14:36:11 +02:00
thomwolf	c304593d8f	BERTology details in readme	2019-06-20 10:05:06 +02:00
thomwolf	34d706a0e1	pruning in bertology	2019-06-19 15:25:49 +02:00
thomwolf	dc8e0019b7	updating examples	2019-06-19 13:23:20 +02:00
thomwolf	68ab9599ce	small fix and updates to readme	2019-06-19 09:38:38 +02:00
thomwolf	4d8c4337ae	test barrier in distrib training	2019-06-18 22:41:28 +02:00
thomwolf	15ebd67d4e	cache in run_classifier + various fixes to the examples	2019-06-18 15:58:22 +02:00
thomwolf	d82e5deeb1	set find_unused_parameters=True in DDP	2019-06-18 12:13:14 +02:00
thomwolf	f964753090	explanation on the current location of the caching folder	2019-06-18 11:36:28 +02:00
thomwolf	382e2d1e50	spliting config and weight files for bert also	2019-06-18 10:37:16 +02:00
thomwolf	4447f270b2	updating hub	2019-06-17 16:21:28 +02:00
thomwolf	33d3db5c43	updating head masking, readme and docstrings	2019-06-17 15:51:28 +02:00
thomwolf	34858ae1d9	adding bert whole words, bertgerman and gpt-2 medium models, head masking	2019-06-17 11:02:39 +02:00
timoeller	16af9ff7b0	Add German Bert model to code, update readme	2019-06-14 17:42:46 +02:00
Colanim	1eba8b9d96	Fix link in README	2019-05-30 14:01:46 +09:00
lukovnikov	331a46ff04	- replaced OpenAIGPTAdam with OpenAIAdam in docs	2019-04-25 16:04:37 +02:00
lukovnikov	704037ad51	- updated docs for new LR API - added some images for illustration - updated comments in optimization	2019-04-25 15:59:39 +02:00
thomwolf	18a8a15f78	improving GPT2 tokenization and adding tests	2019-04-16 17:00:55 +02:00
thomwolf	1135f2384a	clean up logger in examples for distributed case	2019-04-15 15:22:40 +02:00
thomwolf	cc43307023	update readme	2019-04-15 15:06:10 +02:00
thomwolf	60ea6c59d2	added best practices for serialization in README and examples	2019-04-15 15:00:33 +02:00
thomwolf	20577d8a7c	add configuration serialization to readme	2019-04-15 14:21:41 +02:00
thomwolf	b17963d82f	update readme	2019-04-15 13:44:30 +02:00
Weixin Wang	f26ce6992e	Fix links in README	2019-04-02 17:20:32 +08:00
Sepehr Sameni	b588ff362a	fix lm_finetuning's link	2019-03-29 12:39:24 +04:30
Thomas Wolf	694e2117f3	Merge pull request #388 from ananyahjha93/master Added remaining GLUE tasks to 'run_classifier.py'	2019-03-28 09:06:53 +01:00
Thomas Wolf	bbff03fbfc	Merge pull request #394 from desireevl/master Minor change in README	2019-03-27 12:03:00 +01:00
thomwolf	34561e61a5	update main readme also	2019-03-27 12:00:04 +01:00
Ananya Harsh Jha	f471979167	added GLUE dev set results and details on how to run GLUE tasks	2019-03-21 15:38:30 -04:00
Desiree Vogt-Lee	d52f914e24	weigths to weights	2019-03-21 15:02:59 +10:00
Junjie Qian	d648a02203	Correct line number in README for classes	2019-03-08 16:28:03 -08:00
thomwolf	7cc35c3104	fix openai gpt example and updating readme	2019-03-06 11:43:21 +01:00
thomwolf	906b638efa	updating readme	2019-03-06 10:24:19 +01:00
John Hewitt	e14c6b52e3	add BertTokenizer flag to skip basic tokenization	2019-02-26 20:11:24 -08:00
Joel Grus	8722e9eb3b	finish updating docstrings	2019-02-23 06:31:59 -08:00
Stanislas Polu	ff22b3acc0	Few small nits in GPT-2's code examples	2019-02-21 09:15:27 +00:00
Tong Guo	09efcece75	Update README.md	2019-02-21 11:25:33 +08:00
Tony Lin	5b0e0b61f0	fix typo in readme	2019-02-19 20:34:18 +08:00
Davide Fiocco	0ae8eece55	MInor README typos corrected	2019-02-18 21:28:28 +01:00
sam-qordoba	1cb9c76ec5	Fix typo in `GPT2Model` code sample Typo prevented code from running	2019-02-18 09:27:26 -08:00
Thomas Wolf	a25d056b7a	update readme	2019-02-18 15:30:11 +01:00
Thomas Wolf	517d7c8624	update readme	2019-02-18 14:39:55 +01:00
Thomas Wolf	ada22a1c9e	more details in GPT-2 usage example	2019-02-18 14:37:41 +01:00
Thomas Wolf	522733f6cb	readme typo fixes	2019-02-18 14:32:10 +01:00
thomwolf	d44db1145c	update readme	2019-02-18 11:12:09 +01:00
Thomas Wolf	0e774e57a6	Update readme Adding details on how to extract a full list of hidden states for the Transformer-XL	2019-02-14 08:39:58 +01:00
Thomas Wolf	4e56da38d9	Merge pull request #268 from wangxiaodiu/master fixed a minor bug in README.md	2019-02-13 10:19:25 +01:00
thomwolf	67376c02e2	update readme for tokenizers	2019-02-13 10:11:11 +01:00
Liang Niu	e1b3cfb504	fixed a minor bug in README.md	2019-02-12 15:54:23 +04:00
Thomas Wolf	3c33499f87	fix typo in readme	2019-02-12 10:22:54 +01:00
thomwolf	1e71f11dec	Release: 0.5.0	2019-02-11 14:16:27 +01:00
thomwolf	eebc8abbe2	clarify and unify model saving logic in examples	2019-02-11 14:04:19 +01:00
thomwolf	81c7e3ec9f	fix typo in readme	2019-02-11 13:37:12 +01:00
thomwolf	884ca81d87	transposing the inputs of Transformer-XL to have a unified interface	2019-02-11 13:19:59 +01:00
thomwolf	32fea876bb	add distant debugging to run_transfo_xl	2019-02-11 12:53:32 +01:00
thomwolf	b31ba23913	cuda on in the examples by default	2019-02-11 12:15:43 +01:00
thomwolf	2071a9b86e	fix python 2.7 imports	2019-02-11 10:35:36 +01:00
thomwolf	b514a60c36	added tests for OpenAI GPT and Transformer-XL tokenizers	2019-02-11 10:17:16 +01:00
thomwolf	9f9909ea2f	update readme	2019-02-09 16:59:21 +01:00
thomwolf	0c1a6f9b1d	update readme	2019-02-08 22:32:25 +01:00
thomwolf	009b581316	updated readme	2019-02-07 23:15:05 +01:00
thomwolf	f99f2fb661	docstrings	2019-02-07 17:07:22 +01:00
Thomas Wolf	848aae49e1	Merge branch 'master' into python_2	2019-02-06 00:13:20 +01:00
thomwolf	ba37ddc5ce	fix run_lm_modeling example command line	2019-02-06 00:07:08 +01:00
Girishkumar	0dd2b750ca	Minor update in README Update links to classes in `modeling.py`	2019-01-30 23:49:15 +05:30
thomwolf	3a848111e6	update config, docstrings and readme to switch to seperated tokens and position embeddings	2019-01-29 11:00:11 +01:00
Davide Fiocco	35115eaf93	(very) minor update to README	2019-01-16 21:05:24 +01:00
nhatchan	8edc898f63	Fix documentation (missing backslashes) This PR adds missing backslashes in LM Fine-tuning subsection in README.md.	2019-01-13 21:23:19 +09:00
thomwolf	e5c78c6684	update readme and few typos	2019-01-10 01:40:00 +01:00
thomwolf	fa5222c296	update readme	2019-01-10 01:25:28 +01:00
Thomas Wolf	c18bdb4433	Merge pull request #124 from deepset-ai/master Add example for fine tuning BERT language model	2019-01-07 12:03:51 +01:00
Julien Chaumond	8da280ebbe	Setup CI	2018-12-20 16:33:39 -05:00
tholor	e5fc98c542	add exemplary training data. update to nvidia apex. refactor 'item -> line in doc' mapping. add warning for unknown word.	2018-12-20 18:30:52 +01:00
tholor	67f4dd56a3	update readme for run_lm_finetuning	2018-12-19 09:22:37 +01:00
Julien Chaumond	d57763f582	Fix typos	2018-12-18 19:23:22 -05:00
Thomas Wolf	786cc41299	Typos in readme	2018-12-17 09:22:18 +01:00
Daniel Khashabi	8b1b93947f	Minor fix.	2018-12-14 14:10:36 -05:00
Thomas Wolf	8809eb6c93	update readme with information on NVIDIA's apex	2018-12-14 16:59:39 +01:00
thomwolf	d821358884	update readme	2018-12-14 15:15:17 +01:00
thomwolf	087798b7fa	fix reloading model for evaluation in examples	2018-12-13 14:48:12 +01:00
thomwolf	0f544625f4	fix swag example for work with apex	2018-12-13 13:35:59 +01:00
thomwolf	4946c2c500	run_swag example in readme	2018-12-13 13:02:07 +01:00
Thomas Wolf	91aab2a6d3	Merge pull request #116 from FDecaYed/deyuf/fp16_with_apex Change to use apex for better fp16 and multi-gpu support	2018-12-13 12:32:37 +01:00
Thomas Wolf	ffe9075f48	Merge pull request #96 from rodgzilla/multiple-choice-code BertForMultipleChoice and Swag dataset example.	2018-12-13 12:05:11 +01:00
Grégory Châtel	dcb50eaa4b	Swag example readme section update with gradient accumulation run.	2018-12-12 18:17:46 +01:00
Deyu Fu	c8ea286048	change to apex for better fp16 and multi-gpu support	2018-12-11 17:13:58 -08:00
Thomas Wolf	a3a3180c86	Bump up requirements to Python 3.6	2018-12-11 11:29:45 +01:00
Grégory Châtel	0876b77f7f	Change to the README file to add SWAG results.	2018-12-10 15:34:19 +01:00
Davide Fiocco	c9f67e037c	Adding --do_lower_case for all uncased BERTs I had missed those, it should make sense to use them	2018-12-07 20:40:56 +01:00
Grégory Châtel	150f3cd9fa	Few typos in README.md	2018-12-06 19:22:07 +01:00
Grégory Châtel	4fa7892d64	Wrong line number link to modeling file.	2018-12-06 19:18:29 +01:00
Grégory Châtel	6a26e19ea3	Updating README.md with SWAG example informations.	2018-12-06 19:15:08 +01:00
Grégory Châtel	0a7c8bdcac	Fixing badly formatted links.	2018-12-04 13:43:56 +01:00
Grégory Châtel	3113e967db	Adding links to examples files.	2018-12-04 13:40:38 +01:00
Davide Fiocco	8a8aa59d8c	Update finetuning example adding --do_lower_case Should be consistent with the fact that an uncased model is used	2018-12-01 01:00:05 +01:00
thomwolf	f9f3bdd60b	update readme	2018-11-30 23:05:18 +01:00
thomwolf	52ff0590ff	tup => tpu	2018-11-30 23:01:10 +01:00
thomwolf	296f006132	added BertForTokenClassification model	2018-11-30 13:56:53 +01:00
thomwolf	298107fed7	Added new bert models	2018-11-30 13:56:02 +01:00
Davide Fiocco	ec2c339b53	Updated quick-start example with `BertForMaskedLM` As `convert_ids_to_tokens` returns a list, the code in the README currently throws an `AssertionError`, so I propose I quick fix.	2018-11-28 14:53:46 +01:00
thomwolf	05053d163c	update cache_dir in readme and examples	2018-11-26 10:45:13 +01:00
thomwolf	029bdc0d50	fixing readme examples	2018-11-26 09:56:41 +01:00
Thomas Wolf	60e01ac427	fix link in readme	2018-11-21 12:08:30 +01:00
Thomas Wolf	fd32ebed81	Merge pull request #42 from weiyumou/master Fixed UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2	2018-11-20 10:09:50 +01:00
thomwolf	eed255a58d	fixing CLI typo in readme	2018-11-20 10:02:57 +01:00
weiyumou	9ff2b7d86d	Fixed README typo	2018-11-19 23:13:10 -05:00
Thomas Wolf	da73925f6a	fix typos	2018-11-19 20:58:48 +01:00
Joel Grus	dd56cfd89a	update pip package name	2018-11-19 09:50:34 -08:00
Thomas Wolf	956c917344	fix typos in readme	2018-11-17 23:25:23 +01:00
Thomas Wolf	7c91e51c26	update links in readme	2018-11-17 22:54:15 +01:00
Thomas Wolf	e113101702	fix typos in readme	2018-11-17 12:36:35 +01:00
thomwolf	47a7d4ec14	update examples from master	2018-11-17 12:21:35 +01:00
thomwolf	c8cba67742	clean up readme and examples	2018-11-17 12:19:16 +01:00
thomwolf	757750d6f6	fix tests	2018-11-17 11:58:14 +01:00
thomwolf	d0673c7dbd	fix links	2018-11-17 08:59:29 +01:00
thomwolf	68b937aa40	sub section overviews	2018-11-17 08:55:56 +01:00
thomwolf	c54d8b1847	fixing links in readme	2018-11-17 08:46:17 +01:00
thomwolf	f920eff8c3	update readme	2018-11-17 08:42:45 +01:00
thomwolf	886cb49792	updating readme and notebooks	2018-11-16 14:31:15 +01:00
thomwolf	1de35b624b	preparing for first release	2018-11-15 20:56:10 +01:00
Thomas Wolf	278fd28a32	added results for 16-bit fine-tuning in readme	2018-11-13 09:34:49 +01:00
thomwolf	d940eeda54	typo	2018-11-12 15:26:46 +01:00
thomwolf	1cf0a16c67	cleaning up readme	2018-11-12 15:24:47 +01:00
thomwolf	66b0090877	add fp16 training	2018-11-12 15:15:02 +01:00
Thomas Wolf	5dfd19060a	fix typo in readme	2018-11-12 12:39:57 +01:00
Thomas Wolf	fa1aa81f26	fix typo in readme bach examples	2018-11-12 08:37:43 +01:00
Thomas Wolf	6d6b916f48	update to BERT-large results	2018-11-11 17:00:49 +01:00
Thomas Wolf	c4bfc646f5	Add results of fine-tuning BERT-large on GPUs	2018-11-11 16:59:35 +01:00
thomwolf	ea85cca8ab	adding optimize_on_cpu explanation in readme	2018-11-09 11:42:37 +01:00
Thomas Wolf	0c24db9d5f	update results for SQuAD	2018-11-09 09:11:59 +01:00
thomwolf	2c5d993ba4	update readme - fix SQuAD model on multi-GPU	2018-11-08 21:22:22 +01:00
Gopal Krishna	4850ec5888	fixed small typos in the README.md (#8 )	2018-11-08 15:00:02 -05:00
Thomas Wolf	3bfbc21376	updating pytest command	2018-11-08 00:44:17 +01:00
Thomas Wolf	0ed7696191	Updated MRPC results	2018-11-08 00:39:42 +01:00
Thomas Wolf	d92a7f7721	Removing note on run_squad.py example	2018-11-07 23:37:55 +01:00
Thomas Wolf	1a5bbd83dc	Updating run_squad information in readme	2018-11-06 08:53:01 +01:00
Thomas Wolf	79e1b95e75	fix link in readme	2018-11-06 08:38:02 +01:00
Knut Ole Sjøli	886f595c37	Fix typo in subheader (#4 )	2018-11-05 18:34:18 -05:00
Thomas Wolf	59d4cc5f2b	typos	2018-11-05 22:47:24 +01:00
Thomas Wolf	d983eecdd3	more readme typo fixes	2018-11-05 21:29:04 +01:00
Thomas Wolf	8f91b4de91	more typo fixes	2018-11-05 21:24:14 +01:00
Thomas Wolf	7316b0d6d0	fix typo	2018-11-05 21:22:45 +01:00
Clement	d130cb5139	typos	2018-11-05 15:09:24 -05:00
Clement	2a8fee495b	typos	2018-11-05 15:04:06 -05:00
Clement	f968b11657	typo	2018-11-05 14:59:44 -05:00
thomwolf	88e793f31a	fix typos	2018-11-05 16:14:19 +01:00
thomwolf	3914eed505	update readme	2018-11-05 16:09:27 +01:00
thomwolf	7394eb47a5	update readme	2018-11-05 15:35:44 +01:00
thomwolf	6cc651778a	update readme	2018-11-04 21:26:03 +01:00
thomwolf	d6418c5ef3	tweaking the readme	2018-11-03 23:52:35 +01:00
thomwolf	3b70b270e0	update readme	2018-11-03 23:39:55 +01:00
thomwolf	f8276008df	update readme, file names, removing TF code, moving tests	2018-11-03 23:35:14 +01:00
VictorSanh	5889765a7c	Update README.md	2018-11-03 09:18:44 -04:00
VictorSanh	844b2f0e6f	Small update Readme	2018-11-02 08:57:15 -04:00
VictorSanh	72d69a4ef4	Update README	2018-11-02 03:37:39 -04:00
VictorSanh	bf65d4dbb7	Begin Updating the README.md	2018-11-02 02:51:07 -04:00
thomwolf	13ee61e4de	switch to full google code	2018-10-31 18:46:03 +01:00
thomwolf	12e013dbac	added wordpiece - updated readme	2018-10-30 23:09:09 +01:00
Thomas Wolf	43badf217d	Initial commit	2018-10-29 14:56:02 +01:00

... 7 8 9 10 11 ...

712 Commits