transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-06 06:10:04 +06:00

Author	SHA1	Message	Date
Sylvain Gugger	b2747af543	Improvements to PretrainedConfig documentation (#5642 ) * Update PretrainedConfig doc * Formatting * Small fixes * Forgotten args and more cleanup	2020-07-10 10:31:47 -04:00
Sylvain Gugger	760f726e51	Add forum link in the docs (#5637 )	2020-07-09 15:13:22 -04:00
Lysandre Debut	1158e56551	Correct extension (#5631 )	2020-07-09 11:03:07 -04:00
Stas Bekman	fa5423b169	doc fixes (#5613 )	2020-07-08 19:52:44 -04:00
Joe Davison	b4b33fdf25	Guide to fixed-length model perplexity evaluation (#5449 ) * add first draft ppl guide * upload imgs * expand on strides * ref typo * rm superfluous past var * add tokenization disclaimer	2020-07-07 16:04:15 -06:00
Sam Shleifer	353b8f1e7a	Add mbart-large-cc25, support translation finetuning (#5129 ) improve unittests for finetuning, especially w.r.t testing frozen parameters fix freeze_embeds for T5 add streamlit setup.cfg	2020-07-07 13:23:01 -04:00
Suraj Patil	33e43edddc	[docs] fix model_doc links in model summary (#5566 ) * fix model_doc links * update model links	2020-07-07 11:06:12 -04:00
Quentin Lhoest	fbd8792195	Add DPR model (#5279 ) * beginning of dpr modeling * wip * implement forward * remove biencoder + better init weights * export dpr model to embed model for nlp lib * add new api * remove old code * make style * fix dumb typo * don't load bert weights * docs * docs * style * move the `k` parameter * fix init_weights * add pretrained configs * minor * update config names * style * better config * style * clean code based on PR comments * change Dpr to DPR * fix config * switch encoder config to a dict * style * inheritance -> composition * add messages in assert startements * add dpr reader tokenizer * one tokenizer per model * fix base_model_prefix * fix imports * typo * add convert script * docs * change tokenizers conf names * style * change tokenizers conf names * minor * minor * fix wrong names * minor * remove unused convert functions * rename convert script * use return_tensors in tokenizers * remove n_questions dim * move generate logic to tokenizer * style * add docs * docs * quality * docs * add tests * style * add tokenization tests * DPR full tests * Stay true to the attention mask building * update docs * missing param in bert input docs * docs * style Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-07-07 08:56:12 -04:00
Lysandre	1d2332861f	Post v3.0.2 release commit	2020-07-06 18:56:47 -04:00
Lysandre	b0892fa0e8	Release: v3.0.2	2020-07-06 18:49:44 -04:00
Arnav Sharma	b2309cc6bf	Typo fix in `training` doc (#5495 )	2020-07-06 09:15:22 -04:00
ELanning	7ecff0ccbb	Fix typo in training (#5510 )	2020-07-06 09:14:57 -04:00
Sylvain Gugger	6b735a7253	Tokenizer summary (#5467 ) * Work on tokenizer summary * Finish tutorial * Link to it * Apply suggestions from code review Co-authored-by: Anthony MOI <xn1t0x@gmail.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Add vocab definition Co-authored-by: Anthony MOI <xn1t0x@gmail.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-07-02 17:07:42 -04:00
George Ho	84e56669af	Fix typo in glossary (#5466 )	2020-07-02 09:19:33 -04:00
Patrick von Platen	d16e36c7e5	[Reformer] Add Masked LM Reformer (#5426 ) * fix conflicts * fix * happy rebasing	2020-07-01 22:43:18 +02:00
Patrick von Platen	fe81f7d12c	finish reformer qa head (#5433 )	2020-07-01 12:27:14 -04:00
Sylvain Gugger	6c55e9fc32	Fix dropdown bug in searches (#5440 ) * Trigger CI * Fix dropdown bug in searches	2020-07-01 11:02:59 -04:00
Sylvain Gugger	4ade7491f4	Fix examples titles and optimization doc page (#5408 )	2020-07-01 08:11:25 -04:00
Sylvain Gugger	87716a6d07	Documentation for the Trainer API (#5383 ) * Documentation for the Trainer API * Address review comments * Address comments	2020-06-30 11:43:43 -04:00
Sylvain Gugger	0607b88945	How to share model cards with the CLI (#5374 ) * How to share model cards * Switch the two options * Fix bad copy/cut * Julien's suggestion	2020-06-30 08:59:32 -04:00
Lysandre Debut	b9ee87f5c7	Doc for v3.0.0 (#5366 ) * Doc for v3.0.0 * Update docs/source/_static/js/custom.js Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/_static/js/custom.js Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-06-29 11:08:54 -04:00
Lysandre	b62ca59527	Release: v3.0.0	2020-06-29 10:40:13 -04:00
Patrick von Platen	4bcc35cd69	[Docs] Benchmark docs (#5360 ) * first doc version * add benchmark docs * fix typos * improve README * Update docs/source/benchmarks.rst Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * fix naming and docs Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-06-29 16:08:57 +02:00
Julien Chaumond	c950fef545	[docs] Small tweaks to #5323	2020-06-29 14:24:33 +02:00
Sylvain Gugger	1af58c0706	New model sharing tutorial (#5323 )	2020-06-27 11:10:02 -04:00
Thomas Wolf	601d4d699c	[tokenizers] Updates data processors, docstring, examples and model cards to the new API (#5308 ) * remove references to old API in docstring - update data processors * style * fix tests - better type checking error messages * better type checking * include awesome fix by @LysandreJik for #5310 * updated doc and examples	2020-06-26 19:48:14 +02:00
Joe Davison	2ffef0d0c7	Training & fine-tuning quickstart (#5034 ) * add initial fine-tuning guide * split code blocks to smaller segments * fix up trianer section of fine-tune doc * a few last typos * Update usage -> task summary link Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-06-25 15:11:11 -06:00
Lysandre Debut	364a5ae1f0	Refactor Code samples; Test code samples (#5036 ) * Refactor code samples * Test docstrings * Style * Tokenization examples * Run rust of tests * First step to testing source docs * Style and BART comment * Test the remainder of the code samples * Style * let to const * Formatting fixes * Ready for merge * Fix fixture + Style * Fix last tests * Update docs/source/quicktour.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Addressing @sgugger's comments + Fix MobileBERT in TF Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-06-25 16:46:00 -04:00
Sylvain Gugger	d12ceb48ba	Tokenization tutorial (#5257 ) * All done * Link to the tutorial * Typo fixes Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Add metnion of the return_xxx args Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-06-24 18:43:20 -04:00
Sylvain Gugger	6894b486d0	Fix version controller links (for realsies) (#5251 )	2020-06-24 12:13:43 -04:00
Sylvain Gugger	609e0c583f	Fix links (#5248 )	2020-06-24 11:35:55 -04:00
Sylvain Gugger	7c41057d50	Add hugs (#5225 )	2020-06-24 07:56:14 -04:00
Sylvain Gugger	173528e368	Add version control menu (#5222 ) * Add version control menu * Constify things Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Apply suggestions from code review Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-06-23 17:05:12 -04:00
Sylvain Gugger	417e492f1e	Quick tour (#5145 ) * Quicktour part 1 * Update * All done * Typos Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Address comments in quick tour * Update docs/source/quicktour.rst Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Update from feedback Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-06-22 16:08:09 -04:00
Sylvain Gugger	1262495a91	Add TF auto model to the docs + fix sphinx warnings (#5187 )	2020-06-22 14:43:52 -04:00
Sylvain Gugger	eb0ca71ef6	Update glossary (#5148 ) * Update glossary * Update docs/source/glossary.rst Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-06-22 08:30:49 -04:00
Vasily Shamporov	9a3f91088c	Add MobileBert (#4901 ) * Add MobileBert * Quality + Conversion script * style * Update src/transformers/modeling_mobilebert.py * Links to S3 * Style * TFMobileBert Slight fixes to the pytorch MobileBert Style * MobileBertForMaskedLM (PT + TF) * MobileBertForNextSentencePrediction (PT + TF) * MobileFor{MultipleChoice, TokenClassification} (PT + TF) ss * Tests + Auto * Doc * Tests * Addressing @sgugger's comments * Adressing @patrickvonplaten's comments * Style * Style * Integration test * style * Model card Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-06-19 16:38:36 -04:00
Suraj Patil	18177a1a60	lm_labels => labels (#5080 )	2020-06-18 09:16:29 +02:00
Sylvain Gugger	204ebc25e6	Update installation page and add contributing to the doc (#5084 ) * Update installation page and add contributing to the doc * Remove mention of symlinks	2020-06-17 14:01:10 -04:00
Sylvain Gugger	7291ea0bff	Reorganize documentation (#5064 ) * Reorganize topics and add all models	2020-06-17 07:55:20 -04:00
Sylvain Gugger	011cc0be51	Fix all sphynx warnings (#5068 )	2020-06-16 16:50:02 -04:00
Yacine Jernite	49c5202522	Eli5 examples (#4968 ) * add eli5 examples * add dense query script * query_di * merging * merging * add_utils * adds nearest neighbor wikipedia * batch queries * training_retriever * new notebooks * moved retriever traiing script * finished wiki40b * max_len_fix * train_s2s * retriever_batch_checkpointing * cleanup * merge * dim_fix * fix_indexer * fix_wiki40b_snippets * fix_embed_for_r * fp32 index * fix_sparse_q * joint_training * remove obsolete datasets * add_passage_nn_results * add_passage_nn_results * add_batch_nn * add_batch_nn * add_data_scripts * notebook * notebook * notebook * fix_multi_gpu * add_app * full_caching * full_caching * notebook * sparse_done * images * notebook * add_image_gif * with_Gif * add_contr_image * notebook * notebook * notebook * train_functions * notebook * min_retrieval_length * pandas_option * notebook * min_retrieval_length * notebook * notebook * eval_Retriever * notebook * images * notebook * add_example * add_example * notebook * fireworks * notebook * notebook * joe's notebook comments * app_update * notebook * notebook_link * captions * notebook * assing RetriBert model * add RetriBert to Auto * change AutoLMHead to AutoSeq2Seq * notebook downloads from hf models * style_black * style_black * app_update * app_update * fix_app_update * style * style * isort * Delete WikiELI5training.ipynb * Delete evaluate_eli5.py * Delete WikiELI5explore.ipynb * Delete ExploreWikiELI5Support.html * Delete explainlikeimfive.py * Delete wiki_snippets.py * children before parent * children before parent * style_black * style_black_only * isort * isort_new * Update src/transformers/modeling_retribert.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> * typo fixes * app_without_asset * cleanup * Delete ELI5animation.gif * Delete ELI5contrastive.svg * Delete ELI5wiki_index.svg * Delete choco_bis.svg * Delete fireworks.gif * Delete huggingface_logo.jpg * Delete huggingface_logo.svg * Delete Long_Form_Question_Answering_with_ELI5_and_Wikipedia.ipynb * Delete eli5_app.py * Delete eli5_utils.py * readme * Update README.md * unused imports * moved_info * default_beam * ftuned model * disclaimer * Update src/transformers/modeling_retribert.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * black * add_doc * names * isort_Examples * isort_Examples * Add doc to index Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-06-16 16:36:58 -04:00
Sylvain Gugger	439aa1d6e9	Remove old section + caching in install (#5027 )	2020-06-16 13:03:41 -04:00
Sylvain Gugger	f9f8a5312e	Add DistilBertForMultipleChoice (#5032 ) * Add `DistilBertForMultipleChoice`	2020-06-15 18:31:41 -04:00
Anthony MOI	36434220fc	[HUGE] Refactoring tokenizers backend - padding - truncation - pre-tokenized pipeline - fast tokenizers - tests (#4510 ) * Use tokenizers pre-tokenized pipeline * failing pretrokenized test * Fix is_pretokenized in python * add pretokenized tests * style and quality * better tests for batched pretokenized inputs * tokenizers clean up - new padding_strategy - split the files * [HUGE] refactoring tokenizers - padding - truncation - tests * style and quality * bump up requied tokenizers version to 0.8.0-rc1 * switched padding/truncation API - simpler better backward compat * updating tests for custom tokenizers * style and quality - tests on pad * fix QA pipeline * fix backward compatibility for max_length only * style and quality * Various cleans up - add verbose * fix tests * update docstrings * Fix tests * Docs reformatted * __call__ method documented Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-06-15 17:12:51 -04:00
Sam Shleifer	a9f1fc6c94	Add bart-base (#5014 )	2020-06-15 13:29:26 -04:00
Suraj Patil	e93ccb3290	BartForQuestionAnswering (#4908 )	2020-06-12 15:47:57 -04:00
Sylvain Gugger	538531cde5	Add AlbertForMultipleChoice (#4959 ) * Add AlbertForMultipleChoice * Make up to date and add all models to common tests	2020-06-12 14:20:19 -04:00
Suraj Patil	ef2dcdccaa	ElectraForQuestionAnswering (#4913 ) * ElectraForQuestionAnswering * udate __init__ * add test for electra qa model * add ElectraForQuestionAnswering in auto models * add ElectraForQuestionAnswering in all_model_classes * fix outputs, input_ids defaults to None * add ElectraForQuestionAnswering in docs * remove commented line	2020-06-10 15:17:52 -04:00
Sylvain Gugger	41a1d27cde	Add XLMRobertaForQuestionAnswering (#4855 ) * Add XLMRobertaForQuestionAnswering * Formatting * Make test happy	2020-06-08 21:22:37 -04:00
Sylvain Gugger	37be3786cf	Clean documentation (#4849 ) * Clean documentation	2020-06-08 11:28:19 -04:00
Sylvain Gugger	56d5d160cd	Add model and doc badges (#4811 ) * Add badges for models and docs	2020-06-05 18:45:42 -04:00
Sylvain Gugger	5c0cfc2cf0	Add link to community models (#4804 )	2020-06-05 15:29:20 -04:00
Sylvain Gugger	fa661ce749	Add model summary (#4789 ) * Add model summary * Add link to pretrained models	2020-06-05 12:22:50 -04:00
Julien Chaumond	99207bd112	Pipelines: miscellanea of QoL improvements and small features... (#4632 ) * [hf_api] Attach all unknown attributes for future-proof compatibility * [Pipeline] NerPipeline is really a TokenClassificationPipeline * modelcard.py: I don't think we need to force the download * Remove config, tokenizer from SUPPORTED_TASKS as we're moving to one model = one weight + one tokenizer * FillMaskPipeline: also output token in string form * TextClassificationPipeline: option to return all scores, not just the argmax * Update docs/source/main_classes/pipelines.rst	2020-06-03 03:51:31 -04:00
Julien Chaumond	b42586ea56	Fix CI after killing archive maps (#4724 ) * 🐛 Fix model ids for BART and Flaubert	2020-06-02 10:21:09 -04:00
Lysandre	b43c78e5d3	Release: v2.11.0	2020-06-02 09:49:09 -04:00
Julien Chaumond	d4c2cb402d	Kill model archive maps (#4636 ) * Kill model archive maps * Fixup * Also kill model_archive_map for MaskedBertPreTrainedModel * Unhook config_archive_map * Tokenizers: align with model id changes * make style && make quality * Fix CI	2020-06-02 09:39:33 -04:00
Patrick von Platen	56ee2560be	[Longformer] Better handling of global attention mask vs local attention mask (#4672 ) * better api * improve automatic setting of global attention mask * fix longformer bug * fix global attention mask in test * fix global attn mask flatten * fix slow tests * update docstring * update docs and make more robust * improve attention mask	2020-05-29 17:58:42 +02:00
Patrick von Platen	9c17256447	[Longformer] Multiple choice for longformer (#4645 ) * add multiple choice for longformer * add models to docs * adapt docstring * add test to longformer * add longformer for mc in init and modeling auto * fix tests	2020-05-29 13:46:08 +02:00
Lysandre Debut	6a17688021	per_device instead of per_gpu/error thrown when argument unknown (#4618 ) * per_device instead of per_gpu/error thrown when argument unknown * [docs] Restore examples.md symlink * Correct absolute links so that symlink to the doc works correctly * Update src/transformers/hf_argparser.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> * Warning + reorder * Docs * Style * not for squad Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-27 11:36:55 -04:00
Patrick von Platen	c589eae2b8	[Longformer For Question Answering] Conversion script, doc, small fixes (#4593 ) * add new longformer for question answering model * add new config as well * fix links * fix links part 2	2020-05-26 14:58:47 +02:00
Patrick von Platen	3e3e552125	[Reformer] fix reformer num buckets (#4564 ) * fix reformer num buckets * fix * adapt docs * set num buckets in config	2020-05-25 16:04:45 -04:00
Alexander Measure	95a26fcf2d	link to paper was broken (#4526 ) changed from https://https://arxiv.org/abs/2001.04451.pdf to https://arxiv.org/abs/2001.04451.pdf	2020-05-22 15:17:09 -04:00
Lysandre	e0db6bbd65	Release: v2.10.0	2020-05-22 10:37:44 -04:00
Patrick von Platen	48c3a70b4e	[Longformer] Docs and clean API (#4464 ) * add longformer docs * improve docs	2020-05-19 21:52:36 +02:00
Iz Beltagy	8f1d047148	Longformer (#4352 ) * first commit * bug fixes * better examples * undo padding * remove wrong VOCAB_FILES_NAMES * License * make style * make isort happy * unit tests * integration test * make `black` happy by undoing `isort` changes!! * lint * no need for the padding value * batch_size not bsz * remove unused type casting * seqlen not seq_len * staticmethod * `bert` selfattention instead of `n2` * uint8 instead of bool + lints * pad inputs_embeds using embeddings not a constant * black * unit test with padding * fix unit tests * remove redundant unit test * upload model weights * resolve todo * simpler _mask_invalid_locations without lru_cache + backward compatible masked_fill_ * increase unittest coverage	2020-05-19 16:04:43 +02:00
Soham Chatterjee	fa6113f9a0	Fixed spelling of training (#4416 )	2020-05-18 11:23:29 -04:00
Lysandre	7cb203fae4	Release: v2.9.1	2020-05-13 17:38:50 -04:00
Sam Shleifer	9a687ebb77	[Marian Fixes] prevent predicting pad_token_id before softmax, support language codes, name multilingual models (#4290 )	2020-05-13 17:29:41 -04:00
Patrick von Platen	839bfaedb2	[Docs, Notebook] Include generation pipeline (#4295 ) * add first text for generation * add generation pipeline to usage * Created using Colaboratory * correct docstring * finish	2020-05-13 14:24:08 -04:00
Guo, Quan	39994051e4	Add migrating from `pytorch-transformers` (#4273 ) "Migrating from pytorch-transformers to transformers" is missing in the main document. It is available in the main `readme` thought. Just move it to the document.	2020-05-11 13:35:13 -04:00
fgaim	41e8291217	Add ALBERT to the Tensorflow to Pytorch model conversion cli (#3933 ) * Add ALBERT to convert command of transformers-cli * Document ALBERT tf to pytorch model conversion	2020-05-11 13:10:00 -04:00
Stefan Schweter	3f42eb979f	Documentation: fix links to NER examples (#4279 ) * docs: fix link to token classification (NER) example * examples: fix links to NER scripts	2020-05-11 12:48:21 -04:00
Patrick von Platen	ac7d5f67a2	[Reformer] Add Enwiki8 Reformer Model - Adapt convert script (#4282 ) * adapt convert script * update convert script * finish * fix marian pretrained docs	2020-05-11 16:38:07 +02:00
Sam Shleifer	3487be75ef	[Marian] documentation and AutoModel support (#4152 ) - MarianSentencepieceTokenizer - > MarianTokenizer - Start using unk token. - add docs page - add better generation params to MarianConfig - more conversion utilities	2020-05-10 13:54:57 -04:00
Girishkumar	9d2f467bfb	[README] Corrected some grammatical mistakes (#4199 )	2020-05-10 09:02:36 -04:00
Julien Chaumond	c99fe0386b	[doc] Fix broken links + remove crazy big notebook	2020-05-07 18:44:18 -04:00
Julien Chaumond	612fa1b10b	Examples readme.md (#4215 ) * README * Update README.md	2020-05-07 15:00:06 -04:00
Lysandre	e7cfc1a313	Release: v2.9.0	2020-05-07 14:15:20 -04:00
Julien Chaumond	0ae96ff8a7	BIG Reorganize examples (#4213 ) * Created using Colaboratory * [examples] reorganize files * remove run_tpu_glue.py as superseded by TPU support in Trainer * Bugfix: int, not tuple * move files around	2020-05-07 13:48:44 -04:00
Patrick von Platen	dca34695d0	Reformer (#3351 ) * first copy & past commit from Bert and morgans LSH code * add easy way to compare to trax original code * translate most of function * make trax lsh self attention deterministic with numpy seed + copy paste code * add same config * add same config * make layer init work * implemented hash_vectors function for lsh attention * continue reformer translation * hf LSHSelfAttentionLayer gives same output as trax layer * refactor code * refactor code * refactor code * refactor * refactor + add reformer config * delete bogus file * split reformer attention layer into two layers * save intermediate step * save intermediate step * make test work * add complete reformer block layer * finish reformer layer * implement causal and self mask * clean reformer test and refactor code * fix merge conflicts * fix merge conflicts * update init * fix device for GPU * fix chunk length init for tests * include morgans optimization * improve memory a bit * improve comment * factorize num_buckets * better testing parameters * make whole model work * make lm model work * add t5 copy paste tokenizer * add chunking feed forward * clean config * add improved assert statements * make tokenizer work * improve test * correct typo * extend config * add complexer test * add new axial position embeddings * add local block attention layer * clean tests * refactor * better testing * save intermediate progress * clean test file * make shorter input length work for model * allow variable input length * refactor * make forward pass for pretrained model work * add generation possibility * finish dropout and init * make style * refactor * add first version of RevNet Layers * make forward pass work and add convert file * make uploaded model forward pass work * make uploaded model forward pass work * refactor code * add namedtuples and cache buckets * correct head masks * refactor * made reformer more flexible * make style * remove set max length * add attention masks * fix up tests * fix lsh attention mask * make random seed optional for the moment * improve memory in reformer * add tests * make style * make sure masks work correctly * detach gradients * save intermediate * correct backprob through gather * make style * change back num hashes * rename to labels * fix rotation shape * fix detach * update * fix trainer * fix backward dropout * make reformer more flexible * fix conflict * fix * fix * add tests for fixed seed in reformer layer * fix trainer typo * fix typo in activations * add fp16 tests * add fp16 training * support fp16 * correct gradient bug in reformer * add fast gelu * re-add dropout for embedding dropout * better naming * better naming * renaming * finalize test branch * finalize tests * add more tests * finish tests * fix * fix type trainer * fix fp16 tests * fix tests * fix tests * fix tests * fix issue with dropout * fix dropout seeds * correct random seed on gpu * finalize random seed for dropout * finalize random seed for dropout * remove duplicate line * correct half precision bug * make style * refactor * refactor * docstring * remove sinusoidal position encodings for reformer * move chunking to modeling_utils * make style * clean config * make style * fix tests * fix auto tests * pretrained models * fix docstring * update conversion file * Update pretrained_models.rst * fix rst * fix rst * update copyright * fix test path * fix test path * fix small issue in test * include reformer in generation tests * add docs for axial position encoding * finish docs * Update convert_reformer_trax_checkpoint_to_pytorch.py * remove isort * include sams comments * remove wrong comment in utils * correct typos * fix typo * Update reformer.rst * applied morgans optimization * make style * make gpu compatible * remove bogus file * big test refactor * add example for chunking * fix typo * add to README	2020-05-07 10:17:01 +02:00
Stefan Schweter	e80be7f1d0	docs: add xlm-roberta section to multi-lingual section (#4101 )	2020-05-01 11:06:58 -04:00
Patrick von Platen	fa49b9afea	Clean Encoder-Decoder models with Bart/T5-like API and add generate possibility (#3383 ) * change encoder decoder style to bart & t5 style * make encoder decoder generation dummy work for bert * make style * clean init config in encoder decoder * add tests for encoder decoder models * refactor and add last tests * refactor and add last tests * fix attn masks for bert encoder decoder * make style * refactor prepare inputs for Bert * refactor * finish encoder decoder * correct typo * add docstring to config * finish * add tests * better naming * make style * fix flake8 * clean docstring * make style * rename	2020-04-28 15:11:09 +02:00
Patrick von Platen	52679fbc2e	add dialogpt training tips (#3996 )	2020-04-28 14:32:31 +02:00
Lorenzo Ampil	12bb7fe770	Fix t5 doc typos (#3978 ) * Fix tpo in into and add line under * Add missing blank line under * Correct types under	2020-04-27 18:27:15 +02:00
Lorenzo Ampil	f16540fcba	Pipeline for Text Generation: GenerationPipeline (#3758 ) * Add GenerationPipeline * Fix parameter names * Correct parameter __call__ parameters * Add model type attribute and correct function calls for prepare_input * Take out trailing commas from init attributes * Remove unnecessary tokenization line * Implement support for multiple text inputs * Apply generation support for multiple input text prompts * Take out tensor coersion * Take out batch index * Add text prompt to return sequence * Squeeze token tensore before decoding * Return only a single list of sequences if only one prompt was used * Correct results variable name * Add GenerationPipeline to SUPPORTED_TASKS with the alias , initalized w GPT2 * Registedred AutoModelWithLMHead for both pt and t * Update docstring for GenerationPipeline * Add kwargs parameter to mode.generate * Take out kwargs parameter after all * Add generation pipeline example in pipeline docstring * Fix max length by squeezing tokens tensor * Apply ensure_tensor_on_device to pytorch tensor * Include generation step in torch.no_grad * Take out input from prepare_xlm_input and set 'en' as default xlm_language * Apply framework specific encoding during prepare_input * Format w make style * Move GenerationPipeline import to follow proper import sorting * Take out training comma from generation dict * Apply requested changes * Change name to TextGenerationPipeline * Apply TextGenerationPipeline rename to __init___ * Changing alias to * Set input mapping as input to ensure_tensor_on_device * Fix assertion placement * Add test_text_generation * Add TextGenerationPipeline to PipelineCommonTests * Take out whitespace * Format __init__ w black * Fix __init__ style * Forman __init___ * Add line to end of __init__ * Correct model tokenizer set for test_text_generation * Ensure to return list of list, not list of string (to pass test) * Limit test models to only 3 to limit runtime to address circleCI timeout error * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Remove argument docstring, __init__, add additional __call__ arguments, and reformat results to list of dict * Fix blank result list * Add TextGenerationPipeline to pipelines.rst * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Fix typos from adding PADDING_TEXT_TOKEN_LENGTH * Fix incorrectly moved result list * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py Co-Authored-By: Patrick von Platen <patrick.v.platen@gmail.com> * Add back generation line and make style * Take out blank whitespace * Apply new alis, text-generation, to test_pipelines * Fix text generation alias in test * Update src/transformers/pipelines.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-04-22 09:37:03 -04:00
Julien Chaumond	1dc9b3c784	Fixes #3877	2020-04-22 01:15:10 +00:00
Thomas Wolf	827d6d6ef0	Cleanup fast tokenizers integration (#3706 ) * First pass on utility classes and python tokenizers * finishing cleanup pass * style and quality * Fix tests * Updating following @mfuntowicz comment * style and quality * Fix Roberta * fix batch_size/seq_length inBatchEncoding * add alignement methods + tests * Fix OpenAI and Transfo-XL tokenizers * adding trim_offsets=True default for GPT2 et RoBERTa * style and quality * fix tests * add_prefix_space in roberta * bump up tokenizers to rc7 * style * unfortunately tensorfow does like these - removing shape/seq_len for now * Update src/transformers/tokenization_utils.py Co-Authored-By: Stefan Schweter <stefan@schweter.it> * Adding doc and docstrings * making flake8 happy Co-authored-by: Stefan Schweter <stefan@schweter.it>	2020-04-18 13:43:57 +02:00
Patrick von Platen	d22894dfd4	[Docs] Add DialoGPT (#3755 ) * add dialoGPT * update README.md * fix conflict * update readme * add code links to docs * Update README.md * Update dialo_gpt2.rst * Update pretrained_models.rst * Update docs/source/model_doc/dialo_gpt2.rst Co-Authored-By: Julien Chaumond <chaumond@gmail.com> * change filename of dialogpt Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-04-16 09:04:32 +02:00
Julien Chaumond	cbad305ce6	[docs] The use of `do_lower_case` in scripts is on its way to deprecation (#3738 )	2020-04-10 12:34:04 -04:00
Sam Shleifer	7a7fdf71f8	Multilingual BART - (#3602 ) - support mbart-en-ro weights - add MBartTokenizer	2020-04-10 11:25:39 -04:00
Lysandre Debut	261c4ff4e2	Update notebooks (#3620 ) * Update notebooks * From local to global link * from local links to actual global links	2020-04-06 14:32:39 -04:00
LysandreJik	36bffc81b3	Release: v2.8.0	2020-04-06 10:03:53 -04:00
Julien Chaumond	94eb68d742	weigths*weights	2020-04-04 15:03:26 -04:00
Lysandre Debut	d5d7d88612	ELECTRA (#3257 ) * Electra wip * helpers * Electra wip * Electra v1 * ELECTRA may be saved/loaded * Generator & Discriminator * Embedding size instead of halving the hidden size * ELECTRA Tokenizer * Revert BERT helpers * ELECTRA Conversion script * Archive maps * PyTorch tests * Start fixing tests * Tests pass * Same configuration for both models * Compatible with base + large * Simplification + weight tying * Archives * Auto + Renaming to standard names * ELECTRA is uncased * Tests * Slight API changes * Update tests * wip * ElectraForTokenClassification * temp * Simpler arch + tests Removed ElectraForPreTraining which will be in a script * Conversion script * Auto model * Update links to S3 * Split ElectraForPreTraining and ElectraForTokenClassification * Actually test PreTraining model * Remove num_labels from configuration * wip * wip * From discriminator and generator to electra * Slight API changes * Better naming * TensorFlow ELECTRA tests * Accurate conversion script * Added to conversion script * Fast ELECTRA tokenizer * Style * Add ELECTRA to README * Modeling Pytorch Doc + Real style * TF Docs * Docs * Correct links * Correct model intialized * random fixes * style * Addressing Patrick's and Sam's comments * Correct links in docs	2020-04-03 14:10:54 -04:00
Patrick von Platen	83d1fbcff6	[Docs] Add usage examples for translation and summarization (#3538 )	2020-03-31 09:36:03 -04:00
Patrick von Platen	42e1e3c67f	Update usage doc regarding generate fn (#3504 )	2020-03-31 09:31:46 -04:00
LysandreJik	6f5a12a583	Release: v2.7.0	2020-03-30 08:49:24 -04:00
Patrick von Platen	5b44e0a31b	[T5] Add training documenation (#3507 ) * Add clear description of how to train T5 * correct docstring in T5 * correct typo * correct docstring format * update t5 model docs * implement collins feedback * fix typo and add more explanation for sentinal tokens * delete unnecessary todos	2020-03-30 13:35:53 +02:00
Patrick von Platen	fa9af2468a	Add T5 to docs (#3461 ) * add t5 docs basis * improve docs * add t5 docs * improve t5 docstring * add t5 tokenizer docstring * finish docstring * make style * add pretrained models * correct typo * make examples work * finalize docs	2020-03-27 10:57:16 -04:00
LysandreJik	471cce24b3	Release: v2.6.0	2020-03-24 10:37:32 -04:00
Sam Shleifer	38a555a83c	Add Summarization to Pipelines (#3128 ) * passing * Undo stupid chg * docs * undo rename * delete-cruft * only import if you have torch * Dont rely on dict ordering * Fix dict ordering upstream * docstring link * docstring link * remove trailing comma for 3.5 compat * new name * delegate kwarging * Update kwargs	2020-03-17 18:04:21 -04:00
Thomas Wolf	2187c49f5c	CPU/GPU memory benchmarking utilities - Remove support for python 3.5 (now only 3.6+) (#3186 ) * memory benchmark rss * have both forward pass and line-by-line mem tracing * cleaned up tracing * refactored and cleaning up API * no f-strings yet... * add GPU mem logging * fix GPU memory monitoring * style and quality * clean up and doc * update with comments * Switching to python 3.6+ * fix quality	2020-03-17 10:17:11 -04:00
Julien Chaumond	d6de6423ba	[doc] --organization tweak Co-Authored-By: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-03-10 16:52:44 -04:00
Julien Chaumond	0e56dc3078	[doc] Document the new --organization flag of CLI	2020-03-10 16:42:01 -04:00
Sam Shleifer	857e0a0d3b	Rename BartForMaskedLM -> BartForConditionalGeneration (#3114 ) * improved documentation	2020-03-05 17:41:18 -05:00
Lysandre	07a79db505	Fix failing doc samples	2020-03-04 19:11:31 -05:00
Lysandre Debut	d3eb7d23a4	Pipeline doc (#3055 ) * Pipeline doc initial commit * pipeline abstraction * Remove modelcard argument from pipeline * Task-specific pipelines can be instantiated with no model or tokenizer * All pipelines doc	2020-03-02 14:07:10 -05:00
Sam Shleifer	b54ef78d0c	Bart-CNN (#3059 ) `generate` code that produces 99% identical summarizations to fairseq on CNN test data, with caching.	2020-03-02 10:35:53 -05:00
Sam Shleifer	9df74b8bc4	Delete all mentions of Model2Model (#3019 )	2020-02-26 11:36:27 -05:00
Lysandre Debut	bb7c468520	Documentation (#2989 ) * All Tokenizers BertTokenizer + few fixes RobertaTokenizer OpenAIGPTTokenizer + Fixes GPT2Tokenizer + fixes TransfoXLTokenizer Correct rst for TransformerXL XLMTokenizer + fixes XLNet Tokenizer + Style DistilBERT + Fix XLNet RST CTRLTokenizer CamemBERT Tokenizer FlaubertTokenizer XLMRobertaTokenizer cleanup * cleanup	2020-02-25 18:43:36 -05:00
Lysandre Debut	65e7c90a77	Adding usage examples for common tasks (#2850 ) * Usage: Sequence Classification & Question Answering * Pipeline example * Language modeling * TensorFlow code for Sequence classification * Custom TF/PT toggler in docs * QA + LM for TensorFlow * Finish Usage for both PyTorch and TensorFlow * Addressing Julien's comments * More assertive * cleanup * Favicon - added favicon option in conf.py along with the favicon image - udpated 🤗 logo. slightly smaller and should appear more consistent across editing programs (no more tongue on the outside of the mouth) Co-authored-by: joshchagani <joshua@joshuachagani.com>	2020-02-25 13:48:24 -05:00
Lysandre	f9ec5ca90b	Release: v2.5.1	2020-02-24 18:22:54 -05:00
Sam Shleifer	53ce3854a1	New BartModel (#2745 ) * Results same as fairseq * Wrote a ton of tests * Struggled with api signatures * added some docs	2020-02-20 18:11:13 -05:00
Lysandre	fb560dcb07	Release: v2.5.0 Welcome Rust Tokenizers	2020-02-19 11:46:19 -05:00
Lysandre	fd639e5be3	Correct quickstart example when using the past	2020-02-10 11:25:56 -05:00
Lysandre	dd28830327	Update RoBERTa tips	2020-02-07 16:42:35 -05:00
Lysandre	db97930122	Update XLM-R tips	2020-02-07 16:42:35 -05:00
VictorSanh	ee5a6856ca	distilbert-base-cased weights + Readmes + omissions	2020-02-07 15:28:13 -05:00
Julien Chaumond	42f08e596f	[examples] rename run_lm_finetuning to run_language_modeling	2020-02-07 09:15:28 -05:00
Julien Chaumond	7748cbbe7d	Oopsie	2020-02-06 15:30:02 -05:00
Julien Chaumond	432c12521e	[docs] Add menu w/ links to other pages on hf.co	2020-02-06 15:30:02 -05:00
Julien Chaumond	eae8ee0389	[doc] model sharing: mention README.md + tweaks cc @lysandrejik @thomwolf	2020-02-05 14:20:03 -05:00
Lysandre	9c67196b83	Update quickstart	2020-02-04 11:11:37 -05:00
Lysandre	d426b58b9e	Patch: v2.4.1	2020-01-31 14:55:33 -05:00
Lysandre	6664ea943d	Release: v2.4.0	2020-01-31 09:40:32 -05:00
Hang Le	b43cb09aaa	Add layerdrop	2020-01-30 12:05:01 -05:00
Lysandre	93dccf527b	Pretrained models	2020-01-30 10:04:18 -05:00
Lysandre	73306d028b	FlauBERT documentation	2020-01-30 10:04:18 -05:00
Lysandre	c69b082601	Update documentation	2020-01-29 12:06:13 -05:00
Lysandre	44a5b4bbe7	Update documentation	2020-01-29 11:47:49 -05:00
Wietse de Vries	f5a236c3ca	Add Dutch pre-trained BERT model	2020-01-27 21:00:34 -05:00
thomwolf	e0849a66ac	adding in the doc	2020-01-27 14:27:07 -05:00
Lysandre	983fef469c	AutoModels doc	2020-01-24 16:37:30 -05:00
Lysandre	24d5ad1dcc	Run the examples in slow	2020-01-23 09:38:45 -05:00
Lysandre	9ddf60b694	Tips + whitespaces	2020-01-23 09:38:45 -05:00
Lysandre	0e9899f451	Fixes	2020-01-23 09:38:45 -05:00
Lysandre	7511f3dd89	PyTorch CTRL + Style	2020-01-23 09:38:45 -05:00
Lysandre	980211a63a	XLM-RoBERTa	2020-01-23 09:38:45 -05:00
Lysandre	db1a7f27a1	PyTorch DistilBERT	2020-01-23 09:38:45 -05:00
Lysandre	b28020f590	TF RoBERTa	2020-01-23 09:38:45 -05:00
Lysandre	3e1bc27e1b	Pytorch RoBERTa	2020-01-23 09:38:45 -05:00
Lysandre	f44ff574d3	Camembert	2020-01-23 09:38:45 -05:00
Lysandre	ccebcae75f	PyTorch XLM	2020-01-23 09:38:45 -05:00
Lysandre	cd656fb21a	PyTorch XLNet	2020-01-23 09:38:45 -05:00
Lysandre	98edad418e	PyTorch Transformer-XL	2020-01-23 09:38:45 -05:00
Lysandre	850795c487	Pytorch GPT	2020-01-23 09:38:45 -05:00
Lysandre	1487b840d3	TF GPT2	2020-01-23 09:38:45 -05:00
Lysandre	bd0d3fd76e	GPT-2 PyTorch models + better tips for BERT	2020-01-23 09:38:45 -05:00
Lysandre	cd77c750c5	BERT PyTorch models	2020-01-23 09:38:45 -05:00
Lysandre	3922a2497e	TF ALBERT + TF Utilities + Fix warnings	2020-01-23 09:38:45 -05:00
Lysandre	00df3d4de0	ALBERT Modeling + required changes to utilities	2020-01-23 09:38:45 -05:00
Lysandre	632675ea88	Can test examples spread over multiple blocks	2020-01-23 09:38:45 -05:00
Lysandre	9bab9b83d2	Glossary	2020-01-23 09:38:45 -05:00
Julien Chaumond	119dc50e2a	Doc tweak on model sharing	2020-01-22 22:40:38 -05:00
Lysandre	387217bd3e	Added example usage	2020-01-14 14:09:09 +01:00
Lysandre	7d1bb7f256	Add missing XLNet and XLM models	2020-01-14 14:09:09 +01:00
Lysandre Debut	632682726f	Updated Configurations	2020-01-14 14:09:09 +01:00
alberduris	81d6841b4b	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
alberduris	dd4df80f0b	Moved the encoded_prompts to correct device	2020-01-06 15:11:12 +01:00
Morgan Funtowicz	80faf22b4a	Updating documentation for converting tensorflow model to reflect the new cli convert format. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>	2020-01-04 13:41:18 +01:00
Julien Chaumond	9b2badf3c9	[cli] Update doc	2019-12-27 22:54:29 -05:00
Aymeric Augustin	a8d34e534e	Remove [--editable] in install instructions. Use -e only in docs targeted at contributors. If a user copy-pastes command line with [--editable], they will hit an error. If they don't know the --editable option, we're giving them a choice to make before they can move forwards, but this isn't a choice they need to make right now.	2019-12-24 08:46:08 +01:00
Aymeric Augustin	70373a5f7c	Update contribution instructions. Also provide shortcuts in a Makefile.	2019-12-23 21:05:30 +01:00
Aymeric Augustin	d8e33dbd67	Fix path to source code in docs config. This should fix API docs, which went AWOL with yesterday's changes.	2019-12-23 16:49:35 +01:00
Aymeric Augustin	45841eaf7b	Remove references to Python 2 in documentation.	2019-12-22 18:38:56 +01:00
Aymeric Augustin	ced0a94204	Switch test files to the standard test_*.py scheme.	2019-12-22 14:15:13 +01:00
Aymeric Augustin	067395d5c5	Move tests outside of library.	2019-12-22 13:47:17 +01:00
Julien Chaumond	ac1b449cc9	[doc] move distilroberta to more appropriate place cc @lysandrejik	2019-12-21 00:09:01 -05:00
Lysandre	a436574bfd	Release: v2.3.0	2019-12-20 16:22:20 -05:00
Rémi Louf	4e3f745ba4	add example for Model2Model in quickstart	2019-12-20 09:12:31 -05:00
Stefan Schweter	f09d999641	docs: fix numbering 😅	2019-12-18 19:49:33 +01:00
Stefan Schweter	dd7a958fd6	docs: add XLM-RoBERTa to pretrained model list (incl. all parameters)	2019-12-18 19:45:46 +01:00
Stefan Schweter	d35405b7a3	docs: add XLM-RoBERTa to index page	2019-12-18 19:45:10 +01:00
Antti Virtanen	abc43ffbff	Add pretrained model documentation for FinBERT.	2019-12-17 20:35:25 -05:00
Julien Chaumond	3f5ccb183e	[doc] Clarify uploads cf `855ff0e91d (commitcomment-36452545)`	2019-12-16 18:20:29 -05:00
Julien Chaumond	855ff0e91d	[doc] Model upload and sharing ping @lysandrejik @thomwolf Is this clear enough? Anything we should add?	2019-12-16 12:42:22 -05:00
Thomas Wolf	e92bcb7eb6	Merge pull request #1739 from huggingface/t5 [WIP] Adding Google T5 model	2019-12-14 09:40:43 +01:00
Lysandre	7bd11dda6f	Release: v2.2.2	2019-12-13 16:45:30 -05:00
thomwolf	5c00e344c1	update model doc - swith 3B/11B to 3b/11b	2019-12-13 16:33:29 +01:00
Thomas Wolf	110394b2ba	Merge branch 'master' into t5	2019-12-13 16:03:32 +01:00
Julien Chaumond	1748fdf657	[doc] Fix rst table	2019-12-11 18:32:27 -05:00
Masatoshi Suzuki	c03c0dfd23	Add support for Japanese BERT models by cl-tohoku	2019-12-11 18:32:27 -05:00
Stefan Schweter	030faccb8d	doc: fix pretrained models table	2019-12-11 12:19:21 -05:00
thomwolf	0558c9cb9b	Merge branch 'master' into t5	2019-12-10 12:58:48 +01:00
Thomas Wolf	e57d00ee10	Merge pull request #1984 from huggingface/squad-refactor [WIP] Squad refactor	2019-12-10 11:07:26 +01:00
Pierric Cistac	5c877fe94a	fix albert links	2019-12-09 18:53:00 -05:00
Lysandre Debut	00c4e39581	Merge branch 'master' into squad-refactor	2019-12-09 10:41:15 -05:00
Aymeric Augustin	35401fe50f	Remove dependency on pytest for running tests (#2055 ) * Switch to plain unittest for skipping slow tests. Add a RUN_SLOW environment variable for running them. * Switch to plain unittest for PyTorch dependency. * Switch to plain unittest for TensorFlow dependency. * Avoid leaking open files in the test suite. This prevents spurious warnings when running tests. * Fix unicode warning on Python 2 when running tests. The warning was: UnicodeWarning: Unicode equal comparison failed to convert both arguments to Unicode - interpreting them as being unequal * Support running PyTorch tests on a GPU. Reverts `27e015bd`. * Tests no longer require pytest. * Make tests pass on cuda	2019-12-06 13:57:38 -05:00
Thomas Wolf	5482822a2b	Merge pull request #2046 from jplu/tf2-ner-example Add NER TF2 example.	2019-12-06 12:12:22 +01:00
LysandreJik	9ecd83dace	Patch evaluation for impossible values + cleanup	2019-12-05 14:44:57 -05:00
VictorSanh	552c44a9b1	release distilm-bert	2019-12-05 10:14:58 -05:00
Julien Plu	9200a759d7	Add few tests on the TF optimization file with some info in the documentation. Complete the README.	2019-12-05 12:56:43 +01:00
Thomas Wolf	1f179f095f	Merge pull request #2011 from AdityaSoni19031997/patch-1 typo fix on the docs as per Pytorch v1.1+	2019-12-05 12:39:04 +01:00
LysandreJik	7a03519975	Documentation	2019-12-04 17:24:35 -05:00
LysandreJik	8101924a68	Patch: v2.2.1	2019-12-03 11:20:26 -05:00
Aditya Soni	c356290c8d	typo fix as per Pytorch v1.1+	2019-12-01 14:08:14 +05:30
Stefan Schweter	8c276b9c92	Merge branch 'master' into distilbert-german	2019-11-27 18:11:49 +01:00
VictorSanh	d75d49a51d	add XnliProcessor to doc	2019-11-27 11:07:22 -05:00
Lysandre	361620954a	Remove TFBertForPreTraining from ALBERT doc	2019-11-27 10:11:37 -05:00
Lysandre	ce02550d50	Fix pretrained models table	2019-11-26 15:47:02 -05:00
Lysandre	cf26a0c85e	Fix pretrained models table	2019-11-26 15:40:03 -05:00
Lysandre	ee4647bd5c	CamemBERT & ALBERT doc	2019-11-26 15:10:51 -05:00
Lysandre	668aac45d2	Pretrained models	2019-11-26 14:52:42 -05:00
Lysandre	ae98d45991	Release: v2.2.0	2019-11-26 14:12:44 -05:00
Julien Chaumond	afaa335851	[doc] Fix assets urls	2019-11-23 11:34:45 -05:00
Stefan Schweter	e631383d4f	docs: add new German distilbert model to pretrained models	2019-11-19 19:52:40 +01:00
Louis MARTIN	035fea5315	Add CamemBERT to auto files and docs	2019-11-16 00:11:07 -05:00
Thomas Wolf	df99f8c5a1	Merge pull request #1832 from huggingface/memory-leak-schedulers replace LambdaLR scheduler wrappers by function	2019-11-14 22:10:31 +01:00
Rémi Louf	2276bf69b7	update the examples, docs and template	2019-11-14 20:38:02 +01:00
Lysandre	e18f786cd5	Quickstart example showcasing past	2019-11-14 10:06:00 -05:00
thomwolf	f03c0c1423	adding models in readme and auto classes	2019-11-08 11:49:46 +01:00
Julien Chaumond	1c542df7e5	Add RoBERTa-based GPT-2 Output Detector from OpenAI converted from https://github.com/openai/gpt-2-output-dataset/tree/master/detector Co-Authored-By: Lysandre Debut <lysandre.debut@reseau.eseo.fr> Co-Authored-By: Jong Wook Kim <jongwook@nyu.edu> Co-Authored-By: Jeff Wu <wuthefwasthat@gmail.com>	2019-11-06 16:26:31 -05:00
Julien Chaumond	30968d70af	misc doc	2019-11-05 19:06:12 -05:00
Lysandre	d7d36181fd	GPT-2 XL	2019-11-05 13:31:58 -05:00
Julien Chaumond	93d2fff071	Close #1654	2019-11-01 09:47:38 -04:00
VictorSanh	8ad5c591cd	[RELEASE] DistilRoBERTa	2019-10-23 10:29:47 -04:00
LysandreJik	82f6abd98a	Benchmark section added to the documentation	2019-10-18 17:27:10 -04:00
Lysandre	3ddce1d74c	Release: 2.1.1	2019-10-11 06:37:49 -04:00
Stefan Schweter	5f25a5f367	model: add support for new German BERT models (cased and uncased) from @dbmdz	2019-10-11 10:20:33 +02:00
LysandreJik	9c2e0a4acf	Release: 2.1.0	2019-10-09 12:14:03 -04:00
LysandreJik	7fe98d8c18	Update CTRL documentation	2019-10-09 12:12:36 -04:00
LysandreJik	89f86f9661	CTRL added to the documentation	2019-10-09 12:04:06 -04:00
thomwolf	d9e60f4f0d	Merge branch 'master' into pr/1383	2019-10-09 17:25:08 +02:00
thomwolf	48b438ff2a	doc and conversion	2019-10-09 17:06:30 +02:00
Julien Chaumond	d688af19e5	Update link to swift-coreml-transformers cc @lysandrejik	2019-10-08 16:37:52 -04:00
LysandreJik	8fcc6507ce	Multilingual	2019-10-07 15:02:42 -04:00
Thomas Wolf	b3cfd97946	Merge pull request #1373 from TimYagan/fix-css Fixed critical css font-family issues	2019-10-03 19:04:02 -04:00
VictorSanh	e2ae9c0b73	fix links in doc index	2019-10-03 11:42:21 -04:00
VictorSanh	c1689ac301	fix name	2019-10-03 10:56:39 -04:00
VictorSanh	4a790c40b1	update doc for distil*	2019-10-03 10:54:02 -04:00
LysandreJik	ebb32261b1	fix #1401	2019-10-02 17:52:56 -04:00
Tim Yagan	0a4ed7192e	Fixed critical css font-family issues Fixed critical css font-family issues to ensure compatibility with multiple webbrowsers	2019-09-29 13:51:01 +02:00
Julien Chaumond	d8b641c839	6 -> 8 models	2019-09-27 17:22:01 -04:00
pj	4f2b6579bf	Fix some typos	2019-09-27 22:55:43 +08:00
Gabriel Luiz Freitas Almeida	d2de5b9d8c	Just some typos	2019-09-27 07:08:36 -03:00
Julien Chaumond	fc9faa8a47	[docs] Doc tweaks Co-Authored-By: Lysandre Debut <lysandre.debut@reseau.eseo.fr>	2019-09-26 18:19:51 -04:00
LysandreJik	93f0c5fc72	Repository link in the documentation	2019-09-26 11:45:00 -04:00
thomwolf	6c3b131516	typo in readme/doc	2019-09-26 16:23:28 +02:00
LysandreJik	7e957237e4	[Doc] XLM + Torch in documentation	2019-09-26 10:08:56 -04:00
LysandreJik	927904bc91	[doc] pytorch_transformers -> transformers	2019-09-26 08:47:15 -04:00
LysandreJik	294edfd83d	Release version in documentation	2019-09-26 08:16:12 -04:00
LysandreJik	de5e4864cb	Documentation	2019-09-26 08:04:54 -04:00
LysandreJik	8349d75773	Various small doc fixes	2019-09-26 07:45:40 -04:00
LysandreJik	fb056494e5	Example usage	2019-09-26 07:45:40 -04:00
LysandreJik	36f592cc82	Updated doc for `InputExample` and `InputFeatures`	2019-09-26 07:45:40 -04:00
LysandreJik	ad4a393e2e	Changed processor documentation architecture. Added documentation for GLUE	2019-09-26 07:45:40 -04:00
LysandreJik	c4ac7a76db	GLUE processors	2019-09-26 07:45:40 -04:00
LysandreJik	4acd87ff4e	TF models added to documentation	2019-09-26 07:45:40 -04:00
LysandreJik	cf5c5c9e1c	Documentation	2019-09-26 07:43:13 -04:00
thomwolf	f47f7f4611	add logo	2019-09-26 11:28:44 +02:00
thomwolf	31c23bd5ee	[BIG] pytorch-transformers => transformers	2019-09-26 10:15:53 +02:00
thomwolf	c88f05163d	fix typo in XLM models	2019-09-16 13:42:20 +02:00
LysandreJik	593c070435	Better examples	2019-09-06 12:00:12 -04:00
LysandreJik	0b52642d37	1.2.0 in docs	2019-09-04 11:03:32 -04:00
LysandreJik	7f522437bc	Updated documentation for LM finetuning script	2019-09-02 13:40:25 -04:00
Julien Chaumond	2dcc5a1629	[doc] Add blurb about large-scale model downloads cc @n1t0 @lysandrejik @thomwolf	2019-09-02 12:27:11 -04:00
LysandreJik	09363f2a8b	Fix documentation index	2019-08-30 19:48:32 -04:00
LysandreJik	e0caab0cf0	fix link	2019-08-30 10:09:17 -04:00
LysandreJik	a600b30cc3	Fix index number in documentation	2019-08-30 10:08:14 -04:00
LysandreJik	20c06fa37d	Added DistilBERT to documentation index	2019-08-30 10:06:51 -04:00
LysandreJik	9ce42dc540	Pretrained models table fix	2019-08-28 13:56:28 -04:00
Thomas Wolf	0ecfd17f49	Merge pull request #987 from huggingface/generative-finetuning Generative finetuning	2019-08-28 16:51:50 +02:00
Thomas Wolf	50792dbdcc	Merge pull request #1127 from huggingface/dilbert DilBERT	2019-08-28 16:43:09 +02:00
LysandreJik	75bc2a03cc	Updated article link	2019-08-28 10:05:15 -04:00
LysandreJik	1dc43e56c9	Documentation additions	2019-08-28 09:37:27 -04:00
thomwolf	0d288727b8	fix #1106	2019-08-27 14:50:22 +02:00
Thomas Wolf	90dcd8c05d	Merge branch 'master' into generative-finetuning	2019-08-22 10:43:30 +02:00
Lysandre	2f9397139d	Added GPT-2 LARGE to Pre-trained Models documentation	2019-08-21 11:29:37 -04:00
Lysandre	d6bbcbc4cf	Added finetuning example to documentation	2019-08-21 11:22:05 -04:00
VictorSanh	6f877d9daf	Update dev results on GLUE (bert-base-uncased) w/ median on 5 runs	2019-08-21 03:43:29 +00:00
thomwolf	e239a4a20f	close #984	2019-08-20 11:02:00 +02:00
LysandreJik	572dcfd1db	Doc	2019-08-14 14:56:14 -04:00
Thomas Wolf	d43dc48b34	Merge branch 'master' into auto_models	2019-08-05 19:17:35 +02:00
thomwolf	0b524b0848	remove derived classes for now	2019-08-05 19:08:19 +02:00
thomwolf	13936a9621	update doc and tests	2019-08-05 18:48:16 +02:00
Lysandre Debut	6f05ad72b4	Merge pull request #791 from huggingface/doc RestructuredText table for pretrained models.	2019-08-05 10:18:00 -04:00
thomwolf	b90e29d52c	working on automodels	2019-08-05 16:06:34 +02:00
thomwolf	328afb7097	cleaning up tokenizer tests structure (at last) - last remaining ppb refs	2019-08-05 14:08:56 +02:00
thomwolf	00132b7a7a	updating docs - adding few tests to tokenizers	2019-08-04 22:42:55 +02:00
thomwolf	009273dbdd	big doc update [WIP]	2019-08-04 12:14:57 +02:00
Julien Chaumond	44dd941efb	link to `swift-coreml-transformers`	2019-08-01 09:50:30 -04:00
thomwolf	c717d38573	dictionnary => dictionary	2019-07-26 23:30:48 +02:00
Sukuya	35c52f2f3c	Update torchscript.rst Import fixed to pytorch_transformers else torchscript flag can't be used.	2019-07-25 16:51:11 +08:00
LysandreJik	9d381e7be9	Fixed incorrect links in the PretrainedModel	2019-07-17 09:25:38 -04:00
Stefan Schweter	e6cc6d237f	docs: fix link to various notebooks	2019-07-16 23:42:28 +02:00
Stefan Schweter	5b78400e21	docs: fix link to modeling example source (bert)	2019-07-16 23:41:57 +02:00
Stefan Schweter	61cc3ee350	docs: fix link to tf checkpoint to pytorch script	2019-07-16 23:41:04 +02:00
Stefan Schweter	dbbd94cb7a	docs: fix link to bertology example and update dataset description	2019-07-16 23:40:04 +02:00
LysandreJik	117ed92992	RestructuredText table for pretrained models.	2019-07-16 11:58:47 -04:00
thomwolf	5c82d3488f	indicate default evaluation in breaking changes	2019-07-16 15:45:58 +02:00
thomwolf	43e0e8fa04	updates to readme and doc	2019-07-16 13:56:47 +02:00
thomwolf	3b8b0e01bb	update readme	2019-07-16 00:12:55 +02:00
thomwolf	2397f958f9	updating examples and doc	2019-07-14 23:20:10 +02:00
LysandreJik	6491575fd5	Added TorchScript disclaimer. CSS modifications.	2019-07-11 12:38:21 -04:00
LysandreJik	c82b74b996	Fixed Sphinx errors and warnings	2019-07-10 15:30:19 -04:00
LysandreJik	f773faa258	Fixed all links. Removed TPU. Changed CLI to Converting TF models. Many minor formatting adjustments. Added "TODO Lysandre filled" where necessary.	2019-07-10 14:45:56 -04:00
LysandreJik	c4bab2dc85	Added footer with social links.	2019-07-09 18:03:01 -04:00
LysandreJik	331db8cc02	Added viewcode plugin for source code visualization within the static website.	2019-07-09 17:01:56 -04:00
LysandreJik	83fb311ef7	Patched warnings + Refactored XLNet's Docstrings	2019-07-09 16:38:30 -04:00
LysandreJik	8fe2c9d98e	Refactored Docstrings of BERT, GPT2, GPT, TransfoXL, XLM and XLNet.	2019-07-09 15:55:31 -04:00
LysandreJik	269e73b601	Adding example detailing how to add a new file to the documentation + adding fonts.	2019-07-09 10:11:29 -04:00
LysandreJik	6847e30e1c	New page detailing the use of TorchScript.	2019-07-08 17:34:24 -04:00
LysandreJik	ab30651802	Hugging Face theme.	2019-07-08 16:05:26 -04:00
LysandreJik	64fd986376	Tokenizers and Config classes are referenced.	2019-07-05 17:44:59 -04:00
LysandreJik	df759114c9	Single file documentation for each model, accompanied by the Documentation overview.	2019-07-05 17:35:26 -04:00
LysandreJik	03de9686a7	Initial folder structure for the documentation. A draft of documentation change has been made in the BertModel class.	2019-07-05 17:11:13 -04:00

... 60 61 62 63 64 ...

3358 Commits