transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-21 21:49:06 +06:00

Author	SHA1	Message	Date
Kevin Canwen Xu	866a8ccabb	Add model cards for Microsoft's MiniLM (#5178 ) * Add model cards for Microsoft's MiniLM * XLMRobertaTokenizer * format * Add thumbnail * finishing up	2020-06-22 21:48:14 +08:00
RafaelWO	b99ad457f4	Added feature to move added tokens in vocabulary for Transformer-XL (#4953 ) * Fixed resize_token_embeddings for transfo_xl model * Fixed resize_token_embeddings for transfo_xl. Added custom methods to TransfoXLPreTrainedModel for resizing layers of the AdaptiveEmbedding. * Updated docstring * Fixed resizinhg cutoffs; added check for new size of embedding layer. * Added test for resize_token_embeddings * Fixed code quality * Fixed unchanged cutoffs in model.config * Added feature to move added tokens in tokenizer. * Fixed code quality * Added feature to move added tokens in tokenizer. * Fixed code quality * Fixed docstring, renamed sym to oken. Co-authored-by: Rafael Weingartner <rweingartner.its-b2015@fh-salzburg.ac.at>	2020-06-22 15:40:52 +02:00
Sylvain Gugger	eb0ca71ef6	Update glossary (#5148 ) * Update glossary * Update docs/source/glossary.rst Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-06-22 08:30:49 -04:00
Patrick von Platen	fa0be6d761	Benchmarks (#4912 ) * finish benchmark * fix isort * fix setup cfg * retab * fix time measuring of tf graph mode * fix tf cuda * clean code * better error message	2020-06-22 12:06:56 +02:00
Zihao Fu	18a0150bfa	fix bart doc (#5132 ) fix bart doc	2020-06-22 10:58:28 +02:00
Mikael Souza	3fe75c7f70	Fixing docs for Encoder Decoder Config (#5171 )	2020-06-22 10:51:17 +02:00
flozi00	59345cc87f	Typo (#5147 )	2020-06-22 10:49:23 +02:00
Ilya Boytsov	bc3a0c0607	[examples] fixes arguments for summarization finetune scripts (#5157 ) Authored-by: i.boytsov <i.boytsov@MAC867.local>	2020-06-21 11:51:21 -04:00
Tim Suchanek	68e19f1c22	Fix typo in root README (#5073 )	2020-06-20 23:00:04 +08:00
Kevin Canwen Xu	c0c577cf8f	Fix PABEE's result table (#5158 )	2020-06-20 22:56:39 +08:00
Julien Chaumond	aa6a29bc25	SummarizationPipeline: init required task name (#5086 ) * SummarizationPipeline: init required task name * Update src/transformers/pipelines.py Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * Apply suggestions from code review Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-06-20 03:16:30 -04:00
Kevin Canwen Xu	2fd28d4363	Add BERT Loses Patience (Patience-based Early Exit) (#5078 ) * Add BERT Loses Patience (Patience-based Early Exit) * update model archive * update format * sort import * flake8 * Add results * full results * align the table * refactor to inherit * default per gpu eval = 1 * Formatting * Formatting * isort * modify readme * Add check * Fix format * Fix format * Doc strings * ALBERT & BERT for sequence classification don't inherit from the original anymore * Remove incorrect comments * Remove incorrect comments * Remove incorrect comments * Sync up with new code * Sync up with new code * Add a test * Add a test * Add a test * Add a test * Add a test * Add a test * Finishing up!	2020-06-20 13:41:46 +08:00
Zhu Baohe	f1679d7c48	Fix dropout in TFMobileBert (#5150 )	2020-06-20 13:21:19 +08:00
Kevin Canwen Xu	5ed94b2312	Update note to avoid confusion (#5131 )	2020-06-20 10:13:34 +08:00
Lysandre	d97b4176e5	Correct device assignment	2020-06-19 21:58:28 -04:00
Vasily Shamporov	9a3f91088c	Add MobileBert (#4901 ) * Add MobileBert * Quality + Conversion script * style * Update src/transformers/modeling_mobilebert.py * Links to S3 * Style * TFMobileBert Slight fixes to the pytorch MobileBert Style * MobileBertForMaskedLM (PT + TF) * MobileBertForNextSentencePrediction (PT + TF) * MobileFor{MultipleChoice, TokenClassification} (PT + TF) ss * Tests + Auto * Doc * Tests * Addressing @sgugger's comments * Adressing @patrickvonplaten's comments * Style * Style * Integration test * style * Model card Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-06-19 16:38:36 -04:00
Sam Shleifer	f45e873910	[bart-mnli] Fix class flipping bug (#5141 )	2020-06-19 13:33:24 -04:00
Erick Rocha Fonseca	e33929ef1e	Fix in Reformer Config documentation (#5138 )	2020-06-19 15:41:31 +02:00
Sam Shleifer	84be482f66	AutoTokenizer supports mbart-large-en-ro (#5121 )	2020-06-18 20:47:37 -04:00
Sam Shleifer	2db1e2f415	[cleanup] remove redundant code in SummarizationDataset (#5119 )	2020-06-18 20:34:48 -04:00
Sylvain Gugger	5f721ad6e4	Fix #5114 (#5122 )	2020-06-18 19:20:04 -04:00
Pri Oberoi	a258982af3	Add missing arg in 02-transformers notebook (#5085 ) * Add missing arg when creating model * Fix typos * Remove from_tf flag when creating model	2020-06-18 19:04:04 -04:00
Deniz	32e94cff64	tf add resize_token_embeddings method (#4351 ) * resize token embeddings * add tokens * add tokens * add tokens * add t5 token method * add t5 token method * add t5 token method * typo * debugging input * debugging input * debug * debug * debug * trying to set embedding tokens properly * set embeddings for generation head too * set embeddings for generation head too * debugging * debugging * enable generation * add base method * add base method * add base method * return logits in the main call * reverting to generation * revert back * set embeddings for the bert main layer * description * fix conflicts * logging * set base model as self * refactor * tf_bert add method * tf_bert add method * tf_bert add method * tf_bert add method * tf_bert add method * tf_bert add method * tf_bert add method * tf_bert add method * v0 * v0 * finalize * final * black * add tests * revert back the emb call * comments * comments * add the second test * add vocab size condig * add tf models * add tf models. add common tests * remove model specific embedding tests * stylish * remove files * stylez * Update src/transformers/modeling_tf_transfo_xl.py change the error. Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * adding unchanged weight test Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-06-18 18:41:26 -04:00
Lysandre Debut	973433260e	Pin `sphinx-rtd-theme` (#5128 )	2020-06-18 18:07:59 -04:00
Sam Shleifer	8a377c3d6e	[fix] Move _adjust_logits above postprocess to fix Marian.generate (#5126 )	2020-06-18 18:06:27 -04:00
Sam Shleifer	3d3e605aff	[cleanup] generate_beam_search comments (#5115 )	2020-06-18 16:30:24 -04:00
Suraj Patil	ca2d0f98c4	ElectraForMultipleChoice (#4954 ) * add ElectraForMultipleChoice * add test_for_multiple_choice * add ElectraForMultipleChoice in auto model * add ElectraForMultipleChoice in all_model_classes * add SequenceSummary related parameters * get rid pooler, use SequenceSummary instead * add electra multiple choice test Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-06-18 14:59:35 -04:00
Ori Garin	279d8e24f7	support local_files_only option for tf models (#5116 )	2020-06-18 13:47:05 -04:00
Julien Chaumond	355954ffca	Create distilbert-base-uncased-distilled-squad-README.md	2020-06-18 05:17:45 -04:00
Suraj Patil	18177a1a60	lm_labels => labels (#5080 )	2020-06-18 09:16:29 +02:00
Lysandre	efeb75b805	Remove misleading comment closes #4958	2020-06-17 18:24:35 -04:00
Saurabh Misra	bb154ac50c	Fixing TPU training by disabling wandb.watch gradients logging for TPU (#4926 )	2020-06-17 18:04:11 -04:00
Suraj Patil	fb6cccb863	fix qa example (#4929 )	2020-06-17 17:54:16 -04:00
Karthikeyan Singaravelan	38bba9cdd5	Fix deprecation warnings due to invalid escape sequences. (#4924 )	2020-06-17 17:46:58 -04:00
Sam Shleifer	f1a3d03741	add pandas to setup.cfg (#5093 )	2020-06-17 16:39:17 -04:00
Sam Shleifer	90c833870c	[MarianTokenizer] Switch to sacremoses for punc normalization (#5092 )	2020-06-17 16:31:05 -04:00
Pranav Dayanand Pawar	049e14f0e3	very minor spelling correction in script command (#5090 ) actual script name - counts_parameters.py	2020-06-17 16:08:43 -04:00
Sylvain Gugger	20fa828984	Make default_data_collator more flexible and deprecate old behavior (#5060 ) * Make default_data_collator more flexible * Accept tensors for all features * Document code * Refactor * Formatting	2020-06-17 15:24:51 -04:00
Yacine Jernite	5e06963394	Some changes to simplify the generation function (#5031 ) * moving logits post-processing out of beam search * moving logits post-processing out of beam search * first step cache * fix_Encoder_Decoder * patrick_version_postprocess * add_keyword_arg	2020-06-17 14:48:06 -04:00
Sylvain Gugger	204ebc25e6	Update installation page and add contributing to the doc (#5084 ) * Update installation page and add contributing to the doc * Remove mention of symlinks	2020-06-17 14:01:10 -04:00
Sam Shleifer	043f9f51f9	[examples] SummarizationModule improvements (#4951 )	2020-06-17 13:51:34 -04:00
Sylvain Gugger	cd40f6564e	Add header and fix command (#5082 )	2020-06-17 11:45:05 -04:00
Julien Chaumond	70bc3ead4f	[TextClassificationPipeline] Hotfix: make json serializable	2020-06-17 15:09:27 +00:00
Sylvain Gugger	7291ea0bff	Reorganize documentation (#5064 ) * Reorganize topics and add all models	2020-06-17 07:55:20 -04:00
Sylvain Gugger	e4aaa45805	Update pipeline examples to doctest syntax (#5030 )	2020-06-16 18:14:58 -04:00
Sylvain Gugger	011cc0be51	Fix all sphynx warnings (#5068 )	2020-06-16 16:50:02 -04:00
flozi00	af497b5672	Typo (#5069 )	2020-06-16 16:46:20 -04:00
Yacine Jernite	49c5202522	Eli5 examples (#4968 ) * add eli5 examples * add dense query script * query_di * merging * merging * add_utils * adds nearest neighbor wikipedia * batch queries * training_retriever * new notebooks * moved retriever traiing script * finished wiki40b * max_len_fix * train_s2s * retriever_batch_checkpointing * cleanup * merge * dim_fix * fix_indexer * fix_wiki40b_snippets * fix_embed_for_r * fp32 index * fix_sparse_q * joint_training * remove obsolete datasets * add_passage_nn_results * add_passage_nn_results * add_batch_nn * add_batch_nn * add_data_scripts * notebook * notebook * notebook * fix_multi_gpu * add_app * full_caching * full_caching * notebook * sparse_done * images * notebook * add_image_gif * with_Gif * add_contr_image * notebook * notebook * notebook * train_functions * notebook * min_retrieval_length * pandas_option * notebook * min_retrieval_length * notebook * notebook * eval_Retriever * notebook * images * notebook * add_example * add_example * notebook * fireworks * notebook * notebook * joe's notebook comments * app_update * notebook * notebook_link * captions * notebook * assing RetriBert model * add RetriBert to Auto * change AutoLMHead to AutoSeq2Seq * notebook downloads from hf models * style_black * style_black * app_update * app_update * fix_app_update * style * style * isort * Delete WikiELI5training.ipynb * Delete evaluate_eli5.py * Delete WikiELI5explore.ipynb * Delete ExploreWikiELI5Support.html * Delete explainlikeimfive.py * Delete wiki_snippets.py * children before parent * children before parent * style_black * style_black_only * isort * isort_new * Update src/transformers/modeling_retribert.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> * typo fixes * app_without_asset * cleanup * Delete ELI5animation.gif * Delete ELI5contrastive.svg * Delete ELI5wiki_index.svg * Delete choco_bis.svg * Delete fireworks.gif * Delete huggingface_logo.jpg * Delete huggingface_logo.svg * Delete Long_Form_Question_Answering_with_ELI5_and_Wikipedia.ipynb * Delete eli5_app.py * Delete eli5_utils.py * readme * Update README.md * unused imports * moved_info * default_beam * ftuned model * disclaimer * Update src/transformers/modeling_retribert.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * black * add_doc * names * isort_Examples * isort_Examples * Add doc to index Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-06-16 16:36:58 -04:00
Sam Shleifer	c3e607496c	[cleanup] examples test_run_squad uses tiny model (#5059 )	2020-06-16 14:06:45 -04:00
Sylvain Gugger	439aa1d6e9	Remove old section + caching in install (#5027 )	2020-06-16 13:03:41 -04:00

... 29 30 31 32 33 ...

5759 Commits