Kevin Canwen Xu
866a8ccabb
Add model cards for Microsoft's MiniLM ( #5178 )
...
* Add model cards for Microsoft's MiniLM
* XLMRobertaTokenizer
* format
* Add thumbnail
* finishing up
2020-06-22 21:48:14 +08:00
RafaelWO
b99ad457f4
Added feature to move added tokens in vocabulary for Transformer-XL ( #4953 )
...
* Fixed resize_token_embeddings for transfo_xl model
* Fixed resize_token_embeddings for transfo_xl.
Added custom methods to TransfoXLPreTrainedModel for resizing layers of
the AdaptiveEmbedding.
* Updated docstring
* Fixed resizinhg cutoffs; added check for new size of embedding layer.
* Added test for resize_token_embeddings
* Fixed code quality
* Fixed unchanged cutoffs in model.config
* Added feature to move added tokens in tokenizer.
* Fixed code quality
* Added feature to move added tokens in tokenizer.
* Fixed code quality
* Fixed docstring, renamed sym to oken.
Co-authored-by: Rafael Weingartner <rweingartner.its-b2015@fh-salzburg.ac.at>
2020-06-22 15:40:52 +02:00
Sylvain Gugger
eb0ca71ef6
Update glossary ( #5148 )
...
* Update glossary
* Update docs/source/glossary.rst
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2020-06-22 08:30:49 -04:00
Patrick von Platen
fa0be6d761
Benchmarks ( #4912 )
...
* finish benchmark
* fix isort
* fix setup cfg
* retab
* fix time measuring of tf graph mode
* fix tf cuda
* clean code
* better error message
2020-06-22 12:06:56 +02:00
Zihao Fu
18a0150bfa
fix bart doc ( #5132 )
...
fix bart doc
2020-06-22 10:58:28 +02:00
Mikael Souza
3fe75c7f70
Fixing docs for Encoder Decoder Config ( #5171 )
2020-06-22 10:51:17 +02:00
flozi00
59345cc87f
Typo ( #5147 )
2020-06-22 10:49:23 +02:00
Ilya Boytsov
bc3a0c0607
[examples] fixes arguments for summarization finetune scripts ( #5157 )
...
Authored-by: i.boytsov <i.boytsov@MAC867.local>
2020-06-21 11:51:21 -04:00
Tim Suchanek
68e19f1c22
Fix typo in root README ( #5073 )
2020-06-20 23:00:04 +08:00
Kevin Canwen Xu
c0c577cf8f
Fix PABEE's result table ( #5158 )
2020-06-20 22:56:39 +08:00
Julien Chaumond
aa6a29bc25
SummarizationPipeline: init required task name ( #5086 )
...
* SummarizationPipeline: init required task name
* Update src/transformers/pipelines.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
* Apply suggestions from code review
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
2020-06-20 03:16:30 -04:00
Kevin Canwen Xu
2fd28d4363
Add BERT Loses Patience (Patience-based Early Exit) ( #5078 )
...
* Add BERT Loses Patience (Patience-based Early Exit)
* update model archive
* update format
* sort import
* flake8
* Add results
* full results
* align the table
* refactor to inherit
* default per gpu eval = 1
* Formatting
* Formatting
* isort
* modify readme
* Add check
* Fix format
* Fix format
* Doc strings
* ALBERT & BERT for sequence classification don't inherit from the original anymore
* Remove incorrect comments
* Remove incorrect comments
* Remove incorrect comments
* Sync up with new code
* Sync up with new code
* Add a test
* Add a test
* Add a test
* Add a test
* Add a test
* Add a test
* Finishing up!
2020-06-20 13:41:46 +08:00
Zhu Baohe
f1679d7c48
Fix dropout in TFMobileBert ( #5150 )
2020-06-20 13:21:19 +08:00
Kevin Canwen Xu
5ed94b2312
Update note to avoid confusion ( #5131 )
2020-06-20 10:13:34 +08:00
Lysandre
d97b4176e5
Correct device assignment
2020-06-19 21:58:28 -04:00
Vasily Shamporov
9a3f91088c
Add MobileBert ( #4901 )
...
* Add MobileBert
* Quality + Conversion script
* style
* Update src/transformers/modeling_mobilebert.py
* Links to S3
* Style
* TFMobileBert
Slight fixes to the pytorch MobileBert
Style
* MobileBertForMaskedLM (PT + TF)
* MobileBertForNextSentencePrediction (PT + TF)
* MobileFor{MultipleChoice, TokenClassification} (PT + TF)
ss
* Tests + Auto
* Doc
* Tests
* Addressing @sgugger's comments
* Adressing @patrickvonplaten's comments
* Style
* Style
* Integration test
* style
* Model card
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2020-06-19 16:38:36 -04:00
Sam Shleifer
f45e873910
[bart-mnli] Fix class flipping bug ( #5141 )
2020-06-19 13:33:24 -04:00
Erick Rocha Fonseca
e33929ef1e
Fix in Reformer Config documentation ( #5138 )
2020-06-19 15:41:31 +02:00
Sam Shleifer
84be482f66
AutoTokenizer supports mbart-large-en-ro ( #5121 )
2020-06-18 20:47:37 -04:00
Sam Shleifer
2db1e2f415
[cleanup] remove redundant code in SummarizationDataset ( #5119 )
2020-06-18 20:34:48 -04:00
Sylvain Gugger
5f721ad6e4
Fix #5114 ( #5122 )
2020-06-18 19:20:04 -04:00
Pri Oberoi
a258982af3
Add missing arg in 02-transformers notebook ( #5085 )
...
* Add missing arg when creating model
* Fix typos
* Remove from_tf flag when creating model
2020-06-18 19:04:04 -04:00
Deniz
32e94cff64
tf add resize_token_embeddings method ( #4351 )
...
* resize token embeddings
* add tokens
* add tokens
* add tokens
* add t5 token method
* add t5 token method
* add t5 token method
* typo
* debugging input
* debugging input
* debug
* debug
* debug
* trying to set embedding tokens properly
* set embeddings for generation head too
* set embeddings for generation head too
* debugging
* debugging
* enable generation
* add base method
* add base method
* add base method
* return logits in the main call
* reverting to generation
* revert back
* set embeddings for the bert main layer
* description
* fix conflicts
* logging
* set base model as self
* refactor
* tf_bert add method
* tf_bert add method
* tf_bert add method
* tf_bert add method
* tf_bert add method
* tf_bert add method
* tf_bert add method
* tf_bert add method
* v0
* v0
* finalize
* final
* black
* add tests
* revert back the emb call
* comments
* comments
* add the second test
* add vocab size condig
* add tf models
* add tf models. add common tests
* remove model specific embedding tests
* stylish
* remove files
* stylez
* Update src/transformers/modeling_tf_transfo_xl.py
change the error.
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* adding unchanged weight test
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2020-06-18 18:41:26 -04:00
Lysandre Debut
973433260e
Pin sphinx-rtd-theme
( #5128 )
2020-06-18 18:07:59 -04:00
Sam Shleifer
8a377c3d6e
[fix] Move _adjust_logits above postprocess to fix Marian.generate ( #5126 )
2020-06-18 18:06:27 -04:00
Sam Shleifer
3d3e605aff
[cleanup] generate_beam_search comments ( #5115 )
2020-06-18 16:30:24 -04:00
Suraj Patil
ca2d0f98c4
ElectraForMultipleChoice ( #4954 )
...
* add ElectraForMultipleChoice
* add test_for_multiple_choice
* add ElectraForMultipleChoice in auto model
* add ElectraForMultipleChoice in all_model_classes
* add SequenceSummary related parameters
* get rid pooler, use SequenceSummary instead
* add electra multiple choice test
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2020-06-18 14:59:35 -04:00
Ori Garin
279d8e24f7
support local_files_only option for tf models ( #5116 )
2020-06-18 13:47:05 -04:00
Julien Chaumond
355954ffca
Create distilbert-base-uncased-distilled-squad-README.md
2020-06-18 05:17:45 -04:00
Suraj Patil
18177a1a60
lm_labels => labels ( #5080 )
2020-06-18 09:16:29 +02:00
Lysandre
efeb75b805
Remove misleading comment
...
closes #4958
2020-06-17 18:24:35 -04:00
Saurabh Misra
bb154ac50c
Fixing TPU training by disabling wandb.watch gradients logging for TPU ( #4926 )
2020-06-17 18:04:11 -04:00
Suraj Patil
fb6cccb863
fix qa example ( #4929 )
2020-06-17 17:54:16 -04:00
Karthikeyan Singaravelan
38bba9cdd5
Fix deprecation warnings due to invalid escape sequences. ( #4924 )
2020-06-17 17:46:58 -04:00
Sam Shleifer
f1a3d03741
add pandas to setup.cfg ( #5093 )
2020-06-17 16:39:17 -04:00
Sam Shleifer
90c833870c
[MarianTokenizer] Switch to sacremoses for punc normalization ( #5092 )
2020-06-17 16:31:05 -04:00
Pranav Dayanand Pawar
049e14f0e3
very minor spelling correction in script command ( #5090 )
...
actual script name - counts_parameters.py
2020-06-17 16:08:43 -04:00
Sylvain Gugger
20fa828984
Make default_data_collator more flexible and deprecate old behavior ( #5060 )
...
* Make default_data_collator more flexible
* Accept tensors for all features
* Document code
* Refactor
* Formatting
2020-06-17 15:24:51 -04:00
Yacine Jernite
5e06963394
Some changes to simplify the generation function ( #5031 )
...
* moving logits post-processing out of beam search
* moving logits post-processing out of beam search
* first step cache
* fix_Encoder_Decoder
* patrick_version_postprocess
* add_keyword_arg
2020-06-17 14:48:06 -04:00
Sylvain Gugger
204ebc25e6
Update installation page and add contributing to the doc ( #5084 )
...
* Update installation page and add contributing to the doc
* Remove mention of symlinks
2020-06-17 14:01:10 -04:00
Sam Shleifer
043f9f51f9
[examples] SummarizationModule improvements ( #4951 )
2020-06-17 13:51:34 -04:00
Sylvain Gugger
cd40f6564e
Add header and fix command ( #5082 )
2020-06-17 11:45:05 -04:00
Julien Chaumond
70bc3ead4f
[TextClassificationPipeline] Hotfix: make json serializable
2020-06-17 15:09:27 +00:00
Sylvain Gugger
7291ea0bff
Reorganize documentation ( #5064 )
...
* Reorganize topics and add all models
2020-06-17 07:55:20 -04:00
Sylvain Gugger
e4aaa45805
Update pipeline examples to doctest syntax ( #5030 )
2020-06-16 18:14:58 -04:00
Sylvain Gugger
011cc0be51
Fix all sphynx warnings ( #5068 )
2020-06-16 16:50:02 -04:00
flozi00
af497b5672
Typo ( #5069 )
2020-06-16 16:46:20 -04:00
Yacine Jernite
49c5202522
Eli5 examples ( #4968 )
...
* add eli5 examples
* add dense query script
* query_di
* merging
* merging
* add_utils
* adds nearest neighbor wikipedia
* batch queries
* training_retriever
* new notebooks
* moved retriever traiing script
* finished wiki40b
* max_len_fix
* train_s2s
* retriever_batch_checkpointing
* cleanup
* merge
* dim_fix
* fix_indexer
* fix_wiki40b_snippets
* fix_embed_for_r
* fp32 index
* fix_sparse_q
* joint_training
* remove obsolete datasets
* add_passage_nn_results
* add_passage_nn_results
* add_batch_nn
* add_batch_nn
* add_data_scripts
* notebook
* notebook
* notebook
* fix_multi_gpu
* add_app
* full_caching
* full_caching
* notebook
* sparse_done
* images
* notebook
* add_image_gif
* with_Gif
* add_contr_image
* notebook
* notebook
* notebook
* train_functions
* notebook
* min_retrieval_length
* pandas_option
* notebook
* min_retrieval_length
* notebook
* notebook
* eval_Retriever
* notebook
* images
* notebook
* add_example
* add_example
* notebook
* fireworks
* notebook
* notebook
* joe's notebook comments
* app_update
* notebook
* notebook_link
* captions
* notebook
* assing RetriBert model
* add RetriBert to Auto
* change AutoLMHead to AutoSeq2Seq
* notebook downloads from hf models
* style_black
* style_black
* app_update
* app_update
* fix_app_update
* style
* style
* isort
* Delete WikiELI5training.ipynb
* Delete evaluate_eli5.py
* Delete WikiELI5explore.ipynb
* Delete ExploreWikiELI5Support.html
* Delete explainlikeimfive.py
* Delete wiki_snippets.py
* children before parent
* children before parent
* style_black
* style_black_only
* isort
* isort_new
* Update src/transformers/modeling_retribert.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
* typo fixes
* app_without_asset
* cleanup
* Delete ELI5animation.gif
* Delete ELI5contrastive.svg
* Delete ELI5wiki_index.svg
* Delete choco_bis.svg
* Delete fireworks.gif
* Delete huggingface_logo.jpg
* Delete huggingface_logo.svg
* Delete Long_Form_Question_Answering_with_ELI5_and_Wikipedia.ipynb
* Delete eli5_app.py
* Delete eli5_utils.py
* readme
* Update README.md
* unused imports
* moved_info
* default_beam
* ftuned model
* disclaimer
* Update src/transformers/modeling_retribert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* black
* add_doc
* names
* isort_Examples
* isort_Examples
* Add doc to index
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
2020-06-16 16:36:58 -04:00
Sam Shleifer
c3e607496c
[cleanup] examples test_run_squad uses tiny model ( #5059 )
2020-06-16 14:06:45 -04:00
Sylvain Gugger
439aa1d6e9
Remove old section + caching in install ( #5027 )
2020-06-16 13:03:41 -04:00