Jake Tae
fe78fe98ca
Enable tqdm toggling ( #15167 )
...
* feature: enable tqdm toggle
* test: add tqdm unit test
* style: run linter
* Update tests/test_tqdm_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
* refactor: use tiny model, run linter
* docs: add tqdm to logging
* docs: add tqdm reference to `http_get`
* style: run linter
* Update docs/source/main_classes/logging.mdx
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
* fix: use `AutoConfig` for framework agnostic testing
* chore: mv tqdm test to `test_logging.py`
* feature: implement enable/disable functions
* docs: mv docstring to comment
* chore: mv tqdm functions to `logging.py`
* docs: update docs to reference `enable/disable` funcs
* test: update test to use `enable/disable` func
* chore: update function reference in comment
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
2022-01-18 17:52:35 -05:00
NielsRogge
74bec9865c
Add MAE ( #15120 )
...
* First draft
* More improvements
* More improvements
* More improvements
* Fix embeddings
* Add conversion script
* Finish conversion script
* More improvements
* Fix forward pass
* Remove print statements
* Add weights initialization
* Add initialization of decoder weights
* Add support for other models in the conversion script
* Fix patch_size for huge model
* Fix most of the tests
* Fix integration test
* Fix docs
* Fix archive_list
* Apply suggestions from code review
* Improve documentation
* Apply more suggestions
* Skip some tests due to non-deterministic behaviour
* Fix test_initialization
* Remove unneccessary initialization of nn.Embedding
* Improve docs
* Fix dummies
* Remove ViTMAEFeatureExtractor from docs
* Add model to README and table of contents
* Delete inference file
2022-01-18 16:21:32 +01:00
Li-Huai (Allan) Lin
22454ae492
Add REALM ( #13292 )
...
* REALM initial commit
* Retriever OK (Update new_gelu).
* Encoder prediction score OK
* Encoder pretrained model OK
* Update retriever comments
* Update docs, tests, and imports
* Prune unused models
* Make embedder as a module `RealmEmbedder`
* Add RealmRetrieverOutput
* Update tokenization
* Pass all tests in test_modeling_realm.py
* Prune RealmModel
* Update docs
* Add training test.
* Remove completed TODO
* Style & Quality
* Prune `RealmModel`
* Fixup
* Changes:
1. Remove RealmTokenizerFast
2. Update docstrings
3. Add a method to RealmTokenizer to handle candidates tokenization.
* Fix up
* Style
* Add tokenization tests
* Update `from_pretrained` tests
* Apply suggestions
* Style & Quality
* Copy BERT model
* Fix comment to avoid docstring copying
* Make RealmBertModel private
* Fix bug
* Style
* Basic QA
* Save
* Complete reader logits
* Add searcher
* Complete searcher & reader
* Move block records init to constructor
* Fix training bug
* Add some outputs to RealmReader
* Add finetuned checkpoint variable names parsing
* Fix bug
* Update REALM config
* Add RealmForOpenQA
* Update convert_tfrecord logits
* Fix bugs
* Complete imports
* Update docs
* Update naming
* Add brute-force searcher
* Pass realm model tests
* Style
* Exclude RealmReader from common tests
* Fix
* Fix
* convert docs
* up
* up
* more make style
* up
* upload
* up
* Fix
* Update src/transformers/__init__.py
* adapt testing
* change modeling code
* fix test
* up
* up
* up
* correct more
* make retriever work
* update
* make style
* finish main structure
* Resolve merge conflict
* Make everything work
* Style
* Fixup
* Fixup
* Update training test
* fix retriever
* remove hardcoded path
* Fix
* Fix modeling test
* Update model links
* Initial retrieval test
* Fix modeling test
* Complete retrieval tests
* Fix
* style
* Fix tests
* Fix docstring example
* Minor fix of retrieval test
* Update license headers and docs
* Apply suggestions from code review
* Style
* Apply suggestions from code review
* Add an example to RealmEmbedder
* Fix
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-01-18 07:24:13 -05:00
Stas Bekman
edd3fce2f7
[doc] new MoE paper ( #15184 )
...
add new paper
2022-01-17 09:10:51 -08:00
Stas Bekman
669e3c50c9
[doc] performance: Efficient Software Prebuilds ( #15147 )
...
* Efficient Software Prebuilds
* improve
2022-01-14 18:25:20 -08:00
AK391
4663c609b9
Add "open in hf spaces" gradio button issue #73 ( #15106 )
...
* update XLMProphetNet link
* update DPR link
* change prophetnet link
* change link MBART
* change link GPT
* update gpt2 link
* ctrl update link
* update Transformer-XL link
* Update Reformer link
* update xlnet link
* bert update link
* udpate albert link
* roberta update link
* update distilbert link
* update convbert link
* update XLM link
* xlm roberta update link
* update Flaubert link
* update electra link
* update funnel transformer and longformer
* bart update link
* pegasus update link
* udpate marianmt link
* t5 update link
* mt5 update link
2022-01-14 10:12:30 -05:00
Carlos Aguayo
3fc221d077
Update model_sharing.mdx ( #15142 )
...
Fix typo
2022-01-13 12:26:02 -05:00
lewtun
021f2ea987
Add ONNX configuration classes to docs ( #15121 )
...
* Add ONNX classes to main package
* Remove permalinks from ONNX guide
* Fix ToC entry
* Revert "Add ONNX classes to main package"
This reverts commit eb794a5b00
.
* Add ONNX classes to main doc
* Fix syntax highlighting in doc
* Fix text
* Add FeaturesManager to doc
* Use paths to reference ONNX classes
* Add FeaturesManager to init
* Add missing ONNX paths
2022-01-12 16:33:32 +01:00
Sylvain Gugger
c425d60bb9
Fix link to deepspeed config
2022-01-12 09:32:53 -05:00
lewtun
16f0b7d72c
Update ONNX docs ( #14904 )
...
* Remove docs for deprecated ONNX export
* Tidy up the CLI help messages
* Revamp ONNX docs
* Update auto-config table
* Use DistilBERT as example for consistency
* Wrap up first pass at ONNX docs
* Fix table check
* Add tweaks and introduction
* Add cross-ref
* Fix missing import
* Fix style
* Add permalinks to ONNX configs
* Clarify role of OrderedDict
* Update docs/source/serialization.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Add doctest syntax to code blocks
* Remove permalinks
* Revert "Remove permalinks"
This reverts commit 099701daf0
.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-01-11 18:06:05 +01:00
AK391
68d925195e
Merge branch 'master' into master
2022-01-11 11:11:29 -05:00
novice
28e091430e
Add Nystromformer ( #14659 )
...
* Initial commit
* Config and modelling changes
Added Nystromformer-specific attributes to config and removed all decoder functionality from modelling.
* Modelling and test changes
Added Nystrom approximation and removed decoder tests.
* Code quality fixes
* Modeling changes and conversion script
Initial commits to conversion script, modeling changes.
* Minor modeling changes and conversion script
* Modeling changes
* Correct modeling, add tests and documentation
* Code refactor
* Remove tokenizers
* Code refactor
* Update __init__.py
* Fix bugs
* Update src/transformers/__init__.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/__init__.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/nystromformer/__init__.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update docs/source/model_doc/nystromformer.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/nystromformer/configuration_nystromformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/nystromformer/configuration_nystromformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/nystromformer/configuration_nystromformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/nystromformer/configuration_nystromformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/nystromformer/convert_nystromformer_original_pytorch_checkpoint_to_pytorch.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/nystromformer/configuration_nystromformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update modeling and test_modeling
* Code refactor
* .rst to .mdx
* doc changes
* Doc changes
* Update modeling_nystromformer.py
* Doc changes
* Fix copies
* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update configuration_nystromformer.py
* Fix copies
* Update tests/test_modeling_nystromformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update test_modeling_nystromformer.py
* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* Fix code style
* Update modeling_nystromformer.py
* Update modeling_nystromformer.py
* Fix code style
* Reformat modeling file
* Update modeling_nystromformer.py
* Modify NystromformerForMultipleChoice
* Fix code quality
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Code style changes and torch.no_grad()
* make style
* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-01-11 14:25:49 +01:00
Virus
c4fa908fa9
Adds IBERT to models exportable with ONNX ( #14868 )
...
* Add IBertOnnxConfig and tests
* add all the supported features for IBERT and remove outputs in IbertOnnxConfig
* use OnnxConfig
* fix codestyle
* remove serialization.rst
* codestyle
2022-01-11 12:17:08 +01:00
AK391
5cd7086fdb
XLM-ProphetNet Spaces badge
2022-01-11 00:11:31 -05:00
AK391
4e3208662e
DPR Spaces badge
2022-01-10 13:50:40 -05:00
AK391
ac2c06d492
ProphetNet spaces badge
2022-01-10 13:43:34 -05:00
AK391
bf0201e184
MBART spaces badge
2022-01-10 13:37:17 -05:00
Yih-Dar
b67fd797be
Add TFVisionEncoderDecoderModel ( #14148 )
...
* Start the work on TFVisionEncoderDecoderModel
* Expose TFVisionEncoderDecoderModel
* fix import
* Add modeling_tf_vision_encoder_decoder to _ignore_modules in get_model_modules()
* reorder
* Apply the fix for checkpoint loading as in #14016
* remove attention_mask + fix VISION_DUMMY_INPUTS
* A minimal change to make TF generate() work for vision models as encoder in encoder-decoder setting
* fix wrong condition: shape_list(input_ids) == 2
* add tests
* use personal TFViTModel checkpoint (for now)
* Add equivalence tests + projection layer
* style
* make sure projection layer can run
* Add examples
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Clean comments (need to work on TODOs for PyTorch models)
* Remove TF -> PT in check_pt_tf_equivalence for TFVisionEncoderDecoderModel
* fixes
* Revert changes in PT code.
* Update tests/test_modeling_tf_vision_encoder_decoder.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Add test_inference_coco_en for TF test
* fix quality
* fix name
* build doc
* add main_input_name
* Fix ckpt name in test
* fix diff between master and this PR
* fix doc
* fix style and quality
* fix missing doc
* fix labels handling
* Delete auto.rst
* Add the changes done in #14016
* fix prefix
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* make style
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2022-01-10 13:30:14 -05:00
AK391
c9504b2f50
MT5 Spaces badge
2022-01-10 12:57:08 -05:00
AK391
daec528ca9
T5 Spaces badge
2022-01-10 12:51:39 -05:00
AK391
0554e4d5c5
MarianMT Spaces badge
2022-01-10 12:47:12 -05:00
AK391
7ec6aad23d
Pegasus Spaces badge
2022-01-10 12:39:22 -05:00
AK391
03f8b9c9e0
BART Spaces badge
2022-01-10 12:33:59 -05:00
Stas Bekman
37bc0b4e53
[performance doc] Power and Cooling ( #14935 )
...
* [performance doc] Power and Cooling
* more docs
* Update docs/source/performance.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* reword
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-01-10 09:21:04 -08:00
AK391
20f169b523
Longformer Spaces badge
2022-01-10 12:14:18 -05:00
AK391
4fbc924d0a
Funnel Transformer spaces badge
2022-01-10 12:06:05 -05:00
AK391
222c09a635
ELECTRA Spaces badge
2022-01-10 11:53:23 -05:00
Stas Bekman
31838d3e11
[doc] normalize HF Transformers string ( #15023 )
2022-01-10 08:44:33 -08:00
AK391
84f360e862
FlauBERT spaces badge
2022-01-10 11:41:10 -05:00
AK391
9f33116898
XLM-Roberta Spaces badge
2022-01-10 10:54:18 -05:00
AK391
20fa9eb035
XLM Spaces badge
2022-01-10 10:48:06 -05:00
AK391
16b6df6fca
ConvBERT spaces badge
2022-01-10 10:33:03 -05:00
Santiago Castro
f21bc4215a
Use tqdm.auto in Pipeline docs ( #14920 )
...
It's better for e.g. notebook.
2022-01-10 10:28:34 -05:00
Mishig Davaadorj
f012c00ada
Model summary horizontal banners ( #15058 )
2022-01-10 10:06:14 -05:00
Minghao Li
b2c477fc6d
support the trocr small models ( #14893 )
...
* support the trocr small models
* resolve conflict
* Update docs/source/model_doc/trocr.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update docs/source/model_doc/trocr.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update docs/source/model_doc/trocr.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/trocr/processing_trocr.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/trocr/processing_trocr.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/trocr/processing_trocr.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/trocr/processing_trocr.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* fix unexpected indent in processing_trocr.py
* Update src/transformers/models/trocr/processing_trocr.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* update the docstring of processing_trocr
* remove extra space
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
2022-01-10 09:28:03 -05:00
Yih-Dar
0a03a86813
fix model table cell text alignment ( #14999 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-01-10 06:44:11 -05:00
AK391
5be1242ac0
Merge branch 'huggingface:master' into master
2022-01-07 11:48:22 -05:00
AK391
484e7a441f
Distilbert spaces badge
2022-01-07 11:47:56 -05:00
K.C. Tung
f18c6fa94c
Resubmit changes after rebase to master ( #14982 )
2022-01-07 08:34:12 +01:00
AK391
1d71227295
Roberta spaces badge
2022-01-06 18:50:19 -05:00
AK391
cac877425c
ALBERT spaces badge
2022-01-06 13:01:23 -05:00
AK391
794441c379
BERT spaces badge
2022-01-06 12:22:09 -05:00
AK391
f872f18dca
XLNet spaces badge
2022-01-06 12:09:50 -05:00
AK391
8d187e7feb
Reformer Spaces badge
2022-01-06 11:59:21 -05:00
AK391
59fb636948
Transformer-XL badge
2022-01-06 11:47:41 -05:00
AK391
2380136722
add spaces badges
2022-01-04 16:13:57 -05:00
Kevin Ko
857ab55c01
[doc] Update parallelism.mdx ( #15018 )
...
* Update parallelism.mdx
* Update parallelism.mdx
2022-01-04 09:58:27 -08:00
Daniel Stancl
21aecc0971
Add Flax RoFormer ( #15005 )
...
* Add FlaxRoFormer
* Clean code + make quality
* Fix output pooling for FlaxRoFormerForMultipleChoiceModule
* Apply suggestions from code review
* add flax model to repos
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2022-01-04 13:23:10 +01:00
Kevin Ko
f2ab21833f
Update parallelism.mdx ( #15013 )
...
* Update parallelism.mdx
* Update parallelism.mdx
* Update parallelism.mdx
* Update parallelism.mdx
* Update parallelism.mdx
* Update parallelism.mdx
* Update parallelism.mdx
* Update parallelism.mdx
2022-01-03 11:49:27 -08:00
Sylvain Gugger
8f6373c61c
Map model_type and doc pages names ( #14944 )
...
* Map model_type and doc pages names
* Add script
* Fix typo
* Quality
* Manual check for Auto
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
2022-01-03 05:08:55 -05:00