transformers/docs/source/model_doc
Francesco Saverio Zuppichini d83d22f578
Maskformer (#15682)
* maskformer

* conflicts

* conflicts

* minor fixes

* feature extractor test fix

refactor MaskFormerLoss following conversation

MaskFormer related types should not trigger a module time import error

missed one

removed all the types that are not used

update config mapping

minor updates in the doc

resolved conversation that doesn't need a discussion

minor changes

resolved conversations

fixed DetrDecoder

* minor changes

minor changes

fixed mdx file

test feature_extractor return types

functional losses -> classes

removed the return type test for the feature extractor

minor changes + style + quality

* conflicts?

* rebase master

* readme

* added missing files

* deleded poolformers test that where in the wrong palce

* CI

* minor changes

* Apply suggestions from code review

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* resolved conversations

* minor changes

* conversations

[Unispeech] Fix slow tests (#15818)

* remove soundfile old way of loading audio

* Adapt slow test

[Barthez Tokenizer] Fix saving (#15815)

[TFXLNet] Correct tf xlnet generate (#15822)

* [TFXLNet] Correct tf xlnet

* adapt test comment

Fix the push run (#15807)

Fix semantic segmentation pipeline test (#15826)

Fix dummy_inputs() to dummy_inputs in symbolic_trace doc (#15776)

Add model specific output classes to PoolFormer model docs (#15746)

* Added model specific output classes to poolformer docs

* Fixed Segformer typo in Poolformer docs

Adding the option to return_timestamps on pure CTC ASR models. (#15792)

* Adding the option to return_timestamps on pure CTC ASR models.

* Remove `math.prod` which was introduced in Python 3.8

* int are not floats.

* Reworking the PR to support "char" vs "word" output.

* Fixup!

* Update src/transformers/pipelines/automatic_speech_recognition.py

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/pipelines/automatic_speech_recognition.py

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/pipelines/automatic_speech_recognition.py

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/pipelines/automatic_speech_recognition.py

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/pipelines/automatic_speech_recognition.py

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/pipelines/automatic_speech_recognition.py

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/pipelines/automatic_speech_recognition.py

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/pipelines/automatic_speech_recognition.py

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/pipelines/automatic_speech_recognition.py

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Quality.

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

HFTracer.trace should use/return self.graph to be compatible with torch.fx.Tracer (#15824)

Fix tf.concatenate + test past_key_values for TF models (#15774)

* fix wrong method name tf.concatenate

* add tests related to causal LM / decoder

* make style and quality

* clean-up

* Fix TFBertModel's extended_attention_mask when past_key_values is provided

* Fix tests

* fix copies

* More tf.int8 -> tf.int32 in TF test template

* clean-up

* Update TF test template

* revert the previous commit + update the TF test template

* Fix TF template extended_attention_mask when past_key_values is provided

* Fix some styles manually

* clean-up

* Fix ValueError: too many values to unpack in the test

* Fix more: too many values to unpack in the test

* Add a comment for extended_attention_mask when there is past_key_values

* Fix TFElectra extended_attention_mask when past_key_values is provided

* Add tests to other TF models

* Fix for TF Electra test: add prepare_config_and_inputs_for_decoder

* Fix not passing training arg to lm_head in TFRobertaForCausalLM

* Fix tests (with past) for TF Roberta

* add testing for pask_key_values for TFElectra model

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

[examples/summarization and translation] fix readme (#15833)

Add ONNX Runtime quantization for text classification notebook (#15817)

Re-enable doctests for the quicktour (#15828)

* Re-enable doctests for the quicktour

* Re-enable doctests for task_summary (#15830)

* Remove &

Framework split model report (#15825)

Add TFConvNextModel (#15750)

* feat: initial implementation of convnext in tensorflow.

* fix: sample code for the classification model.

* chore: added checked for  from the classification model.

* chore: set bias initializer in the classification head.

* chore: updated license terms.

* chore: removed ununsed imports

* feat: enabled  argument during using drop_path.

* chore: replaced tf.identity with layers.Activation(linear).

* chore: edited default checkpoint.

* fix: minor bugs in the initializations.

* partial-fix: tf model errors for loading pretrained pt weights.

* partial-fix: call method updated

* partial-fix: cross loading of weights (4x3 variables to be matched)

* chore: removed unneeded comment.

* removed playground.py

* rebasing

* rebasing and removing playground.py.

* fix: renaming TFConvNextStage conv and layer norm layers

* chore: added initializers and other minor additions.

* chore: added initializers and other minor additions.

* add: tests for convnext.

* fix: integration tester class.

* fix: issues mentioned in pr feedback (round 1).

* fix: how output_hidden_states arg is propoagated inside the network.

* feat: handling of  arg for pure cnn models.

* chore: added a note on equal contribution in model docs.

* rebasing

* rebasing and removing playground.py.

* feat: encapsulation for the convnext trunk.

* Fix variable naming; Test-related corrections; Run make fixup

* chore: added Joao as a contributor to convnext.

* rebasing

* rebasing and removing playground.py.

* rebasing

* rebasing and removing playground.py.

* chore: corrected copyright year and added comment on NHWC.

* chore: fixed the black version and ran formatting.

* chore: ran make style.

* chore: removed from_pt argument from test, ran make style.

* rebasing

* rebasing and removing playground.py.

* rebasing

* rebasing and removing playground.py.

* fix: tests in the convnext subclass, ran make style.

* rebasing

* rebasing and removing playground.py.

* rebasing

* rebasing and removing playground.py.

* chore: moved convnext test to the correct location

* fix: locations for the test file of convnext.

* fix: convnext tests.

* chore: applied  sgugger's suggestion for dealing w/ output_attentions.

* chore: added comments.

* chore: applied updated quality enviornment style.

* chore: applied formatting with quality enviornment.

* chore: revert to the previous tests/test_modeling_common.py.

* chore: revert to the original test_modeling_common.py

* chore: revert to previous states for test_modeling_tf_common.py and modeling_tf_utils.py

* fix: tests for convnext.

* chore: removed output_attentions argument from convnext config.

* chore: revert to the earlier tf utils.

* fix: output shapes of the hidden states

* chore: removed unnecessary comment

* chore: reverting to the right test_modeling_tf_common.py.

* Styling nits

Co-authored-by: ariG23498 <aritra.born2fly@gmail.com>
Co-authored-by: Joao Gante <joao@huggingface.co>
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>

* minor changes

* doc fix in feature extractor

* doc

* typose

* removed detr logic from config

* removed detr logic from config

* removed num_labels

* small fix in the config

* auxilary -> auxiliary

* make style

* some test is failing

* fix a weird char in config prevending doc-builder

* retry to fix the doc-builder issue

* make style

* new try to fix the doc builder

* CI

* change weights to facebook

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: ariG23498 <aritra.born2fly@gmail.com>
Co-authored-by: Joao Gante <joao@huggingface.co>
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>
2022-03-02 15:48:20 +01:00
..
albert.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
auto.mdx Add SimMIM (#15586) 2022-02-17 19:44:55 +01:00
bart.mdx [Bart] Fix implementation note doc (#15879) 2022-03-02 10:24:32 +01:00
barthez.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
bartpho.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
beit.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
bert-generation.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
bert-japanese.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
bert.mdx Convert rst to mdx bert (#14806) 2021-12-17 11:13:34 -05:00
bertweet.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
big_bird.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
bigbird_pegasus.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
blenderbot-small.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
blenderbot.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
bort.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
byt5.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
camembert.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
canine.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
clip.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
convbert.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
convnext.mdx Add TFConvNextModel (#15750) 2022-02-25 18:19:16 +01:00
cpm.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
ctrl.mdx 🖍 remove broken link (#15615) 2022-02-11 12:33:55 -06:00
data2vec.mdx Add Data2Vec (#15507) 2022-03-01 11:09:20 +01:00
deberta-v2.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
deberta.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
deit.mdx Add SimMIM (#15586) 2022-02-17 19:44:55 +01:00
detr.mdx Doc new front (#14590) 2021-12-01 14:13:02 -05:00
dialogpt.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
distilbert.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
dpr.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
electra.mdx Add ElectraForCausalLM -> Enable Electra encoder-decoder model (#14729) 2021-12-27 12:37:52 +01:00
encoder-decoder.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
flaubert.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
fnet.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
fsmt.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
funnel.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
gpt_neo.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
gpt2.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
gptj.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
herbert.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
hubert.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
ibert.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
imagegpt.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
layoutlm.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
layoutlmv2.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
layoutxlm.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
led.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
longformer.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
luke.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
lxmert.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
m2m_100.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
marian.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
maskformer.mdx Maskformer (#15682) 2022-03-02 15:48:20 +01:00
mbart.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
megatron_gpt2.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
megatron-bert.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
mluke.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
mobilebert.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
mpnet.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
mt5.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
nystromformer.mdx Add Nystromformer (#14659) 2022-01-11 14:25:49 +01:00
openai-gpt.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
pegasus.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
perceiver.mdx Fix Perceiver docs (#14879) 2021-12-22 14:18:03 +01:00
phobert.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
plbart.mdx Add PLBart (#13269) 2022-02-18 14:17:09 +01:00
poolformer.mdx Add model specific output classes to PoolFormer model docs (#15746) 2022-02-25 13:43:56 +01:00
prophetnet.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
qdqbert.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
rag.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
realm.mdx Add FastTokenizer to REALM (#15211) 2022-01-19 15:19:36 +01:00
reformer.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
rembert.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
retribert.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
roberta.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
roformer.mdx Add Flax RoFormer (#15005) 2022-01-04 13:23:10 +01:00
segformer.mdx make table into valid Markdown table syntax (#15337) 2022-01-26 07:10:00 -05:00
sew-d.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
sew.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
speech_to_text_2.mdx Fix typo in speech2text2 doc (#15617) 2022-02-15 13:54:34 +01:00
speech_to_text.mdx Add TFSpeech2Text (#15113) 2022-02-08 16:27:23 +00:00
speech-encoder-decoder.mdx Flax Speech-Encoder-Decoder Model (#15613) 2022-02-28 12:22:36 +01:00
splinter.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
squeezebert.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
swin.mdx Add SimMIM (#15586) 2022-02-17 19:44:55 +01:00
t5.mdx [t5/t0/mt5 models] faster/leaner custom layer norm (#14656) 2022-02-15 16:49:57 -08:00
t5v1.1.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
tapas.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
transfo-xl.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
trocr.mdx Fix doc examples (#15257) 2022-01-20 21:51:51 +01:00
unispeech-sat.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
unispeech.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
vilt.mdx Add link to notebooks (#15791) 2022-03-01 17:44:20 +01:00
vision-encoder-decoder.mdx Make Swin work with VisionEncoderDecoderModel (#15527) 2022-02-14 17:33:35 +01:00
vision-text-dual-encoder.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
visual_bert.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
vit_mae.mdx Add link (#15588) 2022-02-09 23:33:39 +01:00
vit.mdx Add SimMIM (#15586) 2022-02-17 19:44:55 +01:00
wav2vec2_phoneme.mdx Wav2Vec2 meets phonemes (#14353) 2021-12-17 19:56:44 +01:00
wav2vec2.mdx Time stamps for CTC models (#15687) 2022-02-22 19:26:44 +01:00
wavlm.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
xglm.mdx Add XGLM models (#14876) 2022-01-28 18:55:23 +01:00
xlm-prophetnet.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
xlm-roberta-xl.mdx Add support for XLM-R XL and XXL models by modeling_xlm_roberta_xl.py (#13727) 2022-01-29 13:42:37 +01:00
xlm-roberta.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
xlm.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
xlnet.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
xls_r.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
xlsr_wav2vec2.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
yoso.mdx Add YOSO (#15091) 2022-01-26 19:18:29 +01:00