transformers/docs/source/model_doc
Edward Beeching aff9bc405a
Decision transformer gym (#15845)
* Created the Decision Transformer Modle

* updating tests, copy to other machine

* Added last hidden size to Decision Transformer modelling outputs

* Removed copy of original DT file

* made a temporary change to gpt2 to have it conform with the Decision Transformer version

* Updated tests

* Ignoring a file used to test the DT model

* added comments to config file

* added comments and argument descriptions to decision transformer file

* Updated doc

* Ran "make style"

* Remove old model imports

* Removed unused imports, cleaned up init file

* Update docs/source/model_doc/decision_transformer.mdx

added my username

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Reverted changes made to gpt2

* Removed datasets submodule

* Update the modeling outputs to include gpt2 attentions, hidden states and last hidden states

* Added support for return of hidden states, attentions and return dict of gpt2 model.

* Updated tests to include many of the ModelTesterMixin tests. 

The following tests are skipped: test_generate_without_input_ids, test_pruning, test_resize_embeddings, test_head_masking, test_attention_outputs, test_hidden_states_output, test_inputs_embeds, test_model_common_attributes

* Added missing line to the end of gpt2 file

* Added an integration test for the Decision Transformer

Test performs and autoregressive evaluation for two time steps

* Set done and info to _ to fix failing test

* Updated integration test to be deterministic and check expected outputs

* Apply suggestions from code review

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Removed unnecessary config options

* Cleaned up commented code and old comments.

* Cleaned up commented code.

* Changed DecisionTransformer to Decision Transformer

* Added Decision Transformer to the main README file

* Added copy of GTP2 called DecisionTranformerGPT2Model

* isorted imports

* isorted imports

* Added model to non-English README files

* Ran make fix-copies and corrected some cases.

* Updated index file to include Decision Transformer

* Added gpt2 model as copy inside the Decision Transformer model file

* Added the unit test file to the list of TEST_FILES_WITH_NO_COMMON_TESTS

* Deleted redundant checkpoint files (I don't know how these got committed)

* Removed testing files. (These should have never been committed)

* Removed accidentally committed files

* Moved the Decision Transformer test to its own directory

* Add type hints for Pegasus (#16324)

* Funnel type hints (#16323)

* add pt funnel type hints

* add tf funnel type hints

* Add type hints for ProphetNet PyTorch (#16272)

* [GLPN] Improve docs (#16331)

* Add link to notebook

* Add link

* Fix bug

Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

* Added type hints for Pytorch Marian calls (#16200)

* Added type hinting for forward functions in pytorch marian

* typo correction

* Removed type hints on functions from BART per Suraj Patil request

* fix import pb

* fix typo

* corrected tuple call

* ran black

* after fix-copies
Some optional tags on primitives were removed, past_key_values in MarianForCausalLM changed from Tuple of Tuple to List

* Fixing copies to roformer and pegasus

Co-authored-by: Clementine Fourrier <cfourrie@inria.fr>
Co-authored-by: matt <rocketknight1@gmail.com>

* Moved DecisionTransformOutput to modeling_decision_transformer

* Moved the example usage to research project and cleaned comments

* Made tests ignore the copy of gpt2 in Decision Transformer

* Added module output to modelling decision transformer

* removed copied gpt2 model from list of transformers models

* Updated tests and created __init__ file for new test location

* Update README.md

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/decision_transformer/configuration_decision_transformer.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Removed unneeded summary type from config file

* Fixed copies

* Updated pretrained config map to refer to hopper-medium checkpoint

* done (#16340)

* Added Decision transformer to model docs

* Update src/transformers/models/decision_transformer/modeling_decision_transformer.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/decision_transformer/modeling_decision_transformer.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/decision_transformer/configuration_decision_transformer.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add type annotations for Rembert/Splinter and copies (#16338)

* undo black autoformat

* minor fix to rembert forward with default

* make fix-copies, make quality

* Adding types to template model

* Removing List from the template types

* Remove `Optional` from a couple of types that don't accept `None`

Co-authored-by: matt <rocketknight1@gmail.com>

* [Bug template] Shift responsibilities for long-range (#16344)

* Fix code repetition in serialization guide (#16346)

* Adopt framework-specific blocks for content (#16342)

*  refactor code samples with framework-specific blocks

*  update training.mdx

* 🖍 apply feedback

* Updates the default branch from master to main (#16326)

* Updates the default branch from master to main

* Links from `master` to `main`

* Typo

* Update examples/flax/README.md

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Updated model with custom docstring example

* Created the Decision Transformer Modle

* updating tests, copy to other machine

* Added last hidden size to Decision Transformer modelling outputs

* Removed copy of original DT file

* made a temporary change to gpt2 to have it conform with the Decision Transformer version

* Updated tests

* Ignoring a file used to test the DT model

* added comments to config file

* added comments and argument descriptions to decision transformer file

* Updated doc

* Ran "make style"

* Remove old model imports

* Removed unused imports, cleaned up init file

* Update docs/source/model_doc/decision_transformer.mdx

added my username

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Reverted changes made to gpt2

* Removed datasets submodule

* Update the modeling outputs to include gpt2 attentions, hidden states and last hidden states

* Added support for return of hidden states, attentions and return dict of gpt2 model.

* Updated tests to include many of the ModelTesterMixin tests. 

The following tests are skipped: test_generate_without_input_ids, test_pruning, test_resize_embeddings, test_head_masking, test_attention_outputs, test_hidden_states_output, test_inputs_embeds, test_model_common_attributes

* Added missing line to the end of gpt2 file

* Added an integration test for the Decision Transformer

Test performs and autoregressive evaluation for two time steps

* Set done and info to _ to fix failing test

* Updated integration test to be deterministic and check expected outputs

* Apply suggestions from code review

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Removed unnecessary config options

* Cleaned up commented code and old comments.

* Cleaned up commented code.

* Changed DecisionTransformer to Decision Transformer

* Added Decision Transformer to the main README file

* Added copy of GTP2 called DecisionTranformerGPT2Model

* isorted imports

* isorted imports

* Added model to non-English README files

* Ran make fix-copies and corrected some cases.

* Updated index file to include Decision Transformer

* Added gpt2 model as copy inside the Decision Transformer model file

* Added the unit test file to the list of TEST_FILES_WITH_NO_COMMON_TESTS

* Deleted redundant checkpoint files (I don't know how these got committed)

* Removed testing files. (These should have never been committed)

* Removed accidentally committed files

* Moved the Decision Transformer test to its own directory

* Moved DecisionTransformOutput to modeling_decision_transformer

* Moved the example usage to research project and cleaned comments

* Made tests ignore the copy of gpt2 in Decision Transformer

* Added module output to modelling decision transformer

* removed copied gpt2 model from list of transformers models

* Updated tests and created __init__ file for new test location

* Update README.md

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/decision_transformer/configuration_decision_transformer.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Removed unneeded summary type from config file

* Fixed copies

* Updated pretrained config map to refer to hopper-medium checkpoint

* Added Decision transformer to model docs

* Update src/transformers/models/decision_transformer/modeling_decision_transformer.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/decision_transformer/modeling_decision_transformer.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/decision_transformer/configuration_decision_transformer.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Updated model with custom docstring example

* Updated copies, config auto, and readme files.

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Dan Tegzes <48134725+Tegzes@users.noreply.github.com>
Co-authored-by: Adam Montgomerie <adam@avanssion.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: Clémentine Fourrier <22726840+clefourrier@users.noreply.github.com>
Co-authored-by: Clementine Fourrier <cfourrie@inria.fr>
Co-authored-by: matt <rocketknight1@gmail.com>
Co-authored-by: Francesco Saverio Zuppichini <francesco.zuppichini@gmail.com>
Co-authored-by: Jacob Dineen <54680234+jacobdineen@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
2022-03-23 16:18:43 -04:00
..
albert.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
auto.mdx Making MaskFormerForInstanceSegmentation. (#15934) 2022-03-04 13:56:15 +01:00
bart.mdx Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
barthez.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
bartpho.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
beit.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
bert-generation.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
bert-japanese.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
bert.mdx Convert rst to mdx bert (#14806) 2021-12-17 11:13:34 -05:00
bertweet.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
big_bird.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
bigbird_pegasus.mdx Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
blenderbot-small.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
blenderbot.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
bort.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
byt5.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
camembert.mdx Add TFCamembertForCausalLM and ONNX integration test (#16073) 2022-03-14 08:40:42 +01:00
canine.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
clip.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
convbert.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
convnext.mdx Add TFConvNextModel (#15750) 2022-02-25 18:19:16 +01:00
cpm.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
ctrl.mdx 🖍 remove broken link (#15615) 2022-02-11 12:33:55 -06:00
data2vec.mdx Add Data2Vec (#15507) 2022-03-01 11:09:20 +01:00
deberta-v2.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
deberta.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
decision_transformer.mdx Decision transformer gym (#15845) 2022-03-23 16:18:43 -04:00
deit.mdx Add SimMIM (#15586) 2022-02-17 19:44:55 +01:00
detr.mdx Doc new front (#14590) 2021-12-01 14:13:02 -05:00
dialogpt.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
distilbert.mdx Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
dit.mdx Add Document Image Transformer (DiT) (#15984) 2022-03-10 11:34:44 +01:00
dpr.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
electra.mdx Add ElectraForCausalLM -> Enable Electra encoder-decoder model (#14729) 2021-12-27 12:37:52 +01:00
encoder-decoder.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
flaubert.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
fnet.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
fsmt.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
funnel.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
glpn.mdx [GLPN] Improve docs (#16331) 2022-03-22 15:45:29 +01:00
gpt_neo.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
gpt2.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
gptj.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
herbert.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
hubert.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
ibert.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
imagegpt.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
layoutlm.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
layoutlmv2.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
layoutxlm.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
led.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
longformer.mdx Remove disclaimer from Longformer docs (#16296) 2022-03-21 10:05:47 -04:00
luke.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
lxmert.mdx Convert model files from rst to mdx (#14865) 2021-12-22 03:27:30 -05:00
m2m_100.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
marian.mdx Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
maskformer.mdx Maskformer (#15682) 2022-03-02 15:48:20 +01:00
mbart.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
megatron_gpt2.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
megatron-bert.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
mluke.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
mobilebert.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
mpnet.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
mt5.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
nystromformer.mdx Add Nystromformer (#14659) 2022-01-11 14:25:49 +01:00
openai-gpt.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
pegasus.mdx Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
perceiver.mdx Fix Perceiver docs (#14879) 2021-12-22 14:18:03 +01:00
phobert.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
plbart.mdx Add PLBart (#13269) 2022-02-18 14:17:09 +01:00
poolformer.mdx Add model specific output classes to PoolFormer model docs (#15746) 2022-02-25 13:43:56 +01:00
prophetnet.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
qdqbert.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
rag.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
realm.mdx Fix and improve REALM fine-tuning (#15297) 2022-03-03 14:10:15 +01:00
reformer.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
rembert.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
resnet.mdx [WIP] Resnet (#15770) 2022-03-14 19:57:55 +01:00
retribert.mdx Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
roberta.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
roformer.mdx Add Flax RoFormer (#15005) 2022-01-04 13:23:10 +01:00
segformer.mdx [SegFormer] Remove unused attributes (#16285) 2022-03-21 17:34:10 +01:00
sew-d.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
sew.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
speech_to_text_2.mdx Fix typo in speech2text2 doc (#15617) 2022-02-15 13:54:34 +01:00
speech_to_text.mdx Add TFSpeech2Text (#15113) 2022-02-08 16:27:23 +00:00
speech-encoder-decoder.mdx Flax Speech-Encoder-Decoder Model (#15613) 2022-02-28 12:22:36 +01:00
splinter.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
squeezebert.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
swin.mdx Swin support for any input size (#15986) 2022-03-16 18:38:25 +01:00
t5.mdx Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
t5v1.1.mdx Doc styler examples (#14953) 2021-12-27 19:07:46 -05:00
tapas.mdx Framework split (#16030) 2022-03-15 10:13:34 -04:00
transfo-xl.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
trocr.mdx Fix doc examples (#15257) 2022-01-20 21:51:51 +01:00
unispeech-sat.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
unispeech.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
van.mdx Visual Attention Network (VAN) (#16027) 2022-03-15 08:47:12 +01:00
vilt.mdx Add link to notebooks (#15791) 2022-03-01 17:44:20 +01:00
vision-encoder-decoder.mdx Make Swin work with VisionEncoderDecoderModel (#15527) 2022-02-14 17:33:35 +01:00
vision-text-dual-encoder.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
visual_bert.mdx Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
vit_mae.mdx Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
vit.mdx Add SimMIM (#15586) 2022-02-17 19:44:55 +01:00
wav2vec2_phoneme.mdx Wav2Vec2 meets phonemes (#14353) 2021-12-17 19:56:44 +01:00
wav2vec2.mdx Time stamps for CTC models (#15687) 2022-02-22 19:26:44 +01:00
wavlm.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
xglm.mdx Add XGLM models (#14876) 2022-01-28 18:55:23 +01:00
xlm-prophetnet.mdx Map model_type and doc pages names (#14944) 2022-01-03 05:08:55 -05:00
xlm-roberta-xl.mdx Add support for XLM-R XL and XXL models by modeling_xlm_roberta_xl.py (#13727) 2022-01-29 13:42:37 +01:00
xlm-roberta.mdx Add missing support for Flax XLM-RoBERTa (#15900) 2022-03-04 14:36:28 +01:00
xlm.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
xlnet.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
xls_r.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
xlsr_wav2vec2.mdx Convert rst files (#14888) 2021-12-22 16:14:35 -05:00
yoso.mdx Add YOSO (#15091) 2022-01-26 19:18:29 +01:00