transformers/docs/source/en/tasks
Sylvain Gugger b4d4d6fe87
Add RWKV-4 (#22797)
* First draft of RWKV-4

* Add support for generate

* Style post-rebase

* Properly use state

* Write doc

* Fix doc

* More math

* Add model to README, dummies and clean config

* Fix init

* multiple fixes:

- fix common tests
- fix configuraion default values
- add CI test for checking state computation
- fix some CI tests

* correct tokenizer

* some tweaks

- fix config docstring
- fix failing tests

* fix CI tests

- add output_attention / output_hidden_states
- override test_initialization
- fix failing CIs

* fix conversion script

- fix sharded case
- add new arguments

* add slow tests + more fixes on conversion script

* add another test

* final fixes

* change single name variable

* add mock attention mask for pipeline to work

* correct eos token id

* fix nits

* add checkpoints

* Apply suggestions from code review

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add `tie_word_embeddings` in docstring

* change tensor name

* fix final nits

* Trigger CI

---------

Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-05-09 13:04:10 -04:00
..
asr.mdx Added "Open in Colab" to task guides (#21729) 2023-02-22 08:32:35 -05:00
audio_classification.mdx [Whisper] Add model for audio classification (#21754) 2023-03-07 16:20:21 +01:00
document_question_answering.mdx Add: document question answering task guide (#21518) 2023-02-13 09:24:56 -05:00
image_captioning.mdx [Tasks] Adds image captioning (#21512) 2023-02-10 22:52:12 +05:30
image_classification.mdx Update feature selection in to_tf_dataset (#21935) 2023-04-24 17:34:30 +01:00
language_modeling.mdx Add RWKV-4 (#22797) 2023-05-09 13:04:10 -04:00
masked_language_modeling.mdx Add Mega: Moving Average Equipped Gated Attention (#21766) 2023-03-24 08:17:27 -04:00
monocular_depth_estimation.mdx Depth estimation task guide (#22205) 2023-03-17 08:36:23 -04:00
multiple_choice.mdx Add Mega: Moving Average Equipped Gated Attention (#21766) 2023-03-24 08:17:27 -04:00
object_detection.mdx Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
question_answering.mdx GPTNeoXForQuestionAnswering (#23059) 2023-05-04 10:15:15 -04:00
semantic_segmentation.mdx Fix doc links (#22274) 2023-03-20 17:07:31 +00:00
sequence_classification.mdx Add BioGPTForSequenceClassification (#22253) 2023-05-01 09:17:27 -04:00
summarization.mdx [WIP]NLLB-MoE Adds the moe model (#22024) 2023-03-27 19:42:00 +02:00
text-to-speech.mdx [docs] Text to speech task guide (#23107) 2023-05-04 13:17:13 -04:00
token_classification.mdx added GPTNeoForTokenClassification (#22908) 2023-04-27 12:10:03 -04:00
translation.mdx [WIP]NLLB-MoE Adds the moe model (#22024) 2023-03-27 19:42:00 +02:00
video_classification.mdx Automated compatible models list for task guides (#21338) 2023-01-27 13:19:28 -05:00
zero_shot_image_classification.mdx Zero-shot image classification task guide (#22132) 2023-03-13 10:57:17 -04:00
zero_shot_object_detection.mdx Add: task guide for zero shot object detection (#21829) 2023-02-28 10:23:08 -05:00