transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-13 09:40:06 +06:00

History

Sylvain Gugger b4d4d6fe87 Add RWKV-4 (#22797 ) * First draft of RWKV-4 * Add support for generate * Style post-rebase * Properly use state * Write doc * Fix doc * More math * Add model to README, dummies and clean config * Fix init * multiple fixes: - fix common tests - fix configuraion default values - add CI test for checking state computation - fix some CI tests * correct tokenizer * some tweaks - fix config docstring - fix failing tests * fix CI tests - add output_attention / output_hidden_states - override test_initialization - fix failing CIs * fix conversion script - fix sharded case - add new arguments * add slow tests + more fixes on conversion script * add another test * final fixes * change single name variable * add mock attention mask for pipeline to work * correct eos token id * fix nits * add checkpoints * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * add `tie_word_embeddings` in docstring * change tensor name * fix final nits * Trigger CI --------- Co-authored-by: younesbelkada <younesbelkada@gmail.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>		2023-05-09 13:04:10 -04:00
..
asr.mdx	Added "Open in Colab" to task guides (#21729 )	2023-02-22 08:32:35 -05:00
audio_classification.mdx	[Whisper] Add model for audio classification (#21754 )	2023-03-07 16:20:21 +01:00
document_question_answering.mdx	Add: document question answering task guide (#21518 )	2023-02-13 09:24:56 -05:00
image_captioning.mdx	[Tasks] Adds image captioning (#21512 )	2023-02-10 22:52:12 +05:30
image_classification.mdx	Update feature selection in to_tf_dataset (#21935 )	2023-04-24 17:34:30 +01:00
language_modeling.mdx	Add RWKV-4 (#22797 )	2023-05-09 13:04:10 -04:00
masked_language_modeling.mdx	Add Mega: Moving Average Equipped Gated Attention (#21766 )	2023-03-24 08:17:27 -04:00
monocular_depth_estimation.mdx	Depth estimation task guide (#22205 )	2023-03-17 08:36:23 -04:00
multiple_choice.mdx	Add Mega: Moving Average Equipped Gated Attention (#21766 )	2023-03-24 08:17:27 -04:00
object_detection.mdx	Update quality tooling for formatting (#21480 )	2023-02-06 18:10:56 -05:00
question_answering.mdx	GPTNeoXForQuestionAnswering (#23059 )	2023-05-04 10:15:15 -04:00
semantic_segmentation.mdx	Fix doc links (#22274 )	2023-03-20 17:07:31 +00:00
sequence_classification.mdx	Add `BioGPTForSequenceClassification` (#22253 )	2023-05-01 09:17:27 -04:00
summarization.mdx	[WIP]`NLLB-MoE` Adds the moe model (#22024 )	2023-03-27 19:42:00 +02:00
text-to-speech.mdx	[docs] Text to speech task guide (#23107 )	2023-05-04 13:17:13 -04:00
token_classification.mdx	added GPTNeoForTokenClassification (#22908 )	2023-04-27 12:10:03 -04:00
translation.mdx	[WIP]`NLLB-MoE` Adds the moe model (#22024 )	2023-03-27 19:42:00 +02:00
video_classification.mdx	Automated compatible models list for task guides (#21338 )	2023-01-27 13:19:28 -05:00
zero_shot_image_classification.mdx	Zero-shot image classification task guide (#22132 )	2023-03-13 10:57:17 -04:00
zero_shot_object_detection.mdx	Add: task guide for zero shot object detection (#21829 )	2023-02-28 10:23:08 -05:00