mirror of
https://github.com/huggingface/transformers.git
synced 2025-07-13 09:40:06 +06:00
![]() * First draft of RWKV-4 * Add support for generate * Style post-rebase * Properly use state * Write doc * Fix doc * More math * Add model to README, dummies and clean config * Fix init * multiple fixes: - fix common tests - fix configuraion default values - add CI test for checking state computation - fix some CI tests * correct tokenizer * some tweaks - fix config docstring - fix failing tests * fix CI tests - add output_attention / output_hidden_states - override test_initialization - fix failing CIs * fix conversion script - fix sharded case - add new arguments * add slow tests + more fixes on conversion script * add another test * final fixes * change single name variable * add mock attention mask for pipeline to work * correct eos token id * fix nits * add checkpoints * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * add `tie_word_embeddings` in docstring * change tensor name * fix final nits * Trigger CI --------- Co-authored-by: younesbelkada <younesbelkada@gmail.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> |
||
---|---|---|
.. | ||
asr.mdx | ||
audio_classification.mdx | ||
document_question_answering.mdx | ||
image_captioning.mdx | ||
image_classification.mdx | ||
language_modeling.mdx | ||
masked_language_modeling.mdx | ||
monocular_depth_estimation.mdx | ||
multiple_choice.mdx | ||
object_detection.mdx | ||
question_answering.mdx | ||
semantic_segmentation.mdx | ||
sequence_classification.mdx | ||
summarization.mdx | ||
text-to-speech.mdx | ||
token_classification.mdx | ||
translation.mdx | ||
video_classification.mdx | ||
zero_shot_image_classification.mdx | ||
zero_shot_object_detection.mdx |