mirror of
https://github.com/huggingface/transformers.git
synced 2025-07-13 17:48:22 +06:00
![]() * Add model with cli tool * Remove unwanted stuff * Add new code * Remove inference runner * Style * Fix checks * Test updates * make fixup * fix docs * fix doc * fix test * hopefully fix pipeline tests * refactor * fix CIs * add comment * rename to `GPTBigCodeForCausalLM` * correct readme * make fixup + docs * make fixup * fixes * fixes * Remove pruning * Remove import * Doc updates * More pruning removal * Combine copies * Single MQA implementation, remove kv cache pre-allocation and padding * Update doc * Revert refactor to match gpt2 style * Merge back key and value caches, fix some type hints * Update doc * Fix position ids pith padding (PR 21080) * Add conversion script temporarily * Update conversion script * Remove checkpoint conversion * New model * Fix MQA test * Fix copies * try fix tests * FIX TEST!! * remove `DoubleHeadsModel` * add MQA tests * add slow tests * clean up * add CPU checker * final fixes * fixes - fix GPU issue - fixed slow tests - skip disk offload * fix final issue * Simplify and comment baddbmm fix * Remove unnecessary code * Transpose tweaks * Use beta=1 on cpu, improve tests --------- Co-authored-by: younesbelkada <younesbelkada@gmail.com> |
||
---|---|---|
.. | ||
asr.mdx | ||
audio_classification.mdx | ||
document_question_answering.mdx | ||
image_captioning.mdx | ||
image_classification.mdx | ||
language_modeling.mdx | ||
masked_language_modeling.mdx | ||
monocular_depth_estimation.mdx | ||
multiple_choice.mdx | ||
object_detection.mdx | ||
question_answering.mdx | ||
semantic_segmentation.mdx | ||
sequence_classification.mdx | ||
summarization.mdx | ||
token_classification.mdx | ||
translation.mdx | ||
video_classification.mdx | ||
zero_shot_image_classification.mdx | ||
zero_shot_object_detection.mdx |