transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-06 06:10:04 +06:00

History

Shane A e4ea19b958 Add OLMo model family (#29890 ) * Add OLMo using add-new-model-like with Llama * Fix incorrect tokenizer for OLMo * Copy-paste relevant OLMo methods and their imports * Add OLMo config * Modify OLMo config to follow HF conventions * Remove unneeded Llama code from OLMo model * Add ability for OLMo model to output attentions * Add OLMoPreTrainedModel and OLMoModel * Add OLMoForCausalLM * Minor fixes to OLMo model for style and missing functions * Implement OLMo tokenizer * Implement OLMo to HF conversion script * Add tests for OLMo model * Add tests for OLMo fast tokenizer * Add auto-generated dummy objects * Remove unimplemented OLMo classes from auto and init classes and re-format * Add README and associated auto-generated files * Use OLMo names for common properties * Run make fixup * Remove `\|` from OLMo typing * Remove unneeded tokenization_olmo.py * Revert model, config and converter to add-new-model-like Llama * Move logic for adding bos/eos token into GPTNeoxTokenizerFast * Change OLMoConfig defaults to match OLMo-7B * Use GPTNeoXToknizerFast in OLMo tokenizer tests * Modify auto-generated OLMoModelTests to work for OLMo * Add non-parametric layer norm OLMoLayerNorm * Update weight conversion script for OLMo * Fix __init__ and auto structure for OLMo * Fix errors from make fixup * Remove OLMoTokenizerFast from documentation * Add missing 'Copied from' for OLMoModel._update_causal_mask * Run make fix-copies * Rearrange string replacements in OLMoForCausalLM Copied from * Move OLMo and Llama CausalLM.forward example into global constants * Fix OLMO_GENERATION_EXAMPLE doc string typo * Add option for qkv clipping to OLMo * Rearrange OLMoConfig kwargs in convert_olmo_weights_to_hf * Add clip_qkv to OLMoConfig in convert_olmo_weights_to_hf * Fix OLMo tokenization bug using conversion script * Keep model in full precision after conversion * Do not add eos token automatically * Update references to OLMo model in HF Hub * Do not add eos token during encoding by default * Fix Llama generation example * Run make fixup * OLMo 7B integration test fix * Remove unneeded special case for OLMoConfig * OLMo 7B Twin 2T integration test fix * Fix test_model_7b_greedy_generation * Remove test_compile_static_cache * Fix OLMo and Llama generation example * Run make fixup * Revert "OLMo 7B integration test fix" This reverts commit `4df56a4b15`. * Revert "OLMo 7B Twin 2T integration test fix" This reverts commit `9ff65a4a29`. * Ungate 7B integration tests and fix greedy generation test * Add retries for flaky test_eager_matches_sdpa_generate * Fix output of doc example for OLMoForCausalLM.forward * Downsize OLMo doc test for OLMoForCausalLM.forward to 1B model * Try fix incorrect characters in OLMoForCausalLM.forward doct test * Try fix incorrect characters in OLMoForCausalLM.forward doc test using end quotes * Remove pretraining_tp from OLMo config and model * Add missing 'Copied from' instances * Remove unneeded causal_mask from OLMoModel * Revert Llama changes * Ignore copy for OLMoForCausalLM.forward * Change 'OLMo' to 'Olmo' in classes * Move minimal OLMo tokenization tests to model tests * Add missed 'Copied from' for repeat_kv		2024-04-17 17:59:07 +02:00
..
asr.md	Add new meta w2v2-conformer BERT-like model (#28165 )	2024-01-18 13:37:34 +00:00
audio_classification.md	Add new meta w2v2-conformer BERT-like model (#28165 )	2024-01-18 13:37:34 +00:00
document_question_answering.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
idefics.md	[Docs] Fix broken links and syntax issues (#28918 )	2024-02-08 14:13:35 -08:00
image_captioning.md	[Docs] Fix backticks in inline code and documentation links (#28875 )	2024-02-06 11:15:44 -08:00
image_classification.md	[Trainer] Undo #29896 (#30129 )	2024-04-09 12:55:42 +02:00
image_feature_extraction.md	Fix header in IFE task guide (#29859 )	2024-03-26 12:32:37 +01:00
image_to_image.md	Image-to-Image Task Guide (#26595 )	2023-10-16 15:12:03 +02:00
knowledge_distillation_for_image_classification.md	fixed typos (issue 27919) (#27920 )	2023-12-11 18:44:23 -05:00
language_modeling.md	Add OLMo model family (#29890 )	2024-04-17 17:59:07 +02:00
mask_generation.md	Mask Generation Task Guide (#28897 )	2024-02-14 18:29:49 +00:00
masked_language_modeling.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
monocular_depth_estimation.md	Add Depth Anything (#28654 )	2024-01-25 09:34:50 +01:00
multiple_choice.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
object_detection.md	[Trainer] Undo #29896 (#30129 )	2024-04-09 12:55:42 +02:00
prompting.md	Fix doctest more (for `docs/source/en`) (#30247 )	2024-04-15 14:10:59 +02:00
question_answering.md	fix the post-processing link (#29091 )	2024-02-19 10:15:58 +00:00
semantic_segmentation.md	[docs] Fix image segmentation guide (#30132 )	2024-04-09 09:08:37 -07:00
sequence_classification.md	Add Qwen2MoE (#29377 )	2024-03-27 02:11:55 +01:00
summarization.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
text-to-speech.md	Add FastSpeech2Conformer (#23439 )	2024-01-03 18:01:06 +00:00
token_classification.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
translation.md	Configuring Translation Pipelines documents update #27753 (#29986 )	2024-04-17 11:27:49 +02:00
video_classification.md	[Trainer] Undo #29896 (#30129 )	2024-04-09 12:55:42 +02:00
visual_question_answering.md	VQA task guide (#25244 )	2023-08-09 08:29:06 -04:00
zero_shot_image_classification.md	[docs] Fix model reference in zero shot image classification example (#26206 )	2023-09-19 00:45:12 +02:00
zero_shot_object_detection.md	[Docs] Update README and default pipelines (#28864 )	2024-02-12 10:21:36 +01:00