transformers/docs/source/en/tasks
Shane A e4ea19b958
Add OLMo model family (#29890)
* Add OLMo using add-new-model-like with Llama

* Fix incorrect tokenizer for OLMo

* Copy-paste relevant OLMo methods and their imports

* Add OLMo config

* Modify OLMo config to follow HF conventions

* Remove unneeded Llama code from OLMo model

* Add ability for OLMo model to output attentions

* Add OLMoPreTrainedModel and OLMoModel

* Add OLMoForCausalLM

* Minor fixes to OLMo model for style and missing functions

* Implement OLMo tokenizer

* Implement OLMo to HF conversion script

* Add tests for OLMo model

* Add tests for OLMo fast tokenizer

* Add auto-generated dummy objects

* Remove unimplemented OLMo classes from auto and init classes and re-format

* Add README and associated auto-generated files

* Use OLMo names for common properties

* Run make fixup

* Remove `|` from OLMo typing

* Remove unneeded tokenization_olmo.py

* Revert model, config and converter to add-new-model-like Llama

* Move logic for adding bos/eos token into GPTNeoxTokenizerFast

* Change OLMoConfig defaults to match OLMo-7B

* Use GPTNeoXToknizerFast in OLMo tokenizer tests

* Modify auto-generated OLMoModelTests to work for OLMo

* Add non-parametric layer norm OLMoLayerNorm

* Update weight conversion script for OLMo

* Fix __init__ and auto structure for OLMo

* Fix errors from make fixup

* Remove OLMoTokenizerFast from documentation

* Add missing 'Copied from' for OLMoModel._update_causal_mask

* Run make fix-copies

* Rearrange string replacements in OLMoForCausalLM Copied from

* Move OLMo and Llama CausalLM.forward example into global constants

* Fix OLMO_GENERATION_EXAMPLE doc string typo

* Add option for qkv clipping to OLMo

* Rearrange OLMoConfig kwargs in convert_olmo_weights_to_hf

* Add clip_qkv to OLMoConfig in convert_olmo_weights_to_hf

* Fix OLMo tokenization bug using conversion script

* Keep model in full precision after conversion

* Do not add eos token automatically

* Update references to OLMo model in HF Hub

* Do not add eos token during encoding by default

* Fix Llama generation example

* Run make fixup

* OLMo 7B integration test fix

* Remove unneeded special case for OLMoConfig

* OLMo 7B Twin 2T integration test fix

* Fix test_model_7b_greedy_generation

* Remove test_compile_static_cache

* Fix OLMo and Llama generation example

* Run make fixup

* Revert "OLMo 7B integration test fix"

This reverts commit 4df56a4b15.

* Revert "OLMo 7B Twin 2T integration test fix"

This reverts commit 9ff65a4a29.

* Ungate 7B integration tests and fix greedy generation test

* Add retries for flaky test_eager_matches_sdpa_generate

* Fix output of doc example for OLMoForCausalLM.forward

* Downsize OLMo doc test for OLMoForCausalLM.forward to 1B model

* Try fix incorrect characters in OLMoForCausalLM.forward doct test

* Try fix incorrect characters in OLMoForCausalLM.forward doc test using end quotes

* Remove pretraining_tp from OLMo config and model

* Add missing 'Copied from' instances

* Remove unneeded causal_mask from OLMoModel

* Revert Llama changes

* Ignore copy for OLMoForCausalLM.forward

* Change 'OLMo' to 'Olmo' in classes

* Move minimal OLMo tokenization tests to model tests

* Add missed 'Copied from' for repeat_kv
2024-04-17 17:59:07 +02:00
..
asr.md Add new meta w2v2-conformer BERT-like model (#28165) 2024-01-18 13:37:34 +00:00
audio_classification.md Add new meta w2v2-conformer BERT-like model (#28165) 2024-01-18 13:37:34 +00:00
document_question_answering.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
idefics.md [Docs] Fix broken links and syntax issues (#28918) 2024-02-08 14:13:35 -08:00
image_captioning.md [Docs] Fix backticks in inline code and documentation links (#28875) 2024-02-06 11:15:44 -08:00
image_classification.md [Trainer] Undo #29896 (#30129) 2024-04-09 12:55:42 +02:00
image_feature_extraction.md Fix header in IFE task guide (#29859) 2024-03-26 12:32:37 +01:00
image_to_image.md Image-to-Image Task Guide (#26595) 2023-10-16 15:12:03 +02:00
knowledge_distillation_for_image_classification.md fixed typos (issue 27919) (#27920) 2023-12-11 18:44:23 -05:00
language_modeling.md Add OLMo model family (#29890) 2024-04-17 17:59:07 +02:00
mask_generation.md Mask Generation Task Guide (#28897) 2024-02-14 18:29:49 +00:00
masked_language_modeling.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
monocular_depth_estimation.md Add Depth Anything (#28654) 2024-01-25 09:34:50 +01:00
multiple_choice.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
object_detection.md [Trainer] Undo #29896 (#30129) 2024-04-09 12:55:42 +02:00
prompting.md Fix doctest more (for docs/source/en) (#30247) 2024-04-15 14:10:59 +02:00
question_answering.md fix the post-processing link (#29091) 2024-02-19 10:15:58 +00:00
semantic_segmentation.md [docs] Fix image segmentation guide (#30132) 2024-04-09 09:08:37 -07:00
sequence_classification.md Add Qwen2MoE (#29377) 2024-03-27 02:11:55 +01:00
summarization.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
text-to-speech.md Add FastSpeech2Conformer (#23439) 2024-01-03 18:01:06 +00:00
token_classification.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
translation.md Configuring Translation Pipelines documents update #27753 (#29986) 2024-04-17 11:27:49 +02:00
video_classification.md [Trainer] Undo #29896 (#30129) 2024-04-09 12:55:42 +02:00
visual_question_answering.md VQA task guide (#25244) 2023-08-09 08:29:06 -04:00
zero_shot_image_classification.md [docs] Fix model reference in zero shot image classification example (#26206) 2023-09-19 00:45:12 +02:00
zero_shot_object_detection.md [Docs] Update README and default pipelines (#28864) 2024-02-12 10:21:36 +01:00