Quentin Gallouédec
|
de24fb63ed
|
Use HF papers (#38184)
* Use hf papers
* Hugging Face papers
* doi to hf papers
* style
|
2025-06-13 11:07:09 +00:00 |
|
Steven Liu
|
c0f8d055ce
|
[docs] Redesign (#31757)
* toctree
* not-doctested.txt
* collapse sections
* feedback
* update
* rewrite get started sections
* fixes
* fix
* loading models
* fix
* customize models
* share
* fix link
* contribute part 1
* contribute pt 2
* fix toctree
* tokenization pt 1
* Add new model (#32615)
* v1 - working version
* fix
* fix
* fix
* fix
* rename to correct name
* fix title
* fixup
* rename files
* fix
* add copied from on tests
* rename to `FalconMamba` everywhere and fix bugs
* fix quantization + accelerate
* fix copies
* add `torch.compile` support
* fix tests
* fix tests and add slow tests
* copies on config
* merge the latest changes
* fix tests
* add few lines about instruct
* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* fix
* fix tests
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* "to be not" -> "not to be" (#32636)
* "to be not" -> "not to be"
* Update sam.md
* Update trainer.py
* Update modeling_utils.py
* Update test_modeling_utils.py
* Update test_modeling_utils.py
* fix hfoption tag
* tokenization pt. 2
* image processor
* fix toctree
* backbones
* feature extractor
* fix file name
* processor
* update not-doctested
* update
* make style
* fix toctree
* revision
* make fixup
* fix toctree
* fix
* make style
* fix hfoption tag
* pipeline
* pipeline gradio
* pipeline web server
* add pipeline
* fix toctree
* not-doctested
* prompting
* llm optims
* fix toctree
* fixes
* cache
* text generation
* fix
* chat pipeline
* chat stuff
* xla
* torch.compile
* cpu inference
* toctree
* gpu inference
* agents and tools
* gguf/tiktoken
* finetune
* toctree
* trainer
* trainer pt 2
* optims
* optimizers
* accelerate
* parallelism
* fsdp
* update
* distributed cpu
* hardware training
* gpu training
* gpu training 2
* peft
* distrib debug
* deepspeed 1
* deepspeed 2
* chat toctree
* quant pt 1
* quant pt 2
* fix toctree
* fix
* fix
* quant pt 3
* quant pt 4
* serialization
* torchscript
* scripts
* tpu
* review
* model addition timeline
* modular
* more reviews
* reviews
* fix toctree
* reviews reviews
* continue reviews
* more reviews
* modular transformers
* more review
* zamba2
* fix
* all frameworks
* pytorch
* supported model frameworks
* flashattention
* rm check_table
* not-doctested.txt
* rm check_support_list.py
* feedback
* updates/feedback
* review
* feedback
* fix
* update
* feedback
* updates
* update
---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>
|
2025-03-03 10:33:46 -08:00 |
|
NielsRogge
|
8211c59b9a
|
[KOSMOS-2] Update docs (#27157)
Update docs
|
2023-10-30 21:42:19 +01:00 |
|
Yih-Dar
|
691fd8fdde
|
Add Kosmos-2 model (#24709)
* Add KOSMOS-2 model
* update
* update
* update
* address review comment - 001
* address review comment - 002
* address review comment - 003
* style
* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* fix
* address review comment - 004
* address review comment - 005
* address review comment - 006
* address review comment - 007
* address review comment - 008
* address review comment - 009
* address review comment - 010
* address review comment - 011
* update readme
* fix
* fix
* fix
* [skip ci] fix
* revert the change in _decode
* fix docstring
* fix docstring
* Update docs/source/en/model_doc/kosmos-2.md
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* no more Kosmos2Tokenizer
* style
* remove "returned when being computed by the model"
* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* UTM5 Atten
* fix attn mask
* use present_key_value_states instead of next_decoder_cache
* style
* conversion scripts
* conversion scripts
* conversion scripts
* Add _reorder_cache
* fix doctest and copies
* rename 1
* rename 2
* rename 3
* make fixup
* fix table
* fix docstring
* rename 4
* change repo_id
* remove tip
* update md file
* make style
* update md file
* put docs/source/en/model_doc/kosmos-2.md to slow
* update conversion script
* Use CLIPImageProcessor in Kosmos2Processor
* Remove Kosmos2ImageProcessor
* Remove to_dict in Kosmos2Config
* Remove files
* fix import
* Update conversion
* normalized=False
* Not using hardcoded values like <image>
* elt --> element
* Apply suggestion
* Not using hardcoded values like </image>
* No assert
* No nested functions
* Fix md file
* copy
* update doc
* fix docstring
* fix name
* Remove _add_remove_spaces_around_tag_tokens
* Remove dummy docstring of _preprocess_single_example
* Use `BatchEncoding`
* temp
* temp
* temp
* Update
* Update
* Make Kosmos2ProcessorTest a bit pretty
* Update gradient checkpointing
* Fix gradient checkpointing test
* Remove one liner remove_special_fields
* Simplify conversion script
* fix add_eos_token
* update readme
* update tests
* Change to microsoft/kosmos-2-patch14-224
* style
* Fix doc
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
|
2023-10-30 13:32:17 +01:00 |
|