Pavel Iakubovskii
|
66291778dd
|
Refactor Attention implementation for ViT-based models (#36545)
* Refactor vit attention
* Refactor ViT-based models
* 🚨🚨🚨 Fix prefix for DPT
* Update params order
* trigger tests
* Fix Dinov2 attention
* Fix DPT attention impl propagation for backbone config
* Common test fix: config is modif. inplace - avoid it
* view->reshape
* Fixup
* Fixup
* Enable IJepa FA2
* Add FA2 in corresponding model docs
|
2025-03-20 15:15:01 +00:00 |
|
Steven Liu
|
c0f8d055ce
|
[docs] Redesign (#31757)
* toctree
* not-doctested.txt
* collapse sections
* feedback
* update
* rewrite get started sections
* fixes
* fix
* loading models
* fix
* customize models
* share
* fix link
* contribute part 1
* contribute pt 2
* fix toctree
* tokenization pt 1
* Add new model (#32615)
* v1 - working version
* fix
* fix
* fix
* fix
* rename to correct name
* fix title
* fixup
* rename files
* fix
* add copied from on tests
* rename to `FalconMamba` everywhere and fix bugs
* fix quantization + accelerate
* fix copies
* add `torch.compile` support
* fix tests
* fix tests and add slow tests
* copies on config
* merge the latest changes
* fix tests
* add few lines about instruct
* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* fix
* fix tests
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* "to be not" -> "not to be" (#32636)
* "to be not" -> "not to be"
* Update sam.md
* Update trainer.py
* Update modeling_utils.py
* Update test_modeling_utils.py
* Update test_modeling_utils.py
* fix hfoption tag
* tokenization pt. 2
* image processor
* fix toctree
* backbones
* feature extractor
* fix file name
* processor
* update not-doctested
* update
* make style
* fix toctree
* revision
* make fixup
* fix toctree
* fix
* make style
* fix hfoption tag
* pipeline
* pipeline gradio
* pipeline web server
* add pipeline
* fix toctree
* not-doctested
* prompting
* llm optims
* fix toctree
* fixes
* cache
* text generation
* fix
* chat pipeline
* chat stuff
* xla
* torch.compile
* cpu inference
* toctree
* gpu inference
* agents and tools
* gguf/tiktoken
* finetune
* toctree
* trainer
* trainer pt 2
* optims
* optimizers
* accelerate
* parallelism
* fsdp
* update
* distributed cpu
* hardware training
* gpu training
* gpu training 2
* peft
* distrib debug
* deepspeed 1
* deepspeed 2
* chat toctree
* quant pt 1
* quant pt 2
* fix toctree
* fix
* fix
* quant pt 3
* quant pt 4
* serialization
* torchscript
* scripts
* tpu
* review
* model addition timeline
* modular
* more reviews
* reviews
* fix toctree
* reviews reviews
* continue reviews
* more reviews
* modular transformers
* more review
* zamba2
* fix
* all frameworks
* pytorch
* supported model frameworks
* flashattention
* rm check_table
* not-doctested.txt
* rm check_support_list.py
* feedback
* updates/feedback
* review
* feedback
* fix
* update
* feedback
* updates
* update
---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>
|
2025-03-03 10:33:46 -08:00 |
|
NielsRogge
|
9e420e0269
|
[I-JEPA] Update docs (#35148)
Update docs
|
2024-12-09 10:01:31 +01:00 |
|
Pavel Iakubovskii
|
c8c8dffbe4
|
Update I-JEPA checkpoints path (#35120)
Update checkpoints path
|
2024-12-06 13:42:51 +00:00 |
|
João Marcelo
|
50189e36a6
|
Add I-JEPA (#33125)
* first draft
* add IJepaEmbeddings class
* fix copy-from for IJepa model
* add weight conversion script
* update attention class names in IJepa model
* style changes
* Add push_to_hub option to convert_ijepa_checkpoint function
* add initial tests for I-JEPA
* minor style changes to conversion script
* make fixup related
* rename conversion script
* Add I-JEPA to sdpa docs
* minor fixes
* adjust conversion script
* update conversion script
* adjust sdpa docs
* [run_slow] ijepa
* [run-slow] ijepa
* [run-slow] ijepa
* [run-slow] ijepa
* [run-slow] ijepa
* [run-slow] ijepa
* formatting issues
* adjust modeling to modular code
* add IJepaModel to objects to ignore in docstring checks
* [run-slow] ijepa
* fix formatting issues
* add usage instruction snippet to docs
* change pos encoding, add checkpoint for doc
* add verify logits for all models
* [run-slow] ijepa
* update docs to include image feature extraction instructions
* remove pooling layer from IJepaModel in image classification class
* [run-slow] ijepa
* remove pooling layer from IJepaModel constructor
* update docs
* [run-slow] ijepa
* [run-slow] ijepa
* small changes
* [run-slow] ijepa
* style adjustments
* update copyright in init file
* adjust modular ijepa
* [run-slow] ijepa
|
2024-12-05 16:14:46 +01:00 |
|