Steven Liu
|
a52478253b
|
[docs] Tensor parallelism (#38241)
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
* updates
* feedback
* badges
* fix?
* fix?
* fix?
* fix?
|
2025-06-26 14:40:45 -07:00 |
|
Matej Sirovatka
|
7a611f0afd
|
Fix: make docs work better with doc builder (#38213)
|
2025-05-20 08:23:03 +00:00 |
|
Matej Sirovatka
|
6e738411e1
|
Tensor parallel docs (#38178)
* Feat: initial docs
* Feat: update doc
* Final typos/changes
* Refactor: reorder top to bottom.
|
2025-05-19 17:05:01 +00:00 |
|
jiqing-feng
|
286393fbb1
|
enable tp on CPU (#36299)
* enable tp on CPU
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* get rank from cpu
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* update
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* enable TP tests
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix comment
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* em print
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix model id
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix conflict
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix index and add doc
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
|
2025-03-31 10:55:47 +02:00 |
|
Steven Liu
|
c0f8d055ce
|
[docs] Redesign (#31757)
* toctree
* not-doctested.txt
* collapse sections
* feedback
* update
* rewrite get started sections
* fixes
* fix
* loading models
* fix
* customize models
* share
* fix link
* contribute part 1
* contribute pt 2
* fix toctree
* tokenization pt 1
* Add new model (#32615)
* v1 - working version
* fix
* fix
* fix
* fix
* rename to correct name
* fix title
* fixup
* rename files
* fix
* add copied from on tests
* rename to `FalconMamba` everywhere and fix bugs
* fix quantization + accelerate
* fix copies
* add `torch.compile` support
* fix tests
* fix tests and add slow tests
* copies on config
* merge the latest changes
* fix tests
* add few lines about instruct
* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* fix
* fix tests
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* "to be not" -> "not to be" (#32636)
* "to be not" -> "not to be"
* Update sam.md
* Update trainer.py
* Update modeling_utils.py
* Update test_modeling_utils.py
* Update test_modeling_utils.py
* fix hfoption tag
* tokenization pt. 2
* image processor
* fix toctree
* backbones
* feature extractor
* fix file name
* processor
* update not-doctested
* update
* make style
* fix toctree
* revision
* make fixup
* fix toctree
* fix
* make style
* fix hfoption tag
* pipeline
* pipeline gradio
* pipeline web server
* add pipeline
* fix toctree
* not-doctested
* prompting
* llm optims
* fix toctree
* fixes
* cache
* text generation
* fix
* chat pipeline
* chat stuff
* xla
* torch.compile
* cpu inference
* toctree
* gpu inference
* agents and tools
* gguf/tiktoken
* finetune
* toctree
* trainer
* trainer pt 2
* optims
* optimizers
* accelerate
* parallelism
* fsdp
* update
* distributed cpu
* hardware training
* gpu training
* gpu training 2
* peft
* distrib debug
* deepspeed 1
* deepspeed 2
* chat toctree
* quant pt 1
* quant pt 2
* fix toctree
* fix
* fix
* quant pt 3
* quant pt 4
* serialization
* torchscript
* scripts
* tpu
* review
* model addition timeline
* modular
* more reviews
* reviews
* fix toctree
* reviews reviews
* continue reviews
* more reviews
* modular transformers
* more review
* zamba2
* fix
* all frameworks
* pytorch
* supported model frameworks
* flashattention
* rm check_table
* not-doctested.txt
* rm check_support_list.py
* feedback
* updates/feedback
* review
* feedback
* fix
* update
* feedback
* updates
* update
---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>
|
2025-03-03 10:33:46 -08:00 |
|
Nicolas Patry
|
b4965cecc5
|
Fixing the docs corresponding to the breaking change in torch 2.6. (#36420)
|
2025-02-26 14:11:52 +01:00 |
|
Ke Wen
|
f869d486d3
|
Update doc re list of models supporting TP (#35864)
Update doc about models' TP support
|
2025-02-12 15:53:27 +01:00 |
|
Jacky Lee
|
927c3e39ec
|
Fix image preview in multi-GPU inference docs (#35303)
fix: link for img
|
2024-12-17 09:33:50 -08:00 |
|
Ke Wen
|
20142ab542
|
Simplify Tensor Parallel implementation with PyTorch TP (#34184)
* Simplify Tensor Parallel implementation with PyTorch TP
* Move tp_plan to config
* Lint
* Format and warning
* Disable copy-from check
* Conditionally get attr from config
* make fix-copies
* Move base_model_tp_plan to PretrainedConfig
* Move TP into from_pretrained
* Add device context for load
* Do not serialize
* Move _tp_plan setting to post_init
* Add has_tp_plan
* Add test_tp
* Add 'Multi-gpu inference' doc
* Add backward support for device type identification
* Auto-detect accelerator
* supports_tp_plan
* copyright year
* Fix copy
|
2024-11-18 19:51:49 +01:00 |
|