Jake Tae
|
180c6de6a6
|
docs: fix minor typo (#13289)
`at` should be `a1`
|
2021-08-31 06:49:05 -04:00 |
|
Stas Bekman
|
066fd047cc
|
correct TP implementation resources (#13248)
fix a few implementation links
|
2021-08-31 06:47:23 -04:00 |
|
Stas Bekman
|
27a8c9e4f1
|
[parallelism doc] document Deepspeed-Inference and parallelformers (#12836)
* document Deepspeed-Inference and parallelformers
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
|
2021-07-21 15:11:02 -07:00 |
|
Stas Bekman
|
68605e9db1
|
[doc] parallelism: Which Strategy To Use When (#12712)
|
2021-07-15 09:38:51 -07:00 |
|
Stas Bekman
|
9ee66adadb
|
fix anchor (#12620)
|
2021-07-09 18:48:28 -07:00 |
|
Stas Bekman
|
0dcc3c86e4
|
[doc] DP/PP/TP/etc parallelism (#12524)
* wip
* complete the doc
* missing img
* improve
* correction
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
|
2021-07-09 17:39:09 -07:00 |
|