mirror of
https://github.com/huggingface/transformers.git
synced 2025-07-31 18:22:34 +06:00
Rename torch.run to torchrun (#30405)
torch.run does not exist anywhere as far as I can tell.
This commit is contained in:
parent
696ededd2b
commit
b8b1e442e3
@ -659,7 +659,7 @@ You could also use the [`Trainer`]'s `--save_on_each_node` argument to automatic
|
|||||||
For [torchrun](https://pytorch.org/docs/stable/elastic/run.html), you have to ssh to each node and run the following command on both of them. The launcher waits until both nodes are synchronized before launching the training.
|
For [torchrun](https://pytorch.org/docs/stable/elastic/run.html), you have to ssh to each node and run the following command on both of them. The launcher waits until both nodes are synchronized before launching the training.
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
python -m torch.run --nproc_per_node=8 --nnode=2 --node_rank=0 --master_addr=hostname1 \
|
torchrun --nproc_per_node=8 --nnode=2 --node_rank=0 --master_addr=hostname1 \
|
||||||
--master_port=9901 your_program.py <normal cl args> --deepspeed ds_config.json
|
--master_port=9901 your_program.py <normal cl args> --deepspeed ds_config.json
|
||||||
```
|
```
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user