pytorch/docs/source/elastic
Kiuk Chung 3900509b7d (torchelastic) make --max_restarts explicit in the quickstart and runner docs (#65838)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/65838

closes https://github.com/pytorch/pytorch/pull/65675

The default `--max_restarts` for `torch.distributed.run` was changed to `0` from `3` to make things backwards compatible with `torch.distributed.launch`. Since the default `--max_restarts` used to be greater than `0` we never documented passing `--max_restarts` explicitly in any of our example code.

Test Plan: N/A doc change only

Reviewed By: d4l3k

Differential Revision: D31279544

fbshipit-source-id: 98b31e6a158371bc56907552c5c13958446716f9
2021-09-29 19:29:01 -07:00
..
agent_diagram.jpg
agent.rst
customization.rst
errors.rst [torch] Various improvements to torch.distributed.launch and torch.distributed.run (#61294) 2021-07-08 16:28:06 -07:00
etcd_rdzv_diagram.png
events.rst
examples.rst
kubernetes.rst
metrics.rst
multiprocessing.rst
quickstart.rst (torchelastic) make --max_restarts explicit in the quickstart and runner docs (#65838) 2021-09-29 19:29:01 -07:00
rendezvous.rst [torch/elastic] Update the rendezvous docs (#58160) 2021-05-12 16:54:28 -07:00
run.rst Introduce the torchrun entrypoint (#64049) 2021-08-26 20:17:48 -07:00
timer.rst
train_script.rst Introduce the torchrun entrypoint (#64049) 2021-08-26 20:17:48 -07:00