pytorch/test/distributed
Ke Wen 5c827a4133 [SymmMem] Multi-root tile reduction (#164757)
Stack from [ghstack](https://github.com/ezyang/ghstack/tree/0.12.0) (oldest at bottom):

Perform multiple tile reductions concurrently, with each tile reduced to a separate root.

- The number of concurrent reductions can be smaller than world size, i.e. roots can be a subset of all ranks. But all ranks are still required to call into this API.

- Currently supports NVLink SHARP scope only.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/164757
Approved by: https://github.com/weifengpy, https://github.com/fegin
ghstack dependencies: #162243
2025-10-08 17:28:00 +00:00
..
_composable [FSDP2] check storage equal and consider data_ptr() == 0 (#164595) 2025-10-06 08:44:38 +00:00
_pycute [CuTe] Change the logic of pycute manipulation ops like coalesce, complement from co-lex to lex (#162690) 2025-09-16 19:53:45 +00:00
_shard Revert "Enable all SIM rules except disabled ones (#164645)" 2025-10-05 19:32:21 +00:00
_tools [1/N] Apply UP035 rule in tests (#163947) 2025-09-29 01:42:01 +00:00
algorithms [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
bin
checkpoint Revert "list_stored_sd_metadata API. (#160610)" 2025-10-08 15:10:38 +00:00
elastic [1/N] Apply UP035 rule in tests (#163947) 2025-09-29 01:42:01 +00:00
flight_recorder [fr] Skip the dtype check for some one to all or all to one collective (#163839) 2025-09-25 16:02:06 +00:00
fsdp Revert "Enable all SIM rules except disabled ones (#164645)" 2025-10-05 19:32:21 +00:00
launcher Support XPU in --nproc-per-node option to torchrun (#159474) 2025-09-12 08:32:04 +00:00
nn/jit PEP585 update - test (#145176) 2025-01-22 04:48:28 +00:00
optim Use torch.testing.test_close instead of torch.testing.test_allclose (#164539) 2025-10-03 14:39:10 +00:00
pipelining [PP] Migrate other schedules to use PipelineScheduleRuntime (#164777) 2025-10-08 01:45:57 +00:00
rpc [BE][6/6] fix typos in test/ (test/distributed/) (#157640) 2025-07-11 14:09:37 +00:00
tensor [ContextParallel] add _LoadBalancer classes, and load-balance interface to Context Parallel APIs (#161062) 2025-10-08 01:09:14 +00:00
_test_template.py [C10D] Fix spelling of MultiProcContinuousTest (#160892) 2025-08-19 20:17:19 +00:00
argparse_util_test.py
test_aten_comm_compute_reordering.py respect aten planned overlap in inductor (#164569) 2025-10-06 15:47:55 +00:00
test_backends.py API to retrieve default distributed backend from device (#140536) 2024-11-22 11:01:53 +00:00
test_c10d_common.py Fix invalid f-strings (#164112) 2025-09-30 04:17:13 +00:00
test_c10d_functional_native.py remove more (#164753) 2025-10-08 14:23:38 +00:00
test_c10d_gloo.py [C10d][Gloo] Enable complex datatype support in ProcessGroupGloo (#156633) 2025-09-05 21:24:36 +00:00
test_c10d_logger.py add device generalisation support for distributed tests (#152471) 2025-06-20 07:35:42 +00:00
test_c10d_nccl.py Revert "Enable all SIM rules except disabled ones (#164645)" 2025-10-05 19:32:21 +00:00
test_c10d_object_collectives.py Update test_c10d_object_collectives.py with DistributedTestBase class (#145056) 2025-02-13 03:57:59 +00:00
test_c10d_ops_nccl.py [ROCm] Enabling several UTs (#161715) 2025-09-09 15:49:21 +00:00
test_c10d_pypg.py [c10d] block_current_stream: correctness fixes (#158757) 2025-07-21 22:23:44 +00:00
test_c10d_spawn_gloo.py PEP585 update - test (#145176) 2025-01-22 04:48:28 +00:00
test_c10d_spawn_nccl.py PEP585 update - test (#145176) 2025-01-22 04:48:28 +00:00
test_c10d_spawn_ucc.py PEP585 update - test (#145176) 2025-01-22 04:48:28 +00:00
test_c10d_spawn.py Add __main__ guards to distributed tests (#154628) 2025-06-04 14:39:57 +00:00
test_c10d_ucc.py [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
test_collective_utils.py [C10D] add _summarize_ranks util (#160284) 2025-08-28 00:17:53 +00:00
test_composability.py [PP] Fix FSDP unshard/reshard (#164775) 2025-10-08 01:45:57 +00:00
test_compute_comm_reordering.py [ROCm] Enabling several UTs (#161715) 2025-09-09 15:49:21 +00:00
test_control_collectives.py [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
test_cupy_as_tensor.py [ROCm] Enabling several UTs (#161715) 2025-09-09 15:49:21 +00:00
test_data_parallel.py [ROCm] Enable several distributed UTs (#164390) 2025-10-03 19:52:51 +00:00
test_device_mesh.py Add regression test for get_root_mesh with multiple independent meshes (#164731) 2025-10-06 18:52:25 +00:00
test_dist2.py [c10d] Fix setGroupName and setGroupDesc in group_split and merge_remote_group (#159429) 2025-07-30 19:55:55 +00:00
test_distributed_spawn.py Remove NO_MULTIPROCESSING_SPAWN checks (#146705) 2025-02-28 05:53:19 +00:00
test_dynamo_distributed.py [Reland][2/N]Port several test files under test/distributed to Intel GPU (#159473) 2025-09-17 06:42:27 +00:00
test_fake_pg.py Fake process group Direct construction error (#163665) 2025-10-02 22:19:26 +00:00
test_functional_api.py Revert "Fix decorators skipping NCCL tests (#158846)" 2025-09-10 20:51:31 +00:00
test_inductor_collectives.py remove more (#164753) 2025-10-08 14:23:38 +00:00
test_launcher.py Fix unused Python variables in test/[a-d]* (#134665) 2024-12-13 22:13:12 +00:00
test_multi_threaded_pg.py add device generalization support for distributed tests (#156796) 2025-07-16 09:37:03 +00:00
test_nccl.py Remove old ROCm skip conditions in tests (#164058) 2025-09-29 03:00:58 +00:00
test_nvshmem_triton.py [SymmMem] Add get_nbi the nonblocking version (#163540) 2025-10-01 17:50:24 +00:00
test_nvshmem.py [SymmMem] Multi-root tile reduction (#164757) 2025-10-08 17:28:00 +00:00
test_p2p_ipc.py [ROCm] Enabling several UTs (#161715) 2025-09-09 15:49:21 +00:00
test_pg_wrapper.py [BE][6/6] fix typos in test/ (test/distributed/) (#157640) 2025-07-11 14:09:37 +00:00
test_run.py 154849 Add support to handle IGUSR1 and SIGUSR2 in multiprocessing (#160690) 2025-09-09 22:23:06 +00:00
test_serialization.py distributed/serialization: support zero sized tensors (#164198) 2025-09-30 08:11:29 +00:00
test_store.py [Fix] Adding missing f prefixes to formatted strings [4/N] (#164068) 2025-09-29 04:07:07 +00:00
test_symmetric_memory.py multimem reduce (#164517) 2025-10-08 05:25:16 +00:00