pytorch/test/distributed
Ke Wen 19bf67be32 multimem reduce (#164517)
Modified `multimem_one_shot_all_reduce_out` function to accept a `root` argument, making it a `multimem_reduce` op.

The original `multimem_one_shot_all_reduce` op becomes a caller of the `multimem_reduce`, with each rank providing its own rank id as root.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/164517
Approved by: https://github.com/ngimel
2025-10-08 05:25:16 +00:00
..
_composable [FSDP2] check storage equal and consider data_ptr() == 0 (#164595) 2025-10-06 08:44:38 +00:00
_pycute [CuTe] Change the logic of pycute manipulation ops like coalesce, complement from co-lex to lex (#162690) 2025-09-16 19:53:45 +00:00
_shard Revert "Enable all SIM rules except disabled ones (#164645)" 2025-10-05 19:32:21 +00:00
_tools [1/N] Apply UP035 rule in tests (#163947) 2025-09-29 01:42:01 +00:00
algorithms [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
bin
checkpoint list_stored_sd_metadata API. (#160610) 2025-10-08 04:33:51 +00:00
elastic [1/N] Apply UP035 rule in tests (#163947) 2025-09-29 01:42:01 +00:00
flight_recorder [fr] Skip the dtype check for some one to all or all to one collective (#163839) 2025-09-25 16:02:06 +00:00
fsdp Revert "Enable all SIM rules except disabled ones (#164645)" 2025-10-05 19:32:21 +00:00
launcher Support XPU in --nproc-per-node option to torchrun (#159474) 2025-09-12 08:32:04 +00:00
nn/jit
optim Use torch.testing.test_close instead of torch.testing.test_allclose (#164539) 2025-10-03 14:39:10 +00:00
pipelining [PP] Migrate other schedules to use PipelineScheduleRuntime (#164777) 2025-10-08 01:45:57 +00:00
rpc [BE][6/6] fix typos in test/ (test/distributed/) (#157640) 2025-07-11 14:09:37 +00:00
tensor [ContextParallel] add _LoadBalancer classes, and load-balance interface to Context Parallel APIs (#161062) 2025-10-08 01:09:14 +00:00
_test_template.py [C10D] Fix spelling of MultiProcContinuousTest (#160892) 2025-08-19 20:17:19 +00:00
argparse_util_test.py
test_aten_comm_compute_reordering.py respect aten planned overlap in inductor (#164569) 2025-10-06 15:47:55 +00:00
test_backends.py
test_c10d_common.py Fix invalid f-strings (#164112) 2025-09-30 04:17:13 +00:00
test_c10d_functional_native.py remove more no longer needed torch._check_is_size calls 1 (#164630) 2025-10-04 22:06:04 +00:00
test_c10d_gloo.py [C10d][Gloo] Enable complex datatype support in ProcessGroupGloo (#156633) 2025-09-05 21:24:36 +00:00
test_c10d_logger.py add device generalisation support for distributed tests (#152471) 2025-06-20 07:35:42 +00:00
test_c10d_nccl.py Revert "Enable all SIM rules except disabled ones (#164645)" 2025-10-05 19:32:21 +00:00
test_c10d_object_collectives.py
test_c10d_ops_nccl.py [ROCm] Enabling several UTs (#161715) 2025-09-09 15:49:21 +00:00
test_c10d_pypg.py [c10d] block_current_stream: correctness fixes (#158757) 2025-07-21 22:23:44 +00:00
test_c10d_spawn_gloo.py
test_c10d_spawn_nccl.py
test_c10d_spawn_ucc.py
test_c10d_spawn.py Add __main__ guards to distributed tests (#154628) 2025-06-04 14:39:57 +00:00
test_c10d_ucc.py [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
test_collective_utils.py [C10D] add _summarize_ranks util (#160284) 2025-08-28 00:17:53 +00:00
test_composability.py [PP] Fix FSDP unshard/reshard (#164775) 2025-10-08 01:45:57 +00:00
test_compute_comm_reordering.py [ROCm] Enabling several UTs (#161715) 2025-09-09 15:49:21 +00:00
test_control_collectives.py [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
test_cupy_as_tensor.py [ROCm] Enabling several UTs (#161715) 2025-09-09 15:49:21 +00:00
test_data_parallel.py [ROCm] Enable several distributed UTs (#164390) 2025-10-03 19:52:51 +00:00
test_device_mesh.py Add regression test for get_root_mesh with multiple independent meshes (#164731) 2025-10-06 18:52:25 +00:00
test_dist2.py [c10d] Fix setGroupName and setGroupDesc in group_split and merge_remote_group (#159429) 2025-07-30 19:55:55 +00:00
test_distributed_spawn.py
test_dynamo_distributed.py [Reland][2/N]Port several test files under test/distributed to Intel GPU (#159473) 2025-09-17 06:42:27 +00:00
test_fake_pg.py Fake process group Direct construction error (#163665) 2025-10-02 22:19:26 +00:00
test_functional_api.py Revert "Fix decorators skipping NCCL tests (#158846)" 2025-09-10 20:51:31 +00:00
test_inductor_collectives.py remove more no longer needed torch._check_is_size calls 1 (#164630) 2025-10-04 22:06:04 +00:00
test_launcher.py
test_multi_threaded_pg.py add device generalization support for distributed tests (#156796) 2025-07-16 09:37:03 +00:00
test_nccl.py Remove old ROCm skip conditions in tests (#164058) 2025-09-29 03:00:58 +00:00
test_nvshmem_triton.py [SymmMem] Add get_nbi the nonblocking version (#163540) 2025-10-01 17:50:24 +00:00
test_nvshmem.py [SymmMem] Tiled reduce (#162243) 2025-10-08 02:03:04 +00:00
test_p2p_ipc.py [ROCm] Enabling several UTs (#161715) 2025-09-09 15:49:21 +00:00
test_pg_wrapper.py [BE][6/6] fix typos in test/ (test/distributed/) (#157640) 2025-07-11 14:09:37 +00:00
test_run.py 154849 Add support to handle IGUSR1 and SIGUSR2 in multiprocessing (#160690) 2025-09-09 22:23:06 +00:00
test_serialization.py distributed/serialization: support zero sized tensors (#164198) 2025-09-30 08:11:29 +00:00
test_store.py [Fix] Adding missing f prefixes to formatted strings [4/N] (#164068) 2025-09-29 04:07:07 +00:00
test_symmetric_memory.py multimem reduce (#164517) 2025-10-08 05:25:16 +00:00