pytorch/test/distributed
PyTorch MergeBot c85323c5e8 Revert "Tests Generelization for multiple accelerator devices (#139184)"
This reverts commit b576a8c318.

Reverted https://github.com/pytorch/pytorch/pull/139184 on behalf of https://github.com/clee2000 due to Failing internally when trying to pickle distributed test files D67098795 ([comment](https://github.com/pytorch/pytorch/pull/139184#issuecomment-2539610187))
2024-12-12 17:48:30 +00:00
..
_composable E2E composability testing (#141398) 2024-12-12 04:19:29 +00:00
_shard Flip default on weights_only (#137602) 2024-11-04 18:30:29 +00:00
_tensor [DTensor][random] add HSDP+TP model init test (#143077) 2024-12-12 06:46:16 +00:00
_tools [FSDP2] Move to public torch.distributed.fsdp (#141868) 2024-12-07 01:24:28 +00:00
algorithms Fix ROCm skip decorator for test_ddp_tp and multiprocess UTs (#136161) 2024-09-18 11:01:23 +00:00
bin
checkpoint [checkpointing][oss] Throw an error when loading a different size than saved tensor (#141571) 2024-12-11 15:35:48 +00:00
elastic Fix rendezvous error due to EtcdStore get method not waiting in some cases (#137056) 2024-10-02 01:45:00 +00:00
flight_recorder [FR] Polish the log message for dtype mismatch and don't exit when too many mismatch (#140451) 2024-11-13 07:24:53 +00:00
fsdp Revert "Tests Generelization for multiple accelerator devices (#139184)" 2024-12-12 17:48:30 +00:00
launcher
nn/jit
optim Fix ROCm skip decorator for test_ddp_tp and multiprocess UTs (#136161) 2024-09-18 11:01:23 +00:00
pipelining [pipelining] Add ZBV schedule (#142084) 2024-12-11 02:00:57 +00:00
rpc
tensor/parallel [Inductor] improve the stride preservation logic of user-visible outputs (#136732) 2024-10-26 18:49:14 +00:00
argparse_util_test.py
test_backends.py API to retrieve default distributed backend from device (#140536) 2024-11-22 11:01:53 +00:00
test_c10d_common.py Register Intel distributed Backend (XCCL) in PyTorch distributed package (#141856) 2024-12-10 01:58:06 +00:00
test_c10d_functional_native.py [c10d][Partial-Graph Overlap] Support calling .wait_tensor() on output tensor of eager async_op=True collective if under allow_inflight_collective_as_graph_input_ctx() context manager (#137763) 2024-10-29 03:31:19 +00:00
test_c10d_gloo.py Use task submitter TLS in gloo working threads (#142184) 2024-12-06 17:03:17 +00:00
test_c10d_logger.py [c10d] Switch all timer logging in c10d to wait_counter (#141154) 2024-11-21 01:10:11 +00:00
test_c10d_nccl.py [C10D] Add better profiling title for NCCL barrier, nccl:all_reduce to nccl:all_reduce_barrier (#140785) 2024-12-10 23:08:15 +00:00
test_c10d_object_collectives.py [c10d][CI] Improve world size setting in some tests (#138846) 2024-10-25 23:02:17 +00:00
test_c10d_ops_nccl.py [c10d][CI] Improve world size setting in some tests (#138846) 2024-10-25 23:02:17 +00:00
test_c10d_pypg.py DistributedDataParallel: add init_sync option to control collectives during initialization (#142824) 2024-12-11 20:28:38 +00:00
test_c10d_spawn_gloo.py
test_c10d_spawn_nccl.py
test_c10d_spawn_ucc.py
test_c10d_spawn.py
test_c10d_ucc.py [c10d][UCC] Add _reduce_scatter_base to c10d::ProcessGroupUCC (#138021) 2024-12-09 16:02:24 +00:00
test_collective_utils.py
test_compute_comm_reordering.py [CI] Add Compiled DDP / Compiled FSDP2 / compute-comm reordering tests to test_inductor_distributed (#138178) 2024-10-20 19:38:18 +00:00
test_control_collectives.py
test_data_parallel.py Replace clone.detach with detach.clone (#140264) 2024-11-13 07:01:02 +00:00
test_device_mesh.py [DeviceMesh] Call no_dispatch before doing tensor slicing in DeviceMesh (#142287) 2024-12-10 06:33:01 +00:00
test_distributed_spawn.py
test_dynamo_distributed.py [Dynamo] Replace torch._dynamo.optimize() with torch.compile() [7/N] (#140922) 2024-12-06 07:07:29 +00:00
test_fake_pg.py
test_functional_api.py Generalization of distributed test cases for non-CUDA devices (#138216) 2024-11-18 09:38:00 +00:00
test_inductor_collectives.py Move Sympy printers to torch/utils/_sympy/printers.py (#140597) 2024-11-26 18:11:00 +00:00
test_launcher.py
test_multi_threaded_pg.py
test_nccl.py [Pytorch][ATEN] Enable FP8 NCCL in Pytorch ATEN (#138776) 2024-10-25 21:56:47 +00:00
test_pg_wrapper.py
test_store.py [TCPStore] Throw value error if passing world_size=0 to TCPStore (#137792) 2024-10-11 23:42:57 +00:00
test_symmetric_memory.py [SymmetricMemory] use the python version of empty() and rendezvous() for tests and library ops (#142154) 2024-12-05 22:09:36 +00:00