pytorch/test/distributed
PyTorch MergeBot abe172e268 Revert "Cleanup error reporting for ProcessGroupNCCL (#111979)"
This reverts commit b29c658265.

Reverted https://github.com/pytorch/pytorch/pull/111979 on behalf of https://github.com/huydhn due to Sorry for reverting your change but it is failing multigpu test in trunk b29c658265 ([comment](https://github.com/pytorch/pytorch/pull/111979#issuecomment-1781919184))
2023-10-26 21:29:40 +00:00
..
_composable [DDP] Make _ReplicateState inherit from _State and make replicate eagerly initialized (#109647) 2023-10-12 07:58:39 +00:00
_shard [Test][ShardedTensor] Add test for corner case for chunk sharding spec (#109626) 2023-09-20 14:40:07 +00:00
_spmd Move has_triton to top level triton utils so that dynamo can also access (#109832) 2023-09-22 19:33:41 +00:00
_tensor Add tensor parallel sharding APIs for torch export (#111236) 2023-10-24 10:07:14 +00:00
_tools
algorithms
bin
checkpoint [DCP][Test] Add use_dtensor subtests for test_state_dict FSDP test (#111615) 2023-10-21 00:44:41 +00:00
elastic Enable flake8-bugbear B020 lint (#110823) 2023-10-24 22:43:47 +00:00
fsdp Enable flake8-bugbear B020 lint (#110823) 2023-10-24 22:43:47 +00:00
launcher removing some redundant str splits (#106089) 2023-09-01 00:22:58 +00:00
nn/jit
optim Back out "Reland "Make adding buffers more like adding parameters (#104069)" (#106224)" (#106743) 2023-08-08 15:27:34 +00:00
pipeline/sync [BE] Enable ruff's UP rules and autoformat distributed/ (#105433) 2023-07-19 14:27:11 +00:00
rpc
tensor/parallel Revert "[2D] Enable 2D optimizer set_state_dict() (#111778)" 2023-10-26 00:18:30 +00:00
argparse_util_test.py
test_c10d_common.py Add "cuda" to MPI backend capabilities (#109614) 2023-09-21 13:34:58 +00:00
test_c10d_functional_native.py Native c10d_functional ops (#110570) 2023-10-25 22:56:06 +00:00
test_c10d_gloo.py fix gloo cuda sparse_allreduce dispatch (#111485) 2023-10-19 21:15:45 +00:00
test_c10d_logger.py [c10d] add nccl version to c10d logger (#111215) 2023-10-16 18:47:49 +00:00
test_c10d_nccl.py Revert "Cleanup error reporting for ProcessGroupNCCL (#111979)" 2023-10-26 21:29:40 +00:00
test_c10d_object_collectives.py [c10d] Remove test for init barrier (#103223) 2023-06-08 16:56:40 +00:00
test_c10d_pypg.py
test_c10d_spawn_gloo.py
test_c10d_spawn_nccl.py
test_c10d_spawn_ucc.py
test_c10d_spawn.py [BE] f-stringify torch/ and scripts (#105538) 2023-07-21 19:35:24 +00:00
test_c10d_ucc.py [BE] Enable ruff's UP rules and autoformat distributed/ (#105433) 2023-07-19 14:27:11 +00:00
test_collective_utils.py Initial commit of collective_utils (#101037) 2023-06-27 02:15:16 +00:00
test_compute_comm_reordering.py Fix unit tests and add logging for Inductor intra-graph reordering (#111981) 2023-10-25 18:19:43 +00:00
test_data_parallel.py [reland2] Update custom Function preserve torch function when inputs … (#110895) 2023-10-11 21:37:19 +00:00
test_distributed_spawn.py Back out "Revert "[DDP] multiple forward support for static graph (#103487)" (#103873)" (#103938) 2023-06-22 21:55:58 +00:00
test_dynamo_distributed.py Move has_triton to top level triton utils so that dynamo can also access (#109832) 2023-09-22 19:33:41 +00:00
test_fake_pg.py Enhance fakepg: add fsdp+tp tests (#107626) 2023-08-25 06:17:54 +00:00
test_functional_api.py [collective] Add a torch.compile + functional_collectives test (#110688) 2023-10-10 17:14:50 +00:00
test_inductor_collectives.py Add regex matching to Inductor all2all collective unit tests (#112077) 2023-10-26 08:29:30 +00:00
test_launcher.py
test_multi_threaded_pg.py [C10D] Improve MTPG autograd test. Fixes #105106 (#105356) 2023-07-20 13:51:21 +00:00
test_nccl.py
test_pg_wrapper.py [Dist] Fix coalescing manager + DETAIL debug mode (#111878) 2023-10-24 07:47:39 +00:00
test_store.py [c10d] Add tests for usig libuv through init_process_group. (#108661) 2023-09-20 16:02:20 +00:00