| .. |
|
_composable
|
E2E composability testing (#141398)
|
2024-12-12 04:19:29 +00:00 |
|
_shard
|
Flip default on weights_only (#137602)
|
2024-11-04 18:30:29 +00:00 |
|
_tensor
|
[DTensor][random] add HSDP+TP model init test (#143077)
|
2024-12-12 06:46:16 +00:00 |
|
_tools
|
[FSDP2] Move to public torch.distributed.fsdp (#141868)
|
2024-12-07 01:24:28 +00:00 |
|
algorithms
|
Fix ROCm skip decorator for test_ddp_tp and multiprocess UTs (#136161)
|
2024-09-18 11:01:23 +00:00 |
|
bin
|
|
|
|
checkpoint
|
[checkpointing][oss] Throw an error when loading a different size than saved tensor (#141571)
|
2024-12-11 15:35:48 +00:00 |
|
elastic
|
Fix rendezvous error due to EtcdStore get method not waiting in some cases (#137056)
|
2024-10-02 01:45:00 +00:00 |
|
flight_recorder
|
[FR] Polish the log message for dtype mismatch and don't exit when too many mismatch (#140451)
|
2024-11-13 07:24:53 +00:00 |
|
fsdp
|
Revert "Tests Generelization for multiple accelerator devices (#139184)"
|
2024-12-12 17:48:30 +00:00 |
|
launcher
|
|
|
|
nn/jit
|
|
|
|
optim
|
Fix ROCm skip decorator for test_ddp_tp and multiprocess UTs (#136161)
|
2024-09-18 11:01:23 +00:00 |
|
pipelining
|
[pipelining] Add ZBV schedule (#142084)
|
2024-12-11 02:00:57 +00:00 |
|
rpc
|
|
|
|
tensor/parallel
|
[Inductor] improve the stride preservation logic of user-visible outputs (#136732)
|
2024-10-26 18:49:14 +00:00 |
|
argparse_util_test.py
|
|
|
|
test_backends.py
|
API to retrieve default distributed backend from device (#140536)
|
2024-11-22 11:01:53 +00:00 |
|
test_c10d_common.py
|
Register Intel distributed Backend (XCCL) in PyTorch distributed package (#141856)
|
2024-12-10 01:58:06 +00:00 |
|
test_c10d_functional_native.py
|
[c10d][Partial-Graph Overlap] Support calling .wait_tensor() on output tensor of eager async_op=True collective if under allow_inflight_collective_as_graph_input_ctx() context manager (#137763)
|
2024-10-29 03:31:19 +00:00 |
|
test_c10d_gloo.py
|
Use task submitter TLS in gloo working threads (#142184)
|
2024-12-06 17:03:17 +00:00 |
|
test_c10d_logger.py
|
[c10d] Switch all timer logging in c10d to wait_counter (#141154)
|
2024-11-21 01:10:11 +00:00 |
|
test_c10d_nccl.py
|
[C10D] Add better profiling title for NCCL barrier, nccl:all_reduce to nccl:all_reduce_barrier (#140785)
|
2024-12-10 23:08:15 +00:00 |
|
test_c10d_object_collectives.py
|
[c10d][CI] Improve world size setting in some tests (#138846)
|
2024-10-25 23:02:17 +00:00 |
|
test_c10d_ops_nccl.py
|
[c10d][CI] Improve world size setting in some tests (#138846)
|
2024-10-25 23:02:17 +00:00 |
|
test_c10d_pypg.py
|
DistributedDataParallel: add init_sync option to control collectives during initialization (#142824)
|
2024-12-11 20:28:38 +00:00 |
|
test_c10d_spawn_gloo.py
|
|
|
|
test_c10d_spawn_nccl.py
|
|
|
|
test_c10d_spawn_ucc.py
|
|
|
|
test_c10d_spawn.py
|
|
|
|
test_c10d_ucc.py
|
[c10d][UCC] Add _reduce_scatter_base to c10d::ProcessGroupUCC (#138021)
|
2024-12-09 16:02:24 +00:00 |
|
test_collective_utils.py
|
|
|
|
test_compute_comm_reordering.py
|
[CI] Add Compiled DDP / Compiled FSDP2 / compute-comm reordering tests to test_inductor_distributed (#138178)
|
2024-10-20 19:38:18 +00:00 |
|
test_control_collectives.py
|
|
|
|
test_data_parallel.py
|
Replace clone.detach with detach.clone (#140264)
|
2024-11-13 07:01:02 +00:00 |
|
test_device_mesh.py
|
[DeviceMesh] Call no_dispatch before doing tensor slicing in DeviceMesh (#142287)
|
2024-12-10 06:33:01 +00:00 |
|
test_distributed_spawn.py
|
|
|
|
test_dynamo_distributed.py
|
[Dynamo] Replace torch._dynamo.optimize() with torch.compile() [7/N] (#140922)
|
2024-12-06 07:07:29 +00:00 |
|
test_fake_pg.py
|
|
|
|
test_functional_api.py
|
Generalization of distributed test cases for non-CUDA devices (#138216)
|
2024-11-18 09:38:00 +00:00 |
|
test_inductor_collectives.py
|
Move Sympy printers to torch/utils/_sympy/printers.py (#140597)
|
2024-11-26 18:11:00 +00:00 |
|
test_launcher.py
|
|
|
|
test_multi_threaded_pg.py
|
|
|
|
test_nccl.py
|
[Pytorch][ATEN] Enable FP8 NCCL in Pytorch ATEN (#138776)
|
2024-10-25 21:56:47 +00:00 |
|
test_pg_wrapper.py
|
|
|
|
test_store.py
|
[TCPStore] Throw value error if passing world_size=0 to TCPStore (#137792)
|
2024-10-11 23:42:57 +00:00 |
|
test_symmetric_memory.py
|
[SymmetricMemory] use the python version of empty() and rendezvous() for tests and library ops (#142154)
|
2024-12-05 22:09:36 +00:00 |