pytorch/torch/distributed
Dzmitry Huba 2cd5fd1588 Enable local tensor mode on DTensor view ops test (#165596)
While enabling this test discovered lack of support for sub meshes. Added limited support
for sub meshes by properly computing rank coordinates for a given sub mesh. The implementation
follows similar approach to collectives. We infer all sub meshes for the given dimensions and
compute each rank's coordinates with respect to is sub mesh.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/165596
Approved by: https://github.com/ezyang
2025-10-16 20:52:06 +00:00
..
_composable [2/N] More ruff SIM fixes (#165031) 2025-10-14 14:22:54 +00:00
_local_tensor Enable local tensor mode on DTensor view ops test (#165596) 2025-10-16 20:52:06 +00:00
_pycute Revert "[DeviceMesh] Simplify unflatten method (#165556)" 2025-10-16 20:41:37 +00:00
_shard [2/N] More ruff SIM fixes (#165031) 2025-10-14 14:22:54 +00:00
_sharded_tensor [BE][PYFMT] migrate PYFMT for torch.{distributed,distributions} to ruff format (#144547) 2025-02-28 07:35:56 +00:00
_sharding_spec
_symmetric_memory [async_tp] Support ag+mm with gather_dim lastdim of mat_A (#163068) 2025-10-16 20:14:39 +00:00
_tensor [BE][PYFMT] migrate PYFMT for torch.{distributed,distributions} to ruff format (#144547) 2025-02-28 07:35:56 +00:00
_tools [2/N] More ruff SIM fixes (#165031) 2025-10-14 14:22:54 +00:00
algorithms [2/N] More ruff SIM fixes (#165031) 2025-10-14 14:22:54 +00:00
autograd [remove untyped defs] batch 1 (#157011) 2025-06-30 23:54:40 +00:00
benchmarks Add pyrefly suppressions to torch/distributed (7/n) (#165002) 2025-10-09 04:08:25 +00:00
checkpoint [2/N] More ruff SIM fixes (#165031) 2025-10-14 14:22:54 +00:00
elastic [2/N] More ruff SIM fixes (#165031) 2025-10-14 14:22:54 +00:00
examples Support XPU in memory tracker (#150703) 2025-06-12 21:33:52 +00:00
fsdp [distributed] Replace 164 assert statements in fsdp directory (#165235) 2025-10-14 18:04:57 +00:00
launcher [3/N] Import Callable from collections.abc in torch/distributed (#164104) 2025-09-30 00:28:53 +00:00
nn Add pyrefly suppressions to torch/distributed (7/n) (#165002) 2025-10-09 04:08:25 +00:00
optim [2/N] More ruff SIM fixes (#165031) 2025-10-14 14:22:54 +00:00
pipelining [PP] Update backward_counter and fsdp util to schedule class (#165513) 2025-10-15 21:58:16 +00:00
rpc Add pyrefly suppressions to torch/distributed (7/n) (#165002) 2025-10-09 04:08:25 +00:00
tensor Enable local tensor mode on DTensor view ops test (#165596) 2025-10-16 20:52:06 +00:00
__init__.py [RFC] Add pyrefly to lintrunner (#165179) 2025-10-16 20:07:09 +00:00
_checkpointable.py [BE]: Backport runtime_checkable perf improvements/behavior from 3.12 (#155130) 2025-06-06 13:28:05 +00:00
_composable_state.py Revert "[distributed] Replace assert statements with AssertionError exceptions (#165216)" 2025-10-14 17:05:16 +00:00
_dist2.py Revert "[distributed] Replace assert statements with AssertionError exceptions (#165216)" 2025-10-14 17:05:16 +00:00
_functional_collectives_impl.py Revert "[distributed] Replace assert statements with AssertionError exceptions (#165216)" 2025-10-14 17:05:16 +00:00
_functional_collectives.py [RFC] Add pyrefly to lintrunner (#165179) 2025-10-16 20:07:09 +00:00
_mesh_layout.py Revert "[DeviceMesh] Prefer using _layout over _mesh for all sorts of things (#165554)" 2025-10-16 20:41:37 +00:00
_serialization.py distributed/serialization: support zero sized tensors (#164198) 2025-09-30 08:11:29 +00:00
_state_dict_utils.py Revert "[distributed] Replace assert statements with AssertionError exceptions (#165216)" 2025-10-14 17:05:16 +00:00
argparse_util.py
c10d_logger.py Add pyrefly suppressions to torch/distributed (7/n) (#165002) 2025-10-09 04:08:25 +00:00
collective_utils.py Revert "[distributed] Replace assert statements with AssertionError exceptions (#165216)" 2025-10-14 17:05:16 +00:00
constants.py Revert "[RELAND] Always build USE_DISTRIBUTED (#160449) and Make distributed modules importable even when backend not built (#159889) (#162594)" 2025-09-25 13:47:46 +00:00
CONTRIBUTING.md fix torch/distributed contributing doc (#158934) 2025-07-28 17:01:05 +00:00
device_mesh.py Revert "[DeviceMesh] Prefer using _layout over _mesh for all sorts of things (#165554)" 2025-10-16 20:41:37 +00:00
distributed_c10d.py Revert "[distributed] Replace assert statements with AssertionError exceptions (#165216)" 2025-10-14 17:05:16 +00:00
launch.py [BE][PYFMT] migrate PYFMT for torch.{distributed,distributions} to ruff format (#144547) 2025-02-28 07:35:56 +00:00
logging_handlers.py PEP585 update - torch/distributed (#145164) 2025-01-21 04:23:29 +00:00
remote_device.py Add pyrefly suppressions to torch/distributed (7/n) (#165002) 2025-10-09 04:08:25 +00:00
rendezvous.py Revert "[distributed] Replace assert statements with AssertionError exceptions (#165216)" 2025-10-14 17:05:16 +00:00
run.py Revert "[distributed] Replace assert statements with AssertionError exceptions (#165216)" 2025-10-14 17:05:16 +00:00
utils.py Revert "[distributed] Replace assert statements with AssertionError exceptions (#165216)" 2025-10-14 17:05:16 +00:00