pytorch/torch/distributed
2024-09-30 19:07:01 +00:00
..
_composable [FSDP2] Fixed test_all_gather_extensions_monkey_patch (#136130) 2024-09-23 15:12:44 +00:00
_shard [BE]: Update mypy to 1.11.2 (#133816) 2024-09-16 19:44:11 +00:00
_sharded_tensor
_sharding_spec
_symmetric_memory [micro_pipeline_tp] support all _scaled_mm args (#131984) 2024-08-05 21:44:37 +00:00
_tensor [reland][dtensor] move DTensor to public namespace (#134203) 2024-09-08 17:08:40 +00:00
_tools Runtime Estimator for estimating GPU compute time (#134243) 2024-08-28 20:06:54 +00:00
algorithms [BE]: Update mypy to 1.11.2 (#133816) 2024-09-16 19:44:11 +00:00
autograd
benchmarks [BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200) 2024-08-15 15:50:19 +00:00
checkpoint [Distributed] fix FileSystemWriter __init__ (#136135) 2024-09-16 19:11:08 +00:00
elastic [reland][Elastic] Skip store barrier and store get in host assign (#136865) 2024-09-27 23:40:42 +00:00
examples
fsdp Update real device in FSDP state_dict_utils (#134994) 2024-09-17 04:39:08 +00:00
launcher
nn Revert "added persistent option to buffers and namedbuffers (#132994)" 2024-08-09 18:14:53 +00:00
optim [BE]: Update mypy to 1.11.2 (#133816) 2024-09-16 19:44:11 +00:00
pipelining [PP] Remove modifications to autograd nodes in ZB (#136678) 2024-09-27 07:07:58 +00:00
rpc [BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200) 2024-08-15 15:50:19 +00:00
tensor Revert "Allow parallelize_module to get device_mesh from ambient context (#134247)" 2024-09-30 19:07:01 +00:00
__init__.py Remove ProcessGroupRoundRobin (#132888) 2024-08-08 01:07:40 +00:00
_checkpointable.py
_composable_state.py
_functional_collectives_impl.py
_functional_collectives.py [BE]: Update mypy to 1.11.2 (#133816) 2024-09-16 19:44:11 +00:00
_state_dict_utils.py [DSD] Fix loading uneven full tensor into sharded state dict (#136365) 2024-09-23 16:35:58 +00:00
argparse_util.py
c10d_logger.py
collective_utils.py
constants.py
CONTRIBUTING.md
device_mesh.py [DeviceMesh][EZ] Add group description to new group (#136558) 2024-09-28 03:09:41 +00:00
distributed_c10d.py Add more distributed examples (#130427) 2024-09-20 18:27:27 +00:00
launch.py
logging_handlers.py
remote_device.py [BE][Easy] fix ruff rule needless-bool (SIM103) (#130206) 2024-07-14 08:17:52 +00:00
rendezvous.py [reland] [torchelastic][c10d] Fix store prefix race in rendezvous (#136768) 2024-09-26 17:37:07 +00:00
run.py fix torchrun log message (#131652) 2024-07-25 14:50:10 +00:00
utils.py [BE] fix circular import in torch/distributed/utils.py (#136286) 2024-09-22 20:54:12 +00:00