pytorch/torch/distributed
Wanchao Liang 4cc474dec4 [dtensor] support torch.save/load with DTensor (#103106)
This PR actually enables DTensor to be pickable and add tests to test
torch.save/load works correctly for DTensor
Pull Request resolved: https://github.com/pytorch/pytorch/pull/103106
Approved by: https://github.com/kumpera
2023-06-09 04:11:15 +00:00
..
_composable [Replicate] Add unit test with replicate param names (#102401) 2023-05-31 18:41:03 +00:00
_shard enforce dtype (reland) (#102996) 2023-06-06 00:35:04 +00:00
_sharded_tensor
_sharding_spec
_spmd [distributed] implemented find_all_descendants (#102138) 2023-05-24 21:47:59 +00:00
_tensor [dtensor] support torch.save/load with DTensor (#103106) 2023-06-09 04:11:15 +00:00
_tools
algorithms [CheckpointWrapper] Warn on reentrant use (#102890) 2023-06-04 18:31:22 +00:00
autograd
benchmarks
checkpoint [DCP] Rewrite read slicing to use a wrapper. (#99167) 2023-06-08 13:52:13 +00:00
elastic [BE] Fix flake8 B027 errors - missing abstractmethod decorator (#100715) 2023-05-09 17:28:48 +00:00
examples
fsdp [FSDP] Reshard frozen params in backward (#101982) 2023-06-08 21:12:45 +00:00
launcher Convert logging f-strings to use % format, part four (#98705) 2023-04-11 13:17:59 +00:00
nn [PTD] Make input contiguous for _ReduceScatter (#101373) 2023-05-15 22:08:21 +00:00
optim [BE]: Bugfix functorch and some generic typing improvements (#101337) 2023-05-14 14:20:56 +00:00
pipeline Enable ruff in lintrunner (#99785) 2023-04-24 16:18:44 +00:00
rpc [BE]: enable PLE error codes in ruff and fix bugs (#101079) 2023-05-11 23:57:25 +00:00
tensor Get rid of dim_groups attribute from DeviceMesh (#103105) 2023-06-09 04:11:15 +00:00
__init__.py [c10d] Record time spent for init_process_group, new_group, _store_based_barrier (#101912) 2023-05-24 09:36:34 +00:00
_composable_state.py
_functional_collectives.py Get rid of dim_groups attribute from DeviceMesh (#103105) 2023-06-09 04:11:15 +00:00
argparse_util.py
c10d_logger.py [c10d] Record time spent for init_process_group, new_group, _store_based_barrier (#101912) 2023-05-24 09:36:34 +00:00
constants.py
CONTRIBUTING.md
distributed_c10d.py [c10d] Make it default that PG do not perform barrier after init (#103033) 2023-06-07 06:11:14 +00:00
launch.py
logging_handlers.py
remote_device.py
rendezvous.py
run.py Convert logging f-strings to use % format, part four (#98705) 2023-04-11 13:17:59 +00:00
utils.py [FSDP/DDP] Unify _cast_forward_inputs (#102680) 2023-06-04 18:31:21 +00:00