| .. |
|
_composable
|
[FSDP2] Fixed test_all_gather_extensions_monkey_patch (#136130)
|
2024-09-23 15:12:44 +00:00 |
|
_shard
|
[BE]: Update mypy to 1.11.2 (#133816)
|
2024-09-16 19:44:11 +00:00 |
|
_sharded_tensor
|
|
|
|
_sharding_spec
|
|
|
|
_symmetric_memory
|
[micro_pipeline_tp] support all _scaled_mm args (#131984)
|
2024-08-05 21:44:37 +00:00 |
|
_tensor
|
[reland][dtensor] move DTensor to public namespace (#134203)
|
2024-09-08 17:08:40 +00:00 |
|
_tools
|
Runtime Estimator for estimating GPU compute time (#134243)
|
2024-08-28 20:06:54 +00:00 |
|
algorithms
|
[BE]: Update mypy to 1.11.2 (#133816)
|
2024-09-16 19:44:11 +00:00 |
|
autograd
|
|
|
|
benchmarks
|
[BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200)
|
2024-08-15 15:50:19 +00:00 |
|
checkpoint
|
[Distributed] fix FileSystemWriter __init__ (#136135)
|
2024-09-16 19:11:08 +00:00 |
|
elastic
|
[reland][Elastic] Skip store barrier and store get in host assign (#136865)
|
2024-09-27 23:40:42 +00:00 |
|
examples
|
|
|
|
fsdp
|
Update real device in FSDP state_dict_utils (#134994)
|
2024-09-17 04:39:08 +00:00 |
|
launcher
|
|
|
|
nn
|
Revert "added persistent option to buffers and namedbuffers (#132994)"
|
2024-08-09 18:14:53 +00:00 |
|
optim
|
[BE]: Update mypy to 1.11.2 (#133816)
|
2024-09-16 19:44:11 +00:00 |
|
pipelining
|
[PP] Remove modifications to autograd nodes in ZB (#136678)
|
2024-09-27 07:07:58 +00:00 |
|
rpc
|
[BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200)
|
2024-08-15 15:50:19 +00:00 |
|
tensor
|
Revert "Allow parallelize_module to get device_mesh from ambient context (#134247)"
|
2024-09-30 19:07:01 +00:00 |
|
__init__.py
|
Remove ProcessGroupRoundRobin (#132888)
|
2024-08-08 01:07:40 +00:00 |
|
_checkpointable.py
|
|
|
|
_composable_state.py
|
|
|
|
_functional_collectives_impl.py
|
|
|
|
_functional_collectives.py
|
[BE]: Update mypy to 1.11.2 (#133816)
|
2024-09-16 19:44:11 +00:00 |
|
_state_dict_utils.py
|
[DSD] Fix loading uneven full tensor into sharded state dict (#136365)
|
2024-09-23 16:35:58 +00:00 |
|
argparse_util.py
|
|
|
|
c10d_logger.py
|
|
|
|
collective_utils.py
|
|
|
|
constants.py
|
|
|
|
CONTRIBUTING.md
|
|
|
|
device_mesh.py
|
[DeviceMesh][EZ] Add group description to new group (#136558)
|
2024-09-28 03:09:41 +00:00 |
|
distributed_c10d.py
|
Add more distributed examples (#130427)
|
2024-09-20 18:27:27 +00:00 |
|
launch.py
|
|
|
|
logging_handlers.py
|
|
|
|
remote_device.py
|
[BE][Easy] fix ruff rule needless-bool (SIM103) (#130206)
|
2024-07-14 08:17:52 +00:00 |
|
rendezvous.py
|
[reland] [torchelastic][c10d] Fix store prefix race in rendezvous (#136768)
|
2024-09-26 17:37:07 +00:00 |
|
run.py
|
fix torchrun log message (#131652)
|
2024-07-25 14:50:10 +00:00 |
|
utils.py
|
[BE] fix circular import in torch/distributed/utils.py (#136286)
|
2024-09-22 20:54:12 +00:00 |