pytorch/torch/distributed/_tensor/ops
Wanchao Liang fbb744fd49 [dtensor] enable radam foreach optimizer (#115566)
As titled, test both non-foreach and foreach optim

Pull Request resolved: https://github.com/pytorch/pytorch/pull/115566
Approved by: https://github.com/XilunWu
ghstack dependencies: #115297, #115564, #115565
2023-12-12 03:57:00 +00:00
..
__init__.py [dtensor] support convolution ops (#113123) 2023-11-20 21:01:28 +00:00
basic_strategy.py [DeviceMesh] Rename _device_mesh.py to device_mesh.py to prepare for beta (#115099) (#115193) 2023-12-08 08:44:32 +00:00
common_rules.py fix: docstring error in torch/distributed module (#113241) 2023-11-09 19:10:20 +00:00
conv_ops.py [dtensor] support convolution ops (#113123) 2023-11-20 21:01:28 +00:00
embedding_ops.py [dtensor][8/N] Introduce cost model for sharding (#109145) 2023-10-15 16:00:06 +00:00
experimental_ops.py [DTensor] Add DTensor experimental op for LayerNorm backward sharding rule propogation (#115398) 2023-12-09 09:38:56 +00:00
math_ops.py [DeviceMesh] Rename _device_mesh.py to device_mesh.py to prepare for beta (#115099) (#115193) 2023-12-08 08:44:32 +00:00
matrix_ops.py [DeviceMesh] Rename _device_mesh.py to device_mesh.py to prepare for beta (#115099) (#115193) 2023-12-08 08:44:32 +00:00
pointwise_ops.py [dtensor] enable radam foreach optimizer (#115566) 2023-12-12 03:57:00 +00:00
random_ops.py [DeviceMesh] Rename _device_mesh.py to device_mesh.py to prepare for beta (#115099) (#115193) 2023-12-08 08:44:32 +00:00
tensor_ops.py [DeviceMesh] Rename _device_mesh.py to device_mesh.py to prepare for beta (#115099) (#115193) 2023-12-08 08:44:32 +00:00
utils.py [dtensor] refactor op dispatch and fix is_same_size/equal (#112927) 2023-11-13 22:46:31 +00:00
view_ops.py fix: docstring error in torch/distributed module (#113241) 2023-11-09 19:10:20 +00:00