pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

History

Wanchao Liang fbb744fd49 [dtensor] enable radam foreach optimizer (#115566 ) As titled, test both non-foreach and foreach optim Pull Request resolved: https://github.com/pytorch/pytorch/pull/115566 Approved by: https://github.com/XilunWu ghstack dependencies: #115297, #115564, #115565		2023-12-12 03:57:00 +00:00
..
__init__.py	[dtensor] support convolution ops (#113123 )	2023-11-20 21:01:28 +00:00
basic_strategy.py	[DeviceMesh] Rename _device_mesh.py to device_mesh.py to prepare for beta (#115099 ) (#115193 )	2023-12-08 08:44:32 +00:00
common_rules.py	fix: docstring error in torch/distributed module (#113241 )	2023-11-09 19:10:20 +00:00
conv_ops.py	[dtensor] support convolution ops (#113123 )	2023-11-20 21:01:28 +00:00
embedding_ops.py	[dtensor][8/N] Introduce cost model for sharding (#109145 )	2023-10-15 16:00:06 +00:00
experimental_ops.py	[DTensor] Add DTensor experimental op for LayerNorm backward sharding rule propogation (#115398 )	2023-12-09 09:38:56 +00:00
math_ops.py	[DeviceMesh] Rename _device_mesh.py to device_mesh.py to prepare for beta (#115099 ) (#115193 )	2023-12-08 08:44:32 +00:00
matrix_ops.py	[DeviceMesh] Rename _device_mesh.py to device_mesh.py to prepare for beta (#115099 ) (#115193 )	2023-12-08 08:44:32 +00:00
pointwise_ops.py	[dtensor] enable radam foreach optimizer (#115566 )	2023-12-12 03:57:00 +00:00
random_ops.py	[DeviceMesh] Rename _device_mesh.py to device_mesh.py to prepare for beta (#115099 ) (#115193 )	2023-12-08 08:44:32 +00:00
tensor_ops.py	[DeviceMesh] Rename _device_mesh.py to device_mesh.py to prepare for beta (#115099 ) (#115193 )	2023-12-08 08:44:32 +00:00
utils.py	[dtensor] refactor op dispatch and fix is_same_size/equal (#112927 )	2023-11-13 22:46:31 +00:00
view_ops.py	fix: docstring error in torch/distributed module (#113241 )	2023-11-09 19:10:20 +00:00