pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

History

Jane Xu c23dceb8f1 Add Adafactor foreach impl (#132336 ) This PR adds the foreach impl for Adafactor knowing that there are many ways to improve its runtime perf today (by adding more foreach support). After this PR: - we have a foreach flag for Adafactor - It is NOT the default. Why not? It is only slightly faster + uses O(n) more memory where n is the number of params in your max param group. People tend to use Adafactor for memory efficiency. Next steps: - make torch.compile possible on it - make it faster (by adding more foreach apis) Pull Request resolved: https://github.com/pytorch/pytorch/pull/132336 Approved by: https://github.com/albanD ghstack dependencies: #133360		2024-08-15 17:00:33 +00:00
..
_internal	Add Adafactor foreach impl (#132336 )	2024-08-15 17:00:33 +00:00
__init__.py	[BE][Easy][19/19] enforce style for empty lines in import segments in `torch/[o-z]*/` (#129771 )	2024-08-01 17:07:14 +00:00
_comparison.py	Strict shape checking for NJTs with TestCase.assertEqual() (#131898 )	2024-07-30 20:05:48 +00:00
_creation.py	[BE][Easy][19/19] enforce style for empty lines in import segments in `torch/[o-z]*/` (#129771 )	2024-08-01 17:07:14 +00:00
_utils.py	[BE][Easy][19/19] enforce style for empty lines in import segments in `torch/[o-z]*/` (#129771 )	2024-08-01 17:07:14 +00:00