pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

History

zeshengzong d7a83ab67b Fix `lr_scheduler` unexpectedly calls `step()` when init argument last_epoch is larger than -1 (#149312 ) Fixes #102261 ## Changes - Use flag `_is_initial` to replace `self.last_epoch == 0` condition to judge whether `lr` should be initial value - Add test for `ExponentialLR` checkpoint usecase ## Test Result ```python pytest -s test/optim/test_lrscheduler.py -vv ``` ![image](https://github.com/user-attachments/assets/6fd32bcc-b4fb-4421-b891-620bd4900dc1) Pull Request resolved: https://github.com/pytorch/pytorch/pull/149312 Approved by: https://github.com/janeyx99 Co-authored-by: Jane (Yuan) Xu <31798555+janeyx99@users.noreply.github.com>		2025-05-22 08:42:37 +00:00
..
_multi_tensor
__init__.py
_adafactor.py	Fix incorrect citation of authors in documentation (#145209 )	2025-05-05 17:45:05 +00:00
_functional.py
adadelta.py	Convert Tensor lr to 0-dim as needed for the optimizer to normally work (#145674 )	2025-03-17 23:07:05 +00:00
adagrad.py	Convert Tensor lr to 0-dim as needed for the optimizer to normally work (#145674 )	2025-03-17 23:07:05 +00:00
adam.py	[MPS] grad scaler (#150255 )	2025-04-06 17:06:55 +00:00
adamax.py	Convert Tensor lr to 0-dim as needed for the optimizer to normally work (#145674 )	2025-03-17 23:07:05 +00:00
adamw.py
asgd.py	Add scripts to check xrefs and urls (#151844 )	2025-04-28 09:30:07 +00:00
lbfgs.py	Convert Tensor lr to 0-dim as needed for the optimizer to normally work (#145674 )	2025-03-17 23:07:05 +00:00
lr_scheduler.py	Fix `lr_scheduler` unexpectedly calls `step()` when init argument last_epoch is larger than -1 (#149312 )	2025-05-22 08:42:37 +00:00
nadam.py	Convert Tensor lr to 0-dim as needed for the optimizer to normally work (#145674 )	2025-03-17 23:07:05 +00:00
optimizer.py	Add `load_state_dict` hint doc about invoke order work with lr_scheduler (#149942 )	2025-05-15 01:07:36 +00:00
radam.py	Convert Tensor lr to 0-dim as needed for the optimizer to normally work (#145674 )	2025-03-17 23:07:05 +00:00
rmsprop.py	Convert Tensor lr to 0-dim as needed for the optimizer to normally work (#145674 )	2025-03-17 23:07:05 +00:00
rprop.py	Convert Tensor lr to 0-dim as needed for the optimizer to normally work (#145674 )	2025-03-17 23:07:05 +00:00
sgd.py	Document that dampening is skipped in SGD momentum first step (#152833 )	2025-05-05 20:07:23 +00:00
sparse_adam.py	Convert Tensor lr to 0-dim as needed for the optimizer to normally work (#145674 )	2025-03-17 23:07:05 +00:00
swa_utils.py