pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Yuanyuan Chen	fc8ac1216c	[4/N] Remove unused loop variables in tests (#166690 ) This PR removes unused loop variables in tests. Pull Request resolved: https://github.com/pytorch/pytorch/pull/166690 Approved by: https://github.com/justinchuby, https://github.com/mlazos	2025-10-31 10:20:48 +00:00
Yuanyuan Chen	0e083942cc	Enable PLW0127 in ruff (#165851 ) This PR enables `PLW0127` in ruff, which checks self-assignment of variables with the form `var=var`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/165851 Approved by: https://github.com/Lucaskabela	2025-10-21 03:30:57 +00:00
Yuanyuan Chen	fdab48a7c1	Enable all PIE rules on ruff (#165814 ) This PR enables all PIE rules on ruff, there are already some enabled rules from this family, the new added rules are ``` PIE796 Enum contains duplicate value: {value} PIE808 Unnecessary start argument in range ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/165814 Approved by: https://github.com/ezyang	2025-10-18 07:36:18 +00:00
PyTorch MergeBot	24520b8386	Revert "Enable all PIE rules on ruff (#165814 )" This reverts commit `c79dfdc655`. Reverted https://github.com/pytorch/pytorch/pull/165814 on behalf of https://github.com/cyyever due to Need to cover more files ([comment](https://github.com/pytorch/pytorch/pull/165814#issuecomment-3417931863))	2025-10-18 07:21:08 +00:00
Yuanyuan Chen	c79dfdc655	Enable all PIE rules on ruff (#165814 ) This PR enables all PIE rules on ruff, there are already some enabled rules from this family, the new added rules are ``` PIE796 Enum contains duplicate value: {value} PIE808 Unnecessary start argument in range ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/165814 Approved by: https://github.com/ezyang	2025-10-18 06:40:12 +00:00
Filip	2b6a74abf1	[optim] prevent unintended aliasing in lr_scheduler; update type annotations/docs (#163120 ) 1. Prevents unintended aliasing of `self._last_lr`/`get_last_lr(...)` with `group["lr"]` when `group["lr"]` is a tensor. 2. Prevents unintended aliasing of `LRScheduler.base_lrs` with the `group["initial_lr"]`s. 3. Updates `test/optim/test_lrscheduler.py` to test tensor LRs. 4. Changes type annotations for `_last_lr`, `get_last_lr()`, `base_lrs`, `get_lr()`, and `_get_closed_form_lr()` from `list[float]` to `list[float \| Tensor]`; adds documentation. Fixes #163103 LR schedulers can behave in unexpected ways when using a tensor LR due to patterns like this: ```python self._last_lr: list[float] = [group["lr"] for group in self.optimizer.param_groups] ``` This PR adds a helper to address this: ```python def _param_groups_val_list(optimizer: Optimizer, key: str) -> list[Any]: """Create a list containing group[key] for each optimizer param_group. Prevents aliasing when group[key] could be a Tensor. Raises a KeyError when group[key] does not exist. """ return [ group[key].clone() if isinstance(group[key], Tensor) else group[key] for group in optimizer.param_groups ] ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/163120 Approved by: https://github.com/janeyx99	2025-09-25 06:58:58 +00:00
Filip	167ad09be5	[optim] override SWALR.state_dict and load_state_dict (#163122 ) Fixes #163105 Note that the new `SWALR.load_state_dict` is not backwards compatible: ```python @override def load_state_dict(self, state_dict: dict[str, Any]) -> None: """Load the scheduler's state. Args: state_dict (dict): scheduler state. Should be an object returned from a call to :meth:`state_dict`. """ self.__dict__.update(state_dict) self._set_anneal_func(self._anneal_strategy) ``` If we'd like to maintain compatibility with old state_dicts (loaded with `weights_only=False`), we could use something along these lines: ```python @override def load_state_dict(self, state_dict: dict[str, Any]) -> None: """Load the scheduler's state. Args: state_dict (dict): scheduler state. Should be an object returned from a call to :meth:`state_dict`. """ anneal_func = state_dict.pop("anneal_func", None) strategy = state_dict.get("_anneal_strategy") self.__dict__.update(state_dict) if anneal_func is not None: state_dict["anneal_func"] = anneal_func if strategy is None: if anneal_func == self._linear_anneal: strategy = "linear" elif anneal_func == self._cosine_anneal: strategy = "cos" if strategy is None: strategy = getattr(self, "_anneal_strategy", "cos") self._set_anneal_func(strategy) ``` But given the fact that loading an `SWALR` state_dict before this PR would have caused an error, this seems okay. A GitHub/Google search for `SWALR.load_state_dict` had no results. Happy to change if not, or add a warning just in case. Pull Request resolved: https://github.com/pytorch/pytorch/pull/163122 Approved by: https://github.com/janeyx99	2025-09-17 18:17:26 +00:00
Filip	bc38c5baa1	[optim] prevent problematic tensor aliasing in lr_scheduler (#163098 ) Prevents edge cases in SequentialLR and ReduceLROnPlateau which could corrupt learning rates or trigger recompilation. Supersedes #162360 Fixes #162359 Fixes #163093 While putting #162360 together, I noticed the class of issue I was fixing (i.e. unintended aliasing in lr_schedulers when using Tensor lrs) appeared in several other places. @janeyx99 suggested I put together a follow-up PR. There are several bugs resembling the one fixed in #162360. I added a helper to fix these: ```python def _update_param_group_val(param_group: dict[str, Any], key: str, val: float \| Tensor): """Set param_group[key] to val without aliasing or assignment when they're both tensors. Raises a KeyError if param_group[key] does not exist. """ if isinstance(param_group[key], Tensor): param_group[key].fill_(_to_scalar(val)) else: param_group[key] = val ``` And applied it to fix bugs in `SequentialLR.__init__` and `LRScheduler._update_lr`. I also added it to `CyclicLR.__init__` which was using an equivalent pattern, and `CosineAnnealingWarmRestarts.step` which should have had a similar issue: ```python for param_group, lr in zip(self.optimizer.param_groups, self.get_lr()): param_group["lr"] = lr ``` But did not, because `get_lr()` actually returns tensors when using a tensor lr (despite its `list[float]` return type annotation). Relying on this propagation seems fragile, so I conservatively added the method here as well. I'll be fixing the type annotations and several related issues in followup PRs built off of this one. Pull Request resolved: https://github.com/pytorch/pytorch/pull/163098 Approved by: https://github.com/janeyx99	2025-09-17 13:40:23 +00:00
Kushagra Rastogi	cfc539fe15	Improved error lr last epoch (#162368 ) Fixes #160626 Pull Request resolved: https://github.com/pytorch/pytorch/pull/162368 Approved by: https://github.com/janeyx99	2025-09-15 23:33:14 +00:00
zeshengzong	448a7e7e31	Fix `SequentialLR` deprecate warning about invoke `step(epoch)` (#149392 ) Fixes #116776 #76113 #113222 #67958 ## Changes - Refactor `LRScheduler.step` method, leave `epoch` check logic in public method `step` - Move update `lr` logic to `_update_lr` method - Make `SequentialLR` use `_update_lr` to avoid unnecessary warning message ## Test Result ```bash pytest test/optim/test_lrscheduler.py -vv ``` ![image](https://github.com/user-attachments/assets/e1c5527e-193e-4328-bf95-023139ea0416) Pull Request resolved: https://github.com/pytorch/pytorch/pull/149392 Approved by: https://github.com/janeyx99	2025-08-29 11:45:11 +00:00
Xuehai Pan	596b418391	[BE][PYFMT] migrate PYFMT for `{torch,test}/{nn,optim}/**` to `ruff format` (#144548 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144548 Approved by: https://github.com/ezyang	2025-06-14 11:27:04 +00:00
zeshengzong	d7a83ab67b	Fix `lr_scheduler` unexpectedly calls `step()` when init argument last_epoch is larger than -1 (#149312 ) Fixes #102261 ## Changes - Use flag `_is_initial` to replace `self.last_epoch == 0` condition to judge whether `lr` should be initial value - Add test for `ExponentialLR` checkpoint usecase ## Test Result ```python pytest -s test/optim/test_lrscheduler.py -vv ``` ![image](https://github.com/user-attachments/assets/6fd32bcc-b4fb-4421-b891-620bd4900dc1) Pull Request resolved: https://github.com/pytorch/pytorch/pull/149312 Approved by: https://github.com/janeyx99 Co-authored-by: Jane (Yuan) Xu <31798555+janeyx99@users.noreply.github.com>	2025-05-22 08:42:37 +00:00
zeshengzong	eb69f4e609	Add lr_lambda type check in MultiplicativeLR (#151973 ) Fixes #81554 ## TestResult ### Before ```python In [3]: import torch ...: class SimpleLinearModel(torch.nn.Module): ...: def __init__(self): ...: super(SimpleLinearModel, self).__init__() ...: self.linear = torch.nn.Linear(10, 1) ...: ...: def forward(self, x): ...: return self.linear(x) ...: ...: net = SimpleLinearModel() ...: optimizer = torch.optim.Adam(net.parameters(), lr=0.01) ...: scheduler = torch.optim.lr_scheduler.MultiplicativeLR(optimizer, 0.95) ...: for i in range(10): ...: print(i, scheduler.get_last_lr()) ...: scheduler.step() TypeError: 'float' object is not callable ### After ```python ...: scheduler = torch.optim.lr_scheduler.MultiplicativeLR(optimizer, 0.95) TypeError: lr_lambda should be a function, but got float ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/151973 Approved by: https://github.com/janeyx99	2025-04-29 08:21:41 +00:00
zeshengzong	c81d8c231c	Fix CosineAnnealingWarmRestarts reset T_cur (#151289 ) Fixes #88791 ## Test Result ```python pytest test/optim/test_lrscheduler.py -k test_CosineAnnealingWarmRestarts ``` ![image](https://github.com/user-attachments/assets/75ad238c-f319-47dc-bf2d-da05b0879b84) Pull Request resolved: https://github.com/pytorch/pytorch/pull/151289 Approved by: https://github.com/janeyx99	2025-04-28 23:02:55 +00:00
zeshengzong	fb1b7ec173	Remove deprecate method and attirbute in `LRScheduler` (#147301 ) Following [#99270 suggestion](https://github.com/pytorch/pytorch/issues/99270#issuecomment-1511656408), remove deprecate method `LRScheduler.print_lr` Pull Request resolved: https://github.com/pytorch/pytorch/pull/147301 Approved by: https://github.com/janeyx99	2025-03-05 05:30:19 +00:00
Tom Ritchford	d8c8ba2440	Fix unused Python variables in test/[e-z]* (#136964 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/136964 Approved by: https://github.com/justinchuby, https://github.com/albanD	2024-12-18 23:02:30 +00:00
Matt Pitkin	8a5dd7f59b	Allow SequentialLR to include ChainedScheduler (#133450 ) This fixes #132745 and allows a `SequentialLR` to include schedulers that are compound scheduler types (i.e., a `ChainedScheduler`), which contain a list of schedulers in a `_schedulers` attribute. Pull Request resolved: https://github.com/pytorch/pytorch/pull/133450 Approved by: https://github.com/janeyx99	2024-10-18 02:29:38 +00:00
Jane Xu	f9ed39c989	Autoupdate min_lrs for ReduceLROnPlateau if possible, fixes #104361 (#137637 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/137637 Approved by: https://github.com/albanD	2024-10-10 01:23:30 +00:00
Oguz Ulgen	221350e3a4	Add None return type to init -- tests (#132352 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/132352 Approved by: https://github.com/ezyang ghstack dependencies: #132335, #132351	2024-08-01 15:44:51 +00:00
Xuehai Pan	fbe6f42dcf	[BE][Easy][8/19] enforce style for empty lines in import segments in `test/[k-p]*/` (#129759 ) See https://github.com/pytorch/pytorch/pull/129751#issue-2380881501. Most changes are auto-generated by linter. You can review these PRs via: ```bash git diff --ignore-all-space --ignore-blank-lines HEAD~1 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/129759 Approved by: https://github.com/justinchuby, https://github.com/ezyang	2024-07-31 02:09:20 +00:00
GdoongMathew	3437177e2b	Quick Fix on #126854 , deepcopy `lr` and other possible `base_parameters` (#127190 ) * Apply `deepcopy` to every base parameters (`initial_lr`, `max_lr`) when instantiating `LRScheduler`. Fixes #126854 Pull Request resolved: https://github.com/pytorch/pytorch/pull/127190 Approved by: https://github.com/janeyx99	2024-06-03 18:06:31 +00:00
Mikayla Gawarecki	383d2d1f6c	Add testing and fix issues for weights_only load for LRScheduler (#123775 ) Fixes https://github.com/pytorch/pytorch/issues/98921 There were two issues detected: - `MultiStepLR`: issue is described in https://github.com/pytorch/pytorch/issues/98921, this is resolved by allowlisting `collections.Counter` - `OneCycleLR`: `state_dict['anneal_func']` is either `<function OneCycleLR._annealing_cos at 0x7f364186f5b0>` or `<function OneCycleLR._annealing_linear at 0x7f39aa483640>` depending on the `anneal_func` kwarg. This leads to `WeightsUnpickler error: Unsupported class __builtin__.getattr` from the `weights_only` Unpickler. Fixed the above in a BC-compatible manner by adding `OneCyclicLR._anneal_func_type` as a string attribute and removing `OneCyclicLR.anneal_func` Pull Request resolved: https://github.com/pytorch/pytorch/pull/123775 Approved by: https://github.com/albanD, https://github.com/malfet	2024-04-16 20:29:27 +00:00
Yuanhao Ji	8ce29f1416	Enable UFMT on `test/onnx_caffe2`, `test/optim`, `test/package` and `test/profiler` (#123901 ) Part of: #123062 Ran lintrunner on: - `test/onnx_caffe2` - `test/optim` - `test/package` - `test/profiler` Pull Request resolved: https://github.com/pytorch/pytorch/pull/123901 Approved by: https://github.com/ezyang	2024-04-15 17:46:59 +00:00
Alexander Kurakin	9a1df7cfd7	ReduceLROnPlateau init _last_lr (#119366 ) (#119556 ) Fixes #119366 Pull Request resolved: https://github.com/pytorch/pytorch/pull/119556 Approved by: https://github.com/janeyx99	2024-02-09 19:35:02 +00:00
rockerBOO	d810b10232	Add beta1 support to CyclicLR momentum (#113548 ) Fixes #73910 Pull Request resolved: https://github.com/pytorch/pytorch/pull/113548 Approved by: https://github.com/janeyx99	2024-01-23 01:16:58 +00:00
ancestor-mithril	2b72543f36	Solving pickle error when saving CyclicLR state_dict (#110931 ) ## How to reproduce: ```py import os import tempfile import torch from torch import nn from torch.optim import SGD from torch.optim.lr_scheduler import CyclicLR model = nn.Linear(100, 100) opt = SGD(model.parameters(), lr=1.) scheduler = CyclicLR(opt, base_lr=0.1, max_lr=0.2, scale_fn=lambda x: 0.99) tmp = tempfile.NamedTemporaryFile(delete=False) try: torch.save(scheduler.state_dict(), tmp.name) scheduler.load_state_dict(torch.load(tmp.name)) finally: tmp.close() os.unlink(tmp.name) ``` Error: ``` _pickle.PicklingError: Can't pickle <function <lambda> at 0x000001A51DF67600>: attribute lookup <lambda> on __main__ failed ``` ## Fix: Saving `scale_fn` to the state dict only if it is a callable object and not if it is a function or lambda. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110931 Approved by: https://github.com/janeyx99	2023-11-22 11:38:35 +00:00
Jane Xu	5e30741754	Clean up optimizer imports in test_optim (#113971 ) This is purely a cosmetic change to set up for my optimizer infos, which will benefit from not needing to type optim.SparseAdam or whatever. The next step is actually adding the OptimizerInfos, similar to my attempt in https://github.com/pytorch/pytorch/pull/102774/files Pull Request resolved: https://github.com/pytorch/pytorch/pull/113971 Approved by: https://github.com/cpuhrsch	2023-11-18 03:52:01 +00:00
Thomas J. Fan	a4dc3716c0	Deprecated verbose parameter in LR schedulers (#111302 ) Fixes https://github.com/pytorch/pytorch/issues/100847 This PR follows the comment in https://github.com/pytorch/pytorch/issues/100847#issuecomment-1546247239 by deprecating the `verbose` parameter and removing the print statements. Removing the print statements is technically BC breaking, so I would be okay with putting them back in. To be less annoying, this PR raises a warning only when `verbose` is explicitly passed in. Pull Request resolved: https://github.com/pytorch/pytorch/pull/111302 Approved by: https://github.com/albanD	2023-11-10 23:17:27 +00:00
Jane Xu	4dc66a4b5c	[BE] fix type iteration typo in test_lrscheduler.py (#106908 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106908 Approved by: https://github.com/clee2000, https://github.com/soulitzer	2023-08-09 23:56:06 +00:00
Justin Chu	4cc1745b13	[BE] f-stringify torch/ and scripts (#105538 ) This PR is a follow up on the pyupgrade series to convert more strings to use f-strings using `flynt`. - https://docs.python.org/3/reference/lexical_analysis.html#f-strings - https://pypi.org/project/flynt/ Command used: ``` flynt torch/ -ll 120 flynt scripts/ -ll 120 flynt tools/ -ll 120 ``` and excluded `collect_env.py` Pull Request resolved: https://github.com/pytorch/pytorch/pull/105538 Approved by: https://github.com/ezyang, https://github.com/malfet	2023-07-21 19:35:24 +00:00
Ali Moezzi	8c3958eddc	Fix lr_scheduler serialization contains bound methods issue (#102627 ) Fixes #42376 `torch.save` serializes bound methods inside LR scheduler resulting in large serialized file. Test cases include checking file size, checking if the `anneal_func` is bounded and file is loaded correctly. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102627 Approved by: https://github.com/albanD	2023-06-23 03:53:15 +00:00
Catherine Lee	08c4a442fd	Dont run test files that are already run in test_optim (#103017 ) they get run twice on accident Pull Request resolved: https://github.com/pytorch/pytorch/pull/103017 Approved by: https://github.com/janeyx99	2023-06-06 17:31:21 +00:00
Jane Xu	a53cda1ddc	[optim][BE] split test file into logical parts: SWA, LR, optim (#101100 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/101100 Approved by: https://github.com/albanD	2023-05-12 16:41:44 +00:00

33 Commits