pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Ram Rachum	f6b9848c25	Use chain.from_iterable in optimizer.py (#40156 ) Summary: This is a faster and more idiomatic way of using `itertools.chain`. Instead of computing all the items in the iterable and storing them in memory, they are computed one-by-one and never stored as a huge list. This can save on both runtime and memory space. Pull Request resolved: https://github.com/pytorch/pytorch/pull/40156 Reviewed By: ezyang Differential Revision: D22189038 Pulled By: vincentqb fbshipit-source-id: 160b2c27f442686821a6ea541e1f48f4a846c186	2020-06-23 14:07:05 -07:00
Alex Hedges	a3c87c4922	Make Optimizer.state_dict() nondeterministic (#37347 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/36831. Instead of using `id()`, an arbitrary yet consistent order-based index is used instead. This results in a deterministic output between runs. I am not the biggest fan of using `nonlocal` (it appears to be used sparingly in the codebase) to get `start_index` between calls to `pack_group()`, but the alternatives had larger issues: - Using the last value added to `param_mappings` would be ideal, but that only works if `dict` iteration order is consistent, and PyTorch currently supports Python <3.7. - Using the maximum value added to `param_mappings` wouldn't have that issue but would not be constant time. For testing, I confirmed that `test_optim.py` works before and after these changes. Randomizing the indices in `param_mappings` causes the tests to fail, which is further evidence these changes work. I'm not 100% if these tests are sufficient, but they're a start. Pull Request resolved: https://github.com/pytorch/pytorch/pull/37347 Differential Revision: D21353820 Pulled By: vincentqb fbshipit-source-id: e549f1f154833a461b1f4df6d07ad509aab34ea1	2020-06-01 15:32:02 -07:00
albanD	b0871f211b	Make all optimizers consistent so that they don't change gradients inplace Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30257 Test Plan: Imported from OSS Differential Revision: D18665461 Pulled By: albanD fbshipit-source-id: cfdafef919468a41007881b82fd288b7128baf95	2019-11-26 12:16:25 -08:00
Dmytro Dzhulgakov	c25e33789e	Lightweight at-most-once logging for API usage (#20745 ) Summary: Resubmit #20698 which got messed up. Idea is that when PyTorch is used in a custom build environment (e.g. Facebook), it's useful to track usage of various APIs centrally. This PR introduces a simple very lightweight mechanism to do so - only first invocation of a trigger point would be logged. This is significantly more lightweight than #18235 and thus we can allow to put logging in e.g. TensorImpl. Also adds an initial list of trigger points. Trigger points are added in such a way that no static initialization triggers them, i.e. just linking with libtorch.so will not cause any logging. Further suggestions of what to log are welcomed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20745 Differential Revision: D15429196 Pulled By: dzhulgakov fbshipit-source-id: a5e41a709a65b7ebccc6b95f93854e583cf20aca	2019-05-23 23:17:59 -07:00
Edward Z. Yang	9b1dbffba5	Re-sync with internal repository (#20702 )	2019-05-20 09:22:57 -04:00
Dmytro Dzhulgakov	d3059b9c49	Lightweight logging for once-only API usage	2019-05-19 23:04:40 -07:00
barrh	557b1b362f	Fix copied optimizer (#19308 ) Summary: Add the defaults field to the copied object. Prior to this patch, optimizer.__getattr__ has excluded the defaults attribute of optimizer source object, required by some LR schedulers. (e.g. CyclicLR with momentum) Pull Request resolved: https://github.com/pytorch/pytorch/pull/19308 Differential Revision: D15012801 Pulled By: soumith fbshipit-source-id: 95801b269f6f9d78d531d4fed95c973b280cc96f	2019-04-19 10:27:01 -07:00
Jerry Ma	7956e9718b	Add name for required optimizer parameter. (#13202 ) Summary: Small change -- the benefit is that the docs will show ``<required parameter>`` instead of ``<object object>`` for these required parameters. Pull Request resolved: https://github.com/pytorch/pytorch/pull/13202 Reviewed By: SsnL Differential Revision: D12826252 Pulled By: jma127 fbshipit-source-id: 5f2c8495e5c56920377e4e012b8711e8f2a6e30e	2018-10-29 15:02:21 -07:00
Jeff Smith	05e06f7de2	migrating deprecated calls without abc module for containers (#11515 ) Summary: Implementing #10540. Pull Request resolved: https://github.com/pytorch/pytorch/pull/11515 Reviewed By: apaszke Differential Revision: D9771045 Pulled By: jeffreyksmithjr fbshipit-source-id: 85ea39abaa9b465805a969f122b626b11fc85ef6	2018-09-13 15:09:22 -07:00
Domagoj Alagić	f43e067128	Make optimizer not complain about parameters with requires_grad=False (#7419 )	2018-05-09 11:34:52 -04:00
Tongzhou Wang	1c01eabd3c	Codemod to update our codebase to 0.4 standard (#6641 ) * Codemod to update our codebase to 0.4 standard * Update some of the test scri[ts * remove Variable in test_clip_grad_value * fix _symbolic_override_wrapper_maker	2018-04-17 22:06:54 -04:00
Kento NOZAWA	3b58b859b2	Fix typos in docs (#6389 )	2018-04-07 12:41:15 -04:00
Jiaming Liu	31c0e2321a	Block set from param_group['params'] (#6031 ) * Block set from param_group['params'] This might cause `list(params)` to output in random order. In this case, in `load_state_dict()`, `id_map` would not be matched correctly. * Update Error Message * Add Warning on Optimizer Docs * Update optimizer.py	2018-03-28 07:45:19 -07:00
Sam Gross	30ec06c140	Merge Variable and Tensor classes (#5225 ) This replaces the torch.Tensor constructors with factories that produce Variables. Similarly, functions on the torch module (e.g. torch.randn) now return Variables. To keep the PR to a reasonable size, I've left most of the unused tensor code. Subsequent PRs will remove the dead code, clean-up calls to torch.autograd.Variable, and rename Variable to Tensor everywhere. There are some breaking changes because Variable and Tensors had slightly different semantics. There's a list of those changes here: https://github.com/pytorch/pytorch/wiki/Breaking-Changes-from-Variable-and-Tensor-merge	2018-02-23 18:03:31 -05:00
Vishwak Srinivasan	89acc10f85	Adding description for Optimizers (#4371 )	2017-12-28 16:55:52 +01:00
Sam Gross	d605058212	Replace Variable.volatile with torch.no_grad() (#3970 ) This removes volatile from Variable. The functionality is mostly replaced by a global (thread-local) flag, which is controlled by torch.set_grad_enabled() and the context manager torch.no_grad(). In C++, the flag is exposed through GradMode::is_enabled() and GradMode::set_enabled() Fixes #3627	2017-12-18 15:46:13 -05:00
Adam Paszke	af9fd35d82	Cast tensors when loading optimizer state dicts (#3658 )	2017-11-28 09:56:39 -05:00
Taehoon Lee	5d9de014bd	Fix typos	2017-10-01 03:09:25 -04:00
randxie	1a83c372ec	address issue #1488 by using defaultdict in load_state_dict	2017-09-20 14:56:21 -04:00
Michael Dietz	e69063405e	Allow param groups to be added to Optimizer dynamically (#2374 )	2017-08-30 11:20:58 -04:00
Yan Wang	a76098ac15	fix optimizer when given single parameters (instead of an iterable) When I use the named_parametes to modify the lr and weight decay, I will face a bug. Because the value of the named_parameters return is torch.nn.paramter.Parameter, not a generator of the Parameter.	2017-06-05 23:47:56 -04:00
Adam Paszke	feef54ec34	Don't modify non-volatile grads in zero_grad	2017-05-10 16:43:14 +02:00
Adam Paszke	20aa5b066f	Convert some of the functions to new format Also, fix a lot of issues that appeared after the previous commits.	2017-05-01 16:44:56 -04:00
Adam Paszke	2ca787fcf4	Refactor attribute names in autograd	2017-05-01 16:44:56 -04:00
Martin Raison	f17cfe4293	sparse tensor operations (#735 )	2017-03-03 18:37:03 +01:00
Adam Paszke	bd7a5ad6f0	Make Optimizer.load_state_dict use __setstate__	2017-02-26 20:02:42 +01:00
Luke Yeager	3ed720079e	[pep8] Fix most remaining lint manually	2017-01-28 01:15:51 +01:00
Luke Yeager	e7c1e6a8e3	[pep8] Fix most lint automatically with autopep8 Here's the command I used to invoke autopep8 (in parallel!): git ls-files \| grep '\.py$' \| xargs -n1 -P`nproc` autopep8 -i Several rules are ignored in setup.cfg. The goal is to let autopep8 handle everything which it can handle safely, and to disable any rules which are tricky or controversial to address. We may want to come back and re-enable some of these rules later, but I'm trying to make this patch as safe as possible. Also configures flake8 to match pep8's behavior. Also configures TravisCI to check the whole project for lint.	2017-01-28 01:15:51 +01:00
Adam Paszke	ecfcf39f30	Improve optimizer serialization Also, add optimizer.load_state_dict	2017-01-24 17:30:50 -05:00
Adam Paszke	3238786ea1	Improve optimizer error messages	2017-01-22 18:32:51 -05:00
Adam Paszke	95f0fa8a92	Change .grad attribute of Variables to be a Variable	2017-01-16 12:59:47 -05:00
Adam Paszke	676ffee542	Check params type in optimizers	2017-01-16 12:59:47 -05:00
Adam Paszke	604e13775f	Add optim docs	2017-01-16 12:59:47 -05:00
Sam Gross	162170fd7b	Add optional weight decay to optim.SGD (#269 )	2016-11-29 20:35:40 -05:00
Adam Paszke	09493603f6	Change optimizer API	2016-11-08 18:12:56 +01:00
Adam Paszke	df59b89fbb	Add more optimizers	2016-11-07 22:50:56 +01:00
Adam Paszke	3cbe66ba8c	Change requires_grad default to False	2016-10-05 08:46:34 -07:00
Adam Paszke	99de537a2e	Remove CUDA sync points from losses and trainer	2016-10-05 08:46:31 -07:00
Adam Paszke	4db6667923	Allow specifying per-parameter optimization parameters	2016-10-04 18:21:50 -07:00
Adam Paszke	58b134b793	Allow exporting optimizer state as a dict	2016-10-04 17:33:49 -07:00
Adam Paszke	ff785e5f17	Make optimizers accept a closure	2016-08-25 09:23:39 -07:00
Adam Paszke	7bcb2a4081	Initial optim version	2016-08-23 19:03:30 -07:00

42 Commits