pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Horace He	bb41e62e3b	Updated SGD docs with subscripts (#23985 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/23982 Obvious improvement imo. Also changed `rho` to `mu`, since `rho` and `p` look very similar. Pull Request resolved: https://github.com/pytorch/pytorch/pull/23985 Differential Revision: D16733037 Pulled By: Chillee fbshipit-source-id: 5431615d1983f24d6582da6fc8103ac0093b5832	2019-08-09 10:32:40 -07:00
Neta Zmora	1c76746f61	SGD: remove unneeded multiply-add initialization operations (#18114 ) Summary: The momentum buffer is initialized to the value of d_p, but the current code takes the long way to do this: 1. Create a buffer of zeros 2. Multiply the buffer by the momentum coefficient 3. Add d_p to the buffer All of these can be collapsed into a single step: 1. Create a clone of d_p Pull Request resolved: https://github.com/pytorch/pytorch/pull/18114 Differential Revision: D14509122 Pulled By: ezyang fbshipit-source-id: 4a79b896201d5ff20770b7ae790c244ba744edb8	2019-03-19 10:34:17 -07:00
Tongzhou Wang	a2880531ea	fix SGD lr check (#6244 )	2018-04-03 21:29:18 -04:00
lazypanda1	063946d2b3	Added parameter range checks for all optimizers (#6000 )	2018-03-28 11:22:23 +02:00
SsnL	f76d6c029c	Sparse Adam optimizer for sparse gradients (#3137 ) * sparse adam * Favor dense addition over sparse_mask	2017-11-06 14:20:51 -05:00
SsnL	ba05dc5549	dense buffer (#3139 )	2017-10-17 00:51:37 +02:00
Taehoon Lee	61e4723132	Fix typos (#2472 )	2017-08-25 14:13:38 -04:00
Leonid Vlasenkov	46a868dab7	[Ready] Limit docs line length (#1900 ) * some docs are ready * docs * docs * fix some more * fix some more	2017-07-10 10:24:54 -04:00
Soumith Chintala	85954032d9	fix doc formatting	2017-04-05 22:02:29 -04:00
Nitish Shirish Keskar	1a04b92226	add note regarding SGD momentum	2017-04-05 20:45:41 -04:00
Martin Raison	f17cfe4293	sparse tensor operations (#735 )	2017-03-03 18:37:03 +01:00
Adam Paszke	3277d83648	Add Nesterov Momentum (#887 )	2017-03-01 20:49:59 +01:00
Adam Paszke	ecfcf39f30	Improve optimizer serialization Also, add optimizer.load_state_dict	2017-01-24 17:30:50 -05:00
Adam Paszke	95f0fa8a92	Change .grad attribute of Variables to be a Variable	2017-01-16 12:59:47 -05:00
Adam Paszke	604e13775f	Add optim docs	2017-01-16 12:59:47 -05:00
Adam Paszke	75d850cfd2	Fix optim docs	2016-12-30 00:15:06 -05:00
Sam Gross	126a1cc398	Add Sphinx docs	2016-12-28 00:03:39 +01:00
Sam Gross	162170fd7b	Add optional weight decay to optim.SGD (#269 )	2016-11-29 20:35:40 -05:00
Adam Paszke	09493603f6	Change optimizer API	2016-11-08 18:12:56 +01:00
Adam Paszke	df59b89fbb	Add more optimizers	2016-11-07 22:50:56 +01:00
Adam Paszke	4db6667923	Allow specifying per-parameter optimization parameters	2016-10-04 18:21:50 -07:00
Adam Paszke	58b134b793	Allow exporting optimizer state as a dict	2016-10-04 17:33:49 -07:00
Soumith Chintala	9842be4b15	setting default dampening value to 0	2016-09-13 10:28:33 -07:00
Adam Paszke	ff785e5f17	Make optimizers accept a closure	2016-08-25 09:23:39 -07:00
Adam Paszke	7bcb2a4081	Initial optim version	2016-08-23 19:03:30 -07:00
Adam Paszke	2f342af22f	Move optim to legacy	2016-08-01 12:01:46 -04:00
Adam Paszke	554a1d8336	Add optim	2016-07-21 16:42:06 -04:00

27 Commits