pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Igor Fedan	b05e9d4521	explicitly provide memory format when calling to clone() at lbfgs.py Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28692 Test Plan: Imported from OSS Differential Revision: D18333356 Pulled By: ifedan fbshipit-source-id: ca0de6b721f695893c0756ea1b3b469df1a2b249	2019-11-07 08:20:11 -08:00
lili	1b7f7aa12a	change LBFGS's default tolerance_grad to 1e-7 (#25240 ) Summary: Hi, I noticed after v1.2.0 the implement of LBFGS optimizer has been changed. In this new implement, the return condition has been changed from the sum of the gradients to the max value in the gradients (see: `b15d91490a/torch/optim/lbfgs.py (L313)`). But the default tolerance_grad parameter has not been changed (which is too large for max of gradients), so this result in lots of my old codes not optimizing or only optimizing for one or two steps. So, I came up this pull request to suggest that changing this tolerance_grad to a smaller value Pull Request resolved: https://github.com/pytorch/pytorch/pull/25240 Differential Revision: D17102713 Pulled By: vincentqb fbshipit-source-id: d46acacdca1c319c1db669f75da3405a7db4a7cb	2019-08-28 16:46:04 -07:00
Vincent Quenneville-Belair	f176950a67	Use lower case for strong wolfe option. (#22092 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22092 ghimport-source-id: ccc53ed2f1e16865237334a4dde4d162e21762e5 Test Plan: Imported from OSS Differential Revision: D15955996 Pulled By: vincentqb fbshipit-source-id: 8ffbea3b9ef8ff7021d42524fa46112da8a3438e	2019-06-26 08:20:25 -07:00
fehiepsi	ad73ea22f7	Add strong Wolfe line search for lbfgs (#8824 ) Summary: This pull request adds a line search for lbfgs. "strong Wolfe" is the default line search method in [minFunc](https://www.cs.ubc.ca/~schmidtm/Software/minFunc.html) and it is also recommended in the [Numerical Optimization](https://www.springer.com/gp/book/9780387303031) book. The implementation is based on four sources: + https://www.cs.ubc.ca/~schmidtm/Software/minFunc.html + https://www.springer.com/gp/book/9780387303031 Algorithms 3.5, 3.6, formula 3.59 + https://github.com/torch/optim/blob/master/lswolfe.lua + https://github.com/torch/optim/blob/master/polyinterp.lua The 'lua' version is based on an old version of `minFunc`, which has been updated in 2012. I made a couple of small changes based on the updated version. Due to that, the test of comparing with `.lua` version is not consistent (that's is the reason I changed a learning rate in the test). Pull Request resolved: https://github.com/pytorch/pytorch/pull/8824 Differential Revision: D15783067 Pulled By: vincentqb fbshipit-source-id: 5316d9088233981120376d79c7869d5f97e51b69	2019-06-12 11:32:41 -07:00
Matt Le	c5b9a36f1e	Make return uniform in lbfgs step (#7586 ) * Make return uniform in lbfgs step This ensures that we are returning results of the same type in LBFGS step. * Adding test case to exercise different exit points Sets the tolerance_grad to negative infinity and positive infinity to deterministically excercise the early exit branch * Fixing lint error	2018-05-16 11:16:46 -04:00
Samuel	0c737dff63	fix lbfgs variable names (#7037 ) Switches the step/direction variable names (steps and directions are flipped in the current implementation of the two loop-recursion). This change does not change the numerical output of the program, but should make it easier to follow.	2018-04-27 17:47:37 -04:00
Tongzhou Wang	1c01eabd3c	Codemod to update our codebase to 0.4 standard (#6641 ) * Codemod to update our codebase to 0.4 standard * Update some of the test scri[ts * remove Variable in test_clip_grad_value * fix _symbolic_override_wrapper_maker	2018-04-17 22:06:54 -04:00
Sam Gross	30ec06c140	Merge Variable and Tensor classes (#5225 ) This replaces the torch.Tensor constructors with factories that produce Variables. Similarly, functions on the torch module (e.g. torch.randn) now return Variables. To keep the PR to a reasonable size, I've left most of the unused tensor code. Subsequent PRs will remove the dead code, clean-up calls to torch.autograd.Variable, and rename Variable to Tensor everywhere. There are some breaking changes because Variable and Tensors had slightly different semantics. There's a list of those changes here: https://github.com/pytorch/pytorch/wiki/Breaking-Changes-from-Variable-and-Tensor-merge	2018-02-23 18:03:31 -05:00
Adam Paszke	af9fd35d82	Cast tensors when loading optimizer state dicts (#3658 )	2017-11-28 09:56:39 -05:00
Leonid Vlasenkov	46a868dab7	[Ready] Limit docs line length (#1900 ) * some docs are ready * docs * docs * fix some more * fix some more	2017-07-10 10:24:54 -04:00
Gregory Chanan	e653fe2857	Test fixes for keepdim=False, suppress warnings on backwards-compatible behavior.	2017-06-11 05:37:59 -04:00
Adam Paszke	1e8cb82a2d	Break only after the update in L-BFGS	2017-03-22 18:58:42 -04:00
Martin Raison	f17cfe4293	sparse tensor operations (#735 )	2017-03-03 18:37:03 +01:00
Luke Yeager	e7c1e6a8e3	[pep8] Fix most lint automatically with autopep8 Here's the command I used to invoke autopep8 (in parallel!): git ls-files \| grep '\.py$' \| xargs -n1 -P`nproc` autopep8 -i Several rules are ignored in setup.cfg. The goal is to let autopep8 handle everything which it can handle safely, and to disable any rules which are tricky or controversial to address. We may want to come back and re-enable some of these rules later, but I'm trying to make this patch as safe as possible. Also configures flake8 to match pep8's behavior. Also configures TravisCI to check the whole project for lint.	2017-01-28 01:15:51 +01:00
Adam Paszke	ecfcf39f30	Improve optimizer serialization Also, add optimizer.load_state_dict	2017-01-24 17:30:50 -05:00
Adam Paszke	ca555abcf9	fix comments	2017-01-22 18:02:40 -05:00
Adam Paszke	f8ae34706e	Port L-BFGS from Lua optim	2017-01-22 18:02:40 -05:00

17 Commits