pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Sam Gross	c1fa449763	Break reference cycle in load_state_dict (#20397 ) Summary: load_state_dict includes a recursive inner function `load` that captures Tensors through the close-over variable `state_dict`. Because it's recursive, it also captures itself leading to a reference cycle. This breaks the reference cycle so that any Tensors in state_dict can be collected immediately instead of waiting until the next GC cycle. Alternatively, we could have passed `state_dict` and `metadata` as arguments to load to prevent capture of Tensors. (That would still result in cyclic garbage, but not any cyclic garbage of Tensors). See: https://github.com/pytorch/pytorch/issues/20199#issuecomment-491089004 Pull Request resolved: https://github.com/pytorch/pytorch/pull/20397 Differential Revision: D15414834 Pulled By: colesbury fbshipit-source-id: 4c2275a08b2d8043deb3779db28be03bda15872d	2019-05-20 11:46:00 -07:00
Edward Z. Yang	9b1dbffba5	Re-sync with internal repository (#20702 )	2019-05-20 09:22:57 -04:00
Dmytro Dzhulgakov	d3059b9c49	Lightweight logging for once-only API usage	2019-05-19 23:04:40 -07:00
Alexandros Metsai	9e3bdb3231	Update module.py documentation. (#19347 ) Summary: Added the ">>>" python interpreter sign(three greater than symbols), so that the edited lines will appear as code, not comments/output, in the documentation. Normally, the interpreter would display "..." when expecting a block, but I'm not sure how this would work on the pytorch docs website. It seems that in other code examples the ">>>" sign is used as well, therefore I used with too. Pull Request resolved: https://github.com/pytorch/pytorch/pull/19347 Differential Revision: D14986154 Pulled By: soumith fbshipit-source-id: 8f4d07d71ff7777b46c459837f350eb0a1f17e84	2019-04-18 06:46:24 -07:00
Sepehr Sameni	b11a8c6aef	return missing keys from load_state_dict (#18668 ) Summary: return missing_keys and unexpected_keys from load_state_dict so the user can handle them when strict mode is off; also removed an unused variable Pull Request resolved: https://github.com/pytorch/pytorch/pull/18668 Differential Revision: D14782073 Pulled By: ezyang fbshipit-source-id: ab3b855eb77bb7422594d971988067e86eef20f2	2019-04-04 18:11:56 -07:00
Alexandr Morev	abc171bd53	Fix typo in docstring Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18216 Differential Revision: D14539824 Pulled By: ezyang fbshipit-source-id: 490b72951a75f3f8b949a2d692d660a3693ee98a	2019-03-20 11:16:36 -07:00
Kai Zhang	4ad17c9031	Misleading documentation for module._load_from_state_dict (#17618 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17618 Base on the code, we only add key to `missing_keys` and `unexpected_keys` if `$strict` is `True`. The documentation is confusing. This diff also fix one FLAKE8 warning. Reviewed By: ailzhang Differential Revision: D14280593 fbshipit-source-id: d368f5596bdf74ff62ee4d28d79120f5af91e0a3	2019-03-12 16:57:39 -07:00
ZhuBaohe	acf5ec07af	Correct conv and pooling docstrings in nn module (#17052 ) Summary: This PR fix conv and pooling docstrings in nn module Pull Request resolved: https://github.com/pytorch/pytorch/pull/17052 Differential Revision: D14068566 Pulled By: ezyang fbshipit-source-id: 3ec1de232ff6334b6a544dadefbb0ee6193d443a	2019-02-15 06:58:02 -08:00
Michael Suo	bd75fba4e8	fix tracing using a dictionary as input (#16616 ) Summary: Previously this would fail with the error message: ``` ValueError: Auto nesting doesn't know how to process an input object of type dict. Accepted types: Tensors, or lists/tuples of them ``` Turns out we're not using the line that causes this error (or a side effect of that line), so removing it fixes the issue. Also cleaned up some related dead code (cc apaszke to make sure the code isn't useful in some way) Pull Request resolved: https://github.com/pytorch/pytorch/pull/16616 Differential Revision: D13908352 Pulled By: suo fbshipit-source-id: 27094f1f4ea0af215b901f7ed3520e94fbc587b3	2019-02-01 14:44:56 -08:00
FrankHui	fe4ae9dfe4	add if in register_buffer like register_parameters (#16110 ) Summary: without this "if", code below will throw error " Linear' object has no attribute '_buffers' " And with this if, error would be "cannot assign buffer before Module.\_\_init\_\_() call", which I think it's more accurate, just like register_parameter. ``` import math import torch from torch.nn.parameter import Parameter from torch.nn import functional as F from torch.nn import Module class Linear(Module): def __init__(self, in_features, out_features, bias=True): self.in_features = in_features self.out_features = out_features self.register_buffer('test', torch.Tensor(out_features, in_features)) self.weight = Parameter(torch.Tensor(out_features, in_features)) if bias: self.bias = Parameter(torch.Tensor(out_features)) else: self.register_parameter('bias', None) super(Linear, self).__init__() self.reset_parameters() def reset_parameters(self): stdv = 1. / math.sqrt(self.weight.size(1)) self.weight.data.uniform_(-stdv, stdv) if self.bias is not None: self.bias.data.uniform_(-stdv, stdv) def forward(self, input): return F.linear(input, self.weight, self.bias) def extra_repr(self): return 'in_features={}, out_features={}, bias={}'.format( self.in_features, self.out_features, self.bias is not None ) linear = Linear(3,4) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/16110 Differential Revision: D13715839 Pulled By: soumith fbshipit-source-id: c300eff0a8655aade448354cf489a592f7db722a	2019-01-17 11:50:12 -08:00
Derek Kim	19717224c5	Miscellaneous broken RSTs fixed (#16033 ) Summary: https://pytorch.org/docs/master/tensors.html#torch.Tensor.bernoulli_ https://pytorch.org/docs/master/torch.html#torch.addmm https://pytorch.org/docs/master/distributed_deprecated.html#torch.distributed.deprecated.reduce_multigpu Pull Request resolved: https://github.com/pytorch/pytorch/pull/16033 Differential Revision: D13671202 Pulled By: soumith fbshipit-source-id: 276e10e610affe205376573e7f0f9894695d218d	2019-01-15 09:50:12 -08:00
Peter Goldsborough	aec9fdf0a4	Fix _apply in nn.Module (#15305 ) Summary: Fixes an issue that arose from https://github.com/pytorch/pytorch/pull/13481 where `.shared_memory()` couldn't be called. Effectively undoes all changes to `nn.Module` from that PR and solve the relevant problem in a different way (the goal was to be able to call `._apply()` on the Python wrapper for a C++ module). soumith Pull Request resolved: https://github.com/pytorch/pytorch/pull/15305 Differential Revision: D13493937 Pulled By: goldsborough fbshipit-source-id: 4cb8687f90fc8709a536c5e7eacd0dc8edf6f750	2018-12-17 16:22:21 -08:00
Peter Goldsborough	0bf1383f0a	Python <-> C++ Frontend inter-op (#13481 ) Summary: This PR enables C++ frontend modules to be bound into Python and added as submodules of Python modules. For this, I added lots of pybind11 bindings for the `torch::nn::Module` class, and modified the `torch.nn.Module` class in Python to have a new Metaclass that makes `isinstance(m, torch.nn.Module)` return true when `m` is a C++ frontend module. The methods and fields of C++ modules are bound in such a way that they work seamlessly as submodules of Python modules for most operations (one exception I know of: calling `.to()` ends up calling `.apply()` on each submodule with a Python lambda, which cannot be used in C++ -- this may require small changes on Python side). I've added quite a bunch of tests to verify the bindings and equality with Python. I think I should also try out adding a C++ module as part of some large PyTorch module, like a WLM or something, and see if everything works smoothly. The next step for inter-op across our system is ScriptModule <-> C++ Frontend Module inter-op. I think this will then also allow using C++ frontend modules from TorchScript. apaszke zdevito CC dzhulgakov Pull Request resolved: https://github.com/pytorch/pytorch/pull/13481 Differential Revision: D12981996 Pulled By: goldsborough fbshipit-source-id: 147370d3596ebb0e94c82cec92993a148fee50a7	2018-12-13 08:04:02 -08:00
Ryan Moore	29d697aec4	typo in Module docstring Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14511 Differential Revision: D13246061 Pulled By: soumith fbshipit-source-id: 6c13a2957c4c4324ab5d839d634689c61e25b0fe	2018-11-29 07:17:29 -08:00
albanD	f80d34a1c8	Update Tensor doc (#14339 ) Summary: Add to the Tensor doc info about `.device`, `.is_cuda`, `.requires_grad`, `.is_leaf` and `.grad`. Update the `register_backward_hook` doc with a warning stating that it does not work in all cases. Add support in the `_add_docstr` function to add docstring to attributes. There is an explicit cast here but I am not sure how to handle it properly. The thing is that the doc field for getsetdescr is written as being a const char * (as all other doc fields in descriptors objects) in cpython online documentation. But in the code, it is the only one that is not const. I assumed here that it is a bug in the code because it does not follow the doc and the convention of the others descriptors and so I cast out the const. EDIT: the online doc I was looking at is for 3.7 and in that version both the code and the doc are const. For older versions, both are non const. Please let me know if this should not be done. And if it should be done if there is a cleaner way to do it ! Pull Request resolved: https://github.com/pytorch/pytorch/pull/14339 Differential Revision: D13243266 Pulled By: ezyang fbshipit-source-id: 75b7838f7cd6c8dc72b0c61950e7a971baefaeeb	2018-11-28 15:28:17 -08:00
Tongzhou Wang	2cd912bcc2	Fix more spectral norm bugs (#13350 ) Summary: Problems with SN and DP after #12671 : 1. in eval mode, `weight_orig` is not getting correct gradient #12737 . Fix: keep `v` vector around as a buffer and always calculate `W = W_orig / (u @ W_orig @ v)` even in eval. 2. in training mode, the `weight` buffer of the parallelized module is never updated, if someone touches `weight_orig` and/or `weight` and makes them not sharing storage. So in `eval` the weight used is wrong. Fix: Make `weight` not a buffer anymore and always calculate it as above. 3. #12671 changed SN to update `u` in-place to make DP work correctly, but then it breaks backward through two forwards (e.g., the common GAN loss `D(real) - D(fake)`) because the vectors needed to backprop the 1st forward is changed in the 2nd forward. Fix: This PR clones `u` and `v` before using them. To maintain BC, I added a hook interface for producing and loading state_dict. This is ugly and we should really have better interface for spectral_norm. But for the purpose to fix this issue, I make this patch. Even if we have a better interface, BC mechanism for legacy loading legacy state_dict still needs to be done. cc The controller you requested could not be found. crcrpar Pull Request resolved: https://github.com/pytorch/pytorch/pull/13350 Differential Revision: D12931044 Pulled By: SsnL fbshipit-source-id: 8be6f934eaa62414d76d2c644dedd7e1b7eb31ef	2018-11-06 19:16:13 -08:00
Evgeniy Zheltonozhskiy	c774cb8913	Rephrase unclear error message for shape mismatch (#12870 ) Summary: I spent a couple of minutes trying to understand which shape corresponds to checkpoint and which one to the model Pull Request resolved: https://github.com/pytorch/pytorch/pull/12870 Differential Revision: D10466600 Pulled By: SsnL fbshipit-source-id: 3b68530b1b756462a2acd59e3a033ff633567a6b	2018-10-22 08:57:16 -07:00
Tongzhou Wang	de460c7ad3	Improvements on conv/pool/fold/stft/ParamDict docs (#11106 ) Summary: Also fixes some incorrect formula rendering. Pull Request resolved: https://github.com/pytorch/pytorch/pull/11106 Differential Revision: D9752433 Pulled By: SsnL fbshipit-source-id: 535fc8498638e8b645757fc7535d8771992b7d21	2018-09-11 08:56:21 -07:00
gngdb	c5b021cc88	State dict loading arguments were in the wrong order (#11200 ) Summary: In the state dict loading code, it would print the error message referring to the shape of the loaded parameters and the parameters in the initialised model with the formatting in the wrong order. Swapped them round to fix. Pull Request resolved: https://github.com/pytorch/pytorch/pull/11200 Differential Revision: D9631160 Pulled By: SsnL fbshipit-source-id: 03d9446303bd417fef67027b10d7a27de06486be	2018-09-03 15:42:30 -07:00
Jerry Ma	afd7477eaa	Add ``buffers(),` `named_buffers()`` methods. (#10554 ) Summary: This commit adds the ``buffers()`` and ``named_buffers()`` methods as analogues of ``parameters()`` and ``named_parameters()``. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10554 Reviewed By: SsnL Differential Revision: D9367762 Pulled By: jma127 fbshipit-source-id: f2042e46a7e833dce40cb41681dbd80d7885c74e	2018-08-16 16:26:48 -07:00
Ailing Zhang	9df9c46992	fix loading 1dim tensor from 0.3.* to 0dim tensor (#9781 ) Summary: This PR fixes #9743 . Adding backward support when loading a checkpoint from 0.3.* with 1dim tensor, they are now 0 dim tensor in 0.4+. Pull Request resolved: https://github.com/pytorch/pytorch/pull/9781 Differential Revision: D8988196 Pulled By: ailzhang fbshipit-source-id: a7a1bc771d597394208430575d5a4d23b9653fef	2018-07-26 17:09:41 -07:00
Adam Paszke	aa7af94656	Make JIT tracing a thread-local property (#9414 ) Summary: As in the title. Lets us simplify a lot of code. Depends on #9363, so please review only the last commit. zdevito Pull Request resolved: https://github.com/pytorch/pytorch/pull/9414 Reviewed By: zdevito Differential Revision: D8836496 Pulled By: apaszke fbshipit-source-id: 9b3c3d1f001a9dc522f8478abc005b6b86cfa3e3	2018-07-19 19:09:39 -07:00
Tongzhou Wang	623ae0c07c	Fix loading 0.4 BN checkpoints (#9004 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/8481 Closes https://github.com/pytorch/pytorch/pull/9004 Reviewed By: soumith Differential Revision: D8684017 Pulled By: SsnL fbshipit-source-id: 57820ad5f6b60795358c9447409a364a93ffa1d9	2018-07-01 22:24:21 -07:00
Karan Dwivedi	a41d433d9d	Check key should be string in nn.Module.add_module, parameter and buffer (#8960 ) Summary: Because I probably messed up the rebase in https://github.com/pytorch/pytorch/pull/8905 Closes https://github.com/pytorch/pytorch/pull/8960 Reviewed By: soumith Differential Revision: D8668202 Pulled By: ezyang fbshipit-source-id: 41e19803c7ac7aac898c8e70c6a9769314476ca9	2018-06-27 19:40:00 -07:00
Ailing	5f64484800	update to avoid potential duplicate error msg (#8638 )	2018-06-19 08:50:00 -07:00
Ailing	f14887a63f	check for exact shape match before loading (#8619 ) * check for exact shape match before loading * Use RuntimeError instead of ValueError to keep it consistent with other errors * fix lint	2018-06-18 20:16:34 -07:00
Tongzhou Wang	a77b391de7	[SpectralNorm] don't register original weight as buffer (#8170 ) * don't register original weight as buffer; fixes for buffers that require grad * add test	2018-06-12 14:42:05 -04:00
Tongzhou Wang	c0a419e6ba	Add non_blocking to Tensor/Module.to (#7312 ) * Add non_blocking to Tensor/Module.to * flake8 * Add argparse tests * cpp parse * Use C++ parser * use a commong parse function with Tensor.to * fix test_jit * use THPObjectPtr * increase refcount for None, True, and False * address comments * address comments	2018-06-04 18:46:52 -04:00
li-roy	d564ecb4a5	Update docs with new tensor repr (#6454 ) * Update docs with new tensor repr * remove cuda in dtype * remove changes to gloo submodule * [docs] document tensor.new_* ctor * [docs] Add docs for tensor.to(), tensor.float(), etc * [docs] Moar examples for docs. * [docs] Warning for tensor ctor copy behavior * Quick fix * [docs] Document requires_grad_() * [docs] Add example for requires_grad_() * update slogdet and fft update tensor rst * small fixes * update some docs * additional doc changes * update torch and tensor docs * finish changing tensor docs * fix flake8 * slogdet with negative det * Update functional.py tensor ctors * Fix nll_loss docs * reorder to move device up * torch.LongTensor -> torch.tensor or torch.empty in docs * update tensor constructors in docs * change tensor constructors * change constructors * change more Tensor() to tensor() * Show requires_grads_ docs * Fix set_default_dtype docs * Update docs with new tensor repr * remove cuda in dtype * remove changes to gloo submodule * [docs] document tensor.new_* ctor * [docs] Add docs for tensor.to(), tensor.float(), etc * [docs] Moar examples for docs. * [docs] Warning for tensor ctor copy behavior * Quick fix * [docs] Document requires_grad_() * [docs] Add example for requires_grad_() * update slogdet and fft update tensor rst * small fixes * update some docs * additional doc changes * update torch and tensor docs * finish changing tensor docs * fix flake8 * slogdet with negative det * Update functional.py tensor ctors * Fix nll_loss docs * reorder to move device up * torch.LongTensor -> torch.tensor or torch.empty in docs * update tensor constructors in docs * change tensor constructors * change constructors * change more Tensor() to tensor() * Show requires_grads_ docs * Fix set_default_dtype docs * Link to torch.no_grad, etc, from torch doc * Add dtype aliases to table * regen docs again * Tensor attributes stub page * link to inplace sampling * Link torch.dtype, device, and layout * fix dots after nonfinite floats * better layout docs	2018-04-21 07:35:37 -04:00
Tongzhou Wang	6a41e2dc47	Add BC mechanism to Module.load_state_dict (#6639 ) * Add version counter to module, change load_state_dict to use load_local_state_dict which does class specific loading * Clarifies version number in docs * fix jit tests * fix state_dict tests * typo * fix ddp * exclude version numbers from state dict entries * Fix jit test and empty modules * address comments * test for "." * revert the private version change in state_dict * make IN case a hard error * fix not reporting error when unexpected submodule * address comments * disallow empty string in name and remvoe trailing dot	2018-04-19 15:36:30 -04:00
Tongzhou Wang	de9bdf1d31	Module.to doc udpate and example format update (#6774 )	2018-04-19 13:30:40 -04:00
Tongzhou Wang	354dac9769	updates module.to doc for the new tensor.to(requires_grad) (#6733 )	2018-04-18 18:42:15 -04:00
Tongzhou Wang	1c01eabd3c	Codemod to update our codebase to 0.4 standard (#6641 ) * Codemod to update our codebase to 0.4 standard * Update some of the test scri[ts * remove Variable in test_clip_grad_value * fix _symbolic_override_wrapper_maker	2018-04-17 22:06:54 -04:00
Tongzhou Wang	0e93a2c334	Add Module.to (#6629 )	2018-04-16 17:46:52 -04:00
Kento NOZAWA	3b58b859b2	Fix typos in docs (#6389 )	2018-04-07 12:41:15 -04:00
Kaiyu Shi	605307f8f3	Add support for printing extra information in Module and refactor redundant codes (#5936 ) This PR enables users to print extra information of their subclassed nn.Module. Now I simply insert the user-defined string at the ending of module name, which should be discussed in this PR. Before this PR, users should redefine the __repr__ and copy&paste the source code from Module. * Add support for extra information on Module * Rewrite the repr method of Module * Fix flake8 * Change the __repr__ to get_extra_repr in Linear * Fix extra new-line for empty line * Add test for __repr__ method * Fix bug of block string indent * Add indent for multi-line repr test. * Address review comments * Update tutorial for creating nn.Module * Fix flake8, add extra_repr of bilinear * Refactor DropoutNd * Change to extra_repr in some Modules * Fix flake8 * Refactor padding modules * Refactor pooling module * Fix typo * Change to extra_repr * Fix bug for GroupNorm * Fix bug for LayerNorm	2018-04-02 13:52:33 -04:00
Tongzhou Wang	b21e135ab8	Add class-specific error when key mismatch in load_state_dict (#6086 )	2018-03-29 12:22:23 +02:00
Tongzhou Wang	261dd6ea83	fix named_modules doc, clarify eval doc (#5691 )	2018-03-10 17:35:07 -05:00
Kaiyu Shi	248c93372d	Check value type for register_buffer (#5657 ) * Check value type when registering buffer * Fix PEP8 * Use isinstance in favor of is_tensor	2018-03-10 13:02:04 +01:00
Tongzhou Wang	57c7d132c9	Fix nn.Module.apply doc formatting (#5623 ) * fix nn.Module.apply doc example * other examples' double-colon and newline'	2018-03-08 22:26:01 -05:00
anderspapitto	b9cc035654	import torch.jit in torch/__init__.py (#5638 ) previously, it was being implicitly imported via the import of torch.onnx this is no longer the case, and is a hacky thing to depend on anyway, so import it explicitly	2018-03-08 22:17:47 -05:00
Vishwak Srinivasan	32b3841553	[ready] General documentation improvements (#5450 ) * Improvize documentation 1. Add formula for erf, erfinv 2. Make exp, expm1 similar to log, log1p 3. Symbol change in ge, le, ne, isnan * Fix minor nit in the docstring * More doc improvements 1. Added some formulae 2. Complete scanning till "Other Operations" in Tensor docs * Add more changes 1. Modify all torch.Tensor wherever required * Fix Conv docs 1. Fix minor nits in the references for LAPACK routines * Improve Pooling docs 1. Fix lint error * Improve docs for RNN, Normalization and Padding 1. Fix flake8 error for pooling * Final fixes for torch.nn.* docs. 1. Improve Loss Function documentation 2. Improve Vision Layers documentation * Fix lint error * Improve docstrings in torch.nn.init * Fix lint error * Fix minor error in torch.nn.init.sparse * Fix Activation and Utils Docs 1. Fix Math Errors 2. Add explicit clean to Makefile in docs to prevent running graph generation script while cleaning 3. Fix utils docs * Make PYCMD a Makefile argument, clear up prints in the build_activation_images.py * Fix batch norm doc error	2018-03-08 13:21:12 -05:00
anderspapitto	28b1c94f0f	allow application of @symbolic decorators without circular imports (#5595 )	2018-03-08 12:44:16 -05:00
Tongzhou Wang	27265503ad	nn.* doc update after Variable/Tensor merge (#5459 ) The nn.* counterpart of #5443 . Mostly removed Variable wrapper. Also added doc for nn.RReLU. Notice that torch.randn(*, requires_grad=True) isn't documented until #5462 is done.	2018-03-01 18:11:39 -05:00
Adam Paszke	4afd62db09	Add TracedModule to the JIT (#5409 )	2018-02-28 22:50:50 -08:00
Sam Gross	30ec06c140	Merge Variable and Tensor classes (#5225 ) This replaces the torch.Tensor constructors with factories that produce Variables. Similarly, functions on the torch module (e.g. torch.randn) now return Variables. To keep the PR to a reasonable size, I've left most of the unused tensor code. Subsequent PRs will remove the dead code, clean-up calls to torch.autograd.Variable, and rename Variable to Tensor everywhere. There are some breaking changes because Variable and Tensors had slightly different semantics. There's a list of those changes here: https://github.com/pytorch/pytorch/wiki/Breaking-Changes-from-Variable-and-Tensor-merge	2018-02-23 18:03:31 -05:00
Sam Gross	d605058212	Replace Variable.volatile with torch.no_grad() (#3970 ) This removes volatile from Variable. The functionality is mostly replaced by a global (thread-local) flag, which is controlled by torch.set_grad_enabled() and the context manager torch.no_grad(). In C++, the flag is exposed through GradMode::is_enabled() and GradMode::set_enabled() Fixes #3627	2017-12-18 15:46:13 -05:00
Richard Zou	43dd6319db	Exclude attrs with invalid python variable names from __dir__ (#4011 )	2017-12-18 02:19:55 -05:00
Luca Antiga	4eb8e12765	Introduce scopes during tracing (#3016 )	2017-12-04 09:19:06 -08:00
Luca Antiga	af58bfbb1b	Make integer parameters and buffers immune to float(), double() and half() (#3820 ) * Avoid casting integer params and buffers to float(), double() and half() * Add test for immune integer buffers * Fix documentation for float(), double() and half() * Fix test	2017-11-22 18:34:53 -05:00
rluo	efe4386d24	Fix module load_state_dict error information.	2017-11-10 22:11:30 +01:00
Ozan Çağlayan	cc757acd36	docs: clarify the difference between net() and net.forward() (#3596 )	2017-11-09 08:16:01 -05:00
Ozan Çağlayan	dd6d04ddf2	doc: Normalize all true/false in docstrings to ``True\|False`` (#3593 ) * doc: Normalize all true/false in docstrings to ``True\|False`` This makes them more apparent in the documentation. * doc: fix flake8	2017-11-09 08:12:29 -05:00
Richard Zou	eac0942f6d	Add more nn docs (#3374 )	2017-10-30 18:37:36 -04:00
vfdev	acb73c729b	Space is missing in __repr___ of conv (#3229 ) * - Remove spaces in `__repr__` of layers - Replace `size` by `kernel_size` in `__repr__` of a pooling layer * Fix flake8 errors	2017-10-30 13:45:37 -04:00
SsnL	de1f4e69dd	raw text (#3327 )	2017-10-28 01:24:02 +05:30
andreh7	b46ced4aab	clarification in docstring of Module.register_forward_hook() (#3279 ) * made it explicit in the docstring of Module.register_forward_hook() that the hook(s) will be called AFTER calling forward(). * added "every time" in docstring of Module.register_forward_pre_hook()	2017-10-25 15:36:00 +02:00
Sam Gross	8e58135a26	Fix E722 ('do not use bare except') (#3239 ) The new version of flake8 includes a check for not using bare except. We should avoid this since it catches things like KeyboardInterrupt.	2017-10-23 23:03:37 -04:00
Alykhan Tejani	95556f4075	add ignored_keys param to load_state_dict (#3159 ) * add ignored_keys param to load_state_dict * remove ignored_keys in favour of a strict param * raise KeyError only if strict is enables	2017-10-18 14:14:19 +02:00
SsnL	fce3ed19e5	Change device_id to device in python land (#3133 ) * change device_id to device in python land * cuda/random.py	2017-10-17 00:54:26 +02:00
SsnL	828048f578	Add document on how Module.cuda() and optims should work together (#3056 )	2017-10-10 22:55:23 -04:00
Mark Neumann	a64daf2c59	support dictionary return types in nn.Module's __call__ (#2037 )	2017-10-01 20:33:03 -04:00
Edward Z. Yang	63c835bbe7	Add keep_vars parameter to state_dict. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Alykhan Tejani	c5a8a59116	raise KeyError if registering buffer/param when attr exists (#2108 )	2017-09-01 14:08:49 -04:00
Zhou Mo	2c07f88ea3	Fix typos.	2017-08-25 14:27:07 -04:00
Gregory Chanan	50c208a50b	Revert "Fix typos." This reverts commit `4622b33952`.	2017-08-10 13:57:00 -04:00
Luca Antiga	1ac98b1bce	Add documentation for apply (#2327 )	2017-08-08 21:53:26 -04:00
Zhou Mo	4622b33952	Fix typos.	2017-08-08 11:05:38 -04:00
Kaiyu Shi	4a4d8841e6	Delete unused import	2017-07-23 12:48:11 -04:00
greaber	95ccbf8b0b	better error message in load_state_dict when there are inconsistent tensor sizes (#2151 )	2017-07-19 15:50:29 -04:00
Tzu-Wei Huang	c011d4f3d6	resolves #1991 (#2073 )	2017-07-13 09:57:33 -04:00
Sam Gross	10e23943b3	Fix missing _forward_pre_hooks in serialized modules (#2057 )	2017-07-11 18:23:35 -04:00
Sam Gross	2c038f2074	Add weight normalization implementation (#1945 ) * Add weight normalization implementation This adds forward "pre-hooks" which get called before the module's forward() method. Weight norm is implemented as a hook which calculates the weight variable from the weight_g and weight_v every iteration. Based on @rtqichen implementation. * Specify return type	2017-06-30 15:41:40 -04:00
Adam Paszke	23ab9d481a	Add Module._all_buffers	2017-06-12 21:58:38 -04:00
Matt Dering	d1a4467682	fix a bug when calling modules a module that returns a non-standard data structure currently breaks due to checks for backwards hooks. This refactors the code slightly so this will only break in the event of backwards hooks.	2017-05-11 23:00:45 +02:00
Adam Paszke	feef54ec34	Don't modify non-volatile grads in zero_grad	2017-05-10 16:43:14 +02:00
Wojciech Jaśkowski	3a7e068439	Remove spurious memo argument in Module.parameters() (#1527 )	2017-05-10 13:55:15 +02:00
Adam Paszke	20aa5b066f	Convert some of the functions to new format Also, fix a lot of issues that appeared after the previous commits.	2017-05-01 16:44:56 -04:00
Adam Paszke	2ca787fcf4	Refactor attribute names in autograd	2017-05-01 16:44:56 -04:00
Soumith Chintala	c852883086	add named_parameters that yield name and value of parameters (#1242 )	2017-04-12 16:32:36 -07:00
陈云	d82cad3019	implement nn.Module.__dir__ (#1142 )	2017-04-05 22:18:34 -04:00
Adam Paszke	2d1122739c	Raise AttributeError in Module.__getattr__	2017-04-03 10:38:58 -04:00
Alykhan Tejani	3eab8a71e2	Added docstring to add_module (#1116 )	2017-03-27 11:09:24 -04:00
Francisco Massa	dd893391d5	Add argument to children to yield the name of the modules (#941 )	2017-03-24 20:02:05 +01:00
Adam Paszke	d602b3a834	Allow submodules and parameters to shadow attrs on assignment	2017-03-12 13:31:32 -04:00
Sam Gross	6336300880	Fix bug where adding a hook could replace an existing hook. We were keying hooks by RemovableHandle id. However, we don't hold onto handles and ids of dead objects can be reused. This replaces id(handle) with a global counter.	2017-03-06 12:47:53 -08:00
Sam Gross	5073132837	Implement 'pre' and 'post' hooks at the C++ autograd level	2017-03-06 12:47:53 -08:00
Adam Paszke	c238ee3681	Fix issues with lazy grad initialization (#912 )	2017-03-03 14:23:51 -05:00
Sergey Zagoruyko	b5f7592140	boolean mode in module.train	2017-03-02 09:18:05 -05:00
Adam Paszke	85e82e85d8	Fix bug in zero_grad, when some parameters didn't require grad	2017-02-14 21:28:50 +01:00
Sam Gross	bd5303010d	Refactor autograd package to separate Python dependencies. (#662 ) The core autograd Variable, Function, and Engine no longer depend on the Python API. This let's us implement functions in C++. In the future, we can also multithread engine and release the GIL for most of the non-Python backwards.	2017-02-13 16:00:16 -08:00
Francisco Massa	833b8cbc7a	Remove unused code from module	2017-02-02 17:20:11 +01:00
Luke Yeager	e7c1e6a8e3	[pep8] Fix most lint automatically with autopep8 Here's the command I used to invoke autopep8 (in parallel!): git ls-files \| grep '\.py$' \| xargs -n1 -P`nproc` autopep8 -i Several rules are ignored in setup.cfg. The goal is to let autopep8 handle everything which it can handle safely, and to disable any rules which are tricky or controversial to address. We may want to come back and re-enable some of these rules later, but I'm trying to make this patch as safe as possible. Also configures flake8 to match pep8's behavior. Also configures TravisCI to check the whole project for lint.	2017-01-28 01:15:51 +01:00
Sam Gross	0f65c9267d	Fix typo	2017-01-18 08:46:04 -08:00
Adam Paszke	4cc11066b2	Add torch.utils.data docs and improve notes (#460 ) * Add torch.utils.data docs and improve notes	2017-01-17 14:51:05 -05:00
Francisco Massa	8d9f6c2583	Minor fixes to docs	2017-01-17 10:19:14 -05:00
Adam Paszke	d6fa3b3fd5	Deprecate nn.Container in favor of nn.Module	2017-01-16 19:07:37 -05:00
Sam Gross	38967568ca	Make load_state_dict() more restrictive (#451 ) The load_state_dict() function now raises an error if the argument state_dict has extra keys or is missing keys. Previously, load_state_dict() ignored extra and missing keys, which made it hard to notice when you load an invalid state_dict. This could happen, for example, if you save the state_dict for a DataParallel, but load it into a single model. The state_dict() function now only includes the Tensor data from the paramters, which reduces checkpoint size by not saving gradients.	2017-01-16 13:06:00 -05:00
Adam Paszke	95f0fa8a92	Change .grad attribute of Variables to be a Variable	2017-01-16 12:59:47 -05:00
Sam Gross	69d8331195	Use functools.partial	2017-01-13 23:10:45 +01:00
Sam Gross	7e4ddcfe8a	Remove names from register_hook calls (#446 ) The register hook calls now return an object that can be used to remove the hook. For example, >>> h = module.register_forward_hook(callback) >>> h.remove() # removes hook Or as a context manager: >>> with module.register_forward_hook(callback): ... pass This makes it easier for libraries to use hooks without worrying about name collisions.	2017-01-13 15:57:03 -05:00
Adam Paszke	f4870ca5c6	Fix nn docs	2016-12-30 00:15:06 -05:00
Sam Gross	24af02154c	Use ForkingPickler for sharing tensor/storages across processes (#344 ) This hooks into the (internal) ForkingPickler class in multiprocessing to reduce tensors, storages, and CUDA events instead of our queue from joblib. This makes it easier to use the standard multiprocessing classes in later versions of Python. This also exposes: - Tensor/Storage.share_memory_() - Module.share_memory() These methods move the CPU tensors and storages to shared memory. If you're using the "fork" method of multiprocessing, these objects can be directly inherited instead of serialized through a queue.	2016-12-28 20:34:23 -05:00
Sam Gross	ffcc38cf05	Deterministic ordering of parameters and buffers. (#317 ) Uses the assignment syntax to get deterministic ordering of parameters. The ordering of parameters using the constructor syntax is non-deterministic because kwargs use dict() in Python 3.5 and earlier.	2016-12-16 14:45:56 -05:00
Adam Paszke	12cf96e358	Don't change requires_grad of parameters in train() and eval()	2016-12-15 00:47:55 +01:00
Adam Paszke	8768e64e97	Allow returning changed gradients from the hooks	2016-12-15 00:47:55 +01:00
Adam Paszke	87748ffd4c	Add .type() for torch.nn modules	2016-12-01 23:14:41 +01:00
Sam Gross	18a3c62d9b	Allow NoneType for parameters in Module.load_state_dict	2016-12-01 20:12:15 +01:00
Adam Paszke	2e24da2a0b	Change parameter_dict to state_dict in torch.nn	2016-11-23 18:48:41 +01:00
Soumith Chintala	26d626a47c	adding docs for loss functions, container, module and fix typos	2016-11-17 15:11:27 -05:00
Adam Paszke	78c1094d93	Don't override __call__ in modules	2016-11-16 15:32:18 -08:00
Soumith Chintala	28e3f07b63	adding apply function	2016-11-07 16:17:49 -05:00
Adam Paszke	b4f4cca875	Rename training and evaluation methods	2016-10-30 00:16:06 +02:00
Adam Paszke	e2458bce97	Add Parameter class to nn	2016-10-27 22:31:36 +02:00
Adam Paszke	30be715900	Add training and evaluation to torch.nn	2016-10-24 22:29:43 +02:00
Adam Lerer	b5d13296c6	addressing comments	2016-10-23 21:11:22 -07:00
Adam Lerer	f88c3e9c12	fix some missing features in pytorch needed for RNNs	2016-10-23 20:23:48 -07:00
Sam Gross	fee67c2e1a	Allow parameters and child modules to be assigned by attribute (#136 ) For example: self.linear = nn.Linear(10, 20) self.weight = torch.autograd.Variable(torch.Tensor(10, 20))	2016-10-18 23:34:20 +02:00
Adam Paszke	a22af69335	Add versioning and shared storage handling to autograd (#105 )	2016-10-06 17:12:58 -04:00
Adam Lerer	1213149a2f	add bias option to linear; allow modules to return nested lists/tuples of tensors (#106 ) * add bias option to linear; allow modules to return nested lists/tuples of tensors	2016-10-06 15:59:12 -04:00
Adam Paszke	3cbe66ba8c	Change requires_grad default to False	2016-10-05 08:46:34 -07:00
Adam Paszke	6efefac2df	Add parameter_dict and load_parameter_dict methods for modules	2016-10-04 14:47:56 -07:00
Sam Gross	f4ebc65a12	Add Module.modules() and Module.children() (#90 ) modules(): returns an iterator over all modules in the network children(): returns an iterator over immediate children Also fix __getitem__ in Sequential	2016-10-01 21:18:53 -04:00
Adam Paszke	2d8c2972ae	Only allow leaf variables as module parameters	2016-09-29 11:31:26 -07:00
Sam Gross	cb5d4e836f	Lazy load CUDA and THNN modules (#64 )	2016-09-28 19:29:53 -04:00
Adam Paszke	7f4ff0e615	Fix type conversions in nn	2016-09-27 15:45:49 -07:00
Adam Paszke	f9d25e8e72	Refactor nn (require specifying parameters explicitly)	2016-09-27 15:22:26 -07:00
Adam Paszke	4cdeae3283	Return only unique variables from parameters()	2016-09-25 12:23:43 -07:00
Adam Paszke	eefa0c7b40	Require torch.nn.cuda automatically when calling .cuda()	2016-09-23 18:06:26 -07:00
Adam Paszke	8fdec15a55	Codemod to remove camel case method naming	2016-09-20 08:40:28 -07:00
Adam Paszke	fb39971464	Add more modules to nn	2016-09-14 11:05:56 -07:00
Sam Gross	b738b09606	Clean up Module forward and __call__ (#14 ) * _forward is renamed forward since users should override it * some __call__ overrides are changed to forward * function which return a single variable are changed to return that variable instead of a one-element tuple	2016-09-07 15:41:39 -04:00
Adam Paszke	774a6f1093	Add in-place operations to autograd and nn	2016-08-25 09:34:54 -07:00
Adam Paszke	ff785e5f17	Make optimizers accept a closure	2016-08-25 09:23:39 -07:00
Adam Paszke	ea93fb7ac0	Add more nn modules	2016-08-23 19:15:21 -07:00
Adam Paszke	7bcb2a4081	Initial optim version	2016-08-23 19:03:30 -07:00
Adam Paszke	2bf68e72d5	Add hook system to autograd and nn	2016-08-23 13:51:34 -07:00
Adam Paszke	e055ffbdc7	Add nn	2016-08-19 14:56:55 -07:00

1 2 3 4 5

238 Commits