pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Christian Puhrsch	d8f6be686d	Remove torch/legacy (#11823 ) Summary: Largely unused and hinders current development Pull Request resolved: https://github.com/pytorch/pytorch/pull/11823 Differential Revision: D9925094 Pulled By: cpuhrsch fbshipit-source-id: c797f62180e2128f9a567b0c57c8347957470ea5	2018-09-20 14:00:54 -07:00
Ahti Kalervo	ef44faece2	check attribute existence in torch.legay.nn.SpatialFullConvolution in method type (#8740 ) Summary: This is related to #5255 When adding cuda support for the model, this error comes: `` AttributeError: 'SpatialFullConvolution' object has no attribute 'finput' `` here is my short code for test. https://gist.github.com/kaleaht/26518c3deea5d1d3dda722fbf1f3ecdc I converted torch7's model also from here. https://github.com/art-programmer/FloorplanTransformation Pull Request resolved: https://github.com/pytorch/pytorch/pull/8740 Differential Revision: D8872735 Pulled By: SsnL fbshipit-source-id: 8d97f8b59cdf4049e87be14b78c4608fd973d149	2018-08-14 10:11:13 -07:00
Xiang Gao	6fc75eadf0	Add CELU activation to pytorch (#8551 ) Summary: Also fuse input scale multiplication into ELU Paper: https://arxiv.org/pdf/1704.07483.pdf Pull Request resolved: https://github.com/pytorch/pytorch/pull/8551 Differential Revision: D9088477 Pulled By: SsnL fbshipit-source-id: 877771bee251b27154058f2b67d747c9812c696b	2018-08-01 07:54:44 -07:00
Tongzhou Wang	27455e9c78	Use _six for inf and nan (#9500 ) Summary: Things like `float('inf')` are actually quite expensive. ```py In [1]: import math In [2]: %timeit -n 200 math.inf 49.3 ns ± 1.42 ns per loop (mean ± std. dev. of 7 runs, 200 loops each) In [3]: %timeit -n 200 float('inf') 194 ns ± 39.1 ns per loop (mean ± std. dev. of 7 runs, 200 loops each) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/9500 Reviewed By: soumith Differential Revision: D8876229 Pulled By: SsnL fbshipit-source-id: 78602b76bb53d5588910b58270930c0bd413d2d7	2018-07-18 10:40:29 -07:00
Ailing Zhang	227c8f2654	Implement nn.functional.interpolate based on upsample. (#8591 ) Summary: This PR addresses #5823. * fix docstring: upsample doesn't support LongTensor * Enable float scale up & down sampling for linear/bilinear/trilinear modes. (following SsnL 's commit) * Enable float scale up & down sampling for nearest mode. Note that our implementation is slightly different from TF that there's actually no "align_corners" concept in this mode. * Add a new interpolate function API to replace upsample. Add deprecate warning for upsample. * Add an area mode which is essentially Adaptive_average_pooling into resize_image. * Add test cases for interpolate in test_nn.py * Add a few comments to help understand linear interpolation code. There is only "cubic" mode missing in resize_images API which is pretty useful in practice. And it's labeled as hackamonth here #1552. I discussed with SsnL that we probably want to implement all new ops in ATen instead of THNN/THCUNN. Depending on the priority, I could either put it in my queue or leave it for a HAMer. After the change, the files named as Upsampling.c works for both up/down sampling. I could rename the files if needed. Differential Revision: D8729635 Pulled By: ailzhang fbshipit-source-id: a98dc5e1f587fce17606b5764db695366a6bb56b	2018-07-06 15:28:11 -07:00
Tongzhou Wang	7b25cbbef9	Test nn.Module on non-contiguous inputs (#9114 ) Summary: 1. Let `ModuleTest` raise when they fail on non-contiguous inputs. Fix legacy modules. 2. Fix BN (both THNN and cuDNN) not working on non-contiguous inputs. 3. Fix CUDA EmbeddingBag not working on non-contiguous inputs. To prevent calling `.contiguous()` on in both `forward` and `backward`, a. prefix all current `embedding_bag` functions with `_`, indicating that they require input to be contiguous (there is a check in each function). b. create `embedding_bag`, which makes input arguments `.contiguous()`, and calls `_embedding_bag` 3. Make many ATen `embedding` functions to work on non-contiguous inputs so we don't need to call `input = input.contiguous()` in Python `nn.functional.embedding`. 4. Fix dense-sparse addition when the sparse input is not coalesced and indices or values tensor is not contiguous. This came up in the test cases of Embedding modules with `sparse=True`. Added tests. 5. Update `TensorUtils.cpp` to use `AT_` macros. Request: review from cpuhrsch on the `Embedding` changes. review from ezyang on ATen sparse & BN changes. Closes https://github.com/pytorch/pytorch/pull/9114 Differential Revision: D8717299 Pulled By: SsnL fbshipit-source-id: 0acc6f1c9522b5b605361e75112c16bbe1e98527	2018-07-05 21:09:34 -07:00
Roy Li	21c786071b	update nn loss tests to use new reduction arg (#9118 ) Summary: The tests were using the old args, which caused them to emit a lot of deprecation warnings. closes #9103. Reviewed By: ezyang Differential Revision: D8720581 Pulled By: li-roy fbshipit-source-id: 3b79527f6fe862fb48b99a6394e8d7b89fc7a8c8	2018-07-02 19:41:57 -07:00
Roy Li	c61f0217a5	combine size_average and reduce args in loss functions (#8018 ) Summary: closes #7929 Closes https://github.com/pytorch/pytorch/pull/8018 Differential Revision: D8682540 Pulled By: li-roy fbshipit-source-id: 649170dd1a7f373151c1d4e949838bd1c5651936	2018-07-01 05:39:00 -07:00
ngimel	a015d579dd	move softmax/logsoftmax to ATen (#6786 ) * move softmax/logsoftmax to ATen * specify cpu and gpu accum types * use accreal for CPU * expose softmax backward to python, fix legacy interface * fix Distributions.cu to use common AccumulateType * fix cuda 8 build * delete commented out lines * rebase on master, fix breakages	2018-05-04 14:23:35 -04:00
Tongzhou Wang	1c01eabd3c	Codemod to update our codebase to 0.4 standard (#6641 ) * Codemod to update our codebase to 0.4 standard * Update some of the test scri[ts * remove Variable in test_clip_grad_value * fix _symbolic_override_wrapper_maker	2018-04-17 22:06:54 -04:00
li-roy	e4eee7c2cf	Implement MarginRankingLoss as native function and add reduce=True arg to it (#5346 ) * add reduce=True arg to MarginRankingLoss * make default margin arg match for legacy * remove accidentally added test * fix test * fix native_functions.yaml alphabetical order	2018-03-21 15:40:58 -04:00
li-roy	5bbeb55f22	add reduce=True arg to MultiMarginLoss (#5150 ) * add reduce=True arg to MultiMarginLoss * Change tests to support legacy * fix flake8 * address comments * formatting change * remove free of unallocated tensor * fix after variable/tensor merge	2018-02-27 18:35:50 -05:00
Sam Gross	30ec06c140	Merge Variable and Tensor classes (#5225 ) This replaces the torch.Tensor constructors with factories that produce Variables. Similarly, functions on the torch module (e.g. torch.randn) now return Variables. To keep the PR to a reasonable size, I've left most of the unused tensor code. Subsequent PRs will remove the dead code, clean-up calls to torch.autograd.Variable, and rename Variable to Tensor everywhere. There are some breaking changes because Variable and Tensors had slightly different semantics. There's a list of those changes here: https://github.com/pytorch/pytorch/wiki/Breaking-Changes-from-Variable-and-Tensor-merge	2018-02-23 18:03:31 -05:00
Kato Tetsuro	5c93ca258b	check attribute existence in SpatialFullConvolution (#5255 )	2018-02-16 21:06:08 -05:00
li-roy	147612e64a	add reduce=True arg to SoftMarginLoss (#5071 ) * add reduce=True arg to SoftMarginLoss * add reference function for SoftMarginLoss * Rebase onto master * Address comments * Fix flake8 * Fix rebase error	2018-02-13 10:51:57 -05:00
li-roy	28f056fed2	add reduce=True argument to MultiLabelMarginLoss (#4924 ) * add reduce=True argument to MultiLabelMarginLoss * Fix lint * Addressed comments * Remove unneeded syncthreads calls	2018-02-05 12:28:51 -05:00
Tongzhou Wang	f23feca681	Fix output_nr not incremented correctly (#4812 ) * fix output_nr not incremented correctly * update test_conv_double_backward to cover this case; call accGradParameters if any param (not just weight) requires grad in parse_nn.py * update Spatial/VolumetricFull(Dilated)Convolution to support accGradParameters with only bias requiring grad * Spatial/VolumetricConvolutionMM * Spatial/VolumetricDilatedConvolution * address @fmassa 's comments	2018-02-02 12:39:33 -05:00
Maxim Berman	f7ab0cb56c	Legacy Padding: correct output size with nInputDim	2018-01-19 12:45:30 +01:00
Sam Gross	3249d8bf89	Allow Variables in calls to type2backend (#4724 ) Use x.type() instead of type(x) when accessing type2backend to support Variables as well as Tensors.	2018-01-18 15:01:38 -05:00
Sam Gross	b6a30f7ede	Move SELU to ATen (#4269 ) Fuse scale multiplication into ELU	2017-12-20 16:32:21 -05:00
Edward Z. Yang	a88a8ec827	Convolution derivatives in ATen (#4116 ) * Convolution derivatives in ATen This PR introduces ATen implementation of convolution, which dispatches to THNN/CuDNN/nnpack based on input parameters. The general strategy is to compose this function out of the various forward-backward pairs of specific implementations, rather than write a monolithic function with backwards (which is what we did before because the boilerplate of doing it otherwise would have been very high.) The new API provides the following functions: - _convolution, which is a fully generic, native convolution implementation that dispatches to various other convolution implementations depending on input characteristics. This is prefixed with an underscore because it explicitly takes benchmark, deterministic and cudnn_enabled which are implementation details for CuDNN. The intent is to eventually provide a convolution that reads these parameters out of the context using #4104. - _convolution_nogroup is a convolution implementation for non-CuDNN algorithms which don't support group convolution natively. - _convolution_double_backward is the generic double-backwards implementation for convolution. In more detail: - Most functionality from torch/csrc/autograd/functions/convolution.cpp has been moved into aten/src/ATen/native/Convolution.cpp - We continue to make use of ConvParams, but we now construct the parameters upon entry to a function from the function signature (which does not use ConvParams; having convolution take ConvParams directly would require teaching the code generator how to accept these as parameters, complicating ATen's API model) and destruct them when making subprocedure calls. - I introduce a new idiom, input_r, which represents a const Tensor& reference, which will subsequently be assigned to a local Tensor input. This is helpful because a lot of the existing algorithms relied on being able to assign to locals, which is not permitted with a const reference. - The native argument parser now supports std::array<bool,2> inputs (NB: there MUST NOT be a space; this is the same hack as is applied to derivatives.yaml) - Native parser now supports Tensor? arguments, which indicates a nullable tensor. Previously this function was only used by NN methods. - Documentation updates on THNN library - I added an extra fgradInput argument to VolumetricConvolutionMM_updateOutput and VolumetricConvolutionMM_accGradParameters so that its buffer list lines up with the backward argument list. This makes it possible to write derivative for conv3d which previously was not supported (commented out in derivatives.yaml) - Extra double_backward declarations for all convolution backwards functions was added. - You can now use the syntax Tensor? in native_functions.yaml to indicate that a tensor argument is nullable. There are adjustments to propagate this to the Python argument parser. - NNPACK was ported to ATen, and ATen now builds and links against ATen if possible. New AT_NNPACK_ENABLED macro. The nnpack functions are nnpack_spatial_convolution. - Some modest CuDNN convolution refactoring to remove _forward from names. - There's a new cudnn_convolution_backward function to deal with the fact that CuDNN convolution double backward requires you to have computed all gradients in one go. - Variable set_flags now checks if the tensor is undefined, fixing a silent memory corruption. - checkSameType updated to not raise an exception if called with Variable arguments - "no ATen declaration found for" error message is improved to say what available declarations are - make_variable now accepts undefined tensors, and returns an undefined tensor in this case.	2017-12-20 14:19:27 -05:00
Sam Gross	70b8f0ed47	Fix elu double-backwards when applied in-place (#3687 ) * Fix elu double-backwards when applied in-place Removed unused "input" argument to elu_backwards. Also removed 'inplace' argument from backwards functions, since we don't ever want to use it. * Fix up additional calls to ELU_updateGradInput	2017-11-13 22:41:16 -05:00
Richard Zou	77ddd5130b	Add reduce keyword for KLDivLoss (#3330 )	2017-11-07 08:57:11 -05:00
Richard Zou	6214487fa7	Add reduce keyword to L1Loss (#3366 ) * Add reduce keyword to L1Loss * Fix legacy test for abscriterion * Address comments	2017-11-01 06:33:18 -04:00
Richard Zou	bf4c269bee	Implement reduce keyword for SmoothL1Loss (#3382 ) * Implement reduce keyword for SmoothL1Loss	2017-11-01 06:29:34 -04:00
Richard Zou	3853d5da97	Add reduce keyword to NLLLoss and NLLLoss2d (#3080 ) * API changes * Implement reduce for THNN ClassNLLCriterion * Implement reduce keyword for THCUNN ClassNLLCriterion * Implement reduce for THNN SpatialClassNLLCriterion * Implement reduce for THCUNN SpatialClassNLLCriterion * Make legacy NLLLoss work * Docs for NLLLoss reduce * reduce keyword for double backwards NLLLoss * reduce=False tests * Addressed comments * Fix trailing whitespace * Fix test failures in legacy nn * Rebase: add reduce keyword to aten declarations of NLLLoss * Add reference functions for all NLLLoss and NLLLoss2d test cases * Replaced slow get/set fns. Don't use int64_t in kernels. * Use TH_INDEX_BASE in NLLLoss for consistency * Fix legacy ClassNLLCriterion tests	2017-10-26 13:54:19 -04:00
Adam Paszke	98e67448fa	Large Softmax and LogSoftmax refactor - Cleaned up THNN and THCUNN code and kernels - Improved THCUNN kernel performance 5x, making it match cuDNN performance - Added support for computing softmax over arbitrary dims NOTE: The default dim for 3D inputs is now 1 (used to be 0) - Both functions now accept inputs with arbitrarily many dimensions - Autograd functions no longer save the input (it's unnecessary) - Added cuDNN bindings for softmax, but they are unused as THCUNN matches or even exceeds cuDNN performance	2017-10-19 19:51:10 +02:00
Sam Gross	61bb0d2954	Remove unused parameter 'input' from Tanh	2017-10-13 01:31:48 +02:00
Sam Gross	7bc154f8ea	Remove unused argument 'input' to Sigmoid_updateGradInput (#3079 )	2017-10-11 23:52:50 +02:00
Sam Gross	246a382610	Simplify PReLU binding (#3055 ) * Simplify PReLU binding - Remove internal buffers from function signature - Compute nOutputPlane internally * Fix legacy PReLU	2017-10-10 17:50:13 -04:00
Richard Zou	898c732293	Introduce a `reduce` keyword argument for MSELoss (#2878 ) * Add reduce keyword to MSECriterion API * Move gradOutput usage from py to backend * Implement reduce keyword for THNN MSECriterion * Implement reduce keyword for THCUNN MSECriterion * Implement reduce keyword for MSE double backwards * Tests for MSECriterion with reduce keyword * Documentation for reduce for MSELoss * Make legacy nn work with reduce keyword by ignoring it * Apply linter suggestions * Address comments (small changes) * Revert "Tests for MSECriterion with reduce keyword" This reverts commit 1c0be0defa49d336d023d7d9795db4037c92b6fe. * Undo changes to legacy nn tests * Reuse module test for MSELoss by creating a wrapper class for MSELoss * Address comments: refactor MSECriterion.cu to be nicer * Fix lint & build errors	2017-10-06 10:57:22 -04:00
SsnL	0c2957512f	Fix two legacy modules clearing input tensor in clearState	2017-10-06 10:39:33 -04:00
Taehoon Lee	6ef417ce89	Fix typos	2017-10-02 09:32:25 -04:00
Varun Agrawal	b31cf0ebd4	Added support for nInputDim parameter in legacy Padding class (#2645 ) * Added support for nInputDim parameter in Padding class * moved nInputDim to the end so as to not break backwards compatibilty * hasattr to check if nInputDim is actually set * check if nInputDim is positive before checking against input dim	2017-09-10 13:47:34 -04:00
Iacopo Poli	1b013c0b52	fixed issue #2613 in torch/legacy/nn (#2624 )	2017-09-05 10:13:56 -04:00
Lu Fang	5294017d9f	Adding implicit padding for 3d average pooling	2017-08-26 14:45:19 -04:00
Zhou Mo	2c07f88ea3	Fix typos.	2017-08-25 14:27:07 -04:00
Gregory Chanan	0b0d2a06f7	Update legacy SoftPlus to add threshold constructor arg.	2017-08-14 16:19:10 -04:00
Gregory Chanan	50c208a50b	Revert "Fix typos." This reverts commit `4622b33952`.	2017-08-10 13:57:00 -04:00
Zhou Mo	4622b33952	Fix typos.	2017-08-08 11:05:38 -04:00
Gregory Chanan	aebec91301	Fix serialization of legacy ClassNLLCriterion with ignore_index.	2017-08-01 14:29:33 +05:30
Gregory Chanan	9c1e9d8a9b	Update legacy ClassNLLCriterion to add ignore_index.	2017-08-01 14:29:33 +05:30
Fisher Yu	d6bc2642e7	Add ignore_index to NLLLoss2d	2017-07-13 23:22:48 -04:00
gchanan	a64560c22e	Remove flattening for torch.dot (#1781 )	2017-06-16 02:15:33 +02:00
Gregory Chanan	7da46097fe	Fix lint errors.	2017-06-11 05:37:59 -04:00
Gregory Chanan	69287250d1	Add a broadcast parameter to copy_, use it in the library in cases where there is non-broadcasting calls exposed by the tests.	2017-06-11 05:37:59 -04:00
Gregory Chanan	be65f46c76	Add optional warning for backwards incompatible keepdim. Setting torch.utils.backcompat.keepdim.warning.enabled=True will cause Python warnings in the case where the default value of keepdim is used for 1-d reductions. Also specify keepdim via kwargs in library so these warnings have less noise.	2017-06-11 05:37:59 -04:00
Gregory Chanan	edf2969bd8	Backwards compatible Spatial Normalizations / CrossMapLRN.	2017-06-11 05:37:59 -04:00
Gregory Chanan	e653fe2857	Test fixes for keepdim=False, suppress warnings on backwards-compatible behavior.	2017-06-11 05:37:59 -04:00
Soumith Chintala	d351239c10	fix legacy ClassNLLCriterion for upstream change	2017-06-07 00:38:00 -04:00

1 2 3

104 Commits