pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Lu Fang	c111cdfd1d	Add onnx support for InstanceNorm (#4626 ) * Add ONNX symbolic for instancenorm * Fix some bugs	2018-02-07 10:54:30 -05:00
gchanan	7af433deeb	Add scalar criterion tests (#5087 ) * Add criterion scalar tests. This exposed an issue in MarginRankingLoss with scalars, but the cleanest way to fix is to wait until forward runs on Variables (so we don't have to wait for the backward to check if something is a scalar). * Fix flake8. * Add error message for margin_ranking_loss with scalars.	2018-02-06 18:40:37 -05:00
gchanan	fcccd07cc0	Implement hinge_embedding_loss as a native function. (#5080 )	2018-02-06 14:43:36 -05:00
li-roy	28f056fed2	add reduce=True argument to MultiLabelMarginLoss (#4924 ) * add reduce=True argument to MultiLabelMarginLoss * Fix lint * Addressed comments * Remove unneeded syncthreads calls	2018-02-05 12:28:51 -05:00
Richard Zou	e4ddbeb554	Fix typo (#4846 )	2018-01-25 10:33:45 -05:00
Richard Zou	b997474a4f	Adds Im2Col and Col2Im (#4729 )	2018-01-19 09:37:53 -05:00
Sam Gross	57549b7e44	Bind functions with out= arguments in VariableType (#4565 ) This adds overrides in VariableType for the xxx_out ATen functions and implements Python bindings. There is no support for automatic differentiation. If any of the inputs (or outputs) requires grad, then the function will throw an exception unless it's running in "no-grad" mode. The bindings for calling torch.xxx functions on Variables are moved to a different object. Previously, they were static method on VariableBase. This change prevents users from accidentally calling static methods as if they were instance methods.	2018-01-17 18:27:42 -05:00
Sam Gross	cb83474a57	Fix embedding with sparse=True (#4686 ) Fixes #4666	2018-01-16 16:19:20 -05:00
Kai Arulkumaran	2260649fb6	Local Response Normalization (#4667 ) * Local Response Normalization * Add 1D and 3D LRN * Generalise LRN to higher dims * Use mean instead of sum Specify 'across-channels'	2018-01-15 22:23:51 -05:00
David Pollack	05908e8243	current code works with dim = 3, so I added it to dim checks	2018-01-13 12:58:08 +01:00
Riddhiman Dasgupta	f99c7d9429	Padding_idx in Embedding supports negative indexing (#4496 )	2018-01-09 12:04:11 +01:00
Neeraj Pradhan	408c84de7c	Supporting logits as parameters in Bernoulli and Categorical (#4448 ) * Supporting logits as parameters in Bernoulli and Categorical * address comments * fix lint * modify binary_cross_entropy_with_logits * address comments * add descriptor for lazy attributes * address comments	2018-01-05 03:45:05 -05:00
Richard Zou	35c4d73bdb	Deprecate nn.NLLLoss2d (#4238 ) * Deprecate nn.NLLLoss2d * Fix legacy tests * Fix tests * Remove NLLLoss2d from docs, add deprecation warning instead of error * fix lint * Add more to docs	2018-01-04 12:38:04 -05:00
Hugh Perkins	fc0d940c5e	add gumbel_softmax, based on Eric Jang's implementation (#3341 ) * add gumbel_softmax, based on Eric Jang's implementation * Make gumbel_softmax CUDA friendly * gumbel_softmax tweaks	2018-01-04 12:23:21 -05:00
Sam Gross	20b5e82155	Implement embedding in ATen (#4322 ) Implements nn.Embedding (lookup table) in ATen. Breaking change: new optional argument padding_idx in F.embedding to match nn.Embedding. Note that there are a few bugs in Embedding that are inherited from the previous code: - CUDA renorm has race conditions if index contains duplicate entries - sparse gradient doesn't work with scale_grad_by_freq	2018-01-02 15:44:46 -05:00
Sam Gross	98f71912b0	Fix type signature of in-place NN functions (#4389 ) This is a step towards removing the special casing of NN functions in gen_variable_type.py. It fixes the signature of in-place NN functions so that they return Tensor & instead of Tensor.	2017-12-28 16:50:09 -05:00
Sam Gross	4dba674324	Move factional max pooling to ATen (#4290 )	2017-12-21 17:07:46 -05:00
Edward Z. Yang	5f7c5502b8	Further improvements to ATen convolution (#4287 ) - Rename THNN convolution to have thnn_ prefix. - Propagate CuDNN benchmark and deterministic to at::Context - Add 'convolution', 'convNd' and 'conv_transposeNd' native wrappers, with defaults The conv_transposeNd wrappers are updated to have the same argument order as Python. - torch.nn.functional directly dispatches to the native wrappers - Make it possible to turn off tracing for some native wrappers, so I don't have to write symbolics for all the functions above - Spectral ops can now make use of CuDNN convolution if possible - Better commentary on cudnn_batch_norm - Turn on DCE for all JIT tests. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-12-21 13:03:43 -05:00
Edward Z. Yang	5b8fe5cbb5	Batchnorm in ATen (#4285 ) * Batchnorm in ATen This commit moves BatchNorm derivatives into ATen, eliminating torch/csrc/autograd/functions/batch_normalization.cpp Some refactoring along the way: - Functions got renamed to remove _forward from their names - CuDNN batchnorm forward was modified to return save_mean/save_std instead of take it as parameters. To avoid returning undefined Variables, these return (small) uninitialized tensors when they are not used. - THNN batch normalization takes care of resizing save_mean and save_std on forward. - There are some shenanigans re batchnorm backwards in eval mode. I'm tracking that in #4284 - I decided not to introduce buffers as a proper concept in ATen, which means that tensors like running_mean/running_var are variables in ATen. This meant there needed to be some adjustments to how we trace such variables; the new strategy is if we can't find a Value for a variable, we look and see if we have a Value for the buffer pointed to by the variable, before finally falling back on constant. - This PR finally reliably triggered OOM on Travis builds; I fixed this by reducing the number of parallel jobs. - Stop using std::string when it's not necessary. - Remove training parameter from cudnn_batch_norm_backward, because it doesn't make sense; cuDNN doesn't implement the math for evaluation mode batchnorm backwards. - batchnorm_double_backward is now in an anonymous namespace, as it no longer needs to be called from torch/csrc Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-12-21 11:38:31 -05:00
Sam Gross	b6a30f7ede	Move SELU to ATen (#4269 ) Fuse scale multiplication into ELU	2017-12-20 16:32:21 -05:00
Sam Gross	dad4b2d6cc	Move adaptive avg/max pool1d to ATen (#4266 )	2017-12-20 15:50:17 -05:00
Sam Gross	689ef9cba3	Move upsampling to ATen (#4264 )	2017-12-20 15:12:07 -05:00
Edward Z. Yang	a88a8ec827	Convolution derivatives in ATen (#4116 ) * Convolution derivatives in ATen This PR introduces ATen implementation of convolution, which dispatches to THNN/CuDNN/nnpack based on input parameters. The general strategy is to compose this function out of the various forward-backward pairs of specific implementations, rather than write a monolithic function with backwards (which is what we did before because the boilerplate of doing it otherwise would have been very high.) The new API provides the following functions: - _convolution, which is a fully generic, native convolution implementation that dispatches to various other convolution implementations depending on input characteristics. This is prefixed with an underscore because it explicitly takes benchmark, deterministic and cudnn_enabled which are implementation details for CuDNN. The intent is to eventually provide a convolution that reads these parameters out of the context using #4104. - _convolution_nogroup is a convolution implementation for non-CuDNN algorithms which don't support group convolution natively. - _convolution_double_backward is the generic double-backwards implementation for convolution. In more detail: - Most functionality from torch/csrc/autograd/functions/convolution.cpp has been moved into aten/src/ATen/native/Convolution.cpp - We continue to make use of ConvParams, but we now construct the parameters upon entry to a function from the function signature (which does not use ConvParams; having convolution take ConvParams directly would require teaching the code generator how to accept these as parameters, complicating ATen's API model) and destruct them when making subprocedure calls. - I introduce a new idiom, input_r, which represents a const Tensor& reference, which will subsequently be assigned to a local Tensor input. This is helpful because a lot of the existing algorithms relied on being able to assign to locals, which is not permitted with a const reference. - The native argument parser now supports std::array<bool,2> inputs (NB: there MUST NOT be a space; this is the same hack as is applied to derivatives.yaml) - Native parser now supports Tensor? arguments, which indicates a nullable tensor. Previously this function was only used by NN methods. - Documentation updates on THNN library - I added an extra fgradInput argument to VolumetricConvolutionMM_updateOutput and VolumetricConvolutionMM_accGradParameters so that its buffer list lines up with the backward argument list. This makes it possible to write derivative for conv3d which previously was not supported (commented out in derivatives.yaml) - Extra double_backward declarations for all convolution backwards functions was added. - You can now use the syntax Tensor? in native_functions.yaml to indicate that a tensor argument is nullable. There are adjustments to propagate this to the Python argument parser. - NNPACK was ported to ATen, and ATen now builds and links against ATen if possible. New AT_NNPACK_ENABLED macro. The nnpack functions are nnpack_spatial_convolution. - Some modest CuDNN convolution refactoring to remove _forward from names. - There's a new cudnn_convolution_backward function to deal with the fact that CuDNN convolution double backward requires you to have computed all gradients in one go. - Variable set_flags now checks if the tensor is undefined, fixing a silent memory corruption. - checkSameType updated to not raise an exception if called with Variable arguments - "no ATen declaration found for" error message is improved to say what available declarations are - make_variable now accepts undefined tensors, and returns an undefined tensor in this case.	2017-12-20 14:19:27 -05:00
Sam Gross	b476d10c64	Move max_pool1d to ATen (#4257 )	2017-12-19 20:10:11 -05:00
Sam Gross	9495595520	Move reflection/replication padding to ATen (#4258 )	2017-12-19 18:57:14 -05:00
Sam Gross	227ef1fb60	Move adaptive avg pooling 2d/3d to ATen (#4254 ) Move adaptive avg pooling 2d/3d to ATen Also use ATen for softshrink	2017-12-19 15:45:33 -05:00
James Reed	cb4f6c3148	conv_tbc (#3730 ) attempt to rebase skip conv_tbc in preprocess_nn_functions Add conv_tbc symbolic Fix backward issue with dBias ConvTBC nn wrapper and unit test	2017-12-18 23:52:36 -05:00
Richard Zou	ccf4dc1525	Add reduce arg to BCELoss (#4231 ) * Add reduce arg to BCELoss * Fix test precision * reduce keyword for BCELoss in derivatives.yaml	2017-12-18 12:28:53 -05:00
Soumith Chintala	54d689253e	Revert "Add reduce arg to BCELoss" (#4221 ) * Revert "Add reduce arg to BCELoss (#3532)" This reverts commit `847c56aeb5`.	2017-12-18 03:13:09 -05:00
Richard Zou	847c56aeb5	Add reduce arg to BCELoss (#3532 ) * Add reduce arg to BCELoss * Fix test precision	2017-12-18 02:39:49 -05:00
Kevin Zakka	b86dc0c8ba	add reduce arg to PoissonNLLLoss (#3770 ) * add reduce arg to PoissonNLLLoss * fixed comments except reference function * fixed unit test * small indentation fix * fixing last comments by richard * lint check * another linting issue	2017-12-18 02:32:05 -05:00
Richard Zou	30e6898808	Implement NLLLossNd (#4035 ) * Implement NLLLossNd * Fix tests and typos * Fix tests	2017-12-18 02:16:16 -05:00
Emanuel Jöbstl	be1ef5e4a4	Added explicit tuple element-count to doc for Conv1d. (#4136 ) * Added explicit tuple element-count to doc for Conv1d.	2017-12-14 22:17:46 -05:00
Soumith Chintala	638b10d39b	fix softmax default dim for 1D Tensor	2017-12-01 19:20:04 -05:00
Edward Z. Yang	1c0fbd27a1	CuDNN bindings rewrite (into ATen) (#3666 ) * Comprehensive rewrite of Torch CuDNN bindings / a bit of ATen infra The executive summary is that this moves the torch/csrc/cudnn library into ATen, adding a number of new cudnn_ methods to ATen for batchnorm, convolution, affine grid generator and grid sampler. ATen infra changes: - TensorGeometry was moved to ATen - TensorGeometry was modified to make its interface resemble that of Tensor; in particular, sizes is no longer a field, it's a method. - AT_CUDA_ENABLED macro is set via ATen/Config.h header which is generated at cmake configure time. Fixes https://github.com/zdevito/ATen/issues/168 - Change AT_CUDA_ENABLED macro to be a function macro, so that we error if it is not defined - Introduce a new TensorArg class, which is a Tensor plus a little metadata. This helps us give good error messages when checking dimensions/shapes of tensors. Fixes https://github.com/zdevito/ATen/issues/169 - Also introduce a TensorGeometryArg class, for when you don't need the actual tensor data (which is most of the time.) - Add ATen/Check.h, which contains a number of utility functions for testing shapes, types and devices of input tensors. This will be particulary useful for native methods, which don't get code generated input testing code. These functions take a 'CheckedFrom' argument, at the moment just a string, which specifies some extra information about what function was doing the actual checking; this greatly improves error messages. - Many check functions take initializer lists, which let you test that all tensors have some property. This API is peculiar, in that we IGNORE undefined tensors in this case. This is handled by filterDefined. - Add AT_CUDNN_ENABLED macro - CuDNN linking from ATen was improved; for example, we now actually add the CuDNN headers to our include path. - Add some missing override specifiers to some methods - We now actually build tests with CUDA functionality accessible (previously, AT_CUDA_ENABLED was not defined, meaning that the headers were missing all CUDA-only functionality.) - Native functions now support giving explicit names to return outputs in yaml. This makes it possible to hook into the NN autogenerated derivatives codepath using native functions. CuDNN rewrite changes: - torch/csrc/cudnn now uses ATen (rather than passing around THVoidTensor) and lives in ATen. This lets us remove tensorPointer shenanigans. The functions are exposed to ATen as native functions described in aten/src/ATen/cudnn/cuDNN.yaml - ATen now builds and links against CuDNN when enabled. The cmake package script was taken from Caffe2. - Some header reorganization was done to help reduce dependencies on headers (this reorg is no longer used but I've kept it) - Rename CHECK to CUDNN_CHECK - Rip out old shape/type testing code in favor of modern ATen/Check.h interface using TensorArg. In many cases, increase the robustness of the checking code. - Change the inputs of the public facing functions, so that they can be bound by ATen - Delete THCState; this is retrieved from the global ATen context - Delete cudnnHandle_t, this is retrieved from the global Handles.h - Delete cudnnDataType_t, this is retrieved from the Tensor type - Delete Convolution class, instead its constituent arguments are passed individually - Change functions to return tensors, rather than take an appropriately sized output tensor as an input. - Redo how transposed convolution / backward convolution is implemented (knock on effect of returning tensors). Previously it was assumed that you would always pass an appropriately sized output tensor, but we don't want to do this anymore. For backwards, we instead give the desired output tensor (input, really) size, because that is readily available. For transposed* convolution, however, we take output_padding, and otherwise do the shape calculation. - Redo how legacy group convolution is implemented (knock on effect from porting cudnn to ATen.) Previously, group convolution was implemented by manually constructing sizes and strides and then outputting appropriate, with macros switching between individual groups and all-at-once based on CuDNN version. Now, the code looks exactly what you'd expect: there's a top-level wrapping function that supports group convolution no matter the version of CuDNN, and a low-level wrapper which supports only what CuDNN supports. The top-level function conditions on CuDNN version, and invokes the low-level interface 1 or n times. - There is now a debugging printer for tensor descriptors. - Convolution struct is replaced with ConvolutionArgs, which is not part of the public API but is used internally to conveniently pass around all of the arguments needed for Convolution. - Add some constexprs for well-known dimensions, reduce amount of magic numbers in code. - Put 'deterministic' in to ConvParams. Fixes #3659 - Lots more comments. - Some pessimizations, in the name of code clarity: - The descriptors are initialized on every invocation of convolution forward/backward. Previously, the descriptors were cached, so that you didn't have to initialize them again on backwards. This is difficult to support in the ATen interface so I didn't support it. - Legacy group convolution initializes its workspace for every group it performs. I did not feel motivated to fix this because the legacy codepath is already quite slow. - Affine grid generator and grid sampler automatically call contiguous on their arguments as necessary. - Batchnorm input checking is greatly beefed up, it now checks for the following input characteristics: - Definedness - GPU location - Type - Contiguity - Size PyTorch binding code changes - batchnorm now uses consistent var/data naming - batchnorm and convolution make use of new ATen bindings - Affine grid generator and grid sampler make use of ATen CuDNN bindings via derivatives.yaml. This means I had to restructure the code a little, since the THNN bindings still go through a legacy Python class. - I fixed some warnings: - s/friend class/friend struct/ on InterpreterStateImpl - Removed pessimizing move 'detached' in torch/csrc/autograd/variable.cpp - Removed unused pack_list on Scalar Signed-off-by: Edward Z. Yang <ezyang@fb.com> GCC 4.8 buildfix Signed-off-by: Edward Z. Yang <ezyang@fb.com> Add TensorGeometry to ATen.h Signed-off-by: Edward Z. Yang <ezyang@fb.com> CUDNN_CHECK Signed-off-by: Edward Z. Yang <ezyang@fb.com> Update TODO comment Signed-off-by: Edward Z. Yang <ezyang@fb.com> Delete return in cudnn_grid_sampler Signed-off-by: Edward Z. Yang <ezyang@fb.com> s/cudnnSetStreamToCurrent/setCuDNNStreamToCurrent/g Signed-off-by: Edward Z. Yang <ezyang@fb.com> Don't allocate a new vector when filtering defined. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Remove Check overloads, convert to pass references. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Some more microbenchmarking. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-11-30 23:06:58 -05:00
Sergey Zagoruyko	11c9bd6c98	Allow target.requires_grad in l1_loss and mse_loss (#3876 )	2017-11-27 10:59:16 -05:00
Richard Zou	5215640a41	Fix cosine_similarity's output shape (#3811 )	2017-11-21 18:33:41 -05:00
Sam Gross	9cb8b43778	Split off in-place NN functions (#3683 ) For example, this splits threshold into threshold(), which is now never in-place, and threshold_() which is always in-place. This simplifies the in-place vs. non-in-place logic in gen_variable_type.py, which was bug-prone.	2017-11-14 12:59:06 -05:00
josecabjim	e33df2b88a	Add border-padding for grid_sampler (#3599 ) * adds border padding to spatial grid sampler * fixes flake8 * adds docs	2017-11-12 18:46:49 -05:00
Edward Z. Yang	19515520bb	Make prelu an ATen op. This operator is a warmup I was doing before tackling convolution, as it has many properties that make it a "first" for implementing things. In particular, it is the first operator whose backwards have multiple returns; this means its double backwards is the first backwards for a function with multiple differentiable outputs. This exercises new code for output_mask and set_flags. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-11-10 09:58:40 +08:00
Ozan Çağlayan	dd6d04ddf2	doc: Normalize all true/false in docstrings to ``True\|False`` (#3593 ) * doc: Normalize all true/false in docstrings to ``True\|False`` This makes them more apparent in the documentation. * doc: fix flake8	2017-11-09 08:12:29 -05:00
Richard Zou	77ddd5130b	Add reduce keyword for KLDivLoss (#3330 )	2017-11-07 08:57:11 -05:00
Hugh Perkins	b043a74919	fix softmax doc (#3337 )	2017-11-01 08:47:51 -04:00
Gökçen Eraslan	638f0b5d78	Prevent numerical issues with poisson_nll_loss when log_input=False (#3336 ) * Prevent numerical issues with poisson_nll_loss when log_input=False Evaluation of the logarithm of the input variable in poisson negative log likelihood leads to NaN loss if variable being evaluated is zero. Small epsilon is added to prevent this. See equivalent Keras epsilon here: https://github.com/fchollet/keras/blob/master/keras/losses.py#L68 * PEP8 fix * Add epsilon support to PoissonNLLLoss in nn.modules.loss	2017-11-01 08:47:19 -04:00
Richard Zou	6214487fa7	Add reduce keyword to L1Loss (#3366 ) * Add reduce keyword to L1Loss * Fix legacy test for abscriterion * Address comments	2017-11-01 06:33:18 -04:00
Richard Zou	eac0942f6d	Add more nn docs (#3374 )	2017-10-30 18:37:36 -04:00
Ozan Caglayan	28f3d50f9d	doc: Replace nclasses with C	2017-10-30 12:06:20 -04:00
John Chiotellis	a0ce84e476	fix triplet margin loss documentation (#3339 )	2017-10-28 17:15:58 +02:00
SsnL	de1f4e69dd	raw text (#3327 )	2017-10-28 01:24:02 +05:30
Richard Zou	d8f3c601e4	Add reduce keyword to CrossEntropyLoss	2017-10-27 19:19:52 +02:00
Richard Zou	3853d5da97	Add reduce keyword to NLLLoss and NLLLoss2d (#3080 ) * API changes * Implement reduce for THNN ClassNLLCriterion * Implement reduce keyword for THCUNN ClassNLLCriterion * Implement reduce for THNN SpatialClassNLLCriterion * Implement reduce for THCUNN SpatialClassNLLCriterion * Make legacy NLLLoss work * Docs for NLLLoss reduce * reduce keyword for double backwards NLLLoss * reduce=False tests * Addressed comments * Fix trailing whitespace * Fix test failures in legacy nn * Rebase: add reduce keyword to aten declarations of NLLLoss * Add reference functions for all NLLLoss and NLLLoss2d test cases * Replaced slow get/set fns. Don't use int64_t in kernels. * Use TH_INDEX_BASE in NLLLoss for consistency * Fix legacy ClassNLLCriterion tests	2017-10-26 13:54:19 -04:00
Sam Gross	67839ce7bc	Delete unused Softmax code (#3220 ) Softmax and LogSoftmax are automatically bound and dispatched through VariableType.	2017-10-21 20:51:27 +02:00
Sam Gross	5989b05ecc	Enable ATen implementation of some NN functions and Variable methods	2017-10-20 15:38:01 -04:00
Adam Paszke	98e67448fa	Large Softmax and LogSoftmax refactor - Cleaned up THNN and THCUNN code and kernels - Improved THCUNN kernel performance 5x, making it match cuDNN performance - Added support for computing softmax over arbitrary dims NOTE: The default dim for 3D inputs is now 1 (used to be 0) - Both functions now accept inputs with arbitrarily many dimensions - Autograd functions no longer save the input (it's unnecessary) - Added cuDNN bindings for softmax, but they are unused as THCUNN matches or even exceeds cuDNN performance	2017-10-19 19:51:10 +02:00
Marcin Elantkowski	57ffe64cbe	Embedding related fixes (#3128 ) * Fix docs for nn.Embedding and F.embedding. - add description of 'sparse' argument (#3104) - fix F.embedding example (resulted in RuntimeError) * Make EmbeddingBag a New Style Function. * Add a functional interface for EmbeddingBag * Fix failing tests: add max_norm and norm_type to context, and fix typo in backend call. * Docfix: remove torch.manual_seed from example code. * Add a note about using sparse keyword in Embedding function.	2017-10-18 23:38:07 +02:00
Arthur Crippa Búrigo	17d68f824d	Fix typo. (#3140 )	2017-10-17 00:50:33 +02:00
SsnL	6dc67aef17	doc (#3110 )	2017-10-14 10:44:35 +02:00
Sam Gross	9437644f66	Replace softmin and softsign with simple differentiable expressions	2017-10-10 16:57:47 -04:00
Priya Goyal	2443fcac0b	Deterministic cudnn algorithms	2017-10-10 10:53:34 -04:00
SsnL	0eec332e14	assert reflection padding in range (#3008 )	2017-10-06 17:59:01 -04:00
Richard Zou	898c732293	Introduce a `reduce` keyword argument for MSELoss (#2878 ) * Add reduce keyword to MSECriterion API * Move gradOutput usage from py to backend * Implement reduce keyword for THNN MSECriterion * Implement reduce keyword for THCUNN MSECriterion * Implement reduce keyword for MSE double backwards * Tests for MSECriterion with reduce keyword * Documentation for reduce for MSELoss * Make legacy nn work with reduce keyword by ignoring it * Apply linter suggestions * Address comments (small changes) * Revert "Tests for MSECriterion with reduce keyword" This reverts commit 1c0be0defa49d336d023d7d9795db4037c92b6fe. * Undo changes to legacy nn tests * Reuse module test for MSELoss by creating a wrapper class for MSELoss * Address comments: refactor MSECriterion.cu to be nicer * Fix lint & build errors	2017-10-06 10:57:22 -04:00
SsnL	ba766ef39a	Fix BN size check in eval mode (#2977 )	2017-10-04 16:03:20 -04:00
SsnL	faa6fdfa18	Raise error when each channel only has 1 value in batch norm (#2961 ) * add error when each channel only has 1 value	2017-10-03 17:56:15 -04:00
SsnL	d5a7e304fa	added volumetric adaptive max pooling	2017-09-30 16:57:51 -04:00
Edward Z. Yang	9be8d0a9d2	Add a docstring for functional.linear. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-26 12:29:07 -04:00
SsnL	6a4ec4f9a8	VolumetricAdaptiveAveragePool	2017-09-25 15:12:44 -04:00
Luca Antiga	c580352aee	Adding 1d upsampling (#2846 )	2017-09-24 16:50:24 -04:00
Emanuel Jöbstl	39434ee2e4	Added LPPool1d. (#2783 )	2017-09-20 09:19:29 -04:00
David Pollack	c6ea6ed8ff	Add Nd Padding, Pad1d functions and ConstantPad3d (#2657 )	2017-09-18 14:48:49 -04:00
Gregory Chanan	d910a94b2b	Support AdaptiveMaxPool1d/2d double backwards.	2017-09-13 12:28:43 -04:00
Lu Fang	5294017d9f	Adding implicit padding for 3d average pooling	2017-08-26 14:45:19 -04:00
yunjey	153c9b0714	Add examples in functional.py and loss.py (#2371 ) * Add examples in functional.py Added examples for F.cross_entropy, F.binary_cross_entropy and F.binary_cross_entropy_with_logits. * Add ` for PyTorch docs Added ` for PyTorch docs. * Add examples in loss.py Added examples for nn.BCELoss and nn.BCEWithLogitLoss.	2017-08-25 09:44:36 -04:00
Alykhan Tejani	30baba7d15	fix typo in docstring	2017-08-16 17:55:39 -04:00
Gregory Chanan	c92f229aa2	CosineEmbeddingLoss as a new style function.	2017-08-14 16:19:10 -04:00
Gregory Chanan	9bcb9658d5	MarginRankingLoss as new style function.	2017-08-14 16:19:10 -04:00
Gregory Chanan	7aeb837895	Implement HingeEmbeddingLoss double backwards.	2017-08-14 16:19:10 -04:00
Gregory Chanan	9a243abe5c	Implement Softmin double backwards.	2017-08-14 16:19:10 -04:00
Gregory Chanan	a6cccc8701	Implement RReLU double backwards.	2017-08-14 16:19:10 -04:00
Luca Antiga	cd5275e79f	Convert upsampling Functions to new style (#2372 )	2017-08-11 21:03:58 -04:00
Soumith Chintala	42328b70f7	fix another is_same_size call	2017-08-02 19:53:39 -04:00
Soumith Chintala	b3ca3da4b6	fix type mismatch	2017-08-02 10:18:03 -04:00
yunjey	e1ca722988	Add comments for default value (#2248 ) Added comments for default value in nn.functional	2017-08-01 14:27:46 +05:30
Alykhan Tejani	643f8d12ff	[bugfix] in bce_with_logits logsumexp calculation (#2221 ) * fix bug in bce_with_logits logsumexp calculation * flake8 fix	2017-07-27 05:58:56 +05:30
Gregory Chanan	bcea678e7b	Update rebased functions to call apply.	2017-07-25 07:37:25 +05:30
Gregory Chanan	1a52ca02ef	Always return indices from MaxPool autograd functions to simplify implementation; The callers (in functional.py) will filter out the return instead.	2017-07-25 07:37:25 +05:30
Gregory Chanan	291369ff1b	Convert pooling functions to new-style, once_differentiable functions.	2017-07-25 07:37:25 +05:30
Gregory Chanan	9608e37969	Implement double backwards for PReLU.	2017-07-25 07:37:25 +05:30
Gregory Chanan	ec7c510557	Implement Softsign double backwards.	2017-07-25 07:37:25 +05:30
Gregory Chanan	852dd5f011	Convert _WeightedLoss functions to new style autograd functions.	2017-07-25 07:37:25 +05:30
Gregory Chanan	085abee444	Rebase kl_div changes.	2017-07-25 07:37:25 +05:30
Gregory Chanan	45ce4df74c	Convert auto nn Functions (non-criterion) to new style.	2017-07-25 07:37:25 +05:30
Alykhan Tejani	112728cbe9	reformulate bce_with_logits to not use abs (#2195 ) * reformulate bce_with_logits to not use abs * flake8 fixes	2017-07-25 03:46:27 +05:30
Alykhan Tejani	35757af6f7	Add broadcasting of weights to bce/bce_with_logits (#2161 ) * added tests + removed explicit expand of weight in bce with logits * add auto broadcasting of weight to BCELoss * remove the need for _BCELoss * formatting of warning * remove TODO * move across assert from _functions/thnn/loss.py * flake8 fixes	2017-07-21 16:02:07 -04:00
yunjey	ea607afd06	Add comments in nn.Upsample (#2175 )	2017-07-21 14:34:58 -04:00
Edward Z. Yang	f3f478960e	Convert Embedding to new style. (#1916 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-07-20 02:35:21 -04:00
Hugh Perkins	e537023147	add functional embedding (#1987 )	2017-07-20 01:53:37 -04:00
Aron Barreira Bordin	11f3ccf98f	Add missing Modules to nn.functional (#1801 ) * add dropout2d and dropout3d to functional added some loss functions to functional added tests using dropout from backend added docs fixes * edited loss modules to call functional	2017-07-19 15:55:21 -04:00
Fisher Yu	d6bc2642e7	Add ignore_index to NLLLoss2d	2017-07-13 23:22:48 -04:00
Soumith Chintala	58e4caf80f	add missing docs	2017-07-13 01:01:04 -04:00
Soumith Chintala	169ca67a4e	Adding Spatial Transformers w/CuDNN support	2017-07-12 14:32:06 -04:00
yunjey	1ef1dd9cad	Add comments for readability (#2005 )	2017-07-10 23:02:56 -07:00
Leonid Vlasenkov	46a868dab7	[Ready] Limit docs line length (#1900 ) * some docs are ready * docs * docs * fix some more * fix some more	2017-07-10 10:24:54 -04:00
Gregory Chanan	f6578c1b24	Implement double backwards for Dropout and FeatureDropout.	2017-07-03 18:51:22 -04:00
Gregory Chanan	daa84e7663	Implement bilinear double backward.	2017-07-03 18:51:22 -04:00
Gregory Chanan	1aa145dbac	Implement ConstantPad2d double backwards.	2017-07-03 18:51:22 -04:00
Alykhan Tejani	457587088a	Fix broadcasting issues in binary_cross_entropy_with_logits (#1944 ) * done re-seed cuda device if in bad fork * avoid broadcasting in binary_cross_entropy_with_logits * assert input sizes for BCEWithLogitLoss * added check that BCEWithLogitsLoss == Sigmoid + BCELoss * fix flake8 issues * rename test_bce_with_logits_gives_same_result_as_bce_and_sigmoid -> test_bce_with_logits_gives_same_result_as_sigmooid_and_bce_loss * add warning in BCELoss about input shapes * fix lint	2017-07-01 23:06:36 -04:00
Sam Gross	da0fad8a7a	Use torch.matmul in nn.Linear (#1935 ) This takes advantage of the broadcasting behavior of torch.matmul to support inputs with more than two dimensions. The extra dimensions are treated like part of the batch dimension, much like nn.Bottle in Lua Torch. There are a few related small performance changes: * Addmm computes the gradient in column-major for inputs in column-major format * Variable.mm calls Addmm in-place with the desired output buffer	2017-06-30 16:53:26 -04:00
Sam Gross	4d5075add2	Add ignore_index to nnl_loss and cross_entropy (#1937 )	2017-06-29 00:10:13 -04:00
Leonid Vlasenkov	ae61f3ff42	adds poisson NLL loss (#1779 )	2017-06-27 10:04:54 -04:00
Alykhan Tejani	67968cb60b	Add numerically stable BCELoss which takes logits as input (#1792 )	2017-06-19 22:05:51 -04:00
Francisco Massa	76ee014d10	Add documentation to SELU and AlphaDropout	2017-06-19 18:18:01 -04:00
Francisco Massa	f619ac6ac9	Quickfix for AlphaDropout on CUDA	2017-06-19 18:18:01 -04:00
Sam Gross	38b9598685	Added GLU (gated linear unit) From https://arxiv.org/abs/1612.08083	2017-06-13 20:48:19 -04:00
Francisco Massa	6626881e7a	Add Alpha Dropout (#1775 )	2017-06-13 00:39:49 +02:00
Francisco Massa	a24db91a38	Add SELU activation function (#1769 ) * Add SELU activation function * Remove unnecessary case * Add Function for SELU + tests and fix RReLU inplace * Fix extra line in doc * Fix tests Remove in-place tests for RReLU. For some reason they fail on legacy nn, but passes on nn * SELU in new-style Function It also supports double backprop, verifyed with gradgradcheck * Fix flake8	2017-06-11 10:07:48 +03:00
Luca Antiga	b9ab26765e	Add 3D upsampling (nearest and trilinear) with tests	2017-06-07 11:29:27 -04:00
Soumith Chintala	df7c47142d	fix for THNN NLLLoss signature change	2017-06-07 00:18:11 -04:00
Aron Barreira Bordin	d7db75c10f	added CosineSimilarity to nn.distance and updated docs (#1672 ) * added CosineSimilarity to nn.distance and updated docs	2017-06-06 22:53:21 -04:00
Marvin Cao	174c3cc399	Add support for double backward of LeakyReLU (#1714 )	2017-06-05 11:53:27 -04:00
Alykhan Tejani	f1c57ace1b	added input dim checks to convxD and conv_transposedxd (#1695 ) * add input dim check for conv2d * add None check to conv2d * added input dim checks to convxD and conv_transposedxd * flake8 fixes	2017-06-02 11:58:19 -04:00
Thomas Viehmann	6107d15d14	Twice differentiability of pointwise functions (#1531 )	2017-05-15 12:00:59 -06:00
Adam Paszke	6b84dc26f0	Add F.cosine_similarity (#1502 )	2017-05-15 11:12:54 -06:00
Marvin Cao	0ba20435ce	Add high order grad support for Some operator (#1507 )	2017-05-14 23:02:04 +02:00
Gregory Chanan	171638a451	Fix test_normalize NN test.	2017-05-09 14:25:06 -07:00
Gregory Chanan	ae2b2cbbec	Make keepdim work with autograd.	2017-05-09 14:15:59 -07:00
Sergey Zagoruyko	6d693fe413	Add F.normalize (#1467 )	2017-05-07 13:54:16 +02:00
Marvin CAO	e3f41a4962	Add high order gradient support for Sigmoid (#1496 )	2017-05-07 13:00:20 +02:00
Ankit Vani	4e18d89791	added twice differentiation for a bunch of ops (#1426 )	2017-05-04 06:47:14 -04:00
andrew giessel	2e7635b929	Add flexible bilinear upsampling aspect ratio redux (#1317 )	2017-05-03 08:46:28 -04:00
Soumith Chintala	ecd51f8510	docs fixes	2017-05-02 15:42:33 -04:00
Soumith Chintala	7dd8571bc6	fix avg_pool docs in nn.functional	2017-04-30 08:44:43 -04:00
Adam Paszke	457d78a7d9	Use THCUNN backward kernels for Tanh and Sigmoid in Autograd (#1399 )	2017-04-29 09:07:03 -04:00
Uridah Sami Ahmed	75f1989bec	Add nn.Bilinear and tests	2017-04-28 10:11:30 -04:00
Shubham Jain	a35f507532	Update functional.py (#1298 )	2017-04-19 11:07:12 -04:00
Edward Z. Yang	34546f022a	Expose dilated convolutions. Fixes #1225. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-04-18 17:13:02 -04:00
Edward Z. Yang	ab77742f6e	Add some missing documentation for arguments. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-04-18 17:13:02 -04:00
Christian Sarofeen	e9ff57176b	Fused pointwise kernels for GRU/LSTM	2017-04-11 13:42:06 -07:00
Christian Sarofeen	0b50f794e9	Use thnn version of Tanh/Sigmoid instead of autograd. (#1234 )	2017-04-11 12:49:57 -07:00
Edgar Riba	9504246c32	add triplet margin loss (#1165 )	2017-04-05 22:17:58 -04:00
Soumith Chintala	2979f4b989	add more functions to docs	2017-03-29 01:29:17 -04:00
Jason Kuen	f2c1071c33	Adaptive max and average pooling (1D & 2D) (#1084 )	2017-03-26 17:09:28 +02:00
Edgar Riba	63f6c0d692	add Pairwise distance (#835 )	2017-03-24 11:29:40 -04:00
ngimel	b3ab4b1094	Check torch.backends.cudnn.enabled, padding, and output_padding (#996 ) * Check torch.backends.cudnn.enabled * Don't allow negative padding and output_padding values	2017-03-22 19:42:11 -04:00
Kentaro Wada	7654b3f49e	Add function to compute cross_entropy for 2D image (#802 )	2017-03-16 17:34:04 +01:00
Soumith Chintala	13b1580613	add F.pad to docs	2017-03-15 00:09:14 -04:00
Sam Gross	34ce58c909	Parallelize backwards	2017-03-03 11:26:00 -08:00
Sergey Zagoruyko	12efd53dba	ConstantPad2d and F.pad (#856 )	2017-03-01 19:39:44 +01:00
Ofir Press	5e1d6a3691	Update functional.py (#862 ) Fixed documentation error in conv3d	2017-02-27 10:42:02 -05:00
陈云	838842d4b2	fix documentation error. [issue #790 ](https://github.com/pytorch/pytorch/issues/790 ) (#831 )	2017-02-23 08:59:29 +01:00
Joo-Kyung Kim	336eeee895	kernel_size as the default stride for avg_pool1d (#744 ) Following the documentation, let stride to be kernel_size if stride is not provided.	2017-02-15 13:12:18 +05:30
Soumith Chintala	d4c9a3782b	billinear -> bilinear, docs for upsampling, improved docs for Unpooling, pep8 tests fix (#617 ) * billinear -> bilinear, docs for upsampling, improved docs for Unpooling, pep8 tests fix	2017-01-30 05:08:48 +05:30
Luke Yeager	3ed720079e	[pep8] Fix most remaining lint manually	2017-01-28 01:15:51 +01:00
Luke Yeager	e7c1e6a8e3	[pep8] Fix most lint automatically with autopep8 Here's the command I used to invoke autopep8 (in parallel!): git ls-files \| grep '\.py$' \| xargs -n1 -P`nproc` autopep8 -i Several rules are ignored in setup.cfg. The goal is to let autopep8 handle everything which it can handle safely, and to disable any rules which are tricky or controversial to address. We may want to come back and re-enable some of these rules later, but I'm trying to make this patch as safe as possible. Also configures flake8 to match pep8's behavior. Also configures TravisCI to check the whole project for lint.	2017-01-28 01:15:51 +01:00
Adam Paszke	f8d4f980b3	Add upsampling modules and functions	2017-01-24 17:30:50 -05:00
Alykhan Tejani	f8e89fbe11	fix docs for torch.nn.functional.conv1d (#536 )	2017-01-21 10:41:52 -05:00
Adam Paszke	ee4c77c59f	Docs improvements (#512 ) * Always compile .numpy() for all types * Add torch.nn.functional docs and hidden headers * Use sphinx to generate torchvision docs * Remove unused import in ffi utils	2017-01-19 17:28:49 -05:00
Sergey Zagoruyko	9c218b419f	kl_div and docs (#429 )	2017-01-17 19:24:01 -05:00
Adam Paszke	1dbf44c00d	Add SmoothL1Loss to functional	2017-01-16 12:59:47 -05:00
Sam Gross	3a07228509	Add ConvTranspose1d module (#449 )	2017-01-13 15:22:57 -05:00
Sam Gross	24a2f2e3a0	Add MaxUnpool1d module (#447 )	2017-01-13 14:36:25 -05:00
Sam Gross	d5e45b2278	Add AvgPool1d which just uses AvgPool2d implementation (#439 )	2017-01-12 15:07:11 -05:00
Sam Gross	fd92470e23	Add cuDNN bindings for BatchNorm (#421 )	2017-01-07 15:35:24 -05:00
Adam Paszke	483490cc25	Move PixelShuffle implementation to functional	2016-12-30 23:02:57 +01:00
Adam Paszke	8d60e39fdc	Rename torch.nn.functions to torch.nn._functions	2016-12-30 23:02:57 +01:00
Sam Gross	c367e0b64e	Support dilated 1d and 3d convolutions (#372 ) Fixes #367	2016-12-29 18:20:32 -05:00
Sergey Zagoruyko	62af45d99f	Basic functional interface (#354 )	2016-12-29 22:53:57 +01:00

... 2 3 4 5 6 ...

316 Commits