pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Erik Brinkman	611a608517	Add ATen pdist CPU kernel (#10782 ) Summary: Also add single grad whitelist to the jit test Pull Request resolved: https://github.com/pytorch/pytorch/pull/10782 Reviewed By: ezyang Differential Revision: D9583378 Pulled By: erikbrinkman fbshipit-source-id: 069e5ae68ea7f3524dec39cf1d5fe9cd53941944	2018-08-30 11:55:27 -07:00
Roy Li	f2bb9f0bb5	speed up kl div loss (#10336 ) Summary: Moved kl div loss to aten. benchmarks for 5000 iterations on input size (1000,100) New ``` cuda: forward [0.9736350309103727, 0.9922929517924786, 0.9694818360731006] input requires_grad=True: backward [0.5595634011551738, 0.558339926879853, 0.5546616851352155] double backward [1.2445648494176567, 1.2245905152522027, 1.2349751549772918] target requires_grad=True: backward (new C++) [0.9489959231577814, 0.9553070571273565, 0.9556351029314101] double backward (new C++) [1.8184774098917842, 1.8164670099504292, 1.845708406995982] cpu: forward (new C++) [7.892430987209082, 8.3068826389499, 7.985283812973648] input requires_grad=True: backward (new C++) [4.328460982069373, 4.45323242014274, 4.27946363389492] double backward (new C++) [5.153504415880889, 4.629372010007501, 4.712803596165031] target requires_grad=True: backward (new C++) [3.4181493939831853, 3.3771288259886205, 3.7086612950079143] double backward (new C++) [0.21922698011621833, 0.1858532396145165, 0.19477044604718685] ``` Old ``` cuda: forward [3.101281268056482, 3.068499860819429, 3.0527669726870954] input requires_grad=True: backward [0.5650290949270129, 0.5730433077551425, 0.5588279226794839] double backward [1.1287697306834161, 1.13834543293342, 1.1298578432761133] target requires_grad=True: backward [0.9470391101203859, 0.9560198178514838, 0.9750375030562282] double backward [1.85760727385059, 1.7989214668050408, 1.788982989732176] cpu: forward (new C++) [12.474591840058565, 12.511441555805504, 12.666544185951352] input requires_grad=True: backward (new C++) [7.660991386976093, 7.449987292289734, 7.513917901087552] double backward (new C++) [4.073225498665124, 4.264980792999268, 4.429787891916931] target requires_grad=True: backward (new C++) [3.448499082121998, 3.9072313378565013, 3.2433970272541046] double backward (new C++) [2.126378359273076, 1.9045450473204255, 1.7932004742324352] ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/10336 Differential Revision: D9213636 Pulled By: li-roy fbshipit-source-id: 27cc530f6276f58d35dc7a1d56dfc758a0fc4a7b	2018-08-27 16:10:59 -07:00
Tongzhou Wang	d043f83019	Add tests for Tensor.* nn.* F.* docs (#10311 ) Summary: Test only for existence for now. I had to skip a lot of them so there a FIXME in the test. Also I'm not testing torch.* because of namespace issue. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10311 Differential Revision: D9196341 Pulled By: SsnL fbshipit-source-id: 9c2ca1ffe660bc1cc664474993f8a21198525ccc	2018-08-14 11:39:46 -07:00
Adam Paszke	adbcb3c1dc	Move dropout and alpha dropout to ATen (#10384 ) Summary: zdevito ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/10384 Reviewed By: ezyang Differential Revision: D9272583 Pulled By: apaszke fbshipit-source-id: ed5d37b28ce9ff25800bbaa0daf066cfbf1f9921	2018-08-10 14:55:28 -07:00
Tongzhou Wang	6a55238a3f	Grid sampler: nearest interpolation & reflection padding (#10051 ) Summary: closes #9702 . cc jph00 Commit structure: 1. Change the index calculation logic. I will explain using 1-D for simplicity. Previously we have (in pseudo code): ``` // 1. get the float locations from grid scalar_t x = from_grid() // 2. find the integral surrounding indices int x_left = floor(x) int x_right = x_left + 1 // 3. calculate the linear interpolate weights scalar_t w_left = x_right - x scalar_t w_right = x - x_left // 4. manipulate the integral surrounding indices if needed // (e.g., clip for border padding_mode) x_left = manipulate(x_left, padding_mode) x_right = manipulate(x_right, padding_mode) // 5. interpolate output_val = interpolate(w_left, w_right, x_left, x_right) ``` This is actually incorrect (and also unintuitive) because it calculates the weights before manipulate out-of-boundary indices. Fortunately, this isn't manifested in both of the current supported modes, `'zeros'` and `'border'` padding: + `'zeros'`: doesn't clip + `'border'`: clips, but for out-of-bound `x` both `x_left` and `x_right` are clipped to the same value, so weights don't matter But this is a problem with reflection padding, since after each time we reflect, the values of `w_left` and `w_right` should be swapped. So in this commit I change the algorithm to (numbers corresponding to the ordering in the above pseudo-code) ``` 1. get float location 4. clip the float location 2. find the integral surrounding indices 3. calculate the linear interpolate weights ``` In the backward, because of this change, I need to add new variables to track `d manipulate_output / d manipulate_input`, which is basically a multiplier on the gradient calculated for `grid`. From benchmarking this addition doesn't cause obvious slow downs. 2. Implement reflection padding. The indices will keep being reflected until they become within boundary. Added variant of `clip_coordinates` and `reflect_coordinates` to be used in backward. E.g., ```cpp // clip_coordinates_set_grad works similarly to clip_coordinates except that // it also returns the `d output / d input` via pointer argument `grad_in`. // This is useful in the backward pass of grid_sampler. scalar_t clip_coordinates_set_grad(scalar_t in, int64_t clip_limit, scalar_t grad_in) ``` For example, if `in` is clipped in `'border'` mode, `grad_in` is set to `0`. If `in` is reflected odd* times in `'reflection'` mode, `grad_in` is set to `-1`. 3. Implement nearest interpolation. 4. Add test cases 5. Add better input checking Discussed with goldsborough for moving `operator<<` of `at::Device`, `at::DeviceType` and `at::Layout` into `at` namespace. (Otherwise `AT_CHECK` can't find them.) 6. Support empty tensors. cc gchanan + Make empty tensors not acceptable by cudnn. + Add `AT_ASSERT(kernel block size > 0)` if using `GET_BLOCKS` + Cache `numel` in `TensorGeometry` I was going to use `numel` to test if cudnn descriptor should accept a tensor, but it isn't used eventually. I can revert this if needed. 7. Add more test cases, including on input checking and empty tensors 8. Remove an obsolete comment 9. Update docs. Manually tested by generating docs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10051 Differential Revision: D9123950 Pulled By: SsnL fbshipit-source-id: ac3b4a0a36b39b5d02e83666cc6730111ce216f6	2018-08-10 12:43:27 -07:00
Wei Yang	149d4f776b	use logsigmoid at multilabel_soft_margin_loss, and change output from shape=(N, C)to (N,) (#9965 ) Summary: - fixes #9141, #9301 - use logsigmoid at multilabel_soft_margin_loss to make it more stable (NOT fixing legacy MultiLabelSoftMarginCriterion) - return (N) instead of (N, C) to match the same behavior as MultiMarginLoss - Note that with this PR, the following behavior is expected: ``` loss = F.multilabel_soft_margin_loss(outputs, labels, reduction='none') loss_mean = F.multilabel_soft_margin_loss(outputs, labels, reduction='elementwise_mean') loss_sum = F.multilabel_soft_margin_loss(outputs, labels, reduction='sum') loss.sum() == loss_sum # True loss.mean() == loss_mean # True ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/9965 Differential Revision: D9038402 Pulled By: weiyangfb fbshipit-source-id: 0fa94c7b3cd370ea62bd6333f1a0e9bd0b8ccbb9	2018-08-03 17:54:19 -07:00
Rob Kunkle	6e85112f12	Adding katex rendering of equations, and required edits to equations. (#8848 ) Summary: This fixes issue #8529. - Adds Katex extension to conf.py and requirements.txt - Fixes syntax differences in docs - Should allow documentation pages to render faster Pull Request resolved: https://github.com/pytorch/pytorch/pull/8848 Reviewed By: soumith Differential Revision: D8677702 Pulled By: goodlux fbshipit-source-id: c4a832c5879e0eebcb14763b35a41663331ba23f	2018-08-02 12:25:17 -07:00
Xiang Gao	6fc75eadf0	Add CELU activation to pytorch (#8551 ) Summary: Also fuse input scale multiplication into ELU Paper: https://arxiv.org/pdf/1704.07483.pdf Pull Request resolved: https://github.com/pytorch/pytorch/pull/8551 Differential Revision: D9088477 Pulled By: SsnL fbshipit-source-id: 877771bee251b27154058f2b67d747c9812c696b	2018-08-01 07:54:44 -07:00
Kyle M. Tarplee	aae37324cc	fixed a newly introduced regression in softmax (#10066 ) Summary: There is a regression in softmin in 0.4.1 that was not present in 0.4.0. The behavior of softmin(x) should match softmax(-x) however instead it is implemented (in v0.4.1) as -softmax(x). These are not the same. The fix is trivial because the bug is due to operator precedence. This is a major regression that broke my training. I'm not sure how a unit test did not catch this. ``` x = torch.tensor([1, 2, 3.5, 4]) print(F.softmin(x, dim=0)) # this has the wrong output in 0.4.1 but correct in 0.4.0 print(F.softmax(-x, dim=0)) # this is what softmax should be print(F.softmax(x, dim=0)) print(-F.softmax(x, dim=0)) # this is how softmax is implemented incorrectly ``` In 0.4.1 this produces tensor([-0.0278, -0.0755, -0.3385, -0.5581]) tensor([0.6668, 0.2453, 0.0547, 0.0332]) tensor([0.0278, 0.0755, 0.3385, 0.5581]) tensor([-0.0278, -0.0755, -0.3385, -0.5581]) In 0.4.0 this produces the correct values tensor([ 0.6668, 0.2453, 0.0547, 0.0332]) tensor([ 0.6668, 0.2453, 0.0547, 0.0332]) tensor([ 0.0278, 0.0755, 0.3385, 0.5581]) tensor([-0.0278, -0.0755, -0.3385, -0.5581]) Pull Request resolved: https://github.com/pytorch/pytorch/pull/10066 Differential Revision: D9106995 Pulled By: soumith fbshipit-source-id: 7332503c6077e8461ad6cd72422c749cf6ca595b	2018-07-31 19:28:30 -07:00
Roy Li	2422801625	fix _pointwise_loss for target gradients (#10018 ) Summary: _pointwise loss has some python special casing, we converted reduction to aten enums too early. fixes #10009 Pull Request resolved: https://github.com/pytorch/pytorch/pull/10018 Differential Revision: D9075489 Pulled By: li-roy fbshipit-source-id: 4bf2f5e2911e757602c699ee1ec58223c61d0162	2018-07-31 13:39:58 -07:00
Thomas Viehmann	685224aa14	Add CTC loss (#9628 ) Summary: The CPU and CUDA variants are a direct transposition of Graves et al.'s description of the algorithm with the modification that is is in log space. The there also is a binding for the (much faster) CuDNN implementation. This could eventually fix #3420 I still need to add tests (TestNN seems much more elaborate than the other testing) and fix the bugs than invariably turn up during the testing. Also, I want to add some more code comments. I could use feedback on all sorts of things, including: - Type handling (cuda vs. cpu for the int tensors, dtype for the int tensors) - Input convention. I use log probs because that is what the gradients are for. - Launch parameters for the kernels - Errors and obmissions and anything else I'm not even aware of. Thank you for looking! In terms of performance it looks like it is superficially comparable to WarpCTC (and thus, but I have not systematically investigated this). I have read CuDNN is much faster than implementations because it does not use log-space, but also the gathering step is much much faster (but I avoided trying tricky things, it seems to contribute to warpctc's fragility). I might think some more which existing torch function (scatter or index..) I could learn from for that step. Average timings for the kernels from nvprof for some size: ``` CuDNN: 60.464us compute_alphas_and_betas 16.755us compute_grads_deterministic Cuda: 121.06us ctc_loss_backward_collect_gpu_kernel (= grads) 109.88us ctc_loss_gpu_kernel (= alphas) 98.517us ctc_loss_backward_betas_gpu_kernel (= betas) WarpCTC: 299.74us compute_betas_and_grad_kernel 66.977us compute_alpha_kernel ``` Of course, I still have the (silly) outer blocks loop rather than computing consecutive `s` in each thread which I might change, and there are a few other things where one could look for better implementations. Finally, it might not be unreasonable to start with these implementations, as the performance of the loss has to be seen in the context of the entire training computation, so this would likely dilute the relative speedup considerably. My performance measuring testing script: ``` import timeit import sys import torch num_labels = 10 target_length = 30 input_length = 50 eps = 1e-5 BLANK = 0#num_labels batch_size = 16 torch.manual_seed(5) activations = torch.randn(input_length, batch_size, num_labels + 1) log_probs = torch.log_softmax(activations, 2) probs = torch.exp(log_probs) targets = torch.randint(1, num_labels+1, (batch_size * target_length,), dtype=torch.long) targets_2d = targets.view(batch_size, target_length) target_lengths = torch.tensor(batch_size[target_length]) input_lengths = torch.tensor(batch_size[input_length]) activations = log_probs.detach() def time_cuda_ctc_loss(grout, args): torch.cuda.synchronize() culo, culog_alpha = torch._ctc_loss(args) g, = torch.autograd.grad(culo, args[0], grout) torch.cuda.synchronize() def time_cudnn_ctc_loss(groupt, args): torch.cuda.synchronize() culo, cugra= torch._cudnn_ctc_loss(args) g, = torch.autograd.grad(culo, args[0], grout) torch.cuda.synchronize() def time_warp_ctc_loss(grout, args): torch.cuda.synchronize() culo = warpctc.ctc_loss(args, blank_label=BLANK, size_average=False, length_average=False, reduce=False) g, = torch.autograd.grad(culo, args[0], grout) torch.cuda.synchronize() if sys.argv[1] == 'cuda': lpcu = log_probs.float().cuda().detach().requires_grad_() args = [lpcu, targets_2d.cuda(), input_lengths.cuda(), target_lengths.cuda(), BLANK] grout = lpcu.new_ones((batch_size,)) torch.cuda.synchronize() print(timeit.repeat("time_cuda_ctc_loss(grout, args)", number=1000, globals=globals())) elif sys.argv[1] == 'cudnn': lpcu = log_probs.float().cuda().detach().requires_grad_() args = [lpcu, targets.int(), input_lengths.int(), target_lengths.int(), BLANK, True] grout = lpcu.new_ones((batch_size,)) torch.cuda.synchronize() print(timeit.repeat("time_cudnn_ctc_loss(grout, args)", number=1000, globals=globals())) elif sys.argv[1] == 'warpctc': import warpctc activations = activations.cuda().detach().requires_grad_() args = [activations, input_lengths.int(), targets.int(), target_lengths.int()] grout = activations.new_ones((batch_size,), device='cpu') torch.cuda.synchronize() print(timeit.repeat("time_warp_ctc_loss(grout, *args)", number=1000, globals=globals())) ``` I'll also link to a notebook that I used for writing up the algorithm in simple form and then test the against implementations against it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/9628 Differential Revision: D8952453 Pulled By: ezyang fbshipit-source-id: 18e073f40c2d01a7c96c1cdd41f6c70a06e35860	2018-07-31 11:09:48 -07:00
Adam Paszke	aa7af94656	Make JIT tracing a thread-local property (#9414 ) Summary: As in the title. Lets us simplify a lot of code. Depends on #9363, so please review only the last commit. zdevito Pull Request resolved: https://github.com/pytorch/pytorch/pull/9414 Reviewed By: zdevito Differential Revision: D8836496 Pulled By: apaszke fbshipit-source-id: 9b3c3d1f001a9dc522f8478abc005b6b86cfa3e3	2018-07-19 19:09:39 -07:00
tippisum	5c695e3a60	Implement 2D and 3D alpha_dropout (#9073 ) Summary: It implements per-channel alpha_dropout. It also creates corresponding function classes and unifies the process of dropout and alpha_dropout. Pull Request resolved: https://github.com/pytorch/pytorch/pull/9073 Differential Revision: D8727008 Pulled By: ezyang fbshipit-source-id: 9d509f9c5db4e98f7b698cdfc4443505a4d2b331	2018-07-17 17:10:16 -07:00
Roy Li	a47a30b9ce	Implement grid_sampler in aten (#8929 ) Summary: Partially addresses #8928. Maybe #7273? Pull Request resolved: https://github.com/pytorch/pytorch/pull/8929 Reviewed By: ezyang Differential Revision: D8668919 Pulled By: li-roy fbshipit-source-id: 8ad07b224d2ab211c274c4c10f042501efaae32c	2018-07-10 15:10:24 -07:00
Tongzhou Wang	e8536c08a1	Update extension docs, fix Fold/Unfold docs (#9239 ) Summary: Commits: 1. In extension doc, get rid of all references of `Variable` s (Closes #6947 ) + also add minor improvements + also added a section with links to cpp extension :) goldsborough + removed mentions of `autograd.Function.requires_grad` as it's not used anywhere and hardcoded to `return_Py_True`. 2. Fix several sphinx warnings 3. Change `*` in equations in `module/conv.py` to `\times` 4. Fix docs for `Fold` and `Unfold`. + Added better shape check for `Fold` (it previously may give bogus result when there are not enough blocks). Added test for the checks. 5. Fix doc saying `trtrs` not available for CUDA (#9247 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/9239 Reviewed By: soumith Differential Revision: D8762492 Pulled By: SsnL fbshipit-source-id: 13cd91128981a94493d5efdf250c40465f84346a	2018-07-08 19:09:39 -07:00
Ailing Zhang	227c8f2654	Implement nn.functional.interpolate based on upsample. (#8591 ) Summary: This PR addresses #5823. * fix docstring: upsample doesn't support LongTensor * Enable float scale up & down sampling for linear/bilinear/trilinear modes. (following SsnL 's commit) * Enable float scale up & down sampling for nearest mode. Note that our implementation is slightly different from TF that there's actually no "align_corners" concept in this mode. * Add a new interpolate function API to replace upsample. Add deprecate warning for upsample. * Add an area mode which is essentially Adaptive_average_pooling into resize_image. * Add test cases for interpolate in test_nn.py * Add a few comments to help understand linear interpolation code. There is only "cubic" mode missing in resize_images API which is pretty useful in practice. And it's labeled as hackamonth here #1552. I discussed with SsnL that we probably want to implement all new ops in ATen instead of THNN/THCUNN. Depending on the priority, I could either put it in my queue or leave it for a HAMer. After the change, the files named as Upsampling.c works for both up/down sampling. I could rename the files if needed. Differential Revision: D8729635 Pulled By: ailzhang fbshipit-source-id: a98dc5e1f587fce17606b5764db695366a6bb56b	2018-07-06 15:28:11 -07:00
Tongzhou Wang	7b25cbbef9	Test nn.Module on non-contiguous inputs (#9114 ) Summary: 1. Let `ModuleTest` raise when they fail on non-contiguous inputs. Fix legacy modules. 2. Fix BN (both THNN and cuDNN) not working on non-contiguous inputs. 3. Fix CUDA EmbeddingBag not working on non-contiguous inputs. To prevent calling `.contiguous()` on in both `forward` and `backward`, a. prefix all current `embedding_bag` functions with `_`, indicating that they require input to be contiguous (there is a check in each function). b. create `embedding_bag`, which makes input arguments `.contiguous()`, and calls `_embedding_bag` 3. Make many ATen `embedding` functions to work on non-contiguous inputs so we don't need to call `input = input.contiguous()` in Python `nn.functional.embedding`. 4. Fix dense-sparse addition when the sparse input is not coalesced and indices or values tensor is not contiguous. This came up in the test cases of Embedding modules with `sparse=True`. Added tests. 5. Update `TensorUtils.cpp` to use `AT_` macros. Request: review from cpuhrsch on the `Embedding` changes. review from ezyang on ATen sparse & BN changes. Closes https://github.com/pytorch/pytorch/pull/9114 Differential Revision: D8717299 Pulled By: SsnL fbshipit-source-id: 0acc6f1c9522b5b605361e75112c16bbe1e98527	2018-07-05 21:09:34 -07:00
Roy Li	21c786071b	update nn loss tests to use new reduction arg (#9118 ) Summary: The tests were using the old args, which caused them to emit a lot of deprecation warnings. closes #9103. Reviewed By: ezyang Differential Revision: D8720581 Pulled By: li-roy fbshipit-source-id: 3b79527f6fe862fb48b99a6394e8d7b89fc7a8c8	2018-07-02 19:41:57 -07:00
Wei Yang	cb1bfe91af	Deprecated several functions at torch.nn.functional (#8748 ) Summary: 1. fixes #6245 2. deprecated tanh, sigmoid Closes https://github.com/pytorch/pytorch/pull/8748 Differential Revision: D8697975 Pulled By: weiyangfb fbshipit-source-id: f30714aa0611a1fe870040692f3dbcc8238aece9	2018-07-02 15:54:46 -07:00
Roy Li	c61f0217a5	combine size_average and reduce args in loss functions (#8018 ) Summary: closes #7929 Closes https://github.com/pytorch/pytorch/pull/8018 Differential Revision: D8682540 Pulled By: li-roy fbshipit-source-id: 649170dd1a7f373151c1d4e949838bd1c5651936	2018-07-01 05:39:00 -07:00
Peter Goldsborough	f0772c0ab2	Replace max_pool with max_pool_with_indices (#8946 ) Summary: Re-push from https://github.com/pytorch/pytorch/pull/8892 Closes https://github.com/pytorch/pytorch/pull/8946 Differential Revision: D8666862 Pulled By: goldsborough fbshipit-source-id: 44cd3d63d347316818a7b0f5f89fce8ff7486736	2018-06-28 16:10:08 -07:00
Orion Reblitz-Richardson	9ec0a2aef4	fbshipit-source-id: ba600fcd2b5cefc7621357bdeb05e24cea02e5af	2018-06-27 04:50:56 -07:00
Peter Goldsborough	290d20b094	Replace max_pool with max_pool_with_indices (#8892 ) * Create max_poolXd_with_indices * Match ATen names in ONNX symbolic	2018-06-26 17:09:30 -07:00
Vadim Velikodniy	6e28d4d364	Add pos_weight argument to nn.BCEWithLogitsLoss (#5660 ) (#6856 ) * Add pos_weight argument to nn.BCEWithLogitsLoss and F.binary_cross_entropy_with_logits (#5660) - Add an option to control precision/recall in imbalanced datasets - Add tests (but new_criterion_tests) * Move pos_weight to the end of args list in the documentation. `pos_weight` was moved to the end because it is the last argument in both `nn.BCEWithLogitsLoss` and `binary_cross_entropy_with_logits`	2018-06-26 12:31:07 -04:00
Peter Goldsborough	8e98a1a84d	Create avg_pool1d in ATen (#8880 ) * Create avg_pool1d in ATen * Put function name into check1d method	2018-06-25 20:31:32 -07:00
li-roy	85f4d2b55a	throw error when grid_sample is passed unsupported mode (#8884 )	2018-06-25 22:37:41 -04:00
Tongzhou Wang	731273b8d6	Improve convT output_padding docs (#8825 ) * improve output_padding doc for convT modules * Update functional.py * Update conv.py * lint	2018-06-23 14:33:18 -04:00
Ailing	ddda7cfea5	allow output_size to contain None in adaptive pooling methods (#8596 ) * allow output_size to contain None in adaptive pooling methods * fix lint * address comments	2018-06-22 13:29:15 -04:00
Thomas Viehmann	0ae8b6c027	add fold example and add nn.Fold/nn.Unfold and F.fold/F.unfold to doc (#8600 ) * add fold example and add nn.Fold/nn.Unfold and F.fold/F.unfold to doc and a few drive-by doc fixes * typo	2018-06-18 09:36:42 -04:00
Wei Yang	ae55865a3b	Migrated hardshrink() to ATen and deprecated nn.Hardshrink() (#8117 ) * 1. added hardshrink() to ATen (CPU + GPU); 2. removed nn.Hardshrink(); 3. reusing previous tests for nn.Hardshrink() and included CUDA tests at test_nn; 4. default parameter lambda=0.5 is not working yet * optimized memory read/write * 1. pass in lambd as scalar for CPU/CUDA_apply; 2. removed tests for hardshrink at test_legacy_nn fixes test_utils * 1. replace zeros_like with empty_like; 2. use scalar_cast in cuda * 1. printing lambd value; 2. default lambd=0.5 is still failing * getting around Scalar bug buy removing default value of lambd from native_functions.yaml, and declare it at nn/functional.py * cleaned up debug printf	2018-06-14 16:42:20 -04:00
Tongzhou Wang	a77b391de7	[SpectralNorm] don't register original weight as buffer (#8170 ) * don't register original weight as buffer; fixes for buffers that require grad * add test	2018-06-12 14:42:05 -04:00
Tongzhou Wang	f9926e4ce5	Fix EmbeddingBag max_norm option (#7959 ) * fix EmbeddingBag max_norm option * flake8 * add warning to the embedding bag arg change	2018-05-31 09:42:56 -04:00
Vedaanta Agarwalla	215fe057ea	No Default argument to max_unpool functions (Fixes #7327 ) (#7388 ) * Fix for Issue #7327 * Added testcase for max_unpool	2018-05-29 15:02:23 -04:00
ngimel	a015d579dd	move softmax/logsoftmax to ATen (#6786 ) * move softmax/logsoftmax to ATen * specify cpu and gpu accum types * use accreal for CPU * expose softmax backward to python, fix legacy interface * fix Distributions.cu to use common AccumulateType * fix cuda 8 build * delete commented out lines * rebase on master, fix breakages	2018-05-04 14:23:35 -04:00
Ethan Steinberg	ee00a8049a	Add max pooling support to EmbeddingBag (#5725 ) * Add max mode support to EmbeddingBag * Lint fix * Fix compilation issue on other platforms * Rebase + don't waste memory when not in max mode * Oops, missed a spot * Fix whitespace from merge * less precision * Lower precision to avoid spurious failures * Minor typo * Switch to size()	2018-04-29 16:48:11 -04:00
Emanuel Jöbstl	645ad7ad0c	Fixing LP-Pooling stability issues (#6766 ) * Added ReLU unit to LP pooling, so the gradient does not become NAN if all inputs are zero. * Added workaround for odd p. Added a bit of doc. * Make the linter happy.	2018-04-25 22:13:15 -04:00
li-roy	d564ecb4a5	Update docs with new tensor repr (#6454 ) * Update docs with new tensor repr * remove cuda in dtype * remove changes to gloo submodule * [docs] document tensor.new_* ctor * [docs] Add docs for tensor.to(), tensor.float(), etc * [docs] Moar examples for docs. * [docs] Warning for tensor ctor copy behavior * Quick fix * [docs] Document requires_grad_() * [docs] Add example for requires_grad_() * update slogdet and fft update tensor rst * small fixes * update some docs * additional doc changes * update torch and tensor docs * finish changing tensor docs * fix flake8 * slogdet with negative det * Update functional.py tensor ctors * Fix nll_loss docs * reorder to move device up * torch.LongTensor -> torch.tensor or torch.empty in docs * update tensor constructors in docs * change tensor constructors * change constructors * change more Tensor() to tensor() * Show requires_grads_ docs * Fix set_default_dtype docs * Update docs with new tensor repr * remove cuda in dtype * remove changes to gloo submodule * [docs] document tensor.new_* ctor * [docs] Add docs for tensor.to(), tensor.float(), etc * [docs] Moar examples for docs. * [docs] Warning for tensor ctor copy behavior * Quick fix * [docs] Document requires_grad_() * [docs] Add example for requires_grad_() * update slogdet and fft update tensor rst * small fixes * update some docs * additional doc changes * update torch and tensor docs * finish changing tensor docs * fix flake8 * slogdet with negative det * Update functional.py tensor ctors * Fix nll_loss docs * reorder to move device up * torch.LongTensor -> torch.tensor or torch.empty in docs * update tensor constructors in docs * change tensor constructors * change constructors * change more Tensor() to tensor() * Show requires_grads_ docs * Fix set_default_dtype docs * Link to torch.no_grad, etc, from torch doc * Add dtype aliases to table * regen docs again * Tensor attributes stub page * link to inplace sampling * Link torch.dtype, device, and layout * fix dots after nonfinite floats * better layout docs	2018-04-21 07:35:37 -04:00
Thomas Viehmann	533beab5bb	Fix doc for torch.nn.functional.relu (fixes #6742 ) (#6749 ) Thank you Shengyi Qian (JasonQSY) for spotting and reporting.	2018-04-19 11:25:43 +02:00
Tongzhou Wang	1c01eabd3c	Codemod to update our codebase to 0.4 standard (#6641 ) * Codemod to update our codebase to 0.4 standard * Update some of the test scri[ts * remove Variable in test_clip_grad_value * fix _symbolic_override_wrapper_maker	2018-04-17 22:06:54 -04:00
Mike Vella	d5f041aa8b	Updated documentation for cross entropy loss to include multi-dimensional input shapes (#6638 )	2018-04-17 09:56:43 -04:00
Yannick Soom	fd6d11ae66	Fixed text of error message in case of unexpected target size (#6617 )	2018-04-16 11:27:02 -04:00
Tongzhou Wang	59bda9a8c4	Fix reflection padding boundary checks (#6438 ) * Fix Reflection padding boundary checks * Improve padding docs * fix lint	2018-04-10 10:37:01 -04:00
Kento NOZAWA	3b58b859b2	Fix typos in docs (#6389 )	2018-04-07 12:41:15 -04:00
Tongzhou Wang	48ad4546d2	Move LayerNorm to ATen; remove tracking_running_stats functionality (#5983 ) * move LN to aten; remove tracking_stats functionaility * Address comments about error message and respect cudnn flag for LayerNorm and GroupNorm	2018-03-30 09:44:11 -07:00
Richard Zou	371e14b807	NLLLoss: error message for mismatched input/target batch sizes (#6072 ) Fixes #5554 Adds an error message for when NLLLoss is passed an input and target whose batch sizes don't match. Ideally this check should live in ATen but since there is NLLLoss logic in python the check is there right now.	2018-03-28 14:21:38 -07:00
sundw2014	8964aab260	fix docs error in torch.nn.functional.nll_loss (#6060 ) According to the code in _torch/nn/functional.py:1399_ (```if target.size()[1:] != input.size()[2:]:```), if the size of input is (N, C, d_1, d_2, ..., d_K), the size of target should be (N, d_1, d_2, ..., d_K).	2018-03-28 10:05:14 +02:00
Tongzhou Wang	39829c1670	Improve docs (#5999 ) * Clarify det and svd doc on when backward is not stable * Fix some links in nn.functional doc; improve upsampling doc	2018-03-26 14:09:11 -04:00
Tongzhou Wang	5d77709485	Linearly interpolating upsampling fix (#5927 ) * Changes in bilinear upsampling * Add align_corners option to upsampling module & functional when using linearly interpolating modes When align_corners=True, it uses the old original upsampling scheme, which gives visually better results, but doesn't properly align input and output pixels, and thus cause the output vary basing on input. This PR adds this align_corners option, and changes the default behavior to align_corners=False, with proper warning if this option is not specified upon using nn.Upsample or nn.functional.upsample to let be aware of this new change. Adds tests in test_nn.py for spatial invariance when align_corners=False, and usual module tests for align_corners=False. * remove redundant checks and unnecessary variables; fix the cast * fix negative indices	2018-03-24 12:21:13 -04:00
Tongzhou Wang	08891b0a4e	Group Normalization (#5968 ) * Group Normalization * move to ATen	2018-03-24 12:16:18 -04:00
Vedanuj Goswami	f3e16cc737	Expose gradients w.r.t. input & weight for conv1d, conv2d, conv3d in Python (#5408 ) This PR addresses issue #5024 * Expose Conv2dBackward in python * Separate interface for exposing gardients of operators * Revert old changes * Add tests * Add conv1d gradients. Refactor tests for grad convolutions * Refactor names and change examples * Remove Varibale from tests for conv backward	2018-03-23 17:49:32 -04:00
li-roy	e4eee7c2cf	Implement MarginRankingLoss as native function and add reduce=True arg to it (#5346 ) * add reduce=True arg to MarginRankingLoss * make default margin arg match for legacy * remove accidentally added test * fix test * fix native_functions.yaml alphabetical order	2018-03-21 15:40:58 -04:00
li-roy	1dcad08537	Support N-D tensors in Bilinear (#5764 ) * support n-d inputs in bilinear and move to aten * support n-d inputs in bilinear and move to aten * add asserts to bilinear inputs * address comments * cast int64_t in asserts	2018-03-17 11:57:43 -04:00
li-roy	e876b5d9d0	implement TripletMarginLoss as a native function (#5680 ) * implement TripletMarginLoss as a native function * implement TripletMarginLoss as native function * fix compile error * address comments * address comments * Add keepdim arg to pairwise distance	2018-03-17 11:10:48 -04:00
Peter Goldsborough	effc568cee	Add ReLU to ATen (#5626 )	2018-03-13 19:23:24 +01:00
Vishwak Srinivasan	76a283db40	[ready] General Documentation Improvements - 2 (#5685 ) * Fix some minor errors in existing docs. * Fix Convolution and Pooling docs in torch.nn.functional * Cleaned up torch.nn.functional docs * Address @SsnL 's comments * Add multiplication sign missing in docs * Fix more typos, and clear some warnings * Change infinity symbol in LPPool2d * Revert some changes in torch.nn.functional * Few more minor changes	2018-03-13 09:47:43 -04:00
li-roy	4c4a42b3f9	implement CosineEmbeddingLoss as a native function and add reduce arg (#5646 ) * implement CosineEmbeddingLoss as a native function and add reduce=True arg to it * fix flake8 * address comments * add reference function to tests * fix flake8	2018-03-08 17:54:24 -05:00
Edward Z. Yang	9de922991c	Revert "implement CosineEmbeddingLoss as a native function and add reduce arg" (#5640 ) * Revert "implement CosineEmbeddingLoss as a native function and add reduce arg (#5447)" This reverts commit `c16478fe3f`.	2018-03-08 14:07:17 -05:00
li-roy	c16478fe3f	implement CosineEmbeddingLoss as a native function and add reduce arg (#5447 ) forward (new) [1.1905965859768912, 1.160144692985341, 1.1558120870031416] backward (new) [1.9150976981036365, 1.9792822760064155, 1.8779143309220672] double backward (new) [3.6898688060464337, 3.5784677929477766, 3.569505032035522] forward (old) [3.2359962839400396, 3.275224728975445, 3.3409753759624436] backward (old) [5.668679727939889, 5.722980880062096, 5.585088661056943] double backward (old) N/A * implement CosineEmbeddingLoss as a native function and add reduce=True arg to it * fix flake8 * address comments * add reference function to tests * fix flake8	2018-03-08 13:15:12 -05:00
Francisco Massa	0f50ca0b48	Add reduce to functional smooth_l1 documentation (#5610 ) This has been present in master since https://github.com/pytorch/pytorch/pull/3382 but the doc for the functional interface was not taken into account.	2018-03-07 10:16:40 -05:00
cjsg	15eae9543e	Fixed dimensions in docs of conv and conv_transpose (#5543 )	2018-03-03 05:49:01 -05:00
Edward Z. Yang	f064c5aa33	Expunge all occurrences of torch._C._VariableFunctions (#5525 ) Some of the call-sites now look a little hokey with this removed, saving that for another patch. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2018-03-02 12:19:44 -05:00
Tongzhou Wang	27265503ad	nn.* doc update after Variable/Tensor merge (#5459 ) The nn.* counterpart of #5443 . Mostly removed Variable wrapper. Also added doc for nn.RReLU. Notice that torch.randn(*, requires_grad=True) isn't documented until #5462 is done.	2018-03-01 18:11:39 -05:00
Soumith Chintala	36abf023bd	Added 3d grid sampler (for volumetric transformer networks) (#5453 ) * add 3d grid_sample * add cuda implementation, more testing	2018-02-28 19:32:15 -05:00
li-roy	5bbeb55f22	add reduce=True arg to MultiMarginLoss (#5150 ) * add reduce=True arg to MultiMarginLoss * Change tests to support legacy * fix flake8 * address comments * formatting change * remove free of unallocated tensor * fix after variable/tensor merge	2018-02-27 18:35:50 -05:00
Sam Gross	30ec06c140	Merge Variable and Tensor classes (#5225 ) This replaces the torch.Tensor constructors with factories that produce Variables. Similarly, functions on the torch module (e.g. torch.randn) now return Variables. To keep the PR to a reasonable size, I've left most of the unused tensor code. Subsequent PRs will remove the dead code, clean-up calls to torch.autograd.Variable, and rename Variable to Tensor everywhere. There are some breaking changes because Variable and Tensors had slightly different semantics. There's a list of those changes here: https://github.com/pytorch/pytorch/wiki/Breaking-Changes-from-Variable-and-Tensor-merge	2018-02-23 18:03:31 -05:00
Tongzhou Wang	1848cad108	[ready] Layer Normalization (#4922 ) * at::maybe_data_ptr and Check.h => TensorUtils.h * THNN support for optional BN running_* * ATen support for optional BN running_* * Python nn.* support for optional BN running_; Improve IN and BN doc Add tests for IN and BN new option * Layer Norm * Fix LRN doc * functional interface for LN and IN * Layer norm tests * fix BN double backward returning undefined tensors * fix jit test using wrong dim inputs for BN * add/improve BN, IN and LN GPU tests with half type * Udpate docs to be consistent with Conv notation Fix onnx Clarified onnx symbokic wrapper * fix typo * Address comments	2018-02-22 11:56:41 -05:00
li-roy	68aed0779d	add reduce=True arg to MultiLabelSoftMarginLoss (#5097 ) * add reduce=True arg to MultiLabelSoftMarginLoss * Move some tests to new_criterion_tests * fix flake8 * fix multilabelsoftmarginloss weights test	2018-02-15 15:29:44 -05:00
Richard Zou	ab18aaeba7	Clarify output shapes of reduce=False losses (#5082 )	2018-02-13 10:11:14 -08:00
li-roy	147612e64a	add reduce=True arg to SoftMarginLoss (#5071 ) * add reduce=True arg to SoftMarginLoss * add reference function for SoftMarginLoss * Rebase onto master * Address comments * Fix flake8 * Fix rebase error	2018-02-13 10:51:57 -05:00
cpuhrsch	07be53b57f	Move EmbeddingBag into ATen (#4856 ) This diff creates code related to EmbeddingBag in ATen. It also allows sparse gradients.	2018-02-12 14:20:32 -05:00
li-roy	ce5702fa80	add reduce=True arg to HingeEmbeddingLoss (#5130 ) * add reduce=True arg to HingeEmbeddingLoss * pass arg to super constructor in HingeEmbeddingLoss * make HingeEmbeddingLoss reference fn work on legacy	2018-02-09 11:38:36 -05:00
gchanan	affe742d31	Add scalar module tests for test_nn. (#5116 ) * Add scalar module tests for test_nn. * Properly return from glu. * Guard scalar test with skipIf.	2018-02-08 13:53:24 -05:00
Lu Fang	c111cdfd1d	Add onnx support for InstanceNorm (#4626 ) * Add ONNX symbolic for instancenorm * Fix some bugs	2018-02-07 10:54:30 -05:00
gchanan	7af433deeb	Add scalar criterion tests (#5087 ) * Add criterion scalar tests. This exposed an issue in MarginRankingLoss with scalars, but the cleanest way to fix is to wait until forward runs on Variables (so we don't have to wait for the backward to check if something is a scalar). * Fix flake8. * Add error message for margin_ranking_loss with scalars.	2018-02-06 18:40:37 -05:00
gchanan	fcccd07cc0	Implement hinge_embedding_loss as a native function. (#5080 )	2018-02-06 14:43:36 -05:00
li-roy	28f056fed2	add reduce=True argument to MultiLabelMarginLoss (#4924 ) * add reduce=True argument to MultiLabelMarginLoss * Fix lint * Addressed comments * Remove unneeded syncthreads calls	2018-02-05 12:28:51 -05:00
Richard Zou	e4ddbeb554	Fix typo (#4846 )	2018-01-25 10:33:45 -05:00
Richard Zou	b997474a4f	Adds Im2Col and Col2Im (#4729 )	2018-01-19 09:37:53 -05:00
Sam Gross	57549b7e44	Bind functions with out= arguments in VariableType (#4565 ) This adds overrides in VariableType for the xxx_out ATen functions and implements Python bindings. There is no support for automatic differentiation. If any of the inputs (or outputs) requires grad, then the function will throw an exception unless it's running in "no-grad" mode. The bindings for calling torch.xxx functions on Variables are moved to a different object. Previously, they were static method on VariableBase. This change prevents users from accidentally calling static methods as if they were instance methods.	2018-01-17 18:27:42 -05:00
Sam Gross	cb83474a57	Fix embedding with sparse=True (#4686 ) Fixes #4666	2018-01-16 16:19:20 -05:00
Kai Arulkumaran	2260649fb6	Local Response Normalization (#4667 ) * Local Response Normalization * Add 1D and 3D LRN * Generalise LRN to higher dims * Use mean instead of sum Specify 'across-channels'	2018-01-15 22:23:51 -05:00
David Pollack	05908e8243	current code works with dim = 3, so I added it to dim checks	2018-01-13 12:58:08 +01:00
Riddhiman Dasgupta	f99c7d9429	Padding_idx in Embedding supports negative indexing (#4496 )	2018-01-09 12:04:11 +01:00
Neeraj Pradhan	408c84de7c	Supporting logits as parameters in Bernoulli and Categorical (#4448 ) * Supporting logits as parameters in Bernoulli and Categorical * address comments * fix lint * modify binary_cross_entropy_with_logits * address comments * add descriptor for lazy attributes * address comments	2018-01-05 03:45:05 -05:00
Richard Zou	35c4d73bdb	Deprecate nn.NLLLoss2d (#4238 ) * Deprecate nn.NLLLoss2d * Fix legacy tests * Fix tests * Remove NLLLoss2d from docs, add deprecation warning instead of error * fix lint * Add more to docs	2018-01-04 12:38:04 -05:00
Hugh Perkins	fc0d940c5e	add gumbel_softmax, based on Eric Jang's implementation (#3341 ) * add gumbel_softmax, based on Eric Jang's implementation * Make gumbel_softmax CUDA friendly * gumbel_softmax tweaks	2018-01-04 12:23:21 -05:00
Sam Gross	20b5e82155	Implement embedding in ATen (#4322 ) Implements nn.Embedding (lookup table) in ATen. Breaking change: new optional argument padding_idx in F.embedding to match nn.Embedding. Note that there are a few bugs in Embedding that are inherited from the previous code: - CUDA renorm has race conditions if index contains duplicate entries - sparse gradient doesn't work with scale_grad_by_freq	2018-01-02 15:44:46 -05:00
Sam Gross	98f71912b0	Fix type signature of in-place NN functions (#4389 ) This is a step towards removing the special casing of NN functions in gen_variable_type.py. It fixes the signature of in-place NN functions so that they return Tensor & instead of Tensor.	2017-12-28 16:50:09 -05:00
Sam Gross	4dba674324	Move factional max pooling to ATen (#4290 )	2017-12-21 17:07:46 -05:00
Edward Z. Yang	5f7c5502b8	Further improvements to ATen convolution (#4287 ) - Rename THNN convolution to have thnn_ prefix. - Propagate CuDNN benchmark and deterministic to at::Context - Add 'convolution', 'convNd' and 'conv_transposeNd' native wrappers, with defaults The conv_transposeNd wrappers are updated to have the same argument order as Python. - torch.nn.functional directly dispatches to the native wrappers - Make it possible to turn off tracing for some native wrappers, so I don't have to write symbolics for all the functions above - Spectral ops can now make use of CuDNN convolution if possible - Better commentary on cudnn_batch_norm - Turn on DCE for all JIT tests. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-12-21 13:03:43 -05:00
Edward Z. Yang	5b8fe5cbb5	Batchnorm in ATen (#4285 ) * Batchnorm in ATen This commit moves BatchNorm derivatives into ATen, eliminating torch/csrc/autograd/functions/batch_normalization.cpp Some refactoring along the way: - Functions got renamed to remove _forward from their names - CuDNN batchnorm forward was modified to return save_mean/save_std instead of take it as parameters. To avoid returning undefined Variables, these return (small) uninitialized tensors when they are not used. - THNN batch normalization takes care of resizing save_mean and save_std on forward. - There are some shenanigans re batchnorm backwards in eval mode. I'm tracking that in #4284 - I decided not to introduce buffers as a proper concept in ATen, which means that tensors like running_mean/running_var are variables in ATen. This meant there needed to be some adjustments to how we trace such variables; the new strategy is if we can't find a Value for a variable, we look and see if we have a Value for the buffer pointed to by the variable, before finally falling back on constant. - This PR finally reliably triggered OOM on Travis builds; I fixed this by reducing the number of parallel jobs. - Stop using std::string when it's not necessary. - Remove training parameter from cudnn_batch_norm_backward, because it doesn't make sense; cuDNN doesn't implement the math for evaluation mode batchnorm backwards. - batchnorm_double_backward is now in an anonymous namespace, as it no longer needs to be called from torch/csrc Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-12-21 11:38:31 -05:00
Sam Gross	b6a30f7ede	Move SELU to ATen (#4269 ) Fuse scale multiplication into ELU	2017-12-20 16:32:21 -05:00
Sam Gross	dad4b2d6cc	Move adaptive avg/max pool1d to ATen (#4266 )	2017-12-20 15:50:17 -05:00
Sam Gross	689ef9cba3	Move upsampling to ATen (#4264 )	2017-12-20 15:12:07 -05:00
Edward Z. Yang	a88a8ec827	Convolution derivatives in ATen (#4116 ) * Convolution derivatives in ATen This PR introduces ATen implementation of convolution, which dispatches to THNN/CuDNN/nnpack based on input parameters. The general strategy is to compose this function out of the various forward-backward pairs of specific implementations, rather than write a monolithic function with backwards (which is what we did before because the boilerplate of doing it otherwise would have been very high.) The new API provides the following functions: - _convolution, which is a fully generic, native convolution implementation that dispatches to various other convolution implementations depending on input characteristics. This is prefixed with an underscore because it explicitly takes benchmark, deterministic and cudnn_enabled which are implementation details for CuDNN. The intent is to eventually provide a convolution that reads these parameters out of the context using #4104. - _convolution_nogroup is a convolution implementation for non-CuDNN algorithms which don't support group convolution natively. - _convolution_double_backward is the generic double-backwards implementation for convolution. In more detail: - Most functionality from torch/csrc/autograd/functions/convolution.cpp has been moved into aten/src/ATen/native/Convolution.cpp - We continue to make use of ConvParams, but we now construct the parameters upon entry to a function from the function signature (which does not use ConvParams; having convolution take ConvParams directly would require teaching the code generator how to accept these as parameters, complicating ATen's API model) and destruct them when making subprocedure calls. - I introduce a new idiom, input_r, which represents a const Tensor& reference, which will subsequently be assigned to a local Tensor input. This is helpful because a lot of the existing algorithms relied on being able to assign to locals, which is not permitted with a const reference. - The native argument parser now supports std::array<bool,2> inputs (NB: there MUST NOT be a space; this is the same hack as is applied to derivatives.yaml) - Native parser now supports Tensor? arguments, which indicates a nullable tensor. Previously this function was only used by NN methods. - Documentation updates on THNN library - I added an extra fgradInput argument to VolumetricConvolutionMM_updateOutput and VolumetricConvolutionMM_accGradParameters so that its buffer list lines up with the backward argument list. This makes it possible to write derivative for conv3d which previously was not supported (commented out in derivatives.yaml) - Extra double_backward declarations for all convolution backwards functions was added. - You can now use the syntax Tensor? in native_functions.yaml to indicate that a tensor argument is nullable. There are adjustments to propagate this to the Python argument parser. - NNPACK was ported to ATen, and ATen now builds and links against ATen if possible. New AT_NNPACK_ENABLED macro. The nnpack functions are nnpack_spatial_convolution. - Some modest CuDNN convolution refactoring to remove _forward from names. - There's a new cudnn_convolution_backward function to deal with the fact that CuDNN convolution double backward requires you to have computed all gradients in one go. - Variable set_flags now checks if the tensor is undefined, fixing a silent memory corruption. - checkSameType updated to not raise an exception if called with Variable arguments - "no ATen declaration found for" error message is improved to say what available declarations are - make_variable now accepts undefined tensors, and returns an undefined tensor in this case.	2017-12-20 14:19:27 -05:00
Sam Gross	b476d10c64	Move max_pool1d to ATen (#4257 )	2017-12-19 20:10:11 -05:00
Sam Gross	9495595520	Move reflection/replication padding to ATen (#4258 )	2017-12-19 18:57:14 -05:00
Sam Gross	227ef1fb60	Move adaptive avg pooling 2d/3d to ATen (#4254 ) Move adaptive avg pooling 2d/3d to ATen Also use ATen for softshrink	2017-12-19 15:45:33 -05:00
James Reed	cb4f6c3148	conv_tbc (#3730 ) attempt to rebase skip conv_tbc in preprocess_nn_functions Add conv_tbc symbolic Fix backward issue with dBias ConvTBC nn wrapper and unit test	2017-12-18 23:52:36 -05:00
Richard Zou	ccf4dc1525	Add reduce arg to BCELoss (#4231 ) * Add reduce arg to BCELoss * Fix test precision * reduce keyword for BCELoss in derivatives.yaml	2017-12-18 12:28:53 -05:00
Soumith Chintala	54d689253e	Revert "Add reduce arg to BCELoss" (#4221 ) * Revert "Add reduce arg to BCELoss (#3532)" This reverts commit `847c56aeb5`.	2017-12-18 03:13:09 -05:00
Richard Zou	847c56aeb5	Add reduce arg to BCELoss (#3532 ) * Add reduce arg to BCELoss * Fix test precision	2017-12-18 02:39:49 -05:00
Kevin Zakka	b86dc0c8ba	add reduce arg to PoissonNLLLoss (#3770 ) * add reduce arg to PoissonNLLLoss * fixed comments except reference function * fixed unit test * small indentation fix * fixing last comments by richard * lint check * another linting issue	2017-12-18 02:32:05 -05:00
Richard Zou	30e6898808	Implement NLLLossNd (#4035 ) * Implement NLLLossNd * Fix tests and typos * Fix tests	2017-12-18 02:16:16 -05:00
Emanuel Jöbstl	be1ef5e4a4	Added explicit tuple element-count to doc for Conv1d. (#4136 ) * Added explicit tuple element-count to doc for Conv1d.	2017-12-14 22:17:46 -05:00
Soumith Chintala	638b10d39b	fix softmax default dim for 1D Tensor	2017-12-01 19:20:04 -05:00
Edward Z. Yang	1c0fbd27a1	CuDNN bindings rewrite (into ATen) (#3666 ) * Comprehensive rewrite of Torch CuDNN bindings / a bit of ATen infra The executive summary is that this moves the torch/csrc/cudnn library into ATen, adding a number of new cudnn_ methods to ATen for batchnorm, convolution, affine grid generator and grid sampler. ATen infra changes: - TensorGeometry was moved to ATen - TensorGeometry was modified to make its interface resemble that of Tensor; in particular, sizes is no longer a field, it's a method. - AT_CUDA_ENABLED macro is set via ATen/Config.h header which is generated at cmake configure time. Fixes https://github.com/zdevito/ATen/issues/168 - Change AT_CUDA_ENABLED macro to be a function macro, so that we error if it is not defined - Introduce a new TensorArg class, which is a Tensor plus a little metadata. This helps us give good error messages when checking dimensions/shapes of tensors. Fixes https://github.com/zdevito/ATen/issues/169 - Also introduce a TensorGeometryArg class, for when you don't need the actual tensor data (which is most of the time.) - Add ATen/Check.h, which contains a number of utility functions for testing shapes, types and devices of input tensors. This will be particulary useful for native methods, which don't get code generated input testing code. These functions take a 'CheckedFrom' argument, at the moment just a string, which specifies some extra information about what function was doing the actual checking; this greatly improves error messages. - Many check functions take initializer lists, which let you test that all tensors have some property. This API is peculiar, in that we IGNORE undefined tensors in this case. This is handled by filterDefined. - Add AT_CUDNN_ENABLED macro - CuDNN linking from ATen was improved; for example, we now actually add the CuDNN headers to our include path. - Add some missing override specifiers to some methods - We now actually build tests with CUDA functionality accessible (previously, AT_CUDA_ENABLED was not defined, meaning that the headers were missing all CUDA-only functionality.) - Native functions now support giving explicit names to return outputs in yaml. This makes it possible to hook into the NN autogenerated derivatives codepath using native functions. CuDNN rewrite changes: - torch/csrc/cudnn now uses ATen (rather than passing around THVoidTensor) and lives in ATen. This lets us remove tensorPointer shenanigans. The functions are exposed to ATen as native functions described in aten/src/ATen/cudnn/cuDNN.yaml - ATen now builds and links against CuDNN when enabled. The cmake package script was taken from Caffe2. - Some header reorganization was done to help reduce dependencies on headers (this reorg is no longer used but I've kept it) - Rename CHECK to CUDNN_CHECK - Rip out old shape/type testing code in favor of modern ATen/Check.h interface using TensorArg. In many cases, increase the robustness of the checking code. - Change the inputs of the public facing functions, so that they can be bound by ATen - Delete THCState; this is retrieved from the global ATen context - Delete cudnnHandle_t, this is retrieved from the global Handles.h - Delete cudnnDataType_t, this is retrieved from the Tensor type - Delete Convolution class, instead its constituent arguments are passed individually - Change functions to return tensors, rather than take an appropriately sized output tensor as an input. - Redo how transposed convolution / backward convolution is implemented (knock on effect of returning tensors). Previously it was assumed that you would always pass an appropriately sized output tensor, but we don't want to do this anymore. For backwards, we instead give the desired output tensor (input, really) size, because that is readily available. For transposed* convolution, however, we take output_padding, and otherwise do the shape calculation. - Redo how legacy group convolution is implemented (knock on effect from porting cudnn to ATen.) Previously, group convolution was implemented by manually constructing sizes and strides and then outputting appropriate, with macros switching between individual groups and all-at-once based on CuDNN version. Now, the code looks exactly what you'd expect: there's a top-level wrapping function that supports group convolution no matter the version of CuDNN, and a low-level wrapper which supports only what CuDNN supports. The top-level function conditions on CuDNN version, and invokes the low-level interface 1 or n times. - There is now a debugging printer for tensor descriptors. - Convolution struct is replaced with ConvolutionArgs, which is not part of the public API but is used internally to conveniently pass around all of the arguments needed for Convolution. - Add some constexprs for well-known dimensions, reduce amount of magic numbers in code. - Put 'deterministic' in to ConvParams. Fixes #3659 - Lots more comments. - Some pessimizations, in the name of code clarity: - The descriptors are initialized on every invocation of convolution forward/backward. Previously, the descriptors were cached, so that you didn't have to initialize them again on backwards. This is difficult to support in the ATen interface so I didn't support it. - Legacy group convolution initializes its workspace for every group it performs. I did not feel motivated to fix this because the legacy codepath is already quite slow. - Affine grid generator and grid sampler automatically call contiguous on their arguments as necessary. - Batchnorm input checking is greatly beefed up, it now checks for the following input characteristics: - Definedness - GPU location - Type - Contiguity - Size PyTorch binding code changes - batchnorm now uses consistent var/data naming - batchnorm and convolution make use of new ATen bindings - Affine grid generator and grid sampler make use of ATen CuDNN bindings via derivatives.yaml. This means I had to restructure the code a little, since the THNN bindings still go through a legacy Python class. - I fixed some warnings: - s/friend class/friend struct/ on InterpreterStateImpl - Removed pessimizing move 'detached' in torch/csrc/autograd/variable.cpp - Removed unused pack_list on Scalar Signed-off-by: Edward Z. Yang <ezyang@fb.com> GCC 4.8 buildfix Signed-off-by: Edward Z. Yang <ezyang@fb.com> Add TensorGeometry to ATen.h Signed-off-by: Edward Z. Yang <ezyang@fb.com> CUDNN_CHECK Signed-off-by: Edward Z. Yang <ezyang@fb.com> Update TODO comment Signed-off-by: Edward Z. Yang <ezyang@fb.com> Delete return in cudnn_grid_sampler Signed-off-by: Edward Z. Yang <ezyang@fb.com> s/cudnnSetStreamToCurrent/setCuDNNStreamToCurrent/g Signed-off-by: Edward Z. Yang <ezyang@fb.com> Don't allocate a new vector when filtering defined. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Remove Check overloads, convert to pass references. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Some more microbenchmarking. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-11-30 23:06:58 -05:00
Sergey Zagoruyko	11c9bd6c98	Allow target.requires_grad in l1_loss and mse_loss (#3876 )	2017-11-27 10:59:16 -05:00
Richard Zou	5215640a41	Fix cosine_similarity's output shape (#3811 )	2017-11-21 18:33:41 -05:00
Sam Gross	9cb8b43778	Split off in-place NN functions (#3683 ) For example, this splits threshold into threshold(), which is now never in-place, and threshold_() which is always in-place. This simplifies the in-place vs. non-in-place logic in gen_variable_type.py, which was bug-prone.	2017-11-14 12:59:06 -05:00
josecabjim	e33df2b88a	Add border-padding for grid_sampler (#3599 ) * adds border padding to spatial grid sampler * fixes flake8 * adds docs	2017-11-12 18:46:49 -05:00
Edward Z. Yang	19515520bb	Make prelu an ATen op. This operator is a warmup I was doing before tackling convolution, as it has many properties that make it a "first" for implementing things. In particular, it is the first operator whose backwards have multiple returns; this means its double backwards is the first backwards for a function with multiple differentiable outputs. This exercises new code for output_mask and set_flags. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-11-10 09:58:40 +08:00
Ozan Çağlayan	dd6d04ddf2	doc: Normalize all true/false in docstrings to ``True\|False`` (#3593 ) * doc: Normalize all true/false in docstrings to ``True\|False`` This makes them more apparent in the documentation. * doc: fix flake8	2017-11-09 08:12:29 -05:00
Richard Zou	77ddd5130b	Add reduce keyword for KLDivLoss (#3330 )	2017-11-07 08:57:11 -05:00
Hugh Perkins	b043a74919	fix softmax doc (#3337 )	2017-11-01 08:47:51 -04:00
Gökçen Eraslan	638f0b5d78	Prevent numerical issues with poisson_nll_loss when log_input=False (#3336 ) * Prevent numerical issues with poisson_nll_loss when log_input=False Evaluation of the logarithm of the input variable in poisson negative log likelihood leads to NaN loss if variable being evaluated is zero. Small epsilon is added to prevent this. See equivalent Keras epsilon here: https://github.com/fchollet/keras/blob/master/keras/losses.py#L68 * PEP8 fix * Add epsilon support to PoissonNLLLoss in nn.modules.loss	2017-11-01 08:47:19 -04:00
Richard Zou	6214487fa7	Add reduce keyword to L1Loss (#3366 ) * Add reduce keyword to L1Loss * Fix legacy test for abscriterion * Address comments	2017-11-01 06:33:18 -04:00
Richard Zou	eac0942f6d	Add more nn docs (#3374 )	2017-10-30 18:37:36 -04:00
Ozan Caglayan	28f3d50f9d	doc: Replace nclasses with C	2017-10-30 12:06:20 -04:00
John Chiotellis	a0ce84e476	fix triplet margin loss documentation (#3339 )	2017-10-28 17:15:58 +02:00
SsnL	de1f4e69dd	raw text (#3327 )	2017-10-28 01:24:02 +05:30
Richard Zou	d8f3c601e4	Add reduce keyword to CrossEntropyLoss	2017-10-27 19:19:52 +02:00
Richard Zou	3853d5da97	Add reduce keyword to NLLLoss and NLLLoss2d (#3080 ) * API changes * Implement reduce for THNN ClassNLLCriterion * Implement reduce keyword for THCUNN ClassNLLCriterion * Implement reduce for THNN SpatialClassNLLCriterion * Implement reduce for THCUNN SpatialClassNLLCriterion * Make legacy NLLLoss work * Docs for NLLLoss reduce * reduce keyword for double backwards NLLLoss * reduce=False tests * Addressed comments * Fix trailing whitespace * Fix test failures in legacy nn * Rebase: add reduce keyword to aten declarations of NLLLoss * Add reference functions for all NLLLoss and NLLLoss2d test cases * Replaced slow get/set fns. Don't use int64_t in kernels. * Use TH_INDEX_BASE in NLLLoss for consistency * Fix legacy ClassNLLCriterion tests	2017-10-26 13:54:19 -04:00
Sam Gross	67839ce7bc	Delete unused Softmax code (#3220 ) Softmax and LogSoftmax are automatically bound and dispatched through VariableType.	2017-10-21 20:51:27 +02:00
Sam Gross	5989b05ecc	Enable ATen implementation of some NN functions and Variable methods	2017-10-20 15:38:01 -04:00
Adam Paszke	98e67448fa	Large Softmax and LogSoftmax refactor - Cleaned up THNN and THCUNN code and kernels - Improved THCUNN kernel performance 5x, making it match cuDNN performance - Added support for computing softmax over arbitrary dims NOTE: The default dim for 3D inputs is now 1 (used to be 0) - Both functions now accept inputs with arbitrarily many dimensions - Autograd functions no longer save the input (it's unnecessary) - Added cuDNN bindings for softmax, but they are unused as THCUNN matches or even exceeds cuDNN performance	2017-10-19 19:51:10 +02:00
Marcin Elantkowski	57ffe64cbe	Embedding related fixes (#3128 ) * Fix docs for nn.Embedding and F.embedding. - add description of 'sparse' argument (#3104) - fix F.embedding example (resulted in RuntimeError) * Make EmbeddingBag a New Style Function. * Add a functional interface for EmbeddingBag * Fix failing tests: add max_norm and norm_type to context, and fix typo in backend call. * Docfix: remove torch.manual_seed from example code. * Add a note about using sparse keyword in Embedding function.	2017-10-18 23:38:07 +02:00
Arthur Crippa Búrigo	17d68f824d	Fix typo. (#3140 )	2017-10-17 00:50:33 +02:00
SsnL	6dc67aef17	doc (#3110 )	2017-10-14 10:44:35 +02:00
Sam Gross	9437644f66	Replace softmin and softsign with simple differentiable expressions	2017-10-10 16:57:47 -04:00
Priya Goyal	2443fcac0b	Deterministic cudnn algorithms	2017-10-10 10:53:34 -04:00
SsnL	0eec332e14	assert reflection padding in range (#3008 )	2017-10-06 17:59:01 -04:00
Richard Zou	898c732293	Introduce a `reduce` keyword argument for MSELoss (#2878 ) * Add reduce keyword to MSECriterion API * Move gradOutput usage from py to backend * Implement reduce keyword for THNN MSECriterion * Implement reduce keyword for THCUNN MSECriterion * Implement reduce keyword for MSE double backwards * Tests for MSECriterion with reduce keyword * Documentation for reduce for MSELoss * Make legacy nn work with reduce keyword by ignoring it * Apply linter suggestions * Address comments (small changes) * Revert "Tests for MSECriterion with reduce keyword" This reverts commit 1c0be0defa49d336d023d7d9795db4037c92b6fe. * Undo changes to legacy nn tests * Reuse module test for MSELoss by creating a wrapper class for MSELoss * Address comments: refactor MSECriterion.cu to be nicer * Fix lint & build errors	2017-10-06 10:57:22 -04:00
SsnL	ba766ef39a	Fix BN size check in eval mode (#2977 )	2017-10-04 16:03:20 -04:00
SsnL	faa6fdfa18	Raise error when each channel only has 1 value in batch norm (#2961 ) * add error when each channel only has 1 value	2017-10-03 17:56:15 -04:00
SsnL	d5a7e304fa	added volumetric adaptive max pooling	2017-09-30 16:57:51 -04:00
Edward Z. Yang	9be8d0a9d2	Add a docstring for functional.linear. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-26 12:29:07 -04:00
SsnL	6a4ec4f9a8	VolumetricAdaptiveAveragePool	2017-09-25 15:12:44 -04:00
Luca Antiga	c580352aee	Adding 1d upsampling (#2846 )	2017-09-24 16:50:24 -04:00
Emanuel Jöbstl	39434ee2e4	Added LPPool1d. (#2783 )	2017-09-20 09:19:29 -04:00
David Pollack	c6ea6ed8ff	Add Nd Padding, Pad1d functions and ConstantPad3d (#2657 )	2017-09-18 14:48:49 -04:00
Gregory Chanan	d910a94b2b	Support AdaptiveMaxPool1d/2d double backwards.	2017-09-13 12:28:43 -04:00
Lu Fang	5294017d9f	Adding implicit padding for 3d average pooling	2017-08-26 14:45:19 -04:00
yunjey	153c9b0714	Add examples in functional.py and loss.py (#2371 ) * Add examples in functional.py Added examples for F.cross_entropy, F.binary_cross_entropy and F.binary_cross_entropy_with_logits. * Add ` for PyTorch docs Added ` for PyTorch docs. * Add examples in loss.py Added examples for nn.BCELoss and nn.BCEWithLogitLoss.	2017-08-25 09:44:36 -04:00
Alykhan Tejani	30baba7d15	fix typo in docstring	2017-08-16 17:55:39 -04:00
Gregory Chanan	c92f229aa2	CosineEmbeddingLoss as a new style function.	2017-08-14 16:19:10 -04:00
Gregory Chanan	9bcb9658d5	MarginRankingLoss as new style function.	2017-08-14 16:19:10 -04:00
Gregory Chanan	7aeb837895	Implement HingeEmbeddingLoss double backwards.	2017-08-14 16:19:10 -04:00
Gregory Chanan	9a243abe5c	Implement Softmin double backwards.	2017-08-14 16:19:10 -04:00
Gregory Chanan	a6cccc8701	Implement RReLU double backwards.	2017-08-14 16:19:10 -04:00
Luca Antiga	cd5275e79f	Convert upsampling Functions to new style (#2372 )	2017-08-11 21:03:58 -04:00
Soumith Chintala	42328b70f7	fix another is_same_size call	2017-08-02 19:53:39 -04:00
Soumith Chintala	b3ca3da4b6	fix type mismatch	2017-08-02 10:18:03 -04:00
yunjey	e1ca722988	Add comments for default value (#2248 ) Added comments for default value in nn.functional	2017-08-01 14:27:46 +05:30
Alykhan Tejani	643f8d12ff	[bugfix] in bce_with_logits logsumexp calculation (#2221 ) * fix bug in bce_with_logits logsumexp calculation * flake8 fix	2017-07-27 05:58:56 +05:30
Gregory Chanan	bcea678e7b	Update rebased functions to call apply.	2017-07-25 07:37:25 +05:30
Gregory Chanan	1a52ca02ef	Always return indices from MaxPool autograd functions to simplify implementation; The callers (in functional.py) will filter out the return instead.	2017-07-25 07:37:25 +05:30
Gregory Chanan	291369ff1b	Convert pooling functions to new-style, once_differentiable functions.	2017-07-25 07:37:25 +05:30
Gregory Chanan	9608e37969	Implement double backwards for PReLU.	2017-07-25 07:37:25 +05:30
Gregory Chanan	ec7c510557	Implement Softsign double backwards.	2017-07-25 07:37:25 +05:30
Gregory Chanan	852dd5f011	Convert _WeightedLoss functions to new style autograd functions.	2017-07-25 07:37:25 +05:30
Gregory Chanan	085abee444	Rebase kl_div changes.	2017-07-25 07:37:25 +05:30
Gregory Chanan	45ce4df74c	Convert auto nn Functions (non-criterion) to new style.	2017-07-25 07:37:25 +05:30
Alykhan Tejani	112728cbe9	reformulate bce_with_logits to not use abs (#2195 ) * reformulate bce_with_logits to not use abs * flake8 fixes	2017-07-25 03:46:27 +05:30
Alykhan Tejani	35757af6f7	Add broadcasting of weights to bce/bce_with_logits (#2161 ) * added tests + removed explicit expand of weight in bce with logits * add auto broadcasting of weight to BCELoss * remove the need for _BCELoss * formatting of warning * remove TODO * move across assert from _functions/thnn/loss.py * flake8 fixes	2017-07-21 16:02:07 -04:00
yunjey	ea607afd06	Add comments in nn.Upsample (#2175 )	2017-07-21 14:34:58 -04:00
Edward Z. Yang	f3f478960e	Convert Embedding to new style. (#1916 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-07-20 02:35:21 -04:00
Hugh Perkins	e537023147	add functional embedding (#1987 )	2017-07-20 01:53:37 -04:00
Aron Barreira Bordin	11f3ccf98f	Add missing Modules to nn.functional (#1801 ) * add dropout2d and dropout3d to functional added some loss functions to functional added tests using dropout from backend added docs fixes * edited loss modules to call functional	2017-07-19 15:55:21 -04:00
Fisher Yu	d6bc2642e7	Add ignore_index to NLLLoss2d	2017-07-13 23:22:48 -04:00
Soumith Chintala	58e4caf80f	add missing docs	2017-07-13 01:01:04 -04:00
Soumith Chintala	169ca67a4e	Adding Spatial Transformers w/CuDNN support	2017-07-12 14:32:06 -04:00
yunjey	1ef1dd9cad	Add comments for readability (#2005 )	2017-07-10 23:02:56 -07:00
Leonid Vlasenkov	46a868dab7	[Ready] Limit docs line length (#1900 ) * some docs are ready * docs * docs * fix some more * fix some more	2017-07-10 10:24:54 -04:00
Gregory Chanan	f6578c1b24	Implement double backwards for Dropout and FeatureDropout.	2017-07-03 18:51:22 -04:00
Gregory Chanan	daa84e7663	Implement bilinear double backward.	2017-07-03 18:51:22 -04:00
Gregory Chanan	1aa145dbac	Implement ConstantPad2d double backwards.	2017-07-03 18:51:22 -04:00
Alykhan Tejani	457587088a	Fix broadcasting issues in binary_cross_entropy_with_logits (#1944 ) * done re-seed cuda device if in bad fork * avoid broadcasting in binary_cross_entropy_with_logits * assert input sizes for BCEWithLogitLoss * added check that BCEWithLogitsLoss == Sigmoid + BCELoss * fix flake8 issues * rename test_bce_with_logits_gives_same_result_as_bce_and_sigmoid -> test_bce_with_logits_gives_same_result_as_sigmooid_and_bce_loss * add warning in BCELoss about input shapes * fix lint	2017-07-01 23:06:36 -04:00
Sam Gross	da0fad8a7a	Use torch.matmul in nn.Linear (#1935 ) This takes advantage of the broadcasting behavior of torch.matmul to support inputs with more than two dimensions. The extra dimensions are treated like part of the batch dimension, much like nn.Bottle in Lua Torch. There are a few related small performance changes: * Addmm computes the gradient in column-major for inputs in column-major format * Variable.mm calls Addmm in-place with the desired output buffer	2017-06-30 16:53:26 -04:00
Sam Gross	4d5075add2	Add ignore_index to nnl_loss and cross_entropy (#1937 )	2017-06-29 00:10:13 -04:00
Leonid Vlasenkov	ae61f3ff42	adds poisson NLL loss (#1779 )	2017-06-27 10:04:54 -04:00
Alykhan Tejani	67968cb60b	Add numerically stable BCELoss which takes logits as input (#1792 )	2017-06-19 22:05:51 -04:00
Francisco Massa	76ee014d10	Add documentation to SELU and AlphaDropout	2017-06-19 18:18:01 -04:00
Francisco Massa	f619ac6ac9	Quickfix for AlphaDropout on CUDA	2017-06-19 18:18:01 -04:00
Sam Gross	38b9598685	Added GLU (gated linear unit) From https://arxiv.org/abs/1612.08083	2017-06-13 20:48:19 -04:00
Francisco Massa	6626881e7a	Add Alpha Dropout (#1775 )	2017-06-13 00:39:49 +02:00
Francisco Massa	a24db91a38	Add SELU activation function (#1769 ) * Add SELU activation function * Remove unnecessary case * Add Function for SELU + tests and fix RReLU inplace * Fix extra line in doc * Fix tests Remove in-place tests for RReLU. For some reason they fail on legacy nn, but passes on nn * SELU in new-style Function It also supports double backprop, verifyed with gradgradcheck * Fix flake8	2017-06-11 10:07:48 +03:00
Luca Antiga	b9ab26765e	Add 3D upsampling (nearest and trilinear) with tests	2017-06-07 11:29:27 -04:00
Soumith Chintala	df7c47142d	fix for THNN NLLLoss signature change	2017-06-07 00:18:11 -04:00
Aron Barreira Bordin	d7db75c10f	added CosineSimilarity to nn.distance and updated docs (#1672 ) * added CosineSimilarity to nn.distance and updated docs	2017-06-06 22:53:21 -04:00
Marvin Cao	174c3cc399	Add support for double backward of LeakyReLU (#1714 )	2017-06-05 11:53:27 -04:00
Alykhan Tejani	f1c57ace1b	added input dim checks to convxD and conv_transposedxd (#1695 ) * add input dim check for conv2d * add None check to conv2d * added input dim checks to convxD and conv_transposedxd * flake8 fixes	2017-06-02 11:58:19 -04:00
Thomas Viehmann	6107d15d14	Twice differentiability of pointwise functions (#1531 )	2017-05-15 12:00:59 -06:00
Adam Paszke	6b84dc26f0	Add F.cosine_similarity (#1502 )	2017-05-15 11:12:54 -06:00
Marvin Cao	0ba20435ce	Add high order grad support for Some operator (#1507 )	2017-05-14 23:02:04 +02:00
Gregory Chanan	171638a451	Fix test_normalize NN test.	2017-05-09 14:25:06 -07:00
Gregory Chanan	ae2b2cbbec	Make keepdim work with autograd.	2017-05-09 14:15:59 -07:00
Sergey Zagoruyko	6d693fe413	Add F.normalize (#1467 )	2017-05-07 13:54:16 +02:00
Marvin CAO	e3f41a4962	Add high order gradient support for Sigmoid (#1496 )	2017-05-07 13:00:20 +02:00
Ankit Vani	4e18d89791	added twice differentiation for a bunch of ops (#1426 )	2017-05-04 06:47:14 -04:00
andrew giessel	2e7635b929	Add flexible bilinear upsampling aspect ratio redux (#1317 )	2017-05-03 08:46:28 -04:00
Soumith Chintala	ecd51f8510	docs fixes	2017-05-02 15:42:33 -04:00
Soumith Chintala	7dd8571bc6	fix avg_pool docs in nn.functional	2017-04-30 08:44:43 -04:00
Adam Paszke	457d78a7d9	Use THCUNN backward kernels for Tanh and Sigmoid in Autograd (#1399 )	2017-04-29 09:07:03 -04:00
Uridah Sami Ahmed	75f1989bec	Add nn.Bilinear and tests	2017-04-28 10:11:30 -04:00
Shubham Jain	a35f507532	Update functional.py (#1298 )	2017-04-19 11:07:12 -04:00
Edward Z. Yang	34546f022a	Expose dilated convolutions. Fixes #1225. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-04-18 17:13:02 -04:00
Edward Z. Yang	ab77742f6e	Add some missing documentation for arguments. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-04-18 17:13:02 -04:00
Christian Sarofeen	e9ff57176b	Fused pointwise kernels for GRU/LSTM	2017-04-11 13:42:06 -07:00
Christian Sarofeen	0b50f794e9	Use thnn version of Tanh/Sigmoid instead of autograd. (#1234 )	2017-04-11 12:49:57 -07:00
Edgar Riba	9504246c32	add triplet margin loss (#1165 )	2017-04-05 22:17:58 -04:00
Soumith Chintala	2979f4b989	add more functions to docs	2017-03-29 01:29:17 -04:00
Jason Kuen	f2c1071c33	Adaptive max and average pooling (1D & 2D) (#1084 )	2017-03-26 17:09:28 +02:00
Edgar Riba	63f6c0d692	add Pairwise distance (#835 )	2017-03-24 11:29:40 -04:00
ngimel	b3ab4b1094	Check torch.backends.cudnn.enabled, padding, and output_padding (#996 ) * Check torch.backends.cudnn.enabled * Don't allow negative padding and output_padding values	2017-03-22 19:42:11 -04:00
Kentaro Wada	7654b3f49e	Add function to compute cross_entropy for 2D image (#802 )	2017-03-16 17:34:04 +01:00
Soumith Chintala	13b1580613	add F.pad to docs	2017-03-15 00:09:14 -04:00
Sam Gross	34ce58c909	Parallelize backwards	2017-03-03 11:26:00 -08:00
Sergey Zagoruyko	12efd53dba	ConstantPad2d and F.pad (#856 )	2017-03-01 19:39:44 +01:00
Ofir Press	5e1d6a3691	Update functional.py (#862 ) Fixed documentation error in conv3d	2017-02-27 10:42:02 -05:00
陈云	838842d4b2	fix documentation error. [issue #790 ](https://github.com/pytorch/pytorch/issues/790 ) (#831 )	2017-02-23 08:59:29 +01:00
Joo-Kyung Kim	336eeee895	kernel_size as the default stride for avg_pool1d (#744 ) Following the documentation, let stride to be kernel_size if stride is not provided.	2017-02-15 13:12:18 +05:30
Soumith Chintala	d4c9a3782b	billinear -> bilinear, docs for upsampling, improved docs for Unpooling, pep8 tests fix (#617 ) * billinear -> bilinear, docs for upsampling, improved docs for Unpooling, pep8 tests fix	2017-01-30 05:08:48 +05:30
Luke Yeager	3ed720079e	[pep8] Fix most remaining lint manually	2017-01-28 01:15:51 +01:00
Luke Yeager	e7c1e6a8e3	[pep8] Fix most lint automatically with autopep8 Here's the command I used to invoke autopep8 (in parallel!): git ls-files \| grep '\.py$' \| xargs -n1 -P`nproc` autopep8 -i Several rules are ignored in setup.cfg. The goal is to let autopep8 handle everything which it can handle safely, and to disable any rules which are tricky or controversial to address. We may want to come back and re-enable some of these rules later, but I'm trying to make this patch as safe as possible. Also configures flake8 to match pep8's behavior. Also configures TravisCI to check the whole project for lint.	2017-01-28 01:15:51 +01:00
Adam Paszke	f8d4f980b3	Add upsampling modules and functions	2017-01-24 17:30:50 -05:00
Alykhan Tejani	f8e89fbe11	fix docs for torch.nn.functional.conv1d (#536 )	2017-01-21 10:41:52 -05:00
Adam Paszke	ee4c77c59f	Docs improvements (#512 ) * Always compile .numpy() for all types * Add torch.nn.functional docs and hidden headers * Use sphinx to generate torchvision docs * Remove unused import in ffi utils	2017-01-19 17:28:49 -05:00
Sergey Zagoruyko	9c218b419f	kl_div and docs (#429 )	2017-01-17 19:24:01 -05:00
Adam Paszke	1dbf44c00d	Add SmoothL1Loss to functional	2017-01-16 12:59:47 -05:00
Sam Gross	3a07228509	Add ConvTranspose1d module (#449 )	2017-01-13 15:22:57 -05:00
Sam Gross	24a2f2e3a0	Add MaxUnpool1d module (#447 )	2017-01-13 14:36:25 -05:00
Sam Gross	d5e45b2278	Add AvgPool1d which just uses AvgPool2d implementation (#439 )	2017-01-12 15:07:11 -05:00
Sam Gross	fd92470e23	Add cuDNN bindings for BatchNorm (#421 )	2017-01-07 15:35:24 -05:00
Adam Paszke	483490cc25	Move PixelShuffle implementation to functional	2016-12-30 23:02:57 +01:00
Adam Paszke	8d60e39fdc	Rename torch.nn.functions to torch.nn._functions	2016-12-30 23:02:57 +01:00
Sam Gross	c367e0b64e	Support dilated 1d and 3d convolutions (#372 ) Fixes #367	2016-12-29 18:20:32 -05:00
Sergey Zagoruyko	62af45d99f	Basic functional interface (#354 )	2016-12-29 22:53:57 +01:00

... 6 7 8 9 10 ...

588 Commits