pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
gchanan	4f20a0e439	Fix various sparse transpose issues; remove dead code from Declaratio… (#7200 ) * Fix various sparse transpose issues; remove dead code from Declarations.yaml. 1) Fixes some checks in t_, transpose_ that don't allow transposing empty sparse tensors. 2) Remove out= variants from docs since they don't exist (and haven't since at least v0.3.1). 3) Unify implementations of t_, transpose_, t, transpose. 4) Move dead checking code from Declarations.cwrap to actual implementations. 5) Fix test which never tested transpose_. * Add test for error with t, t_. * Address review comments. * Fix jit tests. * Fix test_jit.	2018-05-18 19:51:41 +02:00
gchanan	7abdc303c6	Don't allow requires_grad to be set on integer Tensor constructors in… (#7185 ) * Don't allow requires_grad to be set on integer Tensor constructors in tensor_new. * Fix autograd test. * Fix test_distributions. * Fix test_jit. * Fix NN tests.	2018-05-18 19:45:10 +02:00
Seth Hendrickson	32b23a4bfc	Throw error on tensor creation when sequence shape cannot be determined (#7583 ) * first commit * unit test * minor style edits	2018-05-18 19:14:42 +02:00
Thomas Viehmann	bf95dff85b	Map digamma +/-inf results to nan in test (fixes #7651 ) (#7665 )	2018-05-18 16:35:00 +02:00
Thomas Viehmann	e1148db7f2	Implement logsumexp (fixes #2591 ) (#7254 ) * Implement logsumexp (fixes #2591) * Add logsumexp_backward, fix _out declaration. Thank you Simon and Edward for your comments!	2018-05-14 22:08:14 -04:00
Thomas Viehmann	cfc1d92975	Implement ellipses ('...') and diagonals (e.g. 'ii->i') in einsum. (#7173 ) This brings the two most important missing numpy einsum features to toch.einsum.	2018-05-12 23:39:37 -04:00
Richard Zou	eaa3f2e613	Fix advanced indexing with negative indices (#7345 ) * Fix advanced indexing with negative indices Fixes #7156 Here is some behavior before this PR: ``` In[1]: x = torch.arange(9).view(3, 3).contiguous() x[[0], [-1]] # Should be equivalent to x[0, -1] Out[1]: tensor([ 8]) ``` The bug is that negative indices are added to the computed linear index directly. In the above example, the linear index computed is "-1", which wraps around to "8", giving the last element of a flattened view of `x`. Instead, we should wrap negative indices around before adding them to the linear index. * Use toCLong()	2018-05-12 23:24:40 -04:00
Jon Walsh	857e3f4a5e	Throw error in tensor constructor when numpy strides mismatch (#7440 )	2018-05-11 11:00:43 +02:00
Ethan Steinberg	9fa1dff66a	Allow the use of torch.device for loading (#7339 ) * Allow using torch.device for loading * Make recommended changes * Better tests	2018-05-10 15:50:00 -04:00
Richard Zou	71626491c4	Add batched linear solver to torch.gesv() (#6100 ) * Add batched linear solver to torch.gesv() Fixes #3164 Picks up from #4502 I moved `gesv` to ATen. Adds bindings for MAGMA's `gesv_batched` function for CUDA. For CPU, runs `THLapack(gesv)` in a for loop. The new function supports arbitrary batch dimensions (and broadcasting of those dimensions). For example, the 4-d tensor `A x B x M x M` should be treated as having batch-size `(A x B)`. The overhead of creating the magma_queue_t is: ~350000 microseconds the first time it's called and ~6 microseconds every time after that. * Tests and docs * Address comments * Address comments * Rebase * Address comments * Fix rebase * Addressed comments * Address comments * Address comments * Addressed comments	2018-05-08 17:06:27 -04:00
Adam Paszke	8091388d0f	Add support for __floordiv__ and __rdiv__ for integral tensors (#7245 )	2018-05-03 23:34:59 +02:00
gchanan	681baa9254	Restore warning to torch.range. (#7194 ) Also, get rid of warning specification in Declarations.cwrap, which currently has no effect.	2018-05-02 21:53:00 -04:00
Thomas Viehmann	07513cfd1d	implement sum over multiple dimensions (fixes #2006 ) (#6152 )	2018-05-02 21:50:29 -04:00
cpuhrsch	88a705555a	Add SLEEF for float and double (#6725 )	2018-05-02 18:40:44 +00:00
gchanan	8031da5479	Implement torch.as_tensor, similar to numpy.asarray. (#7109 ) * Implement torch.as_tensor, similar to numpy.asarray. torch.as_tensor behaves like torch.tensor except it avoids copies if possible; so also somewhat like tensor.new but without the size overloads. I didn't add a requires_grad field, because we haven't decided on the semantics such as as_param. * Remove requires_grad for doc.	2018-05-01 12:54:43 -04:00
Thomas Viehmann	8fbab83c2a	only Tensors of floating point dtype can require gradients (see #7021 ) (#7034 )	2018-04-30 10:20:00 +02:00
gchanan	361648a4a7	Fix torch.tensor(...) device-type calculation when used with numpy an… (#6995 ) * Fix torch.tensor(...) device-type calculation when used with numpy and type inference. * Fix tensor device type inference as well. * Better variable type inference: infer cuda-ness only if device is not specified.	2018-04-27 18:12:33 -04:00
cpuhrsch	ae35e0e924	Support non-contiguous tensors for unary ops (#6119 )	2018-04-27 21:31:34 +02:00
gchanan	a6bfa16c17	torch.arange: add numpy-style type inference. (#7016 ) * torch.arange: add numpy-style type inference. This is a backwards-compatibility breaking change. * Fix flake8. * Use at::optional. * Remove unneeded header files. * Use reference wrapper. * Update arange for test. * Address review comments.	2018-04-27 15:11:45 -04:00
gchanan	18ed2160b0	Use Index rather than Long for IntList parsing (#6674 ) * Use Index rather than Long for IntList, so floating-point types convertible to ints fail the parsing. Basically, our unpackLong code works with floating-point types that are convertible to ints, but this isn't often what you want (because of truncation). What you actually want is to convert to an index, which will usually find such issues. I made this the minimal change I could because: 1) I didn't want to change unpackLong because the existing code call checkLong before unpackLong, so this should be a non-issue most of the time. And fixing this properly requires calling checkLong again, which will slow everything down. 2) An exception above is with IntList, which only checks that 1) it is a tuple or 2) it is a varargs tuple (i.e. torch.ones(1, 2, 3)). * Fix bug. * Don't conflict tensor and IntList bindings. * Change function to be consistent between python 2 and 3. * Check Index. * Move IntList overloads in legacy new functions to below Tensor overloads.	2018-04-26 19:13:23 -04:00
gchanan	a08091a42d	Implement matmul_out and dot_out. (#6961 ) * Implement matmul_out and dot_out. * Fix autograd by only calling _out variants if we have an out ourselves. * Disallow mismatched types in dot_out. * Make sure out variant doesn't have a method. * Do proper type conversion.	2018-04-26 16:52:58 -04:00
Thomas Viehmann	2b44c420c8	Enhance diagonal (fixes #6479 ) (#6718 ) * Enhance diagonal This patch - adds Tensor.diagonal to complement torch.diagonal - implements diagonal natively in ATen - makes diagonal a view - implements taking arbitrary diagonals - implements diagonal backward instead of referring to the (more limited) diag * add tests, copy diagonal code to backward for double differentiability * improve tests and doc comment. Thank you, Adam! * Mark diagonal as view function in gen_autograd.py, use simple backward.	2018-04-26 11:11:20 -04:00
Thomas Viehmann	f98b778086	Fix forward and backward for norm/renorm with infty norm (fixes #6817 ) (#6969 )	2018-04-26 12:54:53 +02:00
gchanan	3d907ef78e	Consistently check 'out' variants against specified dtype/layout/device parameters. (#6973 ) We were previously doing this in the most common cases, but not consistently.	2018-04-25 22:46:42 -04:00
Soumith Chintala	333e8c9b22	any/all returns LongTensor, make test expect that (#6957 )	2018-04-25 14:05:29 -04:00
Tao He	39d4814933	Make any and all on ByteTensor behave like sum/prod. (#4627 )	2018-04-25 10:25:38 +02:00
cpuhrsch	a8bdb561b7	Fix reductions on some contiguous tensors where size(dim) == 1 (#6815 )	2018-04-22 13:55:55 -04:00
Richard Zou	d1a992a85e	Disallow chunks that are <= in torch.chunk (#6761 ) Fixes #6759. Before, `tensor.chunk(0)` would cause a divide by 0. `tensor.chunk(-1)` would throw an error complaining that "split_size needs to be positive". This PR changes it so that the error message makes it clear that `chunks` has to be greater than 0.	2018-04-19 18:31:14 -04:00
MRuberry	9c47eb5548	Fixes test_torch.py so that all tests pass on Volta hardware. (#6736 ) Issue: "python3 test_cuda.py" currently results in a failure when using Volta hardware. The failure is in test_advancedindex, and is caused by two "sub-tests." At line 4651 a series of indices are used to compare PyTorch's and Numpy's indexing behavior. At least two of these indices index the same element of the reference tensor multiple times. These are: [slice(None), [[2]], [[0, 3], [4, 4]]] [slice(None), [[0, 1], [1, 0]], [[2, 3], [3, 0]]] The first index selects the 5th element of the third row twice, and the second index selects the 4th element of the second row twice. This causes the test to attempt to update the same index with two distinct values simultaneously. On my machine the Numpy created tensor will always take the "latter" of these two values, while the Volta tensor will always take the "former." (Not to say this behavior is guaranteed by either framework.) The fix is to remove these two indices from test_torch.py. This causes all tests to pass. While updating test_torch.py I also noticed that assert_get_eq(tensor, indexer) had a bug where it was referring to "reference" instead of "tensor." This bug had no impact on behavior. The fix is to have this function refer to its input tensor, "tensor," instead. All tests still pass after this fix.	2018-04-18 22:44:14 -04:00
Adam Paszke	d26ab68485	Sort declarations when generating Python bindings (#6701 ) * Sort declarations when generating Python bindings This helps resolve ambiguities in argument parsing according to any rules we will need. For now, this allows us to make scalar operations more conservarive wrt. argument types, but makes them commutative again. * Fix inconsistencies between mod with tensor and scalar * Fix a stupid mistake	2018-04-18 21:51:35 -04:00
Thomas Viehmann	bd0cc7d364	Implement torch.einsum (fixes #1889 ) (#6307 ) * start at generic trilinear * Implement einsum (fixes #1889) This provides a simple implementation of einsum. It is built on top of the work for computing bilinear (#6110). It uses a naive left-to-right resolution at the moment. Autograd is able to differentiate by itself. The obvious unsupported feature is taking diagonals (einsum('ii->i',(a,)). * add tests and docs * fix flake8 * clean diff * rebase on current master to resolve conflicting String wrapping * clean up after rebase * better commentary in einsum and sumproduct_pair * don't say fixme if it's fixed and rename num_outputs to num_output_dims * adapt python wrapper to use std::string instead of String to avoid typedef at::String * typos and some vector to array conversion * fix accidental python<->python3 change * really fix bad rebase	2018-04-18 13:41:27 +02:00
Francisco Massa	feb8522f99	randperm supports n=0 (#6656 ) This makes it compatible with arange and numpy.random.permutation	2018-04-17 19:03:57 +02:00
gchanan	30849eb668	Bind 0-dim variables without requires grad to int64/double similar to how we do with Scalar. (#6637 ) Note: - Only integral scalar types bind to int64 - Both integral and floating point scalar types bind to double (same rules as python numbers).	2018-04-17 09:54:49 -04:00
Du Phan	c345212c86	Support gpu triangle solve (#6648 ) * add cuda trtrs * remove queue * add test trtrs	2018-04-17 14:33:39 +02:00
gchanan	5ed3f3347a	Add dtypes (with reasonable defaults) to sum, prod, cumsum, cumprod. (#6573 ) * Add dtypes (with reasonable defaults) to sum, prod, cumsum, cumprod. This adds optional dtypes to torch.sum, torch.prod, torch.cumsum, torch.cumprod. By default, the dtype is torch.float64 for integral types, and the dtype of the input for floating point types. * Don't use optional<ScalarType>, because the jit can't handle it yet. Instead, we manually build the overloads. This is fairly painful because of default arguments, but should be easy to pull out once the jit can handle optional<ScalarType>. * Fix keepdim with out parameters. * Fix _cudnn_rnn_flatten_weight. * If dtype is provided to an out function, make sure it matches the dtype of the result. * Fix typo.	2018-04-16 23:52:59 -04:00
gchanan	d7cb78478f	Split set_default_tensor_type(dtype) into set_default_dtype(dtype). (#6599 ) * Split set_default_tensor_type(dtype) into set_default_dtype(dtype). * Fix flake8. The difference between this one and set_default_tensor_type is that it only sets scalar type what determines the type + device of a tensor returned from a factory function with defaults is the default tensor type + the current device (if the default tensor type is cuda). This just changes the scalar type of the default tensor type. We do eventually want to deprecate set_default_tensor_type; it is not clear how to do that in a sensible and backwards compatible way.	2018-04-16 13:49:00 -04:00
gchanan	46374ad5c8	Add tensor.to(device) method. (#6588 ) * Add tensor.on(device) and tensor.on_device_as(tensor) methods. * Rename {'on', 'on_device_as'} -> 'to'. * Fix test ordinal. * Fix device ordinal again.	2018-04-16 10:50:34 -04:00
Richard Zou	6c0f74089f	More precise digamma (#6517 ) * More precise digamma Fixes #6190. This is a rebase of #3955 with some tweaks for better performance around poles. The code is ported over from cephes with permission. By itself, the cephes code returns inf for the poles. For better performance around the poles with float32, one intermediate step is always computed with double precision, regardless of dtype. This step does `PI / tan(PI * input)`. This is necessary because small (1e-6) rounding errors for the inputs to tan have strong effects on the output (ie, the derivative of tan is very large at some points). * Replace usages of finite-differences digamma with newly implemented digamma * Better behavior near and at poles * ScalarConvert -> scalar_cast for readability	2018-04-13 11:49:09 -04:00
Tongzhou Wang	8aa0ae3836	Support arbitrary number of batch dimensions in *FFT (#6528 )	2018-04-12 15:03:22 -04:00
gchanan	749d51414a	Separate cuda-ness from dtype. (#6470 ) * Separate cuda-ness from dtype. There are no longer torch.cuda.int64, etc; only torch.int64 that correspond to at::ScalarType. At the python arg parser level, the corresponding ATen type is selected from the combination of (ScalarType, Layout, Device). There is also currently unused code in here for support ScalarType in native_functions; this will be used for specifying aggregate types on reduction functions. * Fix test_autograd. * Add defaults to randint_like. * Track is_cuda in py tensor types. * Fix test_sparse. * Fix multiprocessing. * Fix rnn. * Fix test_nn. * Fix flake8.	2018-04-12 14:05:44 -04:00
Tongzhou Wang	ca09e4a3c5	Fix THTensor_(take) negative index check (#6482 ) * fix THTensor_(take) negative index check * add tests * rename to invalidIdxPos	2018-04-11 12:12:35 -04:00
Tongzhou Wang	0dff2b5e35	[fft] [3 of 3] Implements backward of fft ifft rfft irfft (#5537 ) * change irfft signal_sizes arg to be the last * add docs for fft, ifft, rfft, irfft; update doc for stft * fix typo in window function docs * improve gradcheck error message * implement backward of fft, ifft, rfft, irfft * add grad tests for fft, ifft, rfft, irfft * fix nits and typos from #6118 * address comments	2018-04-10 22:09:36 -04:00
Tongzhou Wang	930f181255	Fix fft when any of the input dimensions is not aligned (#6118 ) * fix fft when any of the input dimensions is not like complex type; add test for ifft+fft * clarify the comments * Address comments: add note; add helper function * use at::nullopt * add notes on conjugate symmetry; fix complex-to-real cloning condition (should be advanced data layout rather than base_istride) * add at::sum_intlist and at::prod_intlist * revert optional<vector> helper due to windows compiler error	2018-04-10 13:11:05 -04:00
albanD	bb097e2a50	[pytorch] Fix signed random_ (#6463 ) * Fix cpu signed random * fix gpu signed tensor * add test for signed random_ * cleaner tests * fix lint	2018-04-10 13:07:04 -04:00
Naman Jain	acb7df11a2	Add torch.randint and torch.randint_like functions (#6136 ) Adds randint and randint_like to TensorFactories.cpp	2018-04-10 12:08:21 -04:00
Zhou Chang	d0f395f744	[pytorch] Fix clamp is missing kwarg out (#6028 ) (#6418 ) torch.clamp is out from template code, add it manually, same with auto generated code.	2018-04-09 13:39:31 -04:00
gchanan	87e369111a	Add string-style devices to all tensors. (#6283 ) * Add string-style devices to all tensors. Previously, tensors only had a 'get_device' method which would throw an exception on a CPU tensor. This made it necessary to if/else code that was meant to be device agnostic. This PR implements the following: 1) Adds a 'device' property to all tensors that returns a string representation of the device for all tensors. For cpu tensors this is 'cpu'. For cuda tensors this is 'cuda:X', where X is the cuda device ordinal. 2) Adds a DeviceSpec class. This is just a helper class for separating device_type and device_index specification and to allow partial specification. For example, you can call DeviceSpec('cuda'), DeviceSpec('cuda:0'), DeviceSpec('cuda', 1). Also has backwards compatibility support for specifying integers, which are treated as cuda devices. DeviceSpecs have the following properties: a) device_type: string representation of the device type (i.e. 'cpu' or 'cuda') b) device_index: integer for the device index (None if not specified) c) cuda_device_index: for backwards compatibility; behaves roughly like `get_device` did previously. I.e. if a function previously took integers for cuda devices, it can now take DeviceSpecs (or strings), and can maintain the old functionality by calling `old_index = DeviceSpec(old).cuda_device_index`. 3) tensor methods and torch. functions that took integer devices can now take integers, strings, or DeviceSpecs. For example: torch.randn((2,3), dtype=torch.cuda.float32, device='cuda:1') TODO in future PRs: A) Split out cuda from dtype so you don't need to overspecify cuda-ness B) We currently only support strings/DeviceSpecs in tensor methods and torch. functions. We should have equivalents torch.cuda.device(...), torch.cuda.device_of, etc. at the torch. level that work on strings/DeviceSpecs * Add deviceInt64 to python arg parser. * device_str. * Remove device_str. * remove device prefix from attributes. * Use const char * instead of string. * Move autogpu index out of Device. * comment on is_default. * Rename torch.DeviceSpec to torch.device. * comment. * Fix tests. * Fix flake8. * Fix sparse_coo_tensor parameter name. * Improve error message. * Remove device_ prefix from C++ device object. * Allocate static strings. * Return not implemented from rich compare. * Move torch::Device to THPDevice. * Remove cuda index. * Py_RETURN_NOTIMPLEMENTED doesn't exist in python2.	2018-04-06 15:12:05 -04:00
Tongzhou Wang	29c69f049e	add test for old tensor serialization (#6275 )	2018-04-05 17:00:30 -04:00
Vishwak Srinivasan	0aa35780bf	[ready] Implement log2 and log10 in PyTorch (#6272 ) * Implemented log2 and log10 * Re-add incorrectly removed files * Fix minor bugs * Fix log1p docs * Add a try-except for python2 math module in log2 test * Revert changes made to aten/doc/* * Fix docstring errors * Fix windows build	2018-04-05 14:28:37 -04:00
Peter Goldsborough	9ba70856a1	Add max_values and argmax convenience functions to ATen (#6201 ) * Add max_values and argmax convenience functions to ATen * Add documentation for torch.argmax/argmin and skip max_values * Add tests for argmax/argmin * Dont default the dim argument * Use dim=0 in test_torch.py for argmax tests * Implement argmin() and argmax() without dim * Call .contiguous() before .view(-1)	2018-04-04 15:53:26 -04:00

1 2 3 4 5 ...

314 Commits