pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Sam Gross	85e22b5475	Reverts force_gpu_half changes from #3660 (#5000 ) The test_cuda.py setup purports to test half tensors, but actually just re-tests FloatTensors because the keys in type_map were str instead of type. Testing HalfTensors is more complicated, requiring changes to precision and requires excluding some unimplemented methods. We should fully test half CUDA tensors. This change just deletes the duplicate tests of FloatTensor.	2018-02-07 15:33:17 -05:00
Tongzhou Wang	47ee86776e	Fix CPU torch.multinomial with noncontiguous prob tensor (#5093 ) * fix CPU torch.multinomial not working on noncontiguous probability distn' * address comments * change some tabs to spaces in THStorage.c	2018-02-06 22:11:43 -05:00
Peter Goldsborough	86fd5fd524	Replace async with non_blocking for Python 3.7 (#4999 ) * Replace async with non_blocking for Python 3.7 upgrade * Remove trailing whitespace * Give _cuda and _type kwargs and accept async for compatibility * Rename async to non_blocking in all C++ code * Add entries for async in python_variable_methods * Friendlier backward compatibility for cuda and type	2018-02-02 09:23:51 -05:00
albanD	6c197c2f15	fix triu and tril for zero-strided inputs on gpu (#4962 )	2018-01-31 14:38:49 -05:00
Will Feng	82fed06535	disable qr_big cuda test on Windows (#4747 )	2018-01-23 21:29:32 -05:00
Richard Zou	c7a2e318ed	Restore cuda variable.bernoulli() (#4787 )	2018-01-23 21:12:47 -05:00
Adam Paszke	1061d7970d	Move broadcast and broadcast_coalesced to C++	2018-01-18 11:16:45 +01:00
Tongzhou Wang	5918243b0c	Methods for checking CUDA memory usage (#4511 ) * gpu mem allocated * add test * addressed some of @apaszke 's comments * cache stats * add more comments about test	2018-01-09 11:47:48 -05:00
Sam Gross	b8fd57a0cc	Fix handling of empty indices in CUDA Tensor.put_ (#4486 ) Fixes #4386	2018-01-05 12:58:27 -05:00
Will Feng	c6adee0807	disable CUDA HalfTensor tests in test_cuda for Windows (#4482 )	2018-01-04 22:58:13 +01:00
Fritz Obermeyer	35abc4efa2	Add low-precision digamma() and polygamma() functions (#4399 )	2018-01-02 11:53:23 +01:00
Vishwak Srinivasan	e519ef5337	Adding torch.expm1() and its inplace function (#4350 )	2017-12-28 18:56:03 +09:00
Sam Gross	1632ab2979	Fix default device for Variable.new() (#4307 ) Variable.new() should default to the device of "self" if no device is specified. Previously, we were using the current device. This now matches Tensor.new().	2017-12-21 18:35:35 -05:00
Tongzhou Wang	d8b2e5d091	Add python only default init expression; Implement stft, hann/hamming/bartlett window. (#4095 ) * implement stft * addressed comments; implemented window functions; added support for python only default initialization	2017-12-18 12:28:23 -05:00
Tongzhou Wang	e0d5d1b7c9	view in certain noncontig case (#4062 )	2017-12-18 02:08:17 -05:00
Richard Zou	9394e65b44	Add proper shape checking to torch.cat (#4087 ) * Fix catArray in THTensor Asserts that the inputs have the same size except in the cat dimension or are empty (or a mix of both). * Fix catArray for THCTensor * Document torch.cat shape checks * Fix types	2017-12-18 02:05:58 -05:00
Sam Gross	bec0349280	Implement Variable.cuda and Variable.type using ATen (#4139 ) * Implement Variable.cuda using ATen This adds an optional async flag to Tensor::copy_, which attempts to do a non-blocking copy if the one of the tensors is in pinned memory and the other is a CUDA tensor. * Perform cross-device copy in CopyBackwards Also call torch.cuda._lazy_init() from Variable.cuda() * Implement Variable.type via ATen * Changes from review: - remove copy_out - remove unnecessary include - fix default device for .cuda() * Combine if statements in dispatch_type	2017-12-18 01:54:35 -05:00
Richard Zou	dac5e6568d	Better error messages for blas ops with cuda.LongTensor (#4160 ) * Better error messages for blas ops with cuda.LongTensor Fixes #4157 Test plan Try matrix multiplying with cuda.LongTensors >>> import torch >>> x = torch.randn(4, 4).long().cuda() >>> y = torch.randn(4, 4).long().cuda() >>> x.mm(y) Traceback (most recent call last): File "<stdin>", line 1, in <module> RuntimeError: addmm for CUDA tensors only supports floating-point types. Try converting the tensors with .flo at() at /private/home/rzou/pytorch/pytorch/aten/src/THC/generic/THCTensorMathBlas.cu:381	2017-12-14 11:28:59 -05:00
Sam Gross	aeb7a3668d	Implement Variable.new (#4080 )	2017-12-11 15:45:43 -05:00
Tongzhou Wang	c681b03d37	Add determinant function on variable; Add backward on svd (#3816 ) * determinant on variable * svd bwd	2017-12-01 13:22:46 -05:00
Adam Paszke	6ae0d477ea	Fix cuBLAS arguments for fp16 dot (#3660 ) * Fix cuBLAS arguments for fp16 dot * Enable FloatTensor <-> CUDA HalfTensor checks in test_cuda.py	2017-11-29 07:16:34 -08:00
Richard Zou	ec389f5128	Fix cuda symeig (#3566 ) * Fix cuda symeig * Add symeig test * Better check for magma	2017-11-08 20:20:14 -05:00
Richard Zou	00d2befba1	THTensor_varOuterDim numeric stability (#3533 )	2017-11-07 13:47:20 -05:00
Richard Zou	3d06a1e075	Make THCTensor_varInnermostDim numerically stable using Welford's algorithm (#3425 ) * Use Welford's algorithm when reducing along inner dimension for THCTensor's variance fn * Use accreals in THCTensor's varInnermostDim * Skip cuda tests if no cuda * Variance testing	2017-11-06 16:00:29 -05:00
SsnL	8fd171a6fd	add test_index to test_cuda	2017-11-06 14:21:31 -05:00
Sam Gross	7c0b16c140	Add torch.take and Tensor.put_ (#3263 ) * Add torch.take and Tensor.put_ These are similar to numpy.take and numpy.put. The take function allows you to linearly index into a tensor without viewing it as a 1D tensor first. The output has the same shape as the indices. The put function copies value into a tensor also using linear indices.	2017-11-01 06:04:44 -04:00
SsnL	91a8d3325e	test sparse dp, broadcast_coalesced, reduce_add_coalesced	2017-10-28 18:52:35 -04:00
Ozan Çağlayan	e43a63a968	tensor: Ensure that the tensor is contiguous before pinning (#3266 ) (#3273 ) * tensor: Ensure that the tensor is contiguous before pinning (#3266) pin_memory() was producing out-of-order tensor when the given tensor was transposed, i.e. in column-major order. This commit fixes this by calling contiguous() before pinning. * test: add contiguous test for pin_memory (#3266)	2017-10-25 13:17:54 +02:00
SsnL	634c8315a4	isContiguous problems (#3148 ) * with the size=1 case, impossible to do single point check, replace with isContiguousRange * fix stride in desc; fix undef scope * add test for this case for cudnn * assertTrue	2017-10-20 10:20:33 -04:00
Edward Z. Yang	2dcaa40425	Add get_rng_state_all and set_rng_state_all. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-30 16:21:04 -04:00
IraKorshunova	2b9765ad02	Erf and erfinv (#2799 )	2017-09-20 21:23:45 -04:00
Francisco Massa	1da87118cc	Optimize pow for different exponents and add tests	2017-09-10 13:51:05 -04:00
Anton Osokin	0d34a6451a	fixing the bug with squeezing a singleton dimension in torch.min and torch.max	2017-08-16 17:51:48 -04:00
Francisco Massa	b797ee04fc	Add CUDA version of eye	2017-08-16 17:25:52 -04:00
Gregory Chanan	b3db52fe36	Support __neg__, .neg(), and neg_() for Long, Int, Short tensor types.	2017-08-15 02:51:25 -04:00
Christian Sarofeen	ac76ab5fca	Increase tol. for float tensor qr big test. test_FloatTensor_qr_big test is still a bit flaky on K80. Increasing tolerance to improve reliability as tests are moved around and results change for this test.	2017-07-27 14:23:06 -04:00
ngimel	3c275fe7a0	Increase flaky test tolerance (#2185 )	2017-07-22 11:37:34 -04:00
Sam Gross	71ce3448d9	Fix torch.inverse when magma is not available Fixes #2156	2017-07-21 15:57:43 -04:00
Francisco Massa	82143487b3	Add CUDA support for arange Also enables CUDA for range	2017-07-19 15:48:20 -04:00
Trevor Killeen	a45ad7cfba	Advanced Indexing Part 1 -- Purely Integer Array Indexing	2017-06-22 17:21:50 -04:00
Gregory Chanan	5b81746767	Simplify python warning settings and cleanup tests.	2017-06-11 05:37:59 -04:00
Gregory Chanan	69287250d1	Add a broadcast parameter to copy_, use it in the library in cases where there is non-broadcasting calls exposed by the tests.	2017-06-11 05:37:59 -04:00
Gregory Chanan	5af46cb352	Add broadcasting support for matmul.	2017-06-11 05:37:59 -04:00
Gregory Chanan	a36f95fe26	Add broadcast support for fused-matmul broadcasting. Functions are: addmm, addbmm, addr, addmv, baddbmm.	2017-06-11 05:37:59 -04:00
Gregory Chanan	85d838a028	Testing over the following: 1) CPU tensor out-of-place functions 2) CPU tensor in-place functions 3) GPU tensor out-of-place functions 4) GPU tensor in-place functions 5) torch. functions 6) Fallback semantics (use pointwise nElem matching rather than broadcasting)	2017-06-11 05:37:59 -04:00
Edward Z. Yang	ba690d5607	Add support for NVTX functions. (#1748 )	2017-06-10 18:26:58 +02:00
Alykhan Tejani	5f1a16a018	Torch manual seed to seed cuda devices (#1762 )	2017-06-10 12:37:21 +02:00
Adam Paszke	7b578dd68e	Add scatterAdd	2017-05-25 16:49:48 -04:00
Alexander Matyasko	33b3968660	add larger tests for qr	2017-05-08 16:58:54 -07:00
Trevor Killeen	f273377d19	add device asserts in scatter/gather kernels	2017-05-03 11:12:26 -04:00

1 2 3

101 Commits