Sam Gross
b9379cfab7
Use cuDNN and NCCL symbols from _C library ( #1017 )
...
This ensures that we use the same library at the C++ level and with
Python ctypes. It moves the searching for the correct library from
run-time to compile-time.
2017-03-16 16:10:17 -04:00
soumith
7ad948ffa9
fix tests to not sys.exit(), also fix fatal error on THC initialization
2017-03-01 17:37:04 -05:00
Sam Gross
fc6fcf23f7
Lock the cudaFree mutex. ( #880 )
...
Prevents NCCL calls from overlapping with cudaFree() which can lead to
deadlocks.
2017-03-01 11:29:25 -05:00
Adam Paszke
19a65d2bea
Expose stateless methods for torch.cuda.HalfTensor
2017-02-26 20:02:42 +01:00
Sam Gross
bd5303010d
Refactor autograd package to separate Python dependencies. ( #662 )
...
The core autograd Variable, Function, and Engine no longer depend on the
Python API. This let's us implement functions in C++. In the future, we
can also multithread engine and release the GIL for most of the
non-Python backwards.
2017-02-13 16:00:16 -08:00
Zeming Lin
59d66e6963
Sparse Library ( #333 )
2017-01-05 00:43:41 +01:00
Sam Gross
20fffc8bb7
Fix torch.is_tensor for half tensors ( #322 )
...
Fixes #311
2016-12-19 15:27:47 +01:00
Sam Gross
1af9a9637f
Refactor copy and release GIL during copy ( #286 )
2016-12-11 21:54:58 +01:00
Sam Gross
0d7d29fa57
Enable caching allocator for CUDA pinned memory ( #275 )
...
Also add binding for CUDA "sleep" kernel
2016-12-02 01:33:56 -05:00
Adam Paszke
ebc70f7919
Look for libcudart in default CUDA installation paths ( #195 )
2016-11-02 19:36:10 -04:00
Sam Gross
ad5fdef6ac
Make every user-visible Tensor have a Storage ( #179 )
2016-10-31 12:12:22 -04:00
Sam Gross
79ead42ade
Add CUDA Stream and Event API ( #133 )
2016-10-18 12:15:57 -04:00
Sam Gross
8d39fb4094
Use new THC API for device allocator
2016-10-17 09:35:41 -07:00
Sam Gross
ee14cf9438
Add support for pinned memory: ( #127 )
...
torch.Storage/Tensor.pin_memory()
torch.Storage/Tensor.is_pinned()
2016-10-15 18:38:26 -04:00
Sam Gross
c20828478e
Update Module.cpp for THC changes
2016-09-30 11:13:14 -07:00
Adam Paszke
3f7ab95890
Finish implementation of prng related functions
2016-09-29 11:33:25 -07:00
Sam Gross
4e9f0a8255
Use CUDA caching allocator
2016-09-26 13:12:39 -07:00
Adam Paszke
06ab3f962f
Refactor _C extension to export some utilities
2016-09-21 08:36:54 -07:00
Adam Paszke
3ea1da3b2c
Minor fix in CUDA module
2016-09-14 11:09:03 -04:00
soumith
1f2695e875
adding cuda driver check functions for runtime checking
2016-09-13 10:34:13 -07:00
Adam Paszke
8d933cbfc4
Fixes for OS X
2016-08-22 22:45:35 -04:00
Adam Paszke
12bed8dc0d
Add CUDA device selection
2016-08-12 07:46:46 -07:00
Adam Paszke
92e983a489
Fixes for Linux and new cutorch
2016-08-02 09:20:18 -07:00
Adam Paszke
c574295012
Various fixes
2016-07-19 10:45:59 -04:00
Adam Paszke
3a44259b32
Add support for CUDA
2016-07-19 10:45:59 -04:00