pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
SsnL	1661370ac5	Signal handling in DataLoader workers; Timeout option (#3474 )	2017-11-29 23:52:14 +01:00
Sam Gross	4bce69be22	Implement Variable.storage() (#3765 ) This still uses THPStorage, but avoids touching THPTensor	2017-11-20 14:18:07 -05:00
peterjc123	aa911939a3	Improve Windows Compatibility (for csrc/scripts) (#2941 )	2017-11-08 19:51:35 +01:00
Holger Kohr	5e382894be	add numpy() and from_numpy() to HalfTensor (#2953 )	2017-11-08 15:01:29 +01:00
Sam Gross	7c0b16c140	Add torch.take and Tensor.put_ (#3263 ) * Add torch.take and Tensor.put_ These are similar to numpy.take and numpy.put. The take function allows you to linearly index into a tensor without viewing it as a 1D tensor first. The output has the same shape as the indices. The put function copies value into a tensor also using linear indices.	2017-11-01 06:04:44 -04:00
Sam Gross	a65db4e956	Use ATen for torch.cat, torch.addmm, and friends on Variables. (#3286 ) This includes some changes to the dispatch code for torch.xxx functions: - Since Variable.addmm is an instance-method, the self argument has to come first. The dispatch code swaps the first two arguments if necessary to suppor the deprecated signatures where 'alpha' or 'beta' comes before the 'self' tensor. - Delete IMPLEMENT_STATELESS_REVERSED. These functions require output arguments to be passed in using the keyword 'out'. They were meant to handle torch.gt(out, a, b), but we haven't allowed that for a while.	2017-10-25 14:27:45 -04:00
Sam Gross	f1f64c8d07	Generate autograd functions for NN / more refactors (#3136 ) Generate autograd functions for NN and implement more derivatives in derivatives.yaml A big refactor of gen_variable_type.py	2017-10-19 15:03:26 -04:00
Adam Paszke	f9ee52efa9	Update DLPack bindings	2017-10-19 10:06:53 -04:00
Sam Gross	47beb64b5c	Use ATen generator as default CPU generator (#3135 ) ATen has it's own default CPU RNG. Use this as the default in PyTorch so that random functions called through ATen have the same behavior as random functions called through TensorMethods	2017-10-16 14:22:58 -04:00
Priya Goyal	756ab3f24f	Adding conversion from python tensor to dlpack tensor (#2933 )	2017-10-04 08:35:42 -04:00
Soumith Chintala	b3bc5fe302	refactor THCP method defs into cuda/Module.cpp	2017-09-30 13:14:35 -07:00
IraKorshunova	2b9765ad02	Erf and erfinv (#2799 )	2017-09-20 21:23:45 -04:00
Adam Paszke	8dae433de8	Move JIT passes to a separate directory	2017-09-19 10:53:32 -04:00
Edward Z. Yang	2e266837f5	Port TracingState to pybind11, new export() method. Along the way I added converters for Variable and TracingInput. Variable should probably be moved to a more widely known spot. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Adam Paszke	594f98ce16	Support multi-stage AutogradClosures	2017-09-05 17:48:55 -04:00
Zach DeVito	a3fdb281d1	Python wrapper for Node IR using pybind11 Supports almost all of the IR API.	2017-09-05 17:48:55 -04:00
Adam Paszke	bdcbbeaf68	Remove GlobalTracingState	2017-09-05 17:48:55 -04:00
Adam Paszke	e186d16e6b	Apply JIT optimizations form Python	2017-09-05 17:48:55 -04:00
Adam Paszke	ea05ac8f41	Move JIT-related files to jit dir. Remove IR interpreter	2017-09-05 17:48:55 -04:00
Edward Z. Yang	a797ab9343	Rewrite AST to a new, more functional representation. Previously, our AST was a DAG, where shared Nodes indicated a computation should be reused. This commit rewrites the IR into a new functional representation which represents sharing explicitly using variable bindings. We offer a few justifications for this new style: 1. The new representation is not all that different from the old one; it is about as easy to construct, and the lack of an explicit graph doesn't negatively impact our ability to interpret the graph, since we've chosen, as a matter of design, to NOT have the IR participate in the actual execution of a graph. 2. The new let-binding representation has an implicit ordering, which we can use to conveniently keep track of the original order the trace showed up as. This automatically gives us a topsort, and gives us an easier to read textual representation of our IR: %14 = Embedding %11, %0, -1, None, 2, False, False %15 = Dropout %14, 0.2, True, False %16 = Index %12, 0 %17 = Index %12, 1 %18 = Index %13, 0 %19 = Index %13, 1 %20 = Index %15, 0 %21 = Linear %20, %1, %3 %22 = Linear %16, %2, %4 3. It moves us closer to a Futhark style language (http://futhark-lang.org/publications/pldi17.pdf). Major aspects of the diff - Node is replaced with Expr and Arg, a pair of mutually recursive structures which represent our new language. In BNF, the language looks like this: a ::= c \| %i e ::= %i, ... = e \| PyOp e, ... \| Ret %i, ... Technically, Ret is not actually a return (no control flow is involved), it just tuples up a series of tensors (identified by variables). One important invariant is that locals are always tensors; they are never constants (this is asymmetric with Args.) - Arguments support Python constants. This is an important piece because many operators take extra Python literals like integers and tuples in order to specify extra parameters about how an operator operates. Adding this was essential to getting word_language_model to work. - As both Expr and Arg have multiple variants, there is new infrastructure for doing case on the variants using ExprVisitor and ArgVisitor. The strategy here is adapted from WebAssembly's visitors, although we have generalized to permit arbitrary argument forwarding, which is necessary to support tail-recursive visitor calls. TCO is important because our interpreter may recurse arbitrarily deep into a stack of nested lets. If users wish, they can also manually case on the type tag. - Tracing is now turned on and off using _tracer_enter/_tracer_exit in torch._C. _tracer_enter accepts a list of variables which are to be treated as arguments; _tracer_exit accepts the list of traced variables which should be returned when you reexecute the trace, and returns the trace expression which can be reexecuted. GlobalTracingState is a global variable which tracks whether or not we are tracing or not. - You use run_forward to execute a trace on some set of parameters. - When under tracing, variables keep track, via trace_local, what the name of their variables in the IR are. Here is a simple runner which leaks memory but can be used to JIT models: import torch.autograd.function as F import torch._C def jit(model): import types real_forward = model.forward def forward(self, args): def flatten(x): return tuple(F._iter_variables(x)) if not hasattr(self, "saved_trace"): torch._C._tracer_enter(tuple(self.parameters()) + flatten(args)) out = real_forward(args) self.saved_trace = torch._C._tracer_exit(flatten(out)) self.saved_outs = out return out else: flat_out = Variable._execution_engine.run_forward(self.saved_trace, tuple(self.parameters()) + flatten(args)) return F._unflatten(flat_out, self.saved_outs) Major problems: - Sanity checking is spotty at best, especially when users pass in variables. - The interpreter leaks tensor memory from the store. When we add back def-use we should be able to deallocate tensors as soon as we know they are no longer necessary. - The interpreter needs to reach feature parity with the old execution engine. From there, we need to see if backwards can be subsumed as well. - I still have no confidence in having memory managed everything correctly. This requires a close look. - Rather than return an open expression as a trace, we should return a lambda instead, which knows about how many formal parameters it requires. - The IR is not introspectable from Python at the moment, but this is simply a matter of implementing all the binding code. - The tracer is NOT reentrant (you can't trace while you're inside a trace.) Furthermore, no sanity checking is done if you try to incorrectly reuse things from one trace in another. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Edward Z. Yang	e1b7872fc2	Make it possible to access IR from Python. Also, add a new trace_fn field to attach forward IR to Variables. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Justin Johnson	94b5990201	Add torch.cuda.get_device_name function (#2540 )	2017-08-26 15:06:37 -04:00
Alykhan Tejani	eb58740651	add ones_like and zeros_like	2017-08-25 14:11:04 -04:00
gchanan	c000d15058	Properly use Py_RETURN_True, Py_RETURN_False in back compatibility warnings. (#2345 )	2017-08-08 21:54:20 -04:00
Zach DeVito	9d8cff9bc1	initialize aten and pytorch to share the same THCState	2017-07-11 10:35:03 -04:00
Adam Paszke	714351ff39	Officially enable process-group mode	2017-06-12 22:02:11 -04:00
Gregory Chanan	4f602a52b5	Use THPUtils_assert rather than THError in torch/csrc/Module.	2017-06-11 05:37:59 -04:00
Gregory Chanan	ffd808768e	Remove raiseErrors from THTensor functions, have THStorage functions take an error_buffer to return a proper error message while being able to handle memory management correctly from calling function.	2017-06-11 05:37:59 -04:00
Gregory Chanan	177785eecf	explicit Ptr constructors, fast transposed copy.	2017-06-11 05:37:59 -04:00
Gregory Chanan	be65f46c76	Add optional warning for backwards incompatible keepdim. Setting torch.utils.backcompat.keepdim.warning.enabled=True will cause Python warnings in the case where the default value of keepdim is used for 1-d reductions. Also specify keepdim via kwargs in library so these warnings have less noise.	2017-06-11 05:37:59 -04:00
Gregory Chanan	3556d1b8a3	Add optional warning for backwards incompatible broadcast. Setting torch.utils.backcompat.broadcast.warning.enabled=True will cause Python warnings in the case where broadcast occurs but previously 1-d view style pointwise ops occured.	2017-06-11 05:37:59 -04:00
Gregory Chanan	5af46cb352	Add broadcasting support for matmul.	2017-06-11 05:37:59 -04:00
Sam Gross	d81da41650	Make sure the number of MKL and OpenMP threads match Otherwise, on many machines, the size of the OpenMP thread pool will change between MKL and our OpenMP enabled functions. The constant thread creation and destruction results in worse performance and leaks memory on GCC 5.4	2017-06-07 14:53:29 -04:00
Adam Paszke	8ea7c87c29	Improve init methods	2017-06-02 23:42:11 +02:00
Adam Paszke	181d2f41bd	Add initial Python wrappers for THDTensors	2017-06-02 23:42:11 +02:00
Trevor Killeen	05bc877a05	make THPPointer have explicit constructors (#1636 )	2017-05-25 15:35:54 -04:00
ethanluoyc	d0504aa41d	Implement lgamma function.	2017-05-08 16:21:26 -07:00
Sam Gross	4c1cdb6148	Refactor Python string utility function	2017-04-28 21:25:26 +02:00
Sam Gross	27990fee54	Use fully qualified name as tp_name for tensors and storages (#1379 )	2017-04-27 16:26:44 -04:00
Martin Raison	cd3bbc9dfd	more operations and optimizations (hspmm, reorder, ...)	2017-04-18 12:46:54 -07:00
albanD	71303b8af4	Autograd deadlock for recent glibc fix (#1243 )	2017-04-12 22:24:31 +02:00
Adam Paszke	afeeb81e79	Add support for keyword arguments in torch.cat	2017-04-11 14:48:54 -07:00
Adam Paszke	91c4ba7980	Add torch.arange and deprecate torch.range	2017-04-03 10:38:58 -04:00
albanD	dfa2d26830	* make random_ range correct when both lower and upper are specified	2017-03-31 15:37:24 -04:00
Sergey Zagoruyko	8dc5d2a22e	export current_blas_handle	2017-03-23 23:32:45 +01:00
Brandon Amos	bb353ccc17	Add batch triangular factorization and solves, add IntegerTensor to cwrap (#903 )	2017-03-23 15:06:00 -04:00
Adam Paszke	faac0f5c25	Fix torch.cat bugs Always use PySequence API and disallow catting along inexistent dimensions.	2017-03-22 18:58:42 -04:00
Sam Gross	379ae6d865	Refactor out dispatchStateless (#1007 ) Some of the error messages were incorrect due to erroneous 'tensor == THPDefaultTensorClass' checks	2017-03-15 16:24:55 -04:00
Martin Raison	f17cfe4293	sparse tensor operations (#735 )	2017-03-03 18:37:03 +01:00
Zhou Chang	f366e5fc81	Support int16 numpy conversions issue #891	2017-03-02 09:15:57 -05:00
Sam Gross	fc6fcf23f7	Lock the cudaFree mutex. (#880 ) Prevents NCCL calls from overlapping with cudaFree() which can lead to deadlocks.	2017-03-01 11:29:25 -05:00
Adam Paszke	67f94557ff	Expose torch.HalfTensor	2017-02-27 19:35:47 -05:00
Sam Gross	bd5303010d	Refactor autograd package to separate Python dependencies. (#662 ) The core autograd Variable, Function, and Engine no longer depend on the Python API. This let's us implement functions in C++. In the future, we can also multithread engine and release the GIL for most of the non-Python backwards.	2017-02-13 16:00:16 -08:00
Sam Gross	712686ce91	Add cat, contiguous, squeeze, and unsqueeze to THPP Use unsqueeze and view from TH/THC	2017-02-11 17:49:31 +01:00
Adam Paszke	79232c24e2	Fixes after rebase	2017-01-31 01:58:09 +01:00
Janusz Marcinkiewicz	76520512e7	DataChannel tests rewrite (#42 ); DataChannel `isend` and `irecv` implementation (#44 )	2017-01-31 01:58:09 +01:00
Adam Paszke	60d1852c7b	Major improvements to master-worker mode * Fixed all undefined symbol errors * Implemented storage interface and THStorage class * RPC improvements * Code refactor	2017-01-31 01:58:09 +01:00
Adam Paszke	55632d81d2	Add Python wrappers for process group mode	2017-01-31 01:58:09 +01:00
Sam Gross	c414bf0aaf	Fix handling of unicode in torch._C._add_docstr (#487 )	2017-01-18 17:22:30 -05:00
Sam Gross	9302f860ae	Remove unused file TensorDocstrings.cpp (#481 ) Tensor docstrings are created in _tensor_docs.py	2017-01-18 13:34:40 -05:00
Soumith Chintala	8aa8f791fc	add more torch.* and Tensor docs (#476 )	2017-01-18 08:39:33 -05:00
Sam Gross	14d5d52789	Add placeholder tensor documentation for methods that exist in torch. (#463 )	2017-01-17 19:37:47 -05:00
Adam Paszke	f91bb96071	Remove cmin, cmax and cinv	2017-01-16 19:07:37 -05:00
Soumith Chintala	bdfef2975c	adding more docs for torch.* functions	2017-01-11 08:19:49 -08:00
Zeming Lin	59d66e6963	Sparse Library (#333 )	2017-01-05 00:43:41 +01:00
Soumith Chintala	6b4ed52f10	adding docs for some torch.* functions, removing all, any stateless methods	2017-01-03 18:29:50 -05:00
Sam Gross	849794cd2c	Remove deprecated and unimplemented functions (#383 )	2016-12-30 18:37:44 -05:00
Sam Gross	ab5776449c	Add documentation for some torch.xxx functions (#382 )	2016-12-30 17:01:47 -05:00
Adam Paszke	9b7eceddc8	Accept outputs in out argument	2016-12-29 12:25:59 +01:00
Sam Gross	24af02154c	Use ForkingPickler for sharing tensor/storages across processes (#344 ) This hooks into the (internal) ForkingPickler class in multiprocessing to reduce tensors, storages, and CUDA events instead of our queue from joblib. This makes it easier to use the standard multiprocessing classes in later versions of Python. This also exposes: - Tensor/Storage.share_memory_() - Module.share_memory() These methods move the CPU tensors and storages to shared memory. If you're using the "fork" method of multiprocessing, these objects can be directly inherited instead of serialized through a queue.	2016-12-28 20:34:23 -05:00
Sam Gross	126a1cc398	Add Sphinx docs	2016-12-28 00:03:39 +01:00
Sam Gross	e46d942ca6	Fix double initialization of HalfStorage (#331 )	2016-12-19 15:19:41 -05:00
Adam Paszke	8e09f0590b	Make sure that C extension was compiled with cuDNN before using it	2016-12-15 00:47:55 +01:00
Adam Paszke	28f0cf6cee	Add docstring support to cwrap (#295 )	2016-12-11 23:25:14 +01:00
Sam Gross	1af9a9637f	Refactor copy and release GIL during copy (#286 )	2016-12-11 21:54:58 +01:00
Sam Gross	0d7d29fa57	Enable caching allocator for CUDA pinned memory (#275 ) Also add binding for CUDA "sleep" kernel	2016-12-02 01:33:56 -05:00
Adam Paszke	1f5951693a	Change torch.randperm to return Long tensors	2016-12-01 23:14:41 +01:00
Adam Paszke	3928f7740a	Implement functional interface for Variables (torch.*)	2016-11-08 16:13:25 -05:00
Adam Paszke	ebc70f7919	Look for libcudart in default CUDA installation paths (#195 )	2016-11-02 19:36:10 -04:00
Sam Gross	f2d7e94948	Use torch.Size for Tensor sizes and tuple for strides See issue #20 The torch.Size class is a tuple subclass which distinguishes sizes from other tuples so that torch.Tensor(size) is interpreted as size instead of data.	2016-10-28 19:37:09 +02:00
Sam Gross	ad2d413c0b	Add C++ bindings for cuDNN (#167 ) The Python ctypes bindings overhead was high enough that it slowed down multi-gpu training when using 4+ Maxwell GPUs.	2016-10-26 19:51:48 -04:00
Adam Paszke	9000f40e61	Add torch.from_numpy	2016-10-24 22:30:11 +02:00
Adam Paszke	f137c0c05a	Improve error messages of stateless functions	2016-10-24 22:29:43 +02:00
Sam Gross	79ead42ade	Add CUDA Stream and Event API (#133 )	2016-10-18 12:15:57 -04:00
Sam Gross	3931beee81	Use THSetNumThreads instead of omp_set_num_threads Set OMP num threads to one in the data loader. Fixes #81 Fixes #82	2016-10-17 15:15:00 -04:00
Sam Gross	ee14cf9438	Add support for pinned memory: (#127 ) torch.Storage/Tensor.pin_memory() torch.Storage/Tensor.is_pinned()	2016-10-15 18:38:26 -04:00
Soumith Chintala	3d6ebde756	qr and ormqr tests and bugfix	2016-10-14 03:10:16 -04:00
Adam Paszke	0325e2f646	Major autograd refactor Improves autograd performance by more than 2x and fixes a couple of bugs. All core functions have been moved to C.	2016-10-13 17:17:49 -07:00
Adam Paszke	2acee24332	Add keyword argument support to most tensor functions	2016-10-13 12:32:04 -04:00
Adam Paszke	96f61bff30	Add LAPACK functions	2016-10-08 20:37:37 -07:00
Adam Paszke	dbe540e49f	Use the custom TH error handler in all threads by default	2016-09-30 14:59:50 -07:00
Adam Paszke	3f7ab95890	Finish implementation of prng related functions	2016-09-29 11:33:25 -07:00
Adam Paszke	941cf4e63d	Add ffi utils for user C extensions	2016-09-29 09:35:56 -07:00
Adam Paszke	1828e7c42f	Add async CUDA copy	2016-09-27 15:12:48 -07:00
Adam Paszke	ddf1598ef8	Add a method for catching exceptions thrown in ctypes	2016-09-25 12:25:54 -07:00
Adam Paszke	e71204b52f	Improve error messages in storage and tensor C functions	2016-09-23 17:17:35 -07:00
Adam Paszke	06ab3f962f	Refactor _C extension to export some utilities	2016-09-21 08:36:54 -07:00
Adam Paszke	8fdec15a55	Codemod to remove camel case method naming	2016-09-20 08:40:28 -07:00
soumith	1f2695e875	adding cuda driver check functions for runtime checking	2016-09-13 10:34:13 -07:00
Adam Paszke	58f507f9e3	Add file descriptor sharing mode to multiprocessing	2016-09-08 11:23:33 -07:00
Adam Paszke	f9d186d33a	Add initial version of multiprocessing module	2016-08-31 19:46:08 -07:00
Adam Paszke	1902bc0bfb	Interface with numpy	2016-08-13 20:19:17 -07:00
Adam Paszke	12bed8dc0d	Add CUDA device selection	2016-08-12 07:46:46 -07:00
Adam Paszke	e9f9fd3727	Major refactor	2016-08-10 09:24:53 -07:00
Adam Paszke	554a1d8336	Add optim	2016-07-21 16:42:06 -04:00
Adam Paszke	bc7bd7a8b3	Add unit tests and fix detected bugs	2016-07-21 13:46:59 -04:00
Adam Paszke	c574295012	Various fixes	2016-07-19 10:45:59 -04:00
Adam Paszke	3a44259b32	Add support for CUDA	2016-07-19 10:45:59 -04:00
Adam Paszke	93ed433de3	Add rand and randn	2016-07-18 23:59:27 -04:00
Adam Paszke	3cec305524	Restructure python code	2016-06-23 22:55:05 +02:00
Adam Paszke	486ea76b98	Add more Tensor methods	2016-06-19 00:24:18 +02:00
Adam Paszke	4f66ea42af	Add random-related Tensor methods	2016-06-18 21:36:10 +02:00
Adam Paszke	857c32bc21	Add all mm methods	2016-06-16 23:40:35 +02:00
Adam Paszke	0eb2b9e756	Add more Tensor and Storage methods	2016-06-15 23:03:47 +02:00
Adam Paszke	fdfe9d836e	Add index* Tensor methods	2016-06-13 13:58:09 +02:00
Adam Paszke	a9282edf79	Add THPPointer and more Tensor methods	2016-06-13 13:26:00 +02:00
Soumith Chintala	5ee3358a92	python 2 support	2016-06-08 19:14:57 -04:00
Adam Paszke	0b61c3f233	Add more Tensor methods	2016-05-13 22:38:51 +02:00
Adam Paszke	56c98f7897	Add more Tensor methods	2016-05-13 00:01:54 +02:00
Adam Paszke	c3f7aac4f9	Add logical functions	2016-05-12 01:22:51 +02:00
Adam Paszke	449ac4ca2a	Add torch.* functions	2016-05-09 19:14:40 +02:00
Adam Paszke	842e1b6358	Add exception handling	2016-05-05 20:58:13 +02:00

... 3 4 5 6 7

322 Commits