pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Zachary DeVito	286cd04a20	JIT cleanup (#7631 ) Cleans up dead code in the JIT: * Remove interpreter_autograd_function * Remove Handles * Remove HandleBuilder * Remove creates_handles, and tracing_autograd_python_function flags * Remove unused var_args * Fix submodules	2018-05-21 10:06:29 -07:00
Adam Paszke	b45f2ff1ae	Remove CompiledFunction + clean up JIT tests (#7421 )	2018-05-16 20:03:04 +02:00
Zachary DeVito	f07f24db0b	Change unique name so that you are guarenteed: (#7166 ) ``` JIT_ASSERT(v->setUnique(x)->uniqueName() == x); ``` This works by changing any other value in the graph with name x to a different name. This mirrors llvm behavior and is useful when you want to ensure some names have particular values.	2018-05-02 07:32:01 -07:00
Zachary DeVito	b7487d42a0	Workaround to make PythonOps traced with torch.jit.trace work correctly. (#6738 ) The long-term fix is to remove the handling-creating pathways and remove all the modes from PythonOp making it into an op that simply calls a PyObject. Right now ONNX expects PythonOp to hold a nn.Function, not a generic callable, so completely removing the legacy pathway will also require changes to how ONNX symbolics are found.	2018-04-24 17:21:00 -07:00
Zachary DeVito	d985cf46f1	Add workaround to fix include warnings in Python 2 builds. (#6716 )	2018-04-24 12:30:19 -07:00
Zachary DeVito	825ce7f196	[jit][script] Allow tuples to be re-assigned (#6538 ) * Allow tuples to be re-assigned This commit improves our support of tuples by making them more first-class. In particular, it allows tuples to be re-assigned across loops and ifs. It does this by making them first-class values in the Graph IR, and then removing the tuples in a LowerTuples pass. An alternative approach would have added more support for desugaring tuples in the Environment object as they were emitted. Instead, the current approach was chosen anticipating a future when tuples are fully supported (including the interpreter). In that future, the current code can be completly reused with the LowerTuples pass just becoming a optimization that removes unneeded tuple allocations.	2018-04-13 17:34:50 -07:00
Luca Antiga	37a84dd40d	Move definitions of Kind out of NO_PYTHON block (#5914 )	2018-03-22 09:36:08 -04:00
Edward Z. Yang	acc409396b	Namespaced symbols (#5820 ) * Namespaced symbols - Our interned strings now have structure, "ns::symname" rather than just "symname" before. We support efficient namespace testing for uniques by encoding the namespace in one byte in the Symbol internal representation. See torch/csrc/jit/interned_strings.h for a more in-depth implementation discussion. - All uses of ksymbol are now attr::symbol (or some appropriate namespace). The valid namespaces are prim, attr, onnx and aten. - Symbol is bound in Python as a qualified string "attr::symbol", EXCEPT for the attribute setting/getting API, whose symbols must always be attr symbols; they get special cased to assume strings are passed. There's a little bit of naughtiness in the implementation, maybe you know how to solve it. - However, the g.op() convenience function assumes that you're generating ONNX operators, unless you explicitly qualify. - All ATen operators and nodes have built-in interned strings generated for them, so you should never have to write a string literal ever again. The tracing code is adjusted to use it. - ONNX exporter now properly tests to see that all operators are in onnx namespace before accepting the export. This is way more robust than the previous exporter, which would be willing to export capitalized operators which were not actually ONNX operators. - A slight organizational change for symbolic.py; this module now ONLY contains aten operators. In particular, the exporter for Constant has moved into utils.py (along with Undefined, from the C++ side), since primitive ops get "special treatment." - The un-inplacing logic in recording is more robust, so that we don't delete a trailing underscore from __and__. This never affected us before because we didn't have any tests for it. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2018-03-16 13:36:11 -04:00
Luca Antiga	396637cdd6	Python-free build of autograd + jit (#5356 ) This PR adds the possibility to build the C++ parts of autograd and jit, with no dependency on Python. The goal is to allow taking a PyTorch IR representation (a tree s-expr) and running it with provided inputs. Prerequisite: build PyTorch so that codegen runs once. Instructions: cd tools/cpp_build bash build_all.sh This will build libtorchjit and torchjit_test in tools/cpp_build/build/torchjit-build. The latter basically runs the code in test_jit.cpp for now. While writing the PR, it turned out that a few of Python.h includes were redundant. They were removed here (PyTorch tests still pass on my machine, we'll see CI). * Introduce Python-free builds of autograd and jit * Remove NO_PYTHON ifdef in functions/special	2018-03-08 15:13:10 -05:00
James Reed	55c64e5243	Add Python function calls to JIT script (#5445 ) * Add Python function calls to script * Script compiler gains a `Resolver` object that runs when it does not understand a function call. This decouples the python resolution from the conversion to IR.	2018-02-28 19:45:04 -08:00
Zachary DeVito	05269b582b	[JIT] Support shape propagation with control-flow (#5391 ) Support shape propagation with control-flow * This allows us to enable optimization in the GraphExecutor for most script tests. * Changes Type to always be present (non-null) on a Value, removing `hasType()` and `typeOption()`. A new type kind 'DynamicType' now represents when a specific type has not been determined. * If/Loop nodes propagate shapes/types in the simple cases where types of outputs do not change depending on where control flows. In other cases, we propagate DynamicType to indicate we do not know what the shape will be. * Remove the `cond` input to the body of Loop to simplify handling in interpreter and shape propagation. * Bugfix for zero-dim contiguousStridesOf	2018-02-26 15:24:05 -08:00
Zachary DeVito	8904616028	add control flow to interpreter (#5293 ) * Use stacks in the interpreter/aten_dispatch Rather than have separate input/output lists, the interpreter now works using a single stack. Operators in the interpreter push/pop from the stack. This allows ownership of tensors to transfer directly to an operator, and an operator can drop the reference to a tensors as soon as it is no longer needed. This is important for the GraphExecutor op, which recursively runs the interpreter. Once autograd is updated to pass variables to Function by value, we will be able to ensure that we release ownership as soon as possible. This commit also switches the interpreter to use a fake tensor 'ContainerTensor' rather than at::Retainable to hold non-tensor data in the interpreter. This allows us to use std::vector<at::Tensor> for all registers, which is significantly less confusing than the OwnedRetainables struct it was replacing. * Add If and Loop to interpreter * Preprocess loop to calculate where references to tensor should be dropped * Add control instructions JumpZ/JumpNZ/Jump * Switch from explicitly having stage structs to having a single list of instructions with Store/Load instructions to take values off the initial stack * Make the interpreter tests executable rather than use expect files * add a flag to interpreter code so that constants are variables if the interpreter is running on variables. * Add tensor_as to its own file	2018-02-22 19:56:15 -08:00
Zach DeVito	99ce581155	Add support for ::copy and ::createClone with blocks	2018-02-02 23:24:49 -08:00
Zach DeVito	0d748fac96	Add nested Blocks in IR This commit is getting the IR ready for representing ONNX control flow. It adds nested blocks to the IR. * Each node now has blocks(), addBlock(), and eraseBlock() similar to a node's output list. * Blocks are a property of every node rather than an attribute because to make it easier to manage the lifetime of the containing nodes and because the behavior of cloning Blocks will likely be different from the way we clone other attributes. * A block itself has a list of nodes, as well as inputs and outputs. The meaning of the nested input/output nodes are specific to the particular node kind containing the block. It is safe to assume inputs to a block will be in scope in the block. * Each Block has an owningNode() and each node has an owningBlock(). The owningNode of the top-most block is null. * Values are lexically scoped: nested blocks can use values from outer blocks that have been defined in previous nodes. Lint has been updated with these new scoping rules. * This change preserves almost all of the pre-Block API. No attempt has been made to make optimizations aware of Blocks. This will need to be done on a case-by-case basis as we make optimizations capable of handling Blocks.	2018-02-02 23:24:49 -08:00
Zachary DeVito	c308e03f3e	Initial GraphExecutor Implementation. (#4982 ) This adds the initial implementation of graph executor for the new JIT design. It includes a few python tests ensuring that nograd, backward, and double-backward cases work for simple examples and some corner cases. More work needs to be done to performance optimize as there are many extra copies and places where we hold onto variables longer than we should. These are noted in the comments.	2018-02-02 17:45:59 -08:00
Zachary DeVito	0ae5498079	[JIT] add create_autodiff_subgraphs (#4822 ) This pass splits differentiable subgraphs into their own Node, similar to a fusion group. This initial implementation does not create optimal subgraphs, but it works well in the case where most things are differentiable, and has the building blocks (`mergeNodes`) to extend to the better implementation.	2018-01-23 23:46:54 -05:00
Zachary DeVito	2da43bf6f1	Make Symbol a true struct (#4717 ) Previous Symbol was just a uint32_t and we converts symbolToString and stringToSymbol. Now Symbol is a struct with a toString method, and constructors from either BuiltinSymbols enums (e.g. kParam) or strings. Symbol is convertible to a uint32_t to ensure it can still be used in switch statement BuiltinSymbol case branches.	2018-01-17 21:49:28 -08:00
Zach DeVito	674ddf6b91	Fix multi-gpu fuser bug cuModuleLoad is only valid for a single device so we need to compile for the particular device that the fusion group will run on. CompiledFunction already specializes different traces for tensors, so we just need to have fusion_compiler produce the cuFunction on the right device.	2018-01-08 15:04:22 -08:00
Luca Antiga	4eb8e12765	Introduce scopes during tracing (#3016 )	2017-12-04 09:19:06 -08:00
Zachary DeVito	929a11f920	Add interpreter support for Handles/PythonOp/CppOp (#3866 ) * Add interpreter support for Handles/PythonOp/CppOp This treats Handles as a first-class type in the interpreter since this turned out to be conceptually simpler than treating them as a separate concept, which requires a second channel for register allocating and moving data from one op to the next. Notes: * The refcounting nature of tensors is factored into its own base type so that it can be shared with other refcounted types such as handle. * Some methods redundant with TensorBase have been deleted from Tensor * The interpreter uses raw refcounted handles. In addition to being able to treat Tensors and Handles as the same base object, it removes a lot of redundant refcounting as objects moved from tensors to input/ output lists. * aten_dispatch has been updated to work directly on the raw refcounted lists to avoid refcounting and duplicate lists. * Removing jit_closure.cpp, The interpreter can now handle all pathways. * Functions like `unsafeToTensorShare` describe how ownership transfers in the interpreter. The `Steal` variants take rvalue references as arguments, and invalidate those arguments to prevent potential problems. * Make TensorTemporary is not a subtype relationship because it is too easy to do something horribly unsafe: ``` void foo(at::Tensor bar) { // bar destructor call release on a temporary! } foo(TensorTemporary(retainable)); // structure slicing! ```	2017-11-29 11:38:57 -05:00
Zach DeVito	2300234c9c	Lint checks, small fixes	2017-11-15 11:47:18 -08:00
Zach DeVito	ef4b19f767	Refactor ir.h to distinguish Nodes and Values This commit adds a Value type similar to the one @ezyang suggested a while ago for handling multi-return nodes. Previously if we had a graph like: a = op1(b) c, d = op2(a) Then its in-memory format would look like: %0 = op1(b) %1 = op2(%0) %2 = select(%1, 0) %2 = select(%1, 1) Select nodes were used only to handle the multi-output case. In the single-output case ops referred directly to their uses. This required special handling for the single- and multi- output cases, and was confusing when used with ONNX which distinguishes values (the inputs/outputs of a node) from the nodes themselves (e.g. a Conv). This commit adds the Node/Value distinction to the IR. In the example above, `a`, `b`, `c`, and `d` are now Value objects, while `op1` and `op2` are now Node objects. Inputs/Outputs to the graph are values. * Nodes now always have multiple outputs, accessible through their `output()` method. * Methods exist for adding/removing outputs from a node. * Nodes own their output Values, destroying a node destroys its outputs and it is only valid to destroy a node when no uses of its outputs remain. * Unlike select, Values do not appear in the nodes list. * The method `node()` on `Value` retrieves its defining node. Calling it is always valid. For inputs, its kind is "Param". Like "Return" there is a single Param node representing all inputs. * For single-output Nodes, the method `output()` retrieves the single output Value, asserting that the node is in-fact single output. * Functions are the same, but some functions like `type()` have moved to Value. * `replaceAllUsesWith` is now sanely defined for both Values and Nodes. In the case of Nodes, it replaces all outputs of the node with the outputs of the replacement node. * stage is defined both on Node/Value. This is because Inputs require a stage. * Apart from changing data types from Node->Value most passes remain the same. Things that previously assumed single-output nodes now have to call output() to get the node. * This removes the uses = [...] field in the outputs because it was getting confusing even before this commit when uses would refer to nodes, but we print the names of Values. The lint pass validates the use list, so printing it out seems less necessary.	2017-11-15 11:47:18 -08:00
Sam Gross	a8efd88cac	Fix warning in jit/ir.cpp	2017-11-03 09:11:33 -07:00
Edward Z. Yang	b7f5bc506e	Make inputs/outputs return an ArrayRef. Some knock on effects: - at() is not supported on ArrayRef. I fixed this by adding a new overload for input() to access a specific input. I also filed https://github.com/zdevito/ATen/pull/152 - Need new overloads for fmap/filter, because template deduction won't attempt an implicit constructor in attempt to match the argument. - New overload in ir.cpp for printing ArrayRef. - When we pybind11 an ArrayRef, we convert it into an iterator. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-11-01 09:49:53 -04:00
Edward Z. Yang	247d50e2ad	Improve const-correctness of JIT. This started off as a minor fix based on Adam's question, "why is printing a graph not const" and snowballed into a giant yak shaving exercise. - The Graph and Node APIs now uniformly enforce deep constness; e.g., if you get a const Node* or const Graph, it is not possible to get a non-const Node/Graph* somewhere else in the graph (even though the member variables of these are non-const. Hooray for private access specifier.) - A big pile of functions got const versions, most notably the printing functions, and functions for accessing inputs(). - REALLY IMPORTANT, BC-BREAKING CHANGE: inputs() now returns a COPY of the inputs, rather than a reference to the underlying. I was forced to do this because there is no way to portably turn a std::vector<Node> into a std::vector<const Node>, which is necessary to provide a const-correct version of inputs() that enforces deep const-correctness. I then justified this choice to myself with the observation that outputs() returned a copy (by necessity), so this makes the API more uniform. But making this change uncovered two very subtle bugs: 1. If you change functions from returning a reference to returning a copy, the idiom node->inputs().begin() is no longer valid, because the memory the iterator points to immediately becomes invalid. THIS SUCKS. Honestly, we should add a lint rule rejecting calling begin()/end() on temporaries because this is very dangerous. To excise this pattern from the codebase, I added begin() and end() methods to Graph, so that we got rid of the graph->nodes().begin() idiom, which happens to be sound, despite not returning a reference, because graph_node_list is a non-owning reference. 2. pybind11 doesn't handle std::vector<Node> cast out of the box. Fortunately, I found a simple fix in the GitHub issues tracker that involved adding an extra type converter. And yes, this does mean that outputs() in Python never worked correctly. - New const_graph_node_list, which is a graph_node_list that gives you const Node There are some more miscellaneous improvements: - Applied CR comment fixes on export.cpp; using replaceInput, and renaming variables for clarity. - assertValidInput helper method added, and applied to replaceInput - Use an explicit function to print THPObjectPtr, otherwise we get the wrong overload. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-11-01 09:49:53 -04:00
Zachary DeVito	8cc30e4895	Fix the Fusion Pass (#3362 ) * update fuser to match ATen-formatted JIT ops * fix concat optimizations and add test * allow onnx export to work with single-export functions * fix onnx handling of multi-return nodes. * nits, format, vision test update * fix add constant * fix driver init issues * Add missing Neg symbolic. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-10-31 13:44:13 -04:00
Edward Z. Yang	40f7f6e095	Improve handling of 'expand' (broadcasting) in JIT and ONNX The pieces: - I improved the lint / asserts to catch some bugs which I committed while working on my export. There are two new properties which the linter checks now: (1) "Anticipated uses". If a node says that is used by M, M better appear later in the topsort. Previously, we only checked if it was in all_nodes. (2) If you are a select node, you better be a multi-type node; if you're not a select node, you better not be! And you should never have an input that is multi-type. - There is a new peephole optimization pass, for simple, local transformations to graphs. Right now, it implements a simple optimization: remove 'expand' invocations that are no-ops (the size before matches the size after), but we can add other things to it later. I needed this for ONNX because no-op expands show up in the left-hand argument, which we don't support. - There is now a broadcast fuser, which fuses ATen expand ops into broadcastable ONNX ops (Add, Div, Mul, Pow, Sub, Gemm.) It only fuses when the original size is a suffix of the new size, as per the ONNX spec. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-10-29 23:50:34 -04:00
Adam Paszke	fa0f3cf98a	Re-enable and fix most JIT tests	2017-10-27 02:40:09 +05:30
Edward Z. Yang	3696300fcf	Include Python.h less using a new stub header. In many "non-Python" headers, we include Python.h because we need to declare a pointer to PyObject, and solely because of that. It would be a lot better if we had a simpler version of Python.h that just declared PyObject available for pointers, without anything else. This is what torch/csrc/utils/python_stub.h does. The good thing about not including Python.h is that it is easy to be warning-less; no more ugly insertions of Python.h on headers where it has no good reason to be. This makes PyTorch warning clean again. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-10-19 23:04:19 -04:00
Edward Z. Yang	be04d5a347	Print small tensors in IR. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-10-05 15:27:49 -04:00
Zach DeVito	2996aad68c	remove dead code, add insertAt helper	2017-09-20 12:24:27 -04:00
Edward Z. Yang	ba95ffed97	Const correctness in IR and Attribute / linked list excision Since this code has been stable for a while, I think it's a good opportunity to make it const correct. There is only a slight increase in code size, which I hope will appease @zdevito. - consts were added to all methods which are logically const. Most notably, lint() is now declared const. - I made extra const versions of Node::iterator(), Node::reverseIterator(), Graph::nodes(), Attribute::find(), linked_list::begin(), linked_list::end(), linked_list::rbegin(), linked_list::rend(); in all cases these were one-liners except for find() (I spent a little time trying to make find() a one-liner but didn't think of a way to do it.). - graph_node_list got factored out into a new, templated type linked_list<T> (perhaps we should call it intrusive_list<T>). I had to template the iterator to define constant and non-constant iterators without duplicating code, and once I was there, I decided to templatize everything else. The code nicely factors out, although I wouldn't recommend using it for anything else without more refactoring. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-20 12:24:27 -04:00
Edward Z. Yang	670ec4bc59	Split Type into its own header file. No other substantive changes. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-20 12:24:27 -04:00
Zach DeVito	06903c3525	bugfix for word language model	2017-09-20 12:24:27 -04:00
Adam Paszke	e05cfb2064	Make sure passes don't mess up stages of nodes and graphs	2017-09-19 10:53:32 -04:00
Adam Paszke	aa1a94058b	Add AddConstant node to the JIT	2017-09-19 10:53:32 -04:00
Adam Paszke	2ae7d8e5f9	Fix Chunk heuristic in graph fuser	2017-09-19 10:53:32 -04:00
Edward Z. Yang	820143f4af	Drop L specifier; reimplement tuple printing in C++ When you call repr() on a long in Python 2, it prints a long suffix. This is annoying for tests which assert on the exact output. Use str() instead. But then there is a problem with Python 2's default tuple str() implementation, where it calls repr() on its arguments rather than str(). This means that if you have a tuple of longs, it will render as "(1L, 2L)" in Python 2. To solve this problem, we just reimplement tuple printing in C++. This is not a very robust fix (nested tuples, dictionaries, all these situations will fail) but in practice it hits the cases that matter. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-12 11:03:03 -04:00
Zach DeVito	a63d88c95b	print more detailed error message when trying to exported an unsupported operator	2017-09-05 17:48:55 -04:00
Edward Z. Yang	cb5fbe1944	Expunge %2.0 syntax. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Edward Z. Yang	394ff072eb	Update to latest ToffeeIR operator schema. - Conv no longer supports bias, so we create an explicit broadcasted addition afterwards. There is one minor problem, however, which is that ConvTranspose in Caffe2 has mandatory bias. So there's a hack. See Note [Caffe2ConvTranspose] for the details. - Squeeze: dims -> axes - Transpose: axes -> perm - Reshape lost its extra output (yay!) Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Edward Z. Yang	4174112b49	Add lint pass for handle invariant. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Priya Goyal	e05979c4ea	adding dummy bias for the conv transpose	2017-09-05 17:48:55 -04:00
Edward Z. Yang	3b478c17a0	JIT backward closure comments / Render stage changes in inputs. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Edward Z. Yang	b2e305e390	Lint after ToffeeIR, and subsequent fallout. I realized we weren't running the linter after ToToffeeIR, so I added a lint call. It thus emerged that the current implementation was using "Unused" nodes that were not added to the graph, which was tripping the lint. I fixed this a few ways: - BatchNorm and Conv primspecs were returning dead "unused" nodes for their (implicit) handle parameters. I removed them because setOutputs handles this already, and a dead unused node which is not attached to the graph violates the "no dead nodes" invariant. - OK, but MaxPool actually needs to return a unused node for the output which supported by PyTorch but not Toffee; we need to error if subsequently in the trace this output is used. The new strategy is to have MaxPool's primspec return a None at the unused position, and then immediately check if there are any uses of that output. If there are, that's an error! - I needed to adjust the Select invariant in the exporter loop: only if a Select node has uses is it mandatory for it to be defined in env. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Zach DeVito	c9f7f2eff4	Change pipeline for exporting to toffeeIR previously: PythonOp/CppOp Graph -> ToffeeIR, primspecs worked with protobufs now: PythonOp/CppOp --ToToffeIR--> jit::Graph of in-memory ToffeIR -> protobufs of ToffeIR This commit let's primspec functions work directly with JIT IR nodes, which makes it possible to do a lot more stuff in those functions.	2017-09-05 17:48:55 -04:00
Edward Z. Yang	d5d65080e3	Put every input on a new line. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-09-05 17:48:55 -04:00
Zach DeVito	a60d9bd022	Bind Attributes in python ir, and add test for python ir binding	2017-09-05 17:48:55 -04:00
Zach DeVito	a3fdb281d1	Python wrapper for Node IR using pybind11 Supports almost all of the IR API.	2017-09-05 17:48:55 -04:00
Priya Goyal	b0ba9a81d2	remove std::list, restore custom node list implementation.	2017-09-05 17:48:55 -04:00

1 2

76 Commits