pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Nikolay Korovaiko	f81395b3e3	Enable more passes in ProfilingGraphExecutor Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22079 Differential Revision: D16119322 Pulled By: Krovatkin fbshipit-source-id: 301fcc42d0e1f031d9de5bcd9679fb8c2d742fef	2019-07-10 10:44:18 -07:00
Sebastian Messmer	de85abf226	Allow default construction of Dict/List (#22084 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22084 For DictPtr/ListPtr, default construction was disallowed because it was ambigious if it's supposed to create an empty list or a nullptr. But since we renamed them to Dict/List, we can now allow default construction without ambiguity. Differential Revision: D15948098 fbshipit-source-id: 942a9235b51608d1870ee4a2f2f0a5d0d45ec6e6	2019-06-25 17:40:48 -07:00
Sebastian Messmer	275087383b	ListPtr->List DictPtr->Dict step 2 (#21937 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21937 This changes call sites to use the new naming scheme Reviewed By: zdevito Differential Revision: D15892404 fbshipit-source-id: 8d32aa90a0ead1066688166478f299fde9c2c133	2019-06-19 18:02:05 -07:00
Sebastian Messmer	b527e48588	Use c10::List (#21177 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21177 - Integrate c10::ListPtr into IValue and the c10 dispatcher. - Streamline conversion to/from IValue. Before, we had IValue::to<> and kernel_functor.h had its own ivalue_to_arg_type and return_type_to_ivalue. They are now unified. Also, this means that nested types like Dicts of Lists of Optional of Dict of ... do work as expected now Differential Revision: D15476433 fbshipit-source-id: bde9df80df20091aa8e6ae17ba7e90abd149b954	2019-06-12 13:58:24 -07:00
Zachary DeVito	ea822d9626	Interpreter support for CallFunction/CallMethod (#21562 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21562 ghimport-source-id: 17e5e183f730f50d97ef48973aafc6249d54978f Reviewed By: suo Differential Revision: D15729500 Pulled By: zdevito fbshipit-source-id: efa8a133b617b1498810392a8da6b513ce00b5eb	2019-06-09 15:28:26 -07:00
Zachary DeVito	ad0c08f950	Expose ExecutionPlan in prep for function calls (#21561 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21561 ghimport-source-id: 4bf28d8140610a0cefef0c0a17f0a513ae855dde Reviewed By: suo Differential Revision: D15729498 Pulled By: zdevito fbshipit-source-id: b26458336da1efaba71d8a577c3917c6622dae0d	2019-06-09 15:28:22 -07:00
Zachary DeVito	e616a5e8b8	Revert D15600067: Expose ExecutionPlan in prep for function calls Differential Revision: D15600067 Original commit changeset: 82b7de458dd6 fbshipit-source-id: ca26a362cd73bdb9e8c4eba15dd5c10986fa79fe	2019-06-07 22:20:44 -07:00
Zachary DeVito	bfb235b8c9	Revert D15618275: Interpreter support for CallFunction/CallMethod Differential Revision: D15618275 Original commit changeset: 038ae27e5416 fbshipit-source-id: 8dbe0f564ba103fe445dacc471085c659171705f	2019-06-07 22:20:40 -07:00
Zachary DeVito	5f6afafdef	Interpreter support for CallFunction/CallMethod (#21325 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21325 ghimport-source-id: eeca1176f5e00c85a69cd016acccf5105e670e02 Reviewed By: jamesr66a Differential Revision: D15618275 Pulled By: zdevito fbshipit-source-id: 038ae27e5416f1ce338009627c839a4d61a00658	2019-06-07 20:56:58 -07:00
Zachary DeVito	1517ff66a1	Expose ExecutionPlan in prep for function calls (#21273 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21273 ghimport-source-id: b92c1e07fbe4122467a21b98d29635295093e0c2 Reviewed By: jamesr66a Differential Revision: D15600067 Pulled By: zdevito fbshipit-source-id: 82b7de458dd65c175f55b0f383bfc3fcf4704032	2019-06-07 20:56:55 -07:00
Wanchao Liang	113a27ee45	bake constants into the traced graph, get rid of getNestedValueTrace (#21046 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21046 ghimport-source-id: 5cb3efb1896fbe42336e24c14fbf0bb5e646528e Differential Revision: D15530991 Pulled By: wanchaol fbshipit-source-id: b096ca5a1cdce496742b7f7e1de3ef8d21e9a8b0	2019-06-03 21:48:11 -07:00
Dmytro Dzhulgakov	c25e33789e	Lightweight at-most-once logging for API usage (#20745 ) Summary: Resubmit #20698 which got messed up. Idea is that when PyTorch is used in a custom build environment (e.g. Facebook), it's useful to track usage of various APIs centrally. This PR introduces a simple very lightweight mechanism to do so - only first invocation of a trigger point would be logged. This is significantly more lightweight than #18235 and thus we can allow to put logging in e.g. TensorImpl. Also adds an initial list of trigger points. Trigger points are added in such a way that no static initialization triggers them, i.e. just linking with libtorch.so will not cause any logging. Further suggestions of what to log are welcomed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20745 Differential Revision: D15429196 Pulled By: dzhulgakov fbshipit-source-id: a5e41a709a65b7ebccc6b95f93854e583cf20aca	2019-05-23 23:17:59 -07:00
Nikolay Korovaiko	f215db9b92	InsertGuards pass Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20438 Differential Revision: D15342655 Pulled By: Krovatkin fbshipit-source-id: a193e582d621b99f848573fb4478e7b62265dc9f	2019-05-20 10:49:19 -07:00
Edward Z. Yang	9b1dbffba5	Re-sync with internal repository (#20702 )	2019-05-20 09:22:57 -04:00
Dmytro Dzhulgakov	d3059b9c49	Lightweight logging for once-only API usage	2019-05-19 23:04:40 -07:00
Edward Yang	97e1f07ffc	Replace AT_CHECK with TORCH_CHECK [shard 10/10] Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20436 Reviewed By: jerryzh168 Differential Revision: D15318926 fbshipit-source-id: 71a43070cc50cc174f703ebc595f1d87c6fc1e91	2019-05-15 07:35:37 -07:00
Nikolay Korovaiko	9499c7b7ee	Profiling GraphExecutor Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19994 Differential Revision: D15307752 Pulled By: Krovatkin fbshipit-source-id: 7b35191042199ef16823487e15fe639968cbdc89	2019-05-10 23:05:47 -07:00
Wanchao Liang	4d676d53a6	split canonicalize_ops, make a decompose pass (#19988 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19988 ghimport-source-id: 1dbf39e07099fa24ef9a6c0221312bf01a8011b7 Differential Revision: D15190355 Pulled By: wanchaol fbshipit-source-id: 83f2b6557efd758810ccb4a4229d71fdebfd06e0	2019-05-08 17:21:59 -07:00
Wanchao Liang	8fbde94664	lower batchmm to non-diff optimization (#19987 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19987 ghimport-source-id: ca4c38312bd56d8a71f1925297deee7f64f573d3 Differential Revision: D15190356 Pulled By: wanchaol fbshipit-source-id: 761edb08c670fcbc24a06a5b11ceddf311f75884	2019-05-06 15:58:33 -07:00
Thomas Viehmann	5c9ab6f411	Specialize Optional[T] to T (or subtype for Tensor) or None when executing graph (#18407 ) Summary: This patch specializes `Optional[Tensor]` graph inputs to either a `DimensionedTensorType` (if a Tensor is passed) or `NoneType`. Other `Optional[T]` are specialized to `T` or `None`. - For unwrapping (checked and unchecked) we need to keep the output type, as IR code that follows unwrapping may not work with NoneType (just as it doesn't deal with Optional). While it would not be hit during execution, it will run against the (legitimate) assumptions of the analysis passes. - Function lookup currently will not match NoneType when it expects optional (I'm not entirely sure why this doesn't lead to unhappyness currently, but hey), I amend this at the level of the function matching code (`operator.cpp`), but see Adam's comments. We would run into trouble if we needed to select between functions whose signature only differs in Optional types with different subtypes, but we would have the same problem when calling them directly, so I would think this is OK. - It would enable throwing away branches we can't hit. This also reduces the "blockyness" of the graph, so it may be easier to apply optimizations (e.g. fuse things in `if t is None: ...` and outside the `if`. - Arguments passed into `Optional[Tensor]` arguments will get shape information, which is very handy. - It get's rid of the problem that tensors passed into Optional arguments get requires_grad set erroneously #18270 (though that also affects lists, which aren't fixed here). - `Optional[List[int]]` is needed for #18697. - We're changing typing in a more subtle way than the `TensorType`->`DimensionedTensorType`. - In particular, specializing to NoneType loses the Type information captured in the `OptionalType` element type. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18407 Reviewed By: zdevito Differential Revision: D15216808 Pulled By: eellison fbshipit-source-id: 01f1a7643deaf4962c3f55eff2070d54b0e54b69	2019-05-06 15:35:03 -07:00
Mikhail Zolotukhin	8b46938355	Cleanup includes in torch/csrc/jit/* (#19922 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19922 ghimport-source-id: 0434c46bf75621ff79ea27a18a2475e7f13e2487 Differential Revision: D15125015 Pulled By: ZolotukhinM fbshipit-source-id: 5685edfc94067f62e363a85e9badb7f757b1d321	2019-05-06 13:40:26 -07:00
Zachary DeVito	b9c20d5224	graph_for based on last_optimized_executed_graph (#19142 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19142 ghimport-source-id: 822013fb7e93032c74867fc77c6774c680aef6d1 Differential Revision: D14888703 Pulled By: zdevito fbshipit-source-id: a2ad65a042d08b1adef965c2cceef37bb5d26ba9	2019-04-16 09:17:53 -07:00
Zachary DeVito	1827ca4c35	Make debug subgraph inlining thread local (#19136 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19136 ghimport-source-id: 3a24ab36aa753ce5cce7bba3467bdbe88e5c7f60 Reviewed By: jamesr66a Differential Revision: D14885051 Pulled By: zdevito fbshipit-source-id: b39c6ceef73ad9caefcbf8f40dd1b9132bba03c2	2019-04-13 08:42:14 -07:00
Bram Wasti	b1539412db	Add pass registration mechanism (#18587 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18587 ghimport-source-id: 80d753f7046a2a719e0c076684f44fa2059a0921 Differential Revision: D14901227 Pulled By: bwasti fbshipit-source-id: 56511d0313419b63945a36b80e9ea51abdef2bd4	2019-04-12 15:32:00 -07:00
Zachary DeVito	1abbee0f8e	Allow Tensor lists to show up in symbolic differentiable graphs. (#16784 ) Summary: It is done by flattening all tensor lists that are inputs/outputs to the graph into the inputs/outputs list in the autograd graph. This is less desirable than simply allowing IValues to exist in the inputs/outputs of autograd::Function but it is substantially less intrusive. CaptureList describes the variables captured for backward in a single class. UnpackInstructs describes how the flattened inputs to backwards are re-packed into lists. ailzhang This PR is also part 2 of covering maskrcnn & bert AD formulas, following #16689. Ops added in this PR: ``` cat index meshgrid reshape split split_with_sizes stack unbind ``` I will also add a few perf numbers here. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16784 Differential Revision: D14104063 Pulled By: ailzhang fbshipit-source-id: 5ceadadfd67ccaac60c5fd6740786c5354e252b9	2019-04-10 18:16:20 -07:00
Michael Suo	ce67775f08	remove unused func (#18712 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18712 ghimport-source-id: e435150a501b20695a5276addee93d795e04b532 Stack from [ghstack](https://github.com/ezyang/ghstack): * #18712 [jit][easy] remove unused func * #18711 [jit] fix side-effects and aliasing for custom ops as title Differential Revision: D14730979 fbshipit-source-id: 381d16ea2a45779bf6d5fc6d90a4f8585461e902	2019-04-05 15:19:28 -07:00
Zachary DeVito	2d07993bcb	Add ability to specialize class types to ArgumentSpec (#18314 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18314 ghimport-source-id: 8cecb768d476ab19c9460f39c8f94a764e4cb052 Stack from [ghstack](https://github.com/ezyang/ghstack): * #18314 Add ability to specialize class types to ArgumentSpec * #18226 Add Slot type to abstract the raw pointers being used for slots. Differential Revision: D14574395 fbshipit-source-id: cc3af6e56e9ae52990f4a1ad56ecceaa2d493577	2019-04-02 17:35:57 -07:00
James Reed	85f36014e2	Experimental logging/counters API (#18235 ) Summary: This defines a generic counters API that users can utilize to provide monitoring functionality in e.g. a production service. We expose both counters for runtime internals as well as a TorchScript API to create user-defined counters. Synopsis of the API: - `torch/csrc/jit/script/logging.h` specifies the externally-facing API in C++ - `torch/jit/_logging.py` specifies the Python API We use an interface, `LoggerBase`, to define the interactions between users and a logging backend. Implementing a subclass of `LoggerBase` allows the user to handle these events in a custom way, such as logging into a DB or calling into an infra-specific counters API. From the frontend perspective, we can create log events in two ways: 1. We provide an `add_stat_value(name, val)` function. This calls into the Logger backend with a key/value pair. For example, we might call `add_stat_value('foo', 1)` to bump an event counter. 2. We provide a `time_point()` function to record a timestamp in nanoseconds. This can be used in conjunction with `add_stat_value` to record runtime wall clock durations. Examples of frontend usage can be found in `test_jit.py TestLogging`. We provide a trivial `LockingLogger` implementation as an example and for testing purposes. It is likely not ready for production usage. It demonstrates that a backend implementing the API can do things like specify aggregation types and report these aggregate stats via the `get_counters()` API. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18235 Differential Revision: D14545060 Pulled By: jamesr66a fbshipit-source-id: 04099543a1898cfdd411511e46e03d5dce9b4881	2019-03-29 17:14:03 -07:00
eellison	dc6b5b2a52	Optimize boolean expressions & unwraps (#18259 ) Summary: Simplify or eliminate boolean and/or expressions, optimize unwrapping a value that cannot be None, and optimize using `is` with a None and a non-None value Since peephole optimize is now introducing constants, i added another constant propagation pass after running it. Previously i had a PR that did this & optimized shape ops - i will add the shape optimizations in a separate PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18259 Differential Revision: D14602749 Pulled By: eellison fbshipit-source-id: 1c3f5a67067d8dfdf55d7b78dcb616472ea8a267	2019-03-25 21:50:57 -07:00
Wanchao Liang	6c9b312fd4	Add addcmul, lerp to fuser, enable scalar->float specialization in symbolic script (#18081 ) Summary: This PR did two things: 1. Enable scalar->float specialization in symbolic script, so AD formula that contains scalar in the schema, should write `float` instead. 2. add addcmul, lerp to AD and fuser. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18081 Differential Revision: D14490493 Pulled By: wanchaol fbshipit-source-id: b3b86d960d5f051b30733bc908b19786111cdaa4	2019-03-25 11:05:45 -07:00
Roy Li	7aae51cded	Replace tensor.type().scalarType() calls with tensor.scalar_type() Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17515 Reviewed By: ezyang Differential Revision: D14233250 fbshipit-source-id: 6c7af8d2291c0c2b148001b30cf03834f34366c0	2019-03-08 14:08:18 -08:00
Wanchao Liang	ab95b5c6cc	Rename prim::Undefined to prim::AutogradZero (#17611 ) Summary: supersedes #17245 Pull Request resolved: https://github.com/pytorch/pytorch/pull/17611 Differential Revision: D14283581 Pulled By: wanchaol fbshipit-source-id: 8022d02b8a021ea2fee9a18a2c8920eb123200c5	2019-03-01 15:13:18 -08:00
Ailing Zhang	b0545aa85f	maskrcnn & bert AD coverage part 1 (#16689 ) Summary: - Moved a few functions from `autograd` namespace to `aten` namespace to be visible from JIT nativeResolver. - Added a hack to loop up keyword only argument. Will add proper support for kw only later - Simulate function overload in aten using `_<number>` as function name suffix. - Even `forward` returns multiple outputs like in `kthvalue`, there's at most one requires grad that we currently support. - Removed the `TensorList` related ops here since partial `TensorList` support is prone to bugs. Our symbolic diff for `cat` was never tested with autodiff, and it seems broken. Need to find another proper way to support these ops(either by properly supporting `TensorList` or sth like `prim::ConstantChunk` and leave them for next PR. Ops supported in this PR: ``` erf expand_as index kthvalue mean permute pow rsub select sqrt squeeze t to topk transpose view var embedding logsumexp // grad is None _dim_arange contiguous nonzero ones_like ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/16689 Differential Revision: D14020806 Pulled By: ailzhang fbshipit-source-id: a5e2c144a7be5a0d39d7ac5f93cb402ec12503a5	2019-02-14 15:36:39 -08:00
Wanchao Liang	ac00e85e36	Remove undefined tensor in jit script (#16379 ) Summary: This PR is a follow up of #15460, it did the following things: * remove the undefined tensor semantic in jit script/tracing mode * change ATen/JIT schema for at::index and other index related ops with `Tensor?[]` to align with what at::index is really doing and to adopt `optional[tensor]` in JIT * change python_print to correctly print the exported script * register both TensorList and ListOfOptionalTensor in JIT ATen ops to support both * Backward compatibility for `torch.jit.annotate(Tensor, None)` List of follow ups: * remove the undefined tensor semantic in jit autograd, autodiff and grad_of * remove prim::Undefined fully For easy reviews, please turn on `hide white space changes` in diff settings. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16379 Differential Revision: D13855677 Pulled By: wanchaol fbshipit-source-id: 0e21c14d7de250c62731227c81bfbfb7b7da20ab	2019-02-07 11:02:14 -08:00
Mikhail Zolotukhin	0e6123fb8a	Remove dependency on ResourceGuard from IR.h. (#16351 ) Summary: It looks like `WithInsertionPoint` and `WithCurrentScope` can be easily implemented without `ResourceGuard` - that helps readability and removes one more dependency. Is there anything I'm missing? Pull Request resolved: https://github.com/pytorch/pytorch/pull/16351 Differential Revision: D13821826 Pulled By: ZolotukhinM fbshipit-source-id: b203200b345fb5508a97dc8656e6f51cde4cc21f	2019-01-29 00:21:32 -08:00
Mikhail Zolotukhin	47bf30661f	Directly include headers from ATen. Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16287 Differential Revision: D13792949 Pulled By: ZolotukhinM fbshipit-source-id: d627d8dc469df048063c70d0b5b8d33fede809a3	2019-01-24 11:22:27 -08:00
Michael Suo	f636dc9276	clang format world (#15524 ) Summary: The PR clang-formats everything in `torch/csrc/jit/` and adds it to the pre-commit hook. Here is a list of non-mechanical changes: - I went over each file and fixed up whenever I could tell that clang-format was clobbering comment formatting. - Made the macros in register_prim_ops a little more clang-format friendly by omitting trailing commas - Refactored autodiff.cpp to use a helper class with explicit state rather than a bunch of capturing lambdas - Small improvements to the precommit hook clang-format Pull Request resolved: https://github.com/pytorch/pytorch/pull/15524 Differential Revision: D13547989 Pulled By: suo fbshipit-source-id: 3ff1541bb06433ccfe6de6e33f29227a2b5bb493	2018-12-26 06:55:01 -08:00
Zachary DeVito	0368054a6d	Split up compiler.cpp (#15355 ) Summary: This separates the different parts of compiler.cpp to make their relationship more clear. In particular it adds: * sugared_value.{h,cpp} - all the public SugaredValues that the compiler defines and a few that were inside compiler.cpp * type_parser.{h, cpp} - Turns TreeRef's defining types into TypePtr * schema_matching.{h, cpp} - infrastructure for matching arguments against overloaded schema and emitting builtin operators with a particular schema. Retains: * compiler.{h, cpp} - now responsible simply for the `defineMethodsInModule` infra structure. Some utility functions like inlineCallTo have moved to ir.h. Only thing that is not a move is some changes in module.h/cpp that remove multiple returns from `Method::emit_call_to`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/15355 Reviewed By: suo, wanchaol Differential Revision: D13507524 Pulled By: zdevito fbshipit-source-id: 69ec936a9ff1a383c12a883616346b219c72e393	2018-12-18 19:43:35 -08:00
David Riazati	f3bff2d500	Add RNNCell modules to Script standard library (#14695 ) Summary: Adds RNNCell modules to script standard lib cc apaszke for argument_spec changes Pull Request resolved: https://github.com/pytorch/pytorch/pull/14695 Differential Revision: D13467680 Pulled By: driazati fbshipit-source-id: 13a14da87714325cc4c3d49e5fde8a850d5d757b	2018-12-18 17:28:28 -08:00
James Sun	e37a22128e	Allow tracing with fork/wait (#15184 ) Summary: There is still limitation on this: if a script module is somewhere in the trace, the inputs/outputs can only be tensors or tuples of tensors. resolves #15052 Pull Request resolved: https://github.com/pytorch/pytorch/pull/15184 Differential Revision: D13457691 Pulled By: highker fbshipit-source-id: 8fe46afc41357a0eb8eadd83f687b31d074deb0e	2018-12-17 20:34:26 -08:00
Peter Goldsborough	7a61306031	Enable all clang-tidy performance checks (#15198 ) Summary: This PR adds the final set of clang-tidy checks we should add for our codebase: a last set of performance-related checks. Most fixes here are around changing `auto` to `const auto&` in a few places where unnecessary copies were made, and adding `reserve()` calls before loops doing repeated `push_back()`. Also a few cases of calling `std::string::find` with a single-character string literal instead of a single char, which uses a less efficient string search algorithm meant for searching larger substrings. ![image](https://user-images.githubusercontent.com/6429851/49978940-adc1a780-ff01-11e8-99da-a4e431361f07.png) ezyang apaszke Pull Request resolved: https://github.com/pytorch/pytorch/pull/15198 Differential Revision: D13468797 Pulled By: goldsborough fbshipit-source-id: 2bed1ea1c7c162b7f3e0e1026f17125e88c4d5b2	2018-12-14 13:32:47 -08:00
Edward Yang	517c7c9861	Canonicalize all includes in PyTorch. (#14849 ) Summary: Anywhere we used #include "foo.h", we now say #include <foo.h> Paths are adjusted to be rooted out of aten/src, torch/lib, or the root level directory. I modified CMakeLists.txt by hand to remove TH and THC from the include paths. I used the following script to do the canonicalization: ``` import subprocess import re import os.path files = subprocess.check_output(['git', 'ls-files']).decode('utf-8').rstrip().split('\n') for fn in files: if not any(fn.endswith(suff) for suff in ['.cu', '.cpp', '.in', '.h', '.hpp', '.cu', '.cuh', '.cc']): continue if not any(fn.startswith(pref) for pref in ["aten/", "torch/"]): continue with open(fn, 'r') as f: c = f.read() def fmt(p): return "#include <{}>".format(p) def repl(m): p = m.group(1) if p in ["dlfcn.h", "unistd.h", "nvrtc.h", "cuda.h", "cuda_runtime.h", "cstdint", "cudnn.h", "Python.h", "cusparse.h", "cuda_runtime_api.h", "cuda_fp16.h", "cublas_v2.h", "stdint.h", "curand_kernel.h"]: return fmt(p) if any(p.startswith(pref) for pref in ["torch/csrc", "c10/", "ATen/", "caffe2/", "TH/", "THC/", "Eigen/", "gtest/", "zdl/", "gloo/", "onnx/", "miopen/"]): return fmt(p) for root in ["aten/src", "torch/lib", ""]: for bad_root in [os.path.dirname(fn), "aten/src/TH", "aten/src/THC", "torch/csrc"]: new_p = os.path.relpath(os.path.join(bad_root, p), root) if not new_p.startswith("../") and (os.path.exists(os.path.join(root, new_p)) or os.path.exists(os.path.join(root, new_p + ".in"))): return fmt(new_p) print("ERROR: ", fn, p) return m.group(0) new_c = re.sub(r'#include "([^"]+)"', repl, c) if new_c != c: print(fn) with open(fn, 'w') as f: f.write(new_c) ``` Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/14849 Reviewed By: dzhulgakov Differential Revision: D13363445 Pulled By: ezyang fbshipit-source-id: 52361f878a672785f9306c9e9ab2513128092b68	2018-12-08 19:38:30 -08:00
Adam Paszke	a60368982b	Batch more matrix multiplies (#13456 ) Summary: This handles the input pre-multiplication in RNNs, yielding pretty significant speedups in backward times. This pass depends on loop unrolling, so we'll batch only as many elements as the unrolling factor allows. cc mruberry ngimel zou3519 zdevito Pull Request resolved: https://github.com/pytorch/pytorch/pull/13456 Differential Revision: D12920339 Pulled By: zou3519 fbshipit-source-id: 5bcd6d259c054a6dea02ae09a9fdf9f030856443	2018-11-26 09:20:35 -08:00
Michael Suo	33d091f432	shape analysis fix (#14325 ) Summary: This PR is deceptively large because of an indenting change. The actual change is small; I will highlight it inline Pull Request resolved: https://github.com/pytorch/pytorch/pull/14325 Differential Revision: D13183296 Pulled By: suo fbshipit-source-id: fcbf6d5317954694ec83e6b8cc1c989f2d8ac298	2018-11-23 11:24:24 -08:00
Michael Suo	b149456645	alias analysis (#14018 ) Summary: First draft of an alias analysis pass. It's a big PR unfortunately; a rough table of contents/suggested order of review: 1. `AliasAnalysis` pass, which traverses the graph and builds an `AliasDb`. The basic strategy is to assign alias information to every value of mutable type (list/tuple/tensor), and use the alias annotations of each node's schema to assign alias info to the outputs based on the alias info the inputs. Nodes that aren't explicitly schematized have hand-written analysis rules. 2. Integration of aliasing information into `moveBefore/AfterTopologicallyValid()`. Basically, we pass in an alias DB when we ask for moveBefore/After. Similar to how we can boil down dependency analysis to "what nodes use this node", we can boil down mutability analysis to "what nodes write to an alias set input/output'd by this node". 3. Integration of alias analysis to optimization passes that need it. Right now, it is `GraphFuser`, `CreateAutodiffSubgraphs`, constant prop, and CSE. Not sure if any others need it. - Testing; still figuring out the best way to do this. - Eventually we want to integrate the alias db into the graph, but we shouldn't do that until we can guarantee that the information can stay up to date with mutations. - Do the same thing `python_printer` did for operators and force people to register alias analyzers if they can't schematize their op. Pull Request resolved: https://github.com/pytorch/pytorch/pull/14018 Differential Revision: D13144906 Pulled By: suo fbshipit-source-id: 1bc964f9121a504c237cef6dfeea6b233694de6a	2018-11-21 17:48:46 -08:00
Michael Suo	7ea9c674bc	migrate subgraph slicing to use `moveBefore/moveAfter` (#13862 ) Summary: Migrate the `CreateAutodiffSubgraphs` pass to use topologically-safe moves instead of DynamicDAG. This is to unify the interface that we use for determining safe node moves to prepare for mutability. The pass looks a lot like GraphFuser now, and there's a lot of code duplication. I plan to pull common stuff out into a "subgraph manipulation utils" thing, but didn't want to clutter this PR. Future steps: - Get rid of code duplication (see above) - Use DynamicDAG to back the `moveBefore/After` calls. Pull Request resolved: https://github.com/pytorch/pytorch/pull/13862 Differential Revision: D13072871 Pulled By: suo fbshipit-source-id: 92e7880ef444e0aefd51df60964bba7feaf42ae0	2018-11-14 17:33:36 -08:00
Thomas Viehmann	9ffabcfcaa	Use nested variant of getValueTrace to allow more flexible tracing script modules (#13597 ) Summary: When tracing scripted functions, we used to only allow Tensor arguments. This enables tracing script modules with List[Tensor] or Tuple[Tensor, Tensor] arguments (passing tuples). Fixes: #13566 Pull Request resolved: https://github.com/pytorch/pytorch/pull/13597 Differential Revision: D12990464 Pulled By: soumith fbshipit-source-id: fdce3afcb1e09f3c26d6ce834c01bf18d261f47c	2018-11-09 06:24:02 -08:00
Adam Paszke	f6ff5d8934	Append parameters when checking graphs for TorchScript Methods (#13553 ) Summary: Also, add an assertion in the GraphExecutor to make sure we don't access memory out of bounds. Pull Request resolved: https://github.com/pytorch/pytorch/pull/13553 Differential Revision: D12924796 Pulled By: soumith fbshipit-source-id: ea2a134084538484178b8ebad33d6716a8e1d633	2018-11-05 16:07:36 -08:00
Peter Goldsborough	0479517325	Add modernize-* checks to clang-tidy (#13196 ) Summary: Enables almost all `modernize-*` checks in clang-tidy. This warns against things such as: - Use of `const std::string&` instead of new-style `std::string` + move, - Using old-style loops instead of range-for loops, - Use of raw `new` - Use of `push_back` instead of `emplace_back` - Use of `virtual` together with `override` (`override` is sufficient) ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/13196 Differential Revision: D12891837 Pulled By: goldsborough fbshipit-source-id: 4d0f782a09eb391ee718d3d66f74c095ee121c09	2018-11-02 20:30:40 -07:00
Michael Suo	5fbaf0eaf8	add augmented assignment ops (#13364 ) Summary: This PR changes the compiler to correctly emit in-place operators for augmented assignments (`+=` and friends). - To better match the Python AST structure, add an `AugAssign` tree view and make `Assign` apply only to `=` assignments. - Emit those `AugAssign` exprs in the compiler, dispatching to in-place aten ops for tensors and lowering to simple assignments for scalar types. - In order to preserve (suspect) ONNX export semantics, add a pass to lower the in-place operators to out-of-place operators. Pull Request resolved: https://github.com/pytorch/pytorch/pull/13364 Differential Revision: D12899734 Pulled By: suo fbshipit-source-id: bec83be0062cb0235eb129aed78d6110a9e2c146	2018-11-02 00:01:07 -07:00

1 2

99 Commits