pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Raghavan Raman	3fe72d30dc	[NNC] Optimize conditionals that correspond to the form generated for aten::cat op. (#57673 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57673 Test Plan: Imported from OSS Reviewed By: bertmaher Differential Revision: D28231374 Pulled By: navahgar fbshipit-source-id: 1777a63df4e5ebed6d515683bd772a88be465b3a	2021-05-18 14:23:48 -07:00
Luca Wehrstedt	5a238eb96e	Fix deadlock in Future due to lock inversion with GIL (#58382 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58382 Calling markCompleted on a Future now first acquires the Future's mutex (as usual) but then sometimes tries to acquire the GIL during the DataPtr extraction while still holding the Future's mutex. (This happens when the value passed to markCompleted is a Python object). This can cause a deadlock if someone else calls any of the other methods of Future while holding the GIL. There are two solutions to this: avoid holding the Future's mutex when extracting DataPtrs, and avoid holding the GIL while invoking the Future's method. In this PR I'm going for the latter, because it's a very simple immediate fix, but I believe this is brittle and that we should probably also consider the former fix. ghstack-source-id: 129105358 Test Plan: The repro in https://github.com/pytorch/pytorch/issues/58239 now doesn't deadlock. Reviewed By: mrshenli Differential Revision: D28472816 fbshipit-source-id: 1bc9bca426dd004f9eb2568db1ffd38f014450e2	2021-05-17 10:53:19 -07:00
Lillian Johnson	9403fe17ce	[torch.package/TorchScript] logic to enable sharing of tensors on load (#57573 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57573 Test Plan: Imported from OSS Reviewed By: suo Differential Revision: D28226975 Pulled By: Lilyjjo fbshipit-source-id: bc8cb3e8052fa18336c437e0601d8b0028fd1895	2021-05-14 08:21:43 -07:00
Lillian Johnson	3ad11803f7	[torch.Package/TorchScript] ScriptModuleSerializer add unified format (#56299 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56299 Test Plan: Imported from OSS Reviewed By: suo Differential Revision: D27832545 Pulled By: Lilyjjo fbshipit-source-id: 1b2880a8458f99bd66a8c9656c5ca700f43cffe8	2021-05-14 08:21:40 -07:00
Lillian Johnson	07de11c26d	[torch.Package/TorchScript] TS serialization importer to handle unified format (#54891 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54891 Changed TorchScript's jit/serialization importer logic to handle both original TS serialization format and new unified TS format Original TS file format: ``` resnet.pt ├── data # tensor data │ ├── 94286146172688 │ ├── 94286146172784 │ └── ... ├── code/ # TorchScript code │ ├── __torch__ │ │ ├── torch │ │ │ └── nn ... │ │ └── torchvision ... │ ├── __torch__.py │ └── __torch__.py.debug_pkl ├── data.pkl # the ScriptModule object, pickled by the TS pickler ├── version # version metadata ├── constants.pkl # any tensor constants present in the TS code └── extra ├── name_of_file └── foo ``` Unified file format: ``` ─── package_name.pt ├── .data │ ├── ts_code # code shared between models │ │ ├── 0 │ │ │ ├── constants.pkl │ │ │ └── data.pkl │ │ ├── 1 │ │ │ ├── constants.pkl │ │ │ └── data.pkl │ │ └── code │ │ ├── __torch__ │ │ │ ├── torch │ │ │ │ └── nn ... │ │ │ └── torchvision ... │ │ ├── __torch__.py │ │ └── __torch__.py.debug_pkl │ ├── 0.storage │ ├── 1.storage │ ├── <many more storages> │ ├── 201.storage │ ├── extern_modules │ └── version └── res ├── mod.pkl # maps to ts_id 0 and .data/ts_code/0 └── mod2.pkl # maps to ts_id 1 and .data/ts_code/1 ``` Test Plan: Imported from OSS Reviewed By: suo Differential Revision: D27832548 Pulled By: Lilyjjo fbshipit-source-id: 4a6e84c3a9bac8eed6a4e4afc2ac76dd691858b0	2021-05-14 08:20:34 -07:00
Dhruv Matani	38e606d056	[RFC] Add method torch.jit._clone_module_with_class (#56152 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56152 Currently, the Bundled Inputs API mutates the module in-place. It adds class methods and not instance methods. This results in a small problem that one can't re-run an already executed cell in Bento if the class has already been subject to bundled inputs. In addition, there is no way to add bundled inputs to a module that has bundled inputs added already. This API provides a way to solve this problem as well by adding an `ignored_methods` to the call to `clone()` by allowing the implementation of bundled inputs to pass in the methods that it will add as `ignored_methods` so that when it does try to add those methods, it will be able to do so successfully. We'll have to be careful when ignoring those methods during the call to `torch.jit._clone_module_with_class` since any bundled input that relies on a user-provided method will need to be preserved and not ignored during the clone. Looking for feedback on whether this is an acceptable direction. ghstack-source-id: 128908360 Test Plan: Added unit test and ran it as `buck test //caffe2/test:mobile` Also see this Bento Notebook: https://www.internalfb.com/intern/anp/view/?id=550829 Reviewed By: gmagogsfm Differential Revision: D27788394 fbshipit-source-id: 48109cd4583506d4efdb345e4ba31385db23a273	2021-05-13 22:31:05 -07:00
BowenBao	346dc88bfa	[ONNX] Support registering custom export for prim::PythonOp from torch.autograd.Function (#55630 ) (#57600 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57600 Demo script: ```python import torch class MyReLU(torch.autograd.Function): staticmethod def forward(ctx, input, scalar_tuple, scalar, scalar_list): ctx.save_for_backward(input) return input.clamp(min=scalar) staticmethod def backward(ctx, grad_output): input, = ctx.saved_tensors grad_input = grad_output.clone() grad_input[input < 0] = 0 return grad_input class MyModule(torch.nn.Module): def __init__(self): super().__init__() self.linear_a = torch.nn.Linear(2, 2) self.linear_b = torch.nn.Linear(2, 2) self.relu = MyReLU.apply def forward(self, x): h = self.linear_a(x) h = self.relu(h, (5, 3), 2, [1, 2, 3]) h = self.linear_b(h) return h """ User define how to export prim::PythonOp into custom op. """ def symbolic_pythonop(g, n, args, *kwargs): # Print information: print('arguments of ', kwargs['name'], ':') print('original node: ', n) for i, out in enumerate(n.outputs()): print('original output {}: {}, requires grad: {}'.format(i, out, out.requiresGrad())) import torch.onnx.symbolic_helper as sym_helper for i, arg in enumerate(args): print('arg {}: {}, requires grad: {}'.format(i, arg, arg.requiresGrad() if sym_helper._is_value(arg) else False)) for k, v in kwargs.items(): print('key: ', k, ' v: ', v) # TODO: all inputs (tensors and scalars) are in args. # backend can define CustomDomain::PythonOp and how info are stored however it deem fit. return g.op("CustomDomain::PythonOp", args[0], name_s=kwargs['name']) torch.onnx.register_custom_op_symbolic("::prim_PythonOp", symbolic_pythonop, 9) # Define input. x = torch.tensor([[0.3971, 0.7544], [0.5695, 0.4388]], requires_grad=True) model = MyModule() # Forward. y = model(x) torch.onnx.export(model, (x,), 'model.onnx', opset_version=12, verbose=True) ``` Test Plan: Imported from OSS Reviewed By: malfet Differential Revision: D28393528 Pulled By: SplitInfinity fbshipit-source-id: e0d55b7c737c5916fda08a3b26b3306037f970df Co-authored-by: BowenBao <bowbao@microsoft.com>	2021-05-13 13:42:49 -07:00
neginraoof	1de3525ca8	[ONNX] Handle PackedParams inputs for _propagate_and_assign_input_shapes (#56449 ) (#57079 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57079 Testing onnx 1.9 release, we see that the old bug is triggered for the caffe2 test: `pytest test/onnx/test_pytorch_onnx_caffe2_quantized.py::TestQuantizedOps::test_small_model` This is because the graph inputs ```python graph(%x.1 : Tensor, %conv1._packed_params : __torch__.torch.classes.quantized.Conv2dPackedParamsBase, %conv2._packed_params : __torch__.torch.classes.quantized.Conv2dPackedParamsBase, %fc.bias : Float(10, strides=[1], requires_grad=0, device=cpu), %fc.weight : Float(10, 72, strides=[72, 1], requires_grad=0, device=cpu)): ``` contains `Conv2dPackedParamsBase` which is a PackedParams. When we do flatten, we will flatten to several tensors, then the shape inference for input misaligned. This PR record how may tensors got flattened in PackeParams, and skip by these number rather than 1, then the UT passed. Note that tuple case should still follow the original logic. Test Plan: Imported from OSS Reviewed By: SplitInfinity Differential Revision: D28393949 Pulled By: malfet fbshipit-source-id: 98d48aad27e5ca03fb10d260f8e625478d996ee2 Co-authored-by: David <jiafa@microsoft.com>	2021-05-12 15:20:26 -07:00
Chen Lai	8c04593c0a	[PyTorch Edge] Add backport to export old bytecode models (#56802 ) Summary: Add an api to backport a model vn to model vi. It accept an input model (file or buffer) and output a model (file or buffer) with an expected bytecode version. In this change, the input is a model and it can come from a file or buffer. The output is a model and can be either file path or buffer. When backport fails, function return false with a warning message : ``` /Users/chenlai/pytorch/cmake-build-debug/bin/test_jit --gtest_filter=LiteInterpreterTest.BackPortByteCodeModelV4:LiteInterpreterTest/.BackPortByteCodeModelV4:/LiteInterpreterTest.BackPortByteCodeModelV4/:/LiteInterpreterTest/*.BackPortByteCodeModelV4 --gtest_color=no Testing started at 2:32 PM ... CUDA not available. Disabling CUDA and MultiCUDA tests [W backport.cpp:419] Warning: Backport doesn't support backport to version3 (function _backport_for_mobile_impl) Process finished with exit code 0 ``` ## Test 1. Run both `caffe2/test/cpp/jit/test_lite_interpreter.cpp` and `caffe2/test/mobile/test_bytecode.py`. 2. Run all prod models with backport api. Pull Request resolved: https://github.com/pytorch/pytorch/pull/56802 ghstack-source-id: 128425510 Test Plan: CI Reviewed By: raziel, iseeyuan Differential Revision: D27844651 fbshipit-source-id: 8a803cf6c76433ee0a3049b1a5570585d569f8d6	2021-05-07 18:14:33 -07:00
Luca Wehrstedt	36e47af58b	Pass reference to parent future in callbacks (#57635 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57635 Note: this PR looks massive, but it's just one simple change, codemodded many times. In many cases, a callback needs to access the value/error produced by the parent future. In Python this was easy because the callback was invoked with the parent future as argument, and could thus inspect it. In C++ the callbacks didn't take any arguments, thus in many cases we worked around this by capturing the future in its own callback. This is risky (leads to reference cycle and thus memory leak) and must be done carefully (spoiler: sometimes we weren't). ghstack-source-id: 128296580 Test Plan: CI Reviewed By: wanchaol Differential Revision: D28178783 fbshipit-source-id: 6de02c4568be42123372edc008f630d5ddae0081	2021-05-07 03:59:18 -07:00
Luca Wehrstedt	8e9bbd3113	Make DataPtr extraction in CUDAFuture faster for Python values (#56918 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56918 Re-importing a Python module each time is a bit expensive, and it's unnecessary because this is a private module which won't change and thus we can cache the value once we first extract it. ghstack-source-id: 128184666 Test Plan: CI Reviewed By: mrshenli Differential Revision: D27985910 fbshipit-source-id: be40ae9b67ab8ea6c07bc2cb9a78d2c2c30b35d3	2021-05-06 01:12:53 -07:00
Yi Huang (Symphony)	ba78bf1363	[standaloneRunner] fix another GIL mutithreading issue exposed by torch::jit::toIValue() (#57688 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57688 P412982836 says that `torch::jit::toIValue()` will also touch GIL through `torch::jit::createGenericDict()` (P412848640) So we have to move `torch::jit::toIValue()` out of multithreading execution Reviewed By: hyuen Differential Revision: D28236527 fbshipit-source-id: 43a33dbcfc828cc42c5e1230c8f5cb415bf7bde4	2021-05-05 21:41:04 -07:00
Chen Lai	fb9a32b7b4	[PyTorch][Edge] Add api to get bytecode model version (#56801 ) Summary: Add an api `_get_bytecode_version` to get version number given a bytecode model in both cxx and python, and the input can be both from file path and buffer. ## Test CI (new added unit test will run as part of `pytorch_core-buck`) 1. run test_lite_interpreter.cpp 2. `python test/mobile/test_bytecode.py` Pull Request resolved: https://github.com/pytorch/pytorch/pull/56801 ghstack-source-id: 128169647 Test Plan: CI (new added unit test will run as part of `pytorch_core-buck`) 1. run test_lite_interpreter.cpp 2. `python test/mobile/test_bytecode.py` Reviewed By: iseeyuan Differential Revision: D27961417 fbshipit-source-id: f786cc9573d855feecff0b4fe8e5363e25f5728c	2021-05-05 09:17:26 -07:00
Luca Wehrstedt	58bc003487	Add pybind type caster for c10::Device (#57292 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57292 In Future (and soon in other places too) we need to receive a list of devices from Python-land. We don't want to just take their indices because we need full devices in order to infer the type from them. torch.device is not defined through pybind, it's defined through a plain `PyModule_AddObject` call with CPython, thus pybind isn't naturally able to understand and convert it. However we can provide a custom type caster which fixes that. We have this already for at::Tensor, at::Generator, ... ghstack-source-id: 127916268 Test Plan: CI Reviewed By: mrshenli Differential Revision: D28092732 fbshipit-source-id: 1c31d0b85a4d5c9e7bde8161efbb7574d505157c	2021-05-01 16:11:10 -07:00
Scott Wolchok	b87d3fa432	[PyTorch][jit] Don't allow create() on singleton types (#56807 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56807 If I understand correctly, there's no reason to create your own instance of these global singleton types. ghstack-source-id: 127312270 Test Plan: CI Reviewed By: SplitInfinity Differential Revision: D27973447 fbshipit-source-id: f12df69d185f1baaa45f2ac6eac70570a7a65912	2021-04-30 10:28:50 -07:00
Luca Wehrstedt	311ad5e3af	Merge CUDAFuture into ivalue::Future (#57052 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57052 This PR caps a stack whose goal was to merge CUDAFuture into ivalue::Future. CUDAFuture used to be a subclass of ivalue::Future, which was already pretty good, but it meant that in several places we needed `#ifdef`s or registries in order to create the right type of class, which was annoying. We've made CUDAFuture device-agnostic, by using generic helpers, so that it doesn't depend on CUDA. Now all its code can be inserted into ivalue::Future. This PR does this very naively, by copy-pasting CUDAFuture's code into the (previously empty) virtual methods of ivalue::Future. This helps ensure the correctness of this PR, as it's straightforward to see it behaves exactly like before. However we probably want to polish it a bit later to iron out so wrinkles. ghstack-source-id: 127713138 (Note: this ignores all push blocking failures!) Test Plan: CI Reviewed By: mrshenli Differential Revision: D28036829 fbshipit-source-id: 3e5b16402f5dc245c1fcb9d7bf06db64dcb0d2a3	2021-04-29 09:31:52 -07:00
Luca Wehrstedt	71c2f88b90	Make CUDAFuture handle any kind of device type (#57051 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57051 Make CUDAFuture autodetect the devicetype from its arguments (which thus change from DeviceIndices to full Devices). This in fact transforms CUDAFuture into a AnythingFuture, since it's not tied to CUDA in any way anymore. Having made it fully device-agnostic, we'll merge it into ivalue::Future in the next PR. ghstack-source-id: 127713134 (Note: this ignores all push blocking failures!) Test Plan: CI Reviewed By: mrshenli Differential Revision: D28032711 fbshipit-source-id: 8ba23b1b0d97f61db8693cd5f3c7bae7989a9bcd	2021-04-29 09:31:50 -07:00
Nikita Shulga	4cb534f92e	Make PyTorch code-base clang-tidy compliant (#56892 ) Summary: This is an automatic change generated by the following script: ``` #!/usr/bin/env python3 from subprocess import check_output, check_call import os def get_compiled_files_list(): import json with open("build/compile_commands.json") as f: data = json.load(f) files = [os.path.relpath(node['file']) for node in data] for idx, fname in enumerate(files): if fname.startswith('build/') and fname.endswith('.DEFAULT.cpp'): files[idx] = fname[len('build/'):-len('.DEFAULT.cpp')] return files def run_clang_tidy(fname): check_call(["python3", "tools/clang_tidy.py", "-c", "build", "-x", fname,"-s"]) changes = check_output(["git", "ls-files", "-m"]) if len(changes) == 0: return check_call(["git", "commit","--all", "-m", f"NOLINT stubs for {fname}"]) def main(): git_files = check_output(["git", "ls-files"]).decode("ascii").split("\n") compiled_files = get_compiled_files_list() for idx, fname in enumerate(git_files): if fname not in compiled_files: continue if fname.startswith("caffe2/contrib/aten/"): continue print(f"[{idx}/{len(git_files)}] Processing {fname}") run_clang_tidy(fname) if __name__ == "__main__": main() ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/56892 Reviewed By: H-Huang Differential Revision: D27991944 Pulled By: malfet fbshipit-source-id: 5415e1eb2c1b34319a4f03024bfaa087007d7179	2021-04-28 14:10:25 -07:00
Jacob Szwejbka	60a5ebfac2	[Pytorch Edge] Remove methods_to_optimize arg (#57045 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57045 Went back and adjusted the previous optimizations to just be applied to every function. Cleaned up api to match. ghstack-source-id: 127214412 ghstack-source-id: 127536155 Test Plan: unit test Reviewed By: kimishpatel Differential Revision: D27950859 fbshipit-source-id: 214e83d5a19b452747fe223615815c10fa4aee58	2021-04-27 14:54:13 -07:00
Pritam Damania	dc8a8cea79	Move caffe2 signal_handler to c10. (#56717 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56717 The signal_handler was under the caffe2 namespacee but was being used by PyTorch as well. I've fixed this my moving it to the c10 namespace where now both C2 and PyTorch can use it. The signal_handler interface in caffe2/utils/signal_handler.h is kept the same for backward compatiblity for C2, but most of the commmon code is moved to c10. ghstack-source-id: 127446929 Test Plan: waitforbuildbot Reviewed By: ezyang Differential Revision: D27946738 fbshipit-source-id: d6228d1a0108f4c807d405e7a0bb799c5375388f	2021-04-26 23:08:12 -07:00
Luca Wehrstedt	a688b29750	Support custom Python classes in CUDAFuture (#56516 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56516 One problem with CUDAFuture's extraction of DataPtrs from IValues is that it only supported Python objects that could be converted to "regular" IValues (e.g., lists/dicts/tuples of ints/strings/tensors/...). One notable exception are custom Python classes, which are in fact a very common data type transferred over RPC. The only solution we found for those is to use the Python pickler to extract the tensors contained in them. We can't insert a Python dependency directly into CUDAFuture, so instead I'm proposing to use the same indirection technique used to support `getSubValues` on Python objects: define some methods on the abstract class `PyObjectHolder` (which can be used by CUDAFuture) but only implement them in the concrete subclass `ConcretePyObjectHolder` (which is only built when Python support is enabled). I am a bit worried about the performance toll of this (pickling isn't exactly known to be cheap) but I think we should start by providing a functionally complete API. We already have ideas on how to make this faster if needed, for example by having users provide a custom DataPtr extractor tailored to their class via a decorator. (Or just use TorchScript). ghstack-source-id: 127295014 Test Plan: Added a test later in the stack Reviewed By: mrshenli Differential Revision: D27887189 fbshipit-source-id: 9d27e4e62390b836e5bb4f06f401cc002f0cf95b	2021-04-24 07:06:28 -07:00
Luca Wehrstedt	15ca379bde	Add CUDA support to a user-created torch.futures.Future (#56517 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56517 Currently a torch.futures.Future could wrap a CUDAFuture, but it could not create one from scratch. This prevented users from using CUDAFutures in some occasions, for example when using `rpc.functions.async_execution`, or in their own code. I don't see any reason for such a limitation, hence here I add support for this. ghstack-source-id: 127261554 Test Plan: Added a test later in the stack Reviewed By: mrshenli Differential Revision: D27887190 fbshipit-source-id: ecbb39c1ad7cd189d478ded9c361448f05a270ad	2021-04-23 08:13:56 -07:00
BowenBao	818ce1d0d2	Add standardOps match more input type in ORT (#53813 ) (#56172 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56172 Enable the standardOps include Add\Sub\Mul\Div\Gemm\Pow\Mod with low precision input in ORT Test Plan: Imported from OSS Reviewed By: pbelevich Differential Revision: D27866136 Pulled By: SplitInfinity fbshipit-source-id: f2cf5649fffefd68c0cc7b6dce94198751636727	2021-04-21 17:58:08 -07:00
BowenBao	9986b109d2	[ONNX] Fix assign input shape for tuple inputs & primitive type inputs (#54112 ) (#56164 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56164 Test Plan: Imported from OSS Reviewed By: pbelevich Differential Revision: D27866139 Pulled By: SplitInfinity fbshipit-source-id: c59f5a07df685e1ccdc4860d603ec422ec80d188	2021-04-20 23:00:37 -07:00
Zhengxu Chen	8176ab6ca0	[JIT] Put explicit error message on class attribute accesses. (#55723 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55723 Resolving https://github.com/pytorch/pytorch/issues/51139 Test Plan: python test/test_jit.py TestClassType.test_unresolved_attributes Imported from OSS Reviewed By: gmagogsfm Differential Revision: D27691960 fbshipit-source-id: 1d078a4ab25af1a73109ca6ef0333a67a634bff6	2021-04-16 15:47:10 -07:00
Bert Maher	8e82e932f3	Reland: D27652485: [nnc] Enable CPU fusion only when num_threads == 1" (#56120 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56120 This reverts commit `ad17fadbfc` (D27786457). The big annoyance here is that depending on the threading mode you may not be able to toggle num_threads at will, so the fusion tests won't fail. I hate this solution, but I'm adding a secondary override for the TE fuser. Now you need to both turn on fusion (_jit_override_can_fuse_on_cpu), and you're OK if you're running with 1 thread, or you can add `_jit_set_texpr_parallel_cpu_enabled` to enable it anyways. This is (a) mainly for tests, since a real user probably won't fiddle aimlessly with the thread count, and (b) will go away once NNC's threading support is fully baked. Test Plan: Imported from OSS Reviewed By: Krovatkin Differential Revision: D27788199 Pulled By: bertmaher fbshipit-source-id: 070d04474f15e9689dbdf8cc1fde43050c6506b1	2021-04-15 15:50:18 -07:00
Edward Yang	6ec71ed4f9	Replace all direct cdata access with THPVariable_Unpack (#55799 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55799 I'm going to change the implementation of cdata soon so I need to abstract over cdata access with a function. Additionally, many users are casting manually casting to THPVariable to access the member so I can remove these unsafe casts in the client code (the implementation, of course, is still doing an unsafe cast.) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D27712130 Pulled By: ezyang fbshipit-source-id: 95fcc013bf3913d67f2c634068eb5b3aab144cb3	2021-04-15 08:57:04 -07:00
James Reed	71a5314591	Fix ScriptMethod dispatch on __torch_function__ (#56103 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56103 Test Plan: Imported from OSS Reviewed By: ezyang Differential Revision: D27784142 Pulled By: jamesr66a fbshipit-source-id: 555dcb7c3a98b8fb9e9ca9b499cafad54e819aa7	2021-04-15 08:46:43 -07:00
Nikitha Malgi	88c06d9dfc	Add cuda device synchronization support in JIT (#55469 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55469 Test Plan: Imported from OSS Reviewed By: ZolotukhinM Differential Revision: D27749077 Pulled By: nikithamalgifb fbshipit-source-id: bce3d331ab781cf3232b47b4f02ef504b9eadc7e	2021-04-14 09:13:07 -07:00
Nikita Shulga	6a39613f35	[BE] Make torch/csrc/jit/tensorexpr/ clang-tidy clean (#55628 ) Summary: Mostly auto-generated changes using ``` python3 tools/clang_tidy.py -c build -x torch/csrc/jit/tensorexpr/eval.cpp -s ``` With following common patterns manually fixed - Use ` = default` instead of `{}` - deleted methods should be public - Use pass-by-value + std::move instead of pass-by-reference+copy Pull Request resolved: https://github.com/pytorch/pytorch/pull/55628 Reviewed By: walterddr Differential Revision: D27655378 Pulled By: malfet fbshipit-source-id: 92be87a08113435d820711103ea9b0364182c71a	2021-04-08 19:44:14 -07:00
Jacob Szwejbka	20d7916a6a	[Pytorch Mobile] Fold Conv BatchNorm for functions besides forward (#54619 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54619 Minor refactor to conv batchnorm folding to work on other functions besides forward ghstack-source-id: 125767010 Test Plan: unit test and {P339453712} Reviewed By: kimishpatel Differential Revision: D27301452 fbshipit-source-id: 4e0cc544a171a970583979a496b2908935124497	2021-04-06 13:07:12 -07:00
Nikitha Malgi	197f9f0826	Merge CUDA Streams and Events (#53902 ) Summary: ----------- - Updates current_stream and default stream API's to take `optional[device]` argument - Adds parsing logic to replace `torch.cuda.Stream` and `torch.cuda.Event` -> `torch.classes.cuda.Stream` and `torch.classes.cuda.Event` for JIT - Merges StreamContext manager for both Eager and JIT. Pull Request resolved: https://github.com/pytorch/pytorch/pull/53902 Test Plan: ------ Run JIT tests: python test/test_jit.py -v TestCUDA Run eager tests: python test/test_cuda.py -v TestCuda Reviewed By: glaringlee Differential Revision: D27494627 Pulled By: nikithamalgifb fbshipit-source-id: b30b0570e38a33fb335c83762eb06ffd46a44b5c	2021-04-05 08:19:55 -07:00
Mike Ruberry	c0ac0fef4e	Revert D27448156: irange for size_t Test Plan: revert-hammer Differential Revision: D27448156 (`041b4431b2`) Original commit changeset: 585da57d4de9 fbshipit-source-id: 8e047c29f391c0166e0a1a87c3fb2a0854377365	2021-04-03 19:14:00 -07:00
Richard Barnes	041b4431b2	irange for size_t (#55163 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55163 Test Plan: Sandcastle Reviewed By: ngimel Differential Revision: D27448156 fbshipit-source-id: 585da57d4de91c692b6360d65f7b8a66deb0f8c1	2021-04-02 23:22:29 -07:00
Meghan Lele	6866c033d5	[JIT] Add recursive scripting for class type module attributes (#55124 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55124 Summary This commit modifies type inference (used by the module scripting code) so that it tries to script the type of any class instances that it encounters. This enables recursive, automatic scripting of class type module attributes. Test Plan This commit adds a test case for this to `TestClassType`. Test Plan: Imported from OSS Reviewed By: gmagogsfm Differential Revision: D23971883 Pulled By: SplitInfinity fbshipit-source-id: 7a5a2e7c12ee68cbdeb0a07e6aaf98734a79cb06	2021-04-02 12:16:21 -07:00
Negin Raoof	cd9dd653e9	[ONNX] Support primitive type input/outputs and attributes (#53550 ) (#54864 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54864 Support primitive type attributes. Needed for Silero model. Test Plan: Imported from OSS Reviewed By: nikithamalgifb Differential Revision: D27408982 Pulled By: SplitInfinity fbshipit-source-id: 16b291eedbe9f9bb31d7664a29a484555df53755	2021-03-31 21:14:20 -07:00
Rohan Varma	a37fbf9b45	[Futures] Bump log verbosity when ignoring cb errors in python future. (#54476 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54476 Per title. For `add_done_callback`, we log but swallow exceptions in order to keep consistent with what concurrent.futures python library does, see discussion in https://github.com/pytorch/pytorch/pull/45675. Although, it would be good to improve the verbosity here as this can be a source of confusion if users are setting a different future via `add_done_callback`, and an error is hit resulting in an unexpected hang (see https://github.com/pytorch/pytorch/issues/52132 for more details on how this can happen). ghstack-source-id: 125300389 Test Plan: CI Reviewed By: lw Differential Revision: D27253004 fbshipit-source-id: 72ed21c8fb6d27de5797c17fc46b762f893e6fea	2021-03-31 15:17:06 -07:00
Jianyu Huang	7fc03dd7c9	Back out "[pytorch][PR] Merge CUDA Streams and Events" (#54996 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54996 Original commit changeset: 45d9fee9a582 Test Plan: CI Reviewed By: jspark1105 Differential Revision: D27444718 fbshipit-source-id: deb627230817923eaf84ade50ecb14bfbce4e779	2021-03-31 10:21:35 -07:00
Michael Suo	8a170fbacd	[package] fix mangling issues with TorchScript (#54915 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54915 TorchScript and torch.package have different mangling schemes. To avoid them interfering with each other, we should undo the torch.package mangling before processing anything with TorchScript (since TS independently makes sure that no names collide). Test Plan: Imported from OSS Reviewed By: SplitInfinity Differential Revision: D27410472 Pulled By: suo fbshipit-source-id: d1cc013c532d9abb7fb9615122bc465ded4785bb	2021-03-31 00:58:05 -07:00
anjali411	1bccd48465	Allow creating SugaredValue for a complex valued IValue and deserialization logic for "infj" and "nanj" global constants (#54328 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54328 Test Plan: Imported from OSS Reviewed By: nikithamalgifb Differential Revision: D27369134 Pulled By: anjali411 fbshipit-source-id: aec26750a6fc8917ee15306684b743d13a91570c	2021-03-29 14:46:29 -07:00
Nikitha Malgi	416ba5c48f	Merge CUDA Streams and Events (#53902 ) Summary: ----------- - Updates current_stream and default stream API's to take `optional[device]` argument - Adds parsing logic to replace `torch.cuda.Stream` and `torch.cuda.Event` -> `torch.classes.cuda.Stream` and `torch.classes.cuda.Event` for JIT - Merges StreamContext manager for both Eager and JIT. Pull Request resolved: https://github.com/pytorch/pytorch/pull/53902 Test Plan: ------ Run JIT tests: python test/test_jit.py -v TestCUDA Run eager tests: python test/test_cuda.py -v TestCuda Reviewed By: SplitInfinity Differential Revision: D27285996 Pulled By: nikithamalgifb fbshipit-source-id: 45d9fee9a582b5f4c82330f5f99eb88584804270	2021-03-26 14:19:39 -07:00
anjali411	f9ca0d87a7	Teach Python TS frontend to parse complex literals (#52881 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52881 This PR adds: 1. logic to parse complex constants (complex literals of the form `bj`) 2. logic to parse complex lists 3. support for complex constructors: `complex(tensor/int/float/bool, tensor/int/float/bool)` 4. Limited operator support - `add`, `sub`, `mul`, `torch.tensor`, `torch.as_tensor` Follow-up work: 1. Add complex support for unary and other registered ops. 2. support complex constructor with string as input (this is supported in Python eager mode). 3. Test all emitXYZ for all XYZ in `ir_emitter.cpp` (currently only emitConst, emitValueToTensor are tested). e.g., test loops etc. 4. onnx doesn't support complex tensors, so we should error out with a clear and descriptive error message. Test Plan: Imported from OSS Reviewed By: bdhirsh Differential Revision: D27245059 Pulled By: anjali411 fbshipit-source-id: af043b5159ae99a9cc8691b5a8401503fa8d6f05	2021-03-24 08:12:17 -07:00
Christian Puhrsch	2668149b8c	Export torch::jit::toIValue (#54449 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/54448 Pull Request resolved: https://github.com/pytorch/pytorch/pull/54449 Reviewed By: SplitInfinity Differential Revision: D27243154 Pulled By: cpuhrsch fbshipit-source-id: fc21d6ce251b868356ad8ea13ae891fb56e311ce	2021-03-22 17:17:18 -07:00
Bin Bao	4626886f21	[JIT] Add CUDNN Conv-Add-Relu fusion for Frozen Model Optimization (#52102 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52102 Test Plan: Imported from OSS Reviewed By: eellison Differential Revision: D26646100 fbshipit-source-id: 7f7a82cc0b42c958b9e0c854b3b5dc6ea7cfff6c	2021-03-18 15:18:52 -07:00
James Reed	255b103c1b	[WIP] Function to retrieve inspect.Signature instances for PyTorch ops (#53830 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53830 Test Plan: Imported from OSS Reviewed By: suo Differential Revision: D26982802 Pulled By: jamesr66a fbshipit-source-id: 18fddc9f3f34b09e173de59f2fe886f8eedd000e	2021-03-17 20:41:27 -07:00
Jacob Szwejbka	8f61b13e80	[Pytorch Mobile] Optimize Non Forward for Mobile (#53314 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53314 Introduction of api for optimizing non forward functions for mobile. As of this diff, all functions that you say to optimize will be preserved, and those functions will be run through canonical optimization. The intention is to stack each further optimization onto separate diffs since they touch multiple files, and it seems like it'd be a nightmare to review. ghstack-source-id: 123909414 Test Plan: torch.utils.mobile_optimizer.optimize_for_mobile(net, methods_to_optimize=["forward", "foo"]) runs fine torch.utils.mobile_optimizer.optimize_for_mobile(net, methods_to_optimize={"foo"}) optimizes just foo if the model doesnt define forward otherwise optimizes foo and forward torch.utils.mobile_optimizer.optimize_for_mobile(net, methods_to_optimize=["forward"]) runs fine torch.utils.mobile_optimizer.optimize_for_mobile(net) runs fine if the model defines forward, Throws otherwise Reviewed By: kimishpatel Differential Revision: D26618689 fbshipit-source-id: 5bff1fb3f3f6085c4a649a8128af9c10f0fa9400	2021-03-17 14:31:24 -07:00
Thomas Viehmann	fd5c1123e4	wrap AliasDb in Python (#51336 ) Summary: Also added a wrapper tlemo 's graphviz export to string. Pull Request resolved: https://github.com/pytorch/pytorch/pull/51336 Reviewed By: ezyang Differential Revision: D26150809 Pulled By: eellison fbshipit-source-id: 9beafce5cbdc1785b986b71c3cd986c1087faa11	2021-03-17 12:55:22 -07:00
BowenBao	57d1df071f	[ONNX] Support inplace operations on inplace indexing (#52063 ) (#53306 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53306 * [ONNX] Fix for sequence of mutations in blocks (#51577) Fixes consecutive mutations in a tensor inside blocks. Also, support append and pop in blocks. * Support inplace operations + indexing * Clean up old pass for remove mutations * Add loop test * Fixes for set attr in loops * Removing the new jit API flag * [ONNX] Redesign onnx pass to enable shape type dependent pattern conversion - cont (#51795) With the introduction of ONNX shape inference, shape and type are inferred on the fly as operators get converted from ATen to ONNX when running symbolic function. This resolves the shape/type requirement for the symbolic functions. The pre-onnx passes however, can not be supported by shape inference, since at that stage the operators in the graph are still ATen operators. This PR is to update the design of ONNX pass, to enable a mechanism of capturing subgraphs of ATen operators of certain patterns, and convert them later, when shape/type information of upstream operators are available. The new design will require pre-onnx passes that need shape/type to be written in two parts, encapsulation and conversion. The encapsulation part will find the nodes of patterns, like how pre-onnx passes were written previously. But instead of converting the nodes, it will encapsulate them into a sub-block of a new placeholder node. This part is called before onnx pass, so it runs before calling symbolic functions. The conversion part will be called inside the onnx pass. In onnx pass, run_symbolic_func will be called for each node in topological order. When it reaches the placeholder node, the conversion part will be invoked. It will convert the nodes inside the sub-block based on pattern. By that time, it will have shape/type of upstream operators available. After the conversion is complete, the placeholder node will be removed, and nodes inside its sub-block converted. Run_symbolic_func will be called for these nodes, and they will be converted from ATen operator to ONNX operator. This PR includes several other fixes, listed below. * ~~replace helper.cpp with onnx_utils.cpp for holding utility functions.~~ * fix EraseNumberTypes on Bool type, the code was outdated that back then Bool type doesn't exist. * ~~enable onnx shape inference in export with parameter/initializer data.~~ * other code clean ups. * fix insertion of identity nodes for loop opset 13 sequence output. ~~PR depends on #51603~~ * Fix after merge * clang * Fix clang * Fix clang * Fix warning message. * Fixes for non-model param attributes * Fix for caffe2 * Additional test * clang * Skip test for lower opsets * fix clang-tidy * Update init.cpp * Update remove_inplace_ops_for_onnx.cpp * Update remove_inplace_ops_for_onnx.cpp * Update remove_inplace_ops_for_onnx.cpp * Fix for clang formatting Test Plan: Imported from OSS Reviewed By: pbelevich, malfet Differential Revision: D26922416 Pulled By: SplitInfinity fbshipit-source-id: e7108620b39b6404c594910786c4d275fee59d84 Co-authored-by: Bowen Bao <bowbao@microsoft.com>	2021-03-12 02:49:11 -08:00
BowenBao	3f9c803fe8	[ONNX] Redesign onnx pass to enable shape type dependent pattern conversion - cont (#51795 ) (#53304 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53304 With the introduction of ONNX shape inference, shape and type are inferred on the fly as operators get converted from ATen to ONNX when running symbolic function. This resolves the shape/type requirement for the symbolic functions. The pre-onnx passes however, can not be supported by shape inference, since at that stage the operators in the graph are still ATen operators. This PR is to update the design of ONNX pass, to enable a mechanism of capturing subgraphs of ATen operators of certain patterns, and convert them later, when shape/type information of upstream operators are available. The new design will require pre-onnx passes that need shape/type to be written in two parts, encapsulation and conversion. The encapsulation part will find the nodes of patterns, like how pre-onnx passes were written previously. But instead of converting the nodes, it will encapsulate them into a sub-block of a new placeholder node. This part is called before onnx pass, so it runs before calling symbolic functions. The conversion part will be called inside the onnx pass. In onnx pass, run_symbolic_func will be called for each node in topological order. When it reaches the placeholder node, the conversion part will be invoked. It will convert the nodes inside the sub-block based on pattern. By that time, it will have shape/type of upstream operators available. After the conversion is complete, the placeholder node will be removed, and nodes inside its sub-block converted. Run_symbolic_func will be called for these nodes, and they will be converted from ATen operator to ONNX operator. This PR includes several other fixes, listed below. * ~~replace helper.cpp with onnx_utils.cpp for holding utility functions.~~ * fix EraseNumberTypes on Bool type, the code was outdated that back then Bool type doesn't exist. * ~~enable onnx shape inference in export with parameter/initializer data.~~ * other code clean ups. * fix insertion of identity nodes for loop opset 13 sequence output. ~~PR depends on #51603~~ Test Plan: Imported from OSS Reviewed By: SplitInfinity Differential Revision: D26922417 Pulled By: malfet fbshipit-source-id: 14ed06158d539e2451c2e5e63ba1b32fb0f75095	2021-03-11 10:30:09 -08:00
Nikitha Malgi	cfaa0bf286	[JIT] Update Namespace from cuda to _cuda (#53378 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53378 Test Plan: Imported from OSS Reviewed By: navahgar Differential Revision: D26970607 Pulled By: nikithamalgifb fbshipit-source-id: 20a55dd9c0071c5870a4b176d30cb9c1e1496687	2021-03-11 00:52:01 -08:00

1 2 3 4 5 ...

366 Commits