pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Will Feng	415595ace4	[C++ API] Remove init-list form of at::indexing::Slice (#34255 ) Summary: The init-list form of `at::indexing::Slice` (i.e. `tensor.index({{1, None, 2}, ...})` instead of `tensor.index({Slice(1, None, 2), ...})`) in C++ API can be easily confused with the list-form indexing in Python API (e.g. `tensor[[1, 3, 2], ...]`), which is not good from readability perspective. This PR removes the init-list form of `at::indexing::Slice` to make the API less confusing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34255 Test Plan: Imported from GitHub, without a `Test Plan:` line. Differential Revision: D20290166 Pulled By: yf225 fbshipit-source-id: abbcbeca0b179219e5e1f196a33ef8aec87ebb76	2020-03-06 05:51:53 -08:00
meganset	b8fd88319a	C++ make torch::nn::Sequential push_back(AnyModule) methods public (#34208 ) Summary: Issue https://github.com/pytorch/pytorch/issues/33192 Moves Sequential::push_back methods with AnyModule from private -> public Allows adding an existing AnyModule via something like: ``` torch::nn::Sequential q; auto a=torch::nn::AnyModule(torch::nn::Linear(1,2)); q->push_back(a); q->push_back("fc",a); ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/34208 Differential Revision: D20300278 Pulled By: yf225 fbshipit-source-id: 4525319bb7fb6667e43a006c9f446a2193781005	2020-03-06 05:47:14 -08:00
Ilia Cherniavskii	b50825e011	Make RecordFunction more robust for async use cases (#34122 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34122 Earlier work added support for async rpc cases when RecordFunction's end callbacks might be called in a different thread; in addition some extra care was needed to handle pointer to parent function; This PR makes RecordFunction aware of potentially multiple threads in use, as well as removes unused parent() call and restricts current() RecordFunction to scope-based record functions (RECORD_FUNCTION macro) Test Plan: unit tests Differential Revision: D20297709 Pulled By: ilia-cher fbshipit-source-id: 46a59e1b2eea0bbd8a59630385e193b38d30f9d1	2020-03-05 22:28:53 -08:00
anjali411	76035f050b	[C++ API Parity] Adam: updated step and class design (#33730 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33730 Differential Revision: D20292073 Pulled By: anjali411 fbshipit-source-id: a7b4a70f29027ab355aebb91873ea55d5cb51783	2020-03-05 19:15:24 -08:00
Meghan Lele	5500c3de0a	Revert D20150304: [pytorch][PR] [JIT] Introduce a fake Tensor creation node for IR unit tests Test Plan: revert-hammer Differential Revision: D20150304 Original commit changeset: c88f5289055a fbshipit-source-id: 14ac0e46145e9fb4f200c6318b63edd541380aeb	2020-03-05 16:25:08 -08:00
Martin Yuan	ccf4d69b75	[Lite Interpreter] Enable __setstate__ (#33294 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33294 1. Serialize bytecode of __setstate__ and run it when loading the model. 2. One use case is quantization. To test this use case a few operators are registered temporarily for lite interpreter. The "_" prefix registration will be removed when the operators are all migrated to mobile. Test Plan: Imported from OSS Differential Revision: D20162898 Pulled By: iseeyuan fbshipit-source-id: 7a3180807bf38fbce594d86993896861f12bb58c	2020-03-05 15:24:21 -08:00
Meghan Lele	9ce833879f	[JIT] Introduce a fake Tensor creation node for IR unit tests (#33914 ) Summary: Summary There is often a need to create a Tensor when writing IR by hand for JIT optimisation pass unit tests. The only options for this today are real Tensor creation functions like `aten::ones`. Any test that uses these functions must also use the same default arguments as the Python/C++ API, which means that all of the tests have to be updated when the API is updated. This commit introduces a new primitive, `prim::MakeTestTensor` with schema `() -> Tensor` that should be used in unit tests instead of real Tensor creation functions. This new primitive has no public-facing API, so the maintenance burden is much lower. Testing This commit updates the alias analysis and DCE tests to use `prim::MakeTestTensor` instead of `aten::rand`, `aten::ones`, and `aten::zeros`. ``` $ ./bin/test_jit CUDA not available. Disabling CUDA and MultiCUDA tests Note: Google Test filter = -_CUDA:_MultiCUDA [==========] Running 75 tests from 1 test case. [----------] Global test environment set-up. [----------] 75 tests from JitTest [ RUN ] JitTest.ADFormulas [ OK ] JitTest.ADFormulas (82 ms) [ RUN ] JitTest.Attributes [ OK ] JitTest.Attributes (0 ms) ... ... ... [ RUN ] JitTest.LiteInterpreterPrim [ OK ] JitTest.LiteInterpreterPrim (0 ms) [ RUN ] JitTest.LiteInterpreterLoadOrigJit [ OK ] JitTest.LiteInterpreterLoadOrigJit (2 ms) [----------] 75 tests from JitTest (150 ms total) [----------] Global test environment tear-down [==========] 75 tests from 1 test case ran. (150 ms total) [ PASSED ] 75 tests. ``` Fixes* This pull request fixes https://github.com/pytorch/pytorch/issues/33500. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33914 Differential Revision: D20150304 Pulled By: SplitInfinity fbshipit-source-id: c88f5289055a02dc20b7a5dcdf87469f9816d020	2020-03-05 12:42:42 -08:00
Martin Yuan	f097ca503d	Add and test training in lite interpreter. (#32359 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32359 Test Plan: Imported from OSS Differential Revision: D19450614 Pulled By: iseeyuan fbshipit-source-id: 6bafff39d7880a5b7fb9cd70c33a4e584812be12	2020-03-03 23:33:43 -08:00
davidriazati	99e211e661	[jit] Add type tags to lists/dicts in pickle (#33255 ) Summary: Stacked PRs * #33474 - [jit] Remove list specializations from pickler * #33255 - [jit] Add type tags to lists/dicts in pickle This adds a global call to `torch.jit._pickle.restore_type_tags` for lists and dicts so that we can preserve their types after serialization. ](https://our.intern.facebook.com/intern/diff/19868637/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/33255 Pulled By: driazati Reviewed By: xman1979, Tianshu-Bao Differential Revision: D19868637 fbshipit-source-id: 2f1826e6679a786ca209198690269f399a542c04	2020-03-03 16:48:21 -08:00
Zachary DeVito	358450e02b	improved TorchScript traceback (#33834 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33834 This changes how we report Tracebacks to make them more clear when there are both serialized and non-serialized ranges. It now looks like: ``` Traceback (most recent call last): File "foo.py", line 25, in <module> s2(a, b) File "/scratch/zdevito/pytorch/torch/nn/modules/module.py", line 550, in __call__ result = self.forward(input, kwargs) RuntimeError: The following operation failed in the TorchScript interpreter. Traceback of TorchScript, serialized code (most recent call last): File "code/__torch__.py", line 7, in forward x: Tensor, y: Tensor) -> Tensor: return (self).bar(x, y, ) ~~~~~~~~~ <--- HERE def bar(self: __torch__.Moo, x: Tensor, File "code/__torch__.py", line 11, in bar x: Tensor, y: Tensor) -> Tensor: _0 = (self).baz(x, y, ) ~~~~~~~~~ <--- HERE _1 = torch.ones([3], dtype=None, layout=None, device=None, pin_memory=None) return torch.add(_0, _1, alpha=1) File "code/__torch__.py", line 17, in baz x: Tensor, y: Tensor) -> Tensor: return torch.add(x, y, alpha=1) ~~~~~~~~~ <--- HERE Traceback of TorchScript, original code (most recent call last): File "foo.py", line 11, in forward def forward(self, x, y): return self.bar(x, y) ~~~~~~~~ <--- HERE File "foo.py", line 9, in bar def bar(self, x, y): return self.baz(x, y) + torch.ones(3) ~~~~~~~~ <--- HERE File "foo.py", line 7, in baz def baz(self, x, y): return x + y ~~~~~ <--- HERE RuntimeError: The size of tensor a (4) must match the size of tensor b (5) at non-singleton dimension 1 ``` It follows Python convension of having the most important information last and reading from the bottom up. Changes: Moved the error message to the end, to copy Python * Report original traceback separate from serialized traceback * Make sure root functions have names in the interpreter trace. Test Plan: Imported from OSS Differential Revision: D20126136 Pulled By: zdevito fbshipit-source-id: fd01f9985e5d74e04c4d064c02e8bc320f4fac13	2020-03-03 12:27:38 -08:00
Wanchao Liang	f909b5535e	[autograd] fix allow_unused checking for C++ API (#34035 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34035 Bug for the conditon check in https://github.com/pytorch/pytorch/pull/24342, realized we don't have tests in either python or cpp to catch this, so added testes for both python and cpp. Thanks hczhu on capturing it! Test Plan: Imported from OSS Differential Revision: D20198837 Pulled By: wanchaol fbshipit-source-id: 33846a14c0a8e7aac2e8328189d10c38a0d7e6ee	2020-03-02 17:57:15 -08:00
Will Feng	1494005cfd	C++ tensor indexing: more indexing tests (#30427 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30427 Test Plan: Imported from OSS Differential Revision: D18695899 Pulled By: yf225 fbshipit-source-id: 74455fe52ef922556fabe65aefca9ec93fe2346d	2020-02-28 22:07:41 -08:00
Martin Yuan	55b44f6746	Throw an exception when method cannot be found from mobile module. (#33972 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33972 Test Plan: Imported from OSS Differential Revision: D20168965 Pulled By: iseeyuan fbshipit-source-id: 2efe5dcb1fb80407cd88a47c50cb382ecd8aa275	2020-02-28 18:28:09 -08:00
Meghan Lele	5eacdfb21f	Revert D20127441: [pytorch][PR] [JIT] Introduce a fake Tensor creation node for IR unit tests Test Plan: revert-hammer Differential Revision: D20127441 Original commit changeset: 56da4f23ac46 fbshipit-source-id: 7d4602e5011bec6f6871eab16af05a3198694e5d	2020-02-27 13:48:31 -08:00
Meghan Lele	390d4d6df3	[JIT] Introduce a fake Tensor creation node for IR unit tests (#33595 ) Summary: Summary There is often a need to create a Tensor when writing IR by hand for JIT optimisation pass unit tests. The only options for this today are real Tensor creation functions like `aten::ones`. Any test that uses these functions must also use the same default arguments as the Python/C++ API, which means that all of the tests have to be updated when the API is updated. This commit introduces a new primitive, `prim::MakeTestTensor` with schema `() -> Tensor` that should be used in unit tests instead of real Tensor creation functions. This new primitive has no public-facing API, so the maintenance burden is much lower. Testing This commit updates the alias analysis and DCE tests to use `prim::MakeTestTensor` instead of `aten::rand`, `aten::ones`, and `aten::zeros`. ``` $ ./bin/test_jit CUDA not available. Disabling CUDA and MultiCUDA tests Note: Google Test filter = -_CUDA:_MultiCUDA [==========] Running 75 tests from 1 test case. [----------] Global test environment set-up. [----------] 75 tests from JitTest [ RUN ] JitTest.ADFormulas [ OK ] JitTest.ADFormulas (82 ms) [ RUN ] JitTest.Attributes [ OK ] JitTest.Attributes (0 ms) ... ... ... [ RUN ] JitTest.LiteInterpreterPrim [ OK ] JitTest.LiteInterpreterPrim (0 ms) [ RUN ] JitTest.LiteInterpreterLoadOrigJit [ OK ] JitTest.LiteInterpreterLoadOrigJit (2 ms) [----------] 75 tests from JitTest (150 ms total) [----------] Global test environment tear-down [==========] 75 tests from 1 test case ran. (150 ms total) [ PASSED ] 75 tests. ``` Fixes* This pull request fixes https://github.com/pytorch/pytorch/issues/33500. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33595 Differential Revision: D20127441 Pulled By: SplitInfinity fbshipit-source-id: 56da4f23ac46335227254f606c6481718108f378	2020-02-27 13:10:20 -08:00
Michael Suo	dbe850af5b	[jit] do the code reorg (#33851 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33851 Rationale and context described in #33828. Script to reproduce the move: https://gist.github.com/suo/16cbefaaeb67ca5a7c6caffd49b7f6e9 ghstack-source-id: 99079645 Test Plan: Make sure CI passes Reviewed By: jamesr66a Differential Revision: D20133869 fbshipit-source-id: 390e9241a9c85366d9005c492ac31f10aa96488e	2020-02-27 13:02:51 -08:00
Omkar Salpekar	24dd800e6a	[Dist Autograd] Functional API for Dist Autograd and Dist Optimizer (#33711 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33711 Fixed #33480 This makes `dist_autograd.backward` and `dist_optimizer.step` functional by making the user explicitly pass in the `context_id` as opposed to relying on the confusing thread_local context_id. This diff incorporates these API changes and all places where these functions are called. More concretely, this code: ``` with dist_autograd.context(): # Forward pass. dist_autograd.backward([loss.sum()]) dist_optim.step() ``` should now be written as follows: ``` with dist_autograd.context() as context_id: # Forward pass. dist_autograd.backward(context_id, [loss.sum()]) dist_optim.step(context_id) ``` Test Plan: Ensuring all existing dist_autograd and dist_optimizer tests pass with the new API. Also added a new test case for input checking. Differential Revision: D20011710 fbshipit-source-id: 216e12207934a2a79c7223332b97c558d89d4d65	2020-02-26 19:08:28 -08:00
Will Feng	5c33d98b0d	Add assert_tensor_equal and assert_tensor_not_equal to test/cpp/api/support.h (#30426 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30426 This PR adds `assert_tensor_equal` and `assert_tensor_not_equal` to `test/cpp/api/support.h`, as better functions for testing whether two tensors are equal / not equal. Test Plan: Imported from OSS Differential Revision: D18695900 Pulled By: yf225 fbshipit-source-id: c19b9bc4c4e84d9f444015023649d27618fcbdf5	2020-02-26 13:25:25 -08:00
Will Feng	0dded4026e	[C++ API] Add PackedSequence / pack_padded_sequence / pad_packed_sequence / pack_sequence (#33652 ) Summary: Most of the function implementation and test code are translated from the Python version. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33652 Differential Revision: D20052211 Pulled By: yf225 fbshipit-source-id: ce6767db54364f91ef4f06674239a12278c2752a	2020-02-25 12:53:41 -08:00
Will Feng	36919278cc	C++ tensor multi-dim indexing: add index() and index_put_() overloads, simple indexing tests, merge with Python indexing path (#32841 ) Summary: This PR adds the following items: - 1st item: `ArrayRef<TensorIndex>` and `std::initializer_list<TensorIndex>` overloads for `Tensor::index` and `Tensor::index_put_`, to be used specifically for multi-dim indexing purpose. Design rationale: * C++ `Tensor::index` and `Tensor::index_put_` are both existing tensor APIs, and they currently (before this PR) only accept a list of tensors (i.e. `ArrayRef<Tensor>`) as indices. If we change their signatures to also accept non-tensors as indices (i.e. `ArrayRef<TensorIndex>`, and `TensorIndex` is convertible from `Tensor` / `Slice` / `None` / `Ellipsis`), it would slow down the original code path (since now it has to go through more steps), which is undesirable. To get around this problem, the proposed solution is to keep the original `ArrayRef<Tensor>` overload, and add `ArrayRef<TensorIndex>` and `std::initializer_list<TensorIndex>` overloads to `Tensor::index` and `Tensor::index_put_`. This way, the original code path won’t be affected, and the tensor multi-dim indexing API is only used when the user explicitly pass an `ArrayRef<TensorIndex>` or a braced-init-list of `TensorIndex`-convertible types to `Tensor::index` and `Tensor::index_put_` . Note that the above proposed solution would still affect perf for the user’s original `Tensor::index` or `Tensor::index_put_` call sites that use a braced-init-list of tensors as input, e.g. `tensor.index({...})` or `tensor.index_put_({...}, value)`, since now such function calls would take the multi-dim indexing path instead of the original advanced indexing path. However, there are only two instances of this in our codebase (one in ATen cpp test, one in a C++ API nn init function), and they can be easily changed to explicitly use `ArrayRef<Tensor>` as input (I changed them in this PR). For external user’s code, since this is part of the C++ frontend which is still considered experimental, we will only talk about this change in the release note, and ask users to switch to using `ArrayRef<Tensor>` explicitly if they want to keep using the original advanced indexing code path. - 2nd item: Mechanisms for parsing `ArrayRef<TensorIndex>` indices and performing indexing operations (mirroring the functions in `torch/csrc/autograd/python_variable_indexing.cpp`). - 3rd item: Simple tests to demonstrate that the `Tensor::index()` and `Tensor::index_put_()` APIs work. I will add more tests after the first few PRs are reviewed. - 4th item: Merge Python/C++ indexing code paths, for code simplicity. I tested locally and found that there is no perf regression resulting from the merge. I will get more concrete numbers for common use cases when we settle on the overall design. This PR supersedes https://github.com/pytorch/pytorch/pull/30425. Pull Request resolved: https://github.com/pytorch/pytorch/pull/32841 Differential Revision: D19919692 Pulled By: yf225 fbshipit-source-id: 7467e64f97fc0e407624809dd183c95ea16b1482	2020-02-24 22:04:00 -08:00
Suyash458	47e90d774e	C++/Python API Parity: add pad_sequence (#32387 ) Summary: - add `pad_sequence` and tests - related issue https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/32387 Differential Revision: D20025421 Pulled By: yf225 fbshipit-source-id: caa9ae2114bece8db387a3a1610f24a3e06b1324	2020-02-21 13:16:09 -08:00
Mikhail Zolotukhin	bb5181b716	[TensorExpr] Add IR Printer. (#33220 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33220 Test Plan: Imported from OSS Differential Revision: D19848379 Pulled By: ZolotukhinM fbshipit-source-id: 1c6ab4f63080d4506dedc3c47938de92fb4bfba2	2020-02-21 13:10:26 -08:00
Mikhail Zolotukhin	fc70fc3610	[TensorExpr] Add IR visitor, IR mutator, and IR evaluator. (#33219 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33219 Test Plan: Imported from OSS Differential Revision: D19848381 Pulled By: ZolotukhinM fbshipit-source-id: 44ca7cd99c25e290a8ffd8146785c19f9c785dfd	2020-02-21 13:10:22 -08:00
Mikhail Zolotukhin	49af9425a7	[TensorExpr] Add core classes for representing expressions and statements. (#33218 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33218 Test Plan: Imported from OSS Differential Revision: D19848378 Pulled By: ZolotukhinM fbshipit-source-id: 48399f8651324d5ad0607e08573d5d7b2026bb23	2020-02-21 13:10:17 -08:00
Mikhail Zolotukhin	1a4f997178	[TensorExpr] Add a class for representing data type. (#33217 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33217 Test Plan: Imported from OSS Differential Revision: D19848380 Pulled By: ZolotukhinM fbshipit-source-id: d8683f8fc4555d2456cd2a7c827d8e8231915b49	2020-02-21 13:10:12 -08:00
nicolov	e77abb9a5b	Normalize reward-to-go in C++ actor-critic (#33550 ) Summary: Comparing to the [Python implementation](https://github.com/pytorch/examples/blob/master/reinforcement_learning/actor_critic.py), it seems like the tensor of normalized reward-to-go is computed but never used. Even if it's just an integration test, this PR switches to the normalized version for better convergence. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33550 Differential Revision: D20024393 Pulled By: yf225 fbshipit-source-id: ebcf0fee14ff39f65f6744278fb0cbf1fc92b919	2020-02-21 09:19:39 -08:00
Jerry Zhang	8527ba8b70	[jit] Add None parameter as parameter instead of attributes (#32964 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32964 att Test Plan: . Imported from OSS Differential Revision: D19913188 fbshipit-source-id: 9cdd93cbaf9892f4311656c786637765a675a68c	2020-02-19 16:06:56 -08:00
Jerry Zhang	07e5e42713	[jit][fix] Remove slot in parameter slot (#32846 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32846 att Test Plan: build/bin/test_jit Imported from OSS Differential Revision: D19844711 fbshipit-source-id: 3d29e5e97e97781f5dc00069827971baed52d76e	2020-02-19 11:15:15 -08:00
Zachary DeVito	c59e35b147	interpreter handling for varargs to remove need for looking at Node (#32791 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32791 When a registered operator has varags (ends with ... in its schema), the interpreter now appends the number of arguments to the top of the stack before invoking the operator. This allows the removal of more uses of Node* in the interpreter. This PR also then cleans up the constructors for Operator to make it more likely someone chooses the correct one. After making these ops: ``` USES NODE: prim::TupleUnpack(...) -> (...) USES NODE: prim::TupleSlice(...) -> (...) USES NODE: prim::TupleConstruct(...) -> (...) USES NODE: prim::ListUnpack(...) -> (...) USES NODE: prim::ListConstruct(...) -> (...) USES NODE: prim::DictConstruct(...) -> (...) USES NODE: prim::Constant() -> (...) USES NODE: prim::isinstance(...) -> (...) USES NODE: prim::CreateObject(...) -> (...) USES NODE: prim::fork(...) -> (...) USES NODE: aten::warn(str message, *, int stacklevel=2) -> () # need stack level information, so ideally in interpreter so it can look at the stack ``` Into interpreter primitives, we can remove all but two constructors for operators: one that is (schema_string, operation), and one that is (symbol, op_creator) for the remaining weird primitives. Test Plan: Imported from OSS Differential Revision: D19673158 Pulled By: zdevito fbshipit-source-id: 95442a001538a6f53c1db4a210f8557ef118de66	2020-02-18 15:04:48 -08:00
Will Feng	a203dc2e6d	[C++ API] Allow skipping default arguments in module's forward method when module is used in Sequential (#33027 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33027 This PR allows default arguments in module's forward method to be skipped when module is used in `torch::nn::Sequential`, by introducing the `FORWARD_HAS_DEFAULT_ARGS` macro and requiring that all modules that have default arguments in its forward method must have a corresponding `FORWARD_HAS_DEFAULT_ARGS` macro call. Fixes issue mentioned in https://github.com/pytorch/pytorch/issues/30931#issuecomment-564144468. Test Plan: Imported from OSS Differential Revision: D19777815 Pulled By: yf225 fbshipit-source-id: 73282fcf63377530063e0092a9d84b6c139d2e32	2020-02-17 20:38:02 -08:00
Will Feng	4724964810	[C++ API] Expose AnyValue and AnyModuleHolder classes (#33026 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33026 This PR contains necessary changes to prepare for https://github.com/pytorch/pytorch/pull/33027. It exposes the following classes to public: 1. `torch::nn::AnyValue`, because if the user has optional arguments in their module's forward method, they must also use the `FORWARD_HAS_DEFAULT_ARGS` macro and pass in the default values for those optional arguments wrapped by `torch::nn::AnyValue`. 2. `torch::nn::AnyModuleHolder`, because `torch::nn::Module` needs to declare it as a friend class for it to be able to access `torch::nn::Module`'s protected methods such as `_forward_has_default_args` / `_forward_num_required_args` / `_forward_populate_default_args`. Test Plan: Imported from OSS Differential Revision: D19777814 Pulled By: yf225 fbshipit-source-id: 1c9d5aa24f0689154752c426a83ee98f64c9d02f	2020-02-17 20:35:22 -08:00
Will Feng	5d7f42847c	Add at::Tensor::retain_grad API (#33349 ) Summary: This PR adds `at::Tensor::retain_grad`, and its implementation mirrors the Python `torch.Tensor.retain_grad` API: `c6271c63f2/torch/tensor.py (L292-L315)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/33349 Differential Revision: D19944524 Pulled By: yf225 fbshipit-source-id: e61d5d761996b6d1b860c04c4b4650c1a49a6a8c	2020-02-17 20:03:48 -08:00
Mikhail Zolotukhin	806e7daa1f	Rename TorchScript compiler to IR emitter to better reflect its function. (#33127 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33127 Test Plan: Imported from OSS Differential Revision: D19806503 Pulled By: ZolotukhinM fbshipit-source-id: ab78bdbbac5f12dbcc6c2e2573f5862a16ffcf3d	2020-02-12 18:45:13 -08:00
anjali411	91744907d4	SGD: updated step and class design (#32592 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32592 Differential Revision: D19868154 Pulled By: anjali411 fbshipit-source-id: ce888efc68b1531d97e8b0abf2b146198e012d2f	2020-02-12 18:38:55 -08:00
Martin Yuan	0e753b2818	Fix SIGABORT caused by double exception in PyTorchStreamReader when file not found. (#33243 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33243 If a file does not exist in an archive, PyTorchStreamReader throws an exception. However, when PyTorchStreamReader is destructed another exception is thrown while processing the first exception. As a result of this double exception there is SIGABORT. Thanks dreiss for catching this bug and suggesting the fix. It happened when he used _load_for_mobile to load a torch script file without bytecode session. A unittest is added to test this case. Test Plan: Imported from OSS Differential Revision: D19859205 Pulled By: iseeyuan fbshipit-source-id: 8f96b6256f1a1f933fce1c256d64604c7e9269e4	2020-02-12 16:27:15 -08:00
Zachary DeVito	99349defc1	remove unnecessary Node* ops (#32760 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32760 Minor changes to the way ops are implemented to remove incidental use of Node* in the operator implementation. Current state for operators that previously took Node: ``` TBD: USES NODE: prim::DifferentiableGraph(...) -> (...) USES NODE: prim::profile(...) -> (...) USES NODE: prim::FusionGroup(...) -> (...) USES NODE: prim::PythonOp(...) -> (...) USES NODE: prim::ImplicitTensorToNum(Tensor a) -> Scalar # next PR Should be made interpreter primitives: USES NODE: prim::TupleUnpack(...) -> (...) USES NODE: prim::TupleSlice(...) -> (...) USES NODE: prim::TupleConstruct(...) -> (...) USES NODE: prim::ListUnpack(...) -> (...) USES NODE: prim::ListConstruct(...) -> (...) USES NODE: prim::DictConstruct(...) -> (...) USES NODE: prim::Constant() -> (...) USES NODE: prim::isinstance(...) -> (...) USES NODE: prim::CreateObject(...) -> (...) USES NODE: prim::fork(...) -> (...) USES NODE: aten::warn(str message, , int stacklevel=2) -> () # need stack level information, so ideally in interpreter so it can look at the stack Should be made into vararg operators, i.e. the operators last argument should be an IValue that contains the number of arguments. USES NODE: prim::FusedConcat(...) -> (...) USES NODE: prim::MMTreeReduce(...) -> (...) USES NODE: prim::MMBatchSide(...) -> (...) USES NODE: prim::ConstantChunk(...) -> (...) USES NODE: prim::AutogradAnyNonZero(...) -> bool USES NODE: prim::BroadcastSizes(...) -> (...) USES NODE: prim::ChunkSizes(...) -> (...) USES NODE: aten::format(str self, ...) -> str USES NODE: prim::Print(...) -> (...) fixed: USES NODE: aten::extend(Tensor[](a!) self, Tensor [] other) -> () USES NODE: aten::copy(Tensor[](a) self) -> Tensor[] USES NODE: aten::extend(int[](a!) self, int [] other) -> () USES NODE: aten::copy(int[](a) self) -> int[] USES NODE: aten::extend(float[](a!) self, float [] other) -> () USES NODE: aten::copy(float[](a) self) -> float[] USES NODE: aten::extend(bool[](a!) self, bool [] other) -> () USES NODE: aten::copy(bool[](a) self) -> bool[] USES NODE: aten::extend(t[](a!) self, t [] other) -> () USES NODE: aten::copy(t[](a) self) -> t[] USES NODE: aten::keys(Dict(str, t) self) -> str[]() USES NODE: aten::values(Dict(str, t) self) -> t[]() USES NODE: aten::dict((str, tVal)[] inputs) -> Dict(str, tVal) USES NODE: aten::keys(Dict(int, t) self) -> int[]() USES NODE: aten::values(Dict(int, t) self) -> t[]() USES NODE: aten::dict((int, tVal)[] inputs) -> Dict(int, tVal) USES NODE: aten::keys(Dict(float, t) self) -> float[]() USES NODE: aten::values(Dict(float, t) self) -> t[]() USES NODE: aten::dict((float, tVal)[] inputs) -> Dict(float, tVal) USES NODE: aten::keys(Dict(Tensor, t) self) -> Tensor[]() USES NODE: aten::values(Dict(Tensor, t) self) -> t[]() USES NODE: aten::dict((Tensor, tVal)[] inputs) -> Dict(Tensor, tVal) USES NODE: aten::test_vartype2(t a, t[] b) -> (t[]) USES NODE: aten::_ncf_unsqueeze(Tensor self, int ndim) -> Tensor USES NODE: aten::_ncf_view(Tensor self, int[] input_shape, int normalized_ndim) -> Tensor USES NODE: prim::is_none(int? a) -> bool USES NODE: aten::__interpolate(Tensor input, int? size = None, float[]? scale_factor = None, str mode = 'nearest', bool? align_corners = None, bool? recompute_scale_factor = None) -> Tensor USES NODE: aten::__interpolate(Tensor input, int[]? size = None, float[]? scale_factor = None, str mode = 'nearest', bool? align_corners = None, bool? recompute_scale_factor = None) -> Tensor USES NODE: aten::__interpolate(Tensor input, int? size = None, float? scale_factor = None, str mode = 'nearest', bool? align_corners = None, bool? recompute_scale_factor = None) -> Tensor USES NODE: aten::__interpolate(Tensor input, int[]? size = None, float? scale_factor = None, str mode = 'nearest', bool? align_corners = None, bool? recompute_scale_factor = None) -> Tensor USES NODE: aten::sorted(t[](a) self) -> (t[]) USES NODE: aten::sort(t[](a!) self, bool reverse=False) -> () USES NODE: aten::test_vartype(t[] a, t b) -> (t) USES NODE: prim::unchecked_unwrap_optional(t(a)? optional) -> t(a) USES NODE: prim::unchecked_cast(...) -> (...) USES NODE: aten::dict() -> Dict(str, Tensor) USES NODE: prim::Load(...) -> (...) USES NODE: prim::Store(...) -> (...) USES NODE: prim::Drop(...) -> (...) USES NODE: aten::tensor(t[] data, , ScalarType? dtype=None, Device? device=None, bool requires_grad=False) -> Tensor USES NODE: aten::as_tensor(t[] data, *, ScalarType? dtype=None, Device? device=None) -> Tensor ``` Test Plan: Imported from OSS Differential Revision: D19615387 Pulled By: zdevito fbshipit-source-id: 95298c3c4249b9f812c332d13f0fb79daeecb662	2020-02-12 14:49:02 -08:00
albanD	3655975565	Add allow_rebase_history flag and fix codegen functions for multiple views (#32790 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32790 Same as https://github.com/pytorch/pytorch/pull/31990 but without the first commit in the stack that is problematic for a lot of people. Test Plan: Imported from OSS Differential Revision: D19814116 Pulled By: albanD fbshipit-source-id: d104911a5b098a5807b4bc08b69803ebd4f69fa6	2020-02-11 07:16:02 -08:00
albanD	e1c53a5c86	Fix version counter bump in cpp Function (#33068 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33068 The version counter is already tracked if we use pytorch's functions but not if the user unpack the Tensor and modifies it by hand or with a third party library. Test Plan: Imported from OSS Differential Revision: D19791564 Pulled By: albanD fbshipit-source-id: a73c0f73d8fd0c0e5bf838f14bed54fa66937840	2020-02-10 07:22:29 -08:00
Pritam Damania	05d18ffaf5	Distributed Autograd: Allow multiple backward passes to accumulate gradients. (#32506 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32506 In this PR, we've introduced a `retain_graph` parameter to distributed autograd similar to `torch.autograd.backward`. In terms of design, this parameter is sent over RPC to all nodes and is used to create the GraphTask on the local nodes. This enables us to run `dist_autograd.backward()` multiple times in the same context. The use case currently for this is to benchmark only the backward pass for distributed autograd. We'd like to measure the QPS for the backward pass and as a result, running a single forward pass and multiple backward passes in a loop is one way to benchmark backward pass performance. ghstack-source-id: 97868900 Test Plan: waitforbuildbot Differential Revision: D19521288 fbshipit-source-id: 7ad8521059fd400d7b5a6ab77ce56e1927ced90a	2020-02-06 23:27:21 -08:00
James Reed	38820a7014	[JIT] Resolve custom classes in source importer (#32977 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32977 ghstack-source-id: 97736042 Test Plan: Imported from OSS Differential Revision: D19724588 fbshipit-source-id: b31b6ae14d2881d3604922e611fe4749108e674d	2020-02-06 10:45:40 -08:00
James Reed	23a4800708	[JIT] Make IRParser use op schema (#32854 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32854 ghstack-source-id: 97736043 Test Plan: Imported from OSS Differential Revision: D19656881 fbshipit-source-id: 509d09fdbd765ca5cd153bec6440aedfb4e6d23b	2020-02-04 19:29:50 -08:00
James Reed	bc4790b3aa	[JIT] Trace uses of torchbind classes as module attributes (#32833 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32833 ghstack-source-id: 97736046 Test Plan: Imported from OSS Differential Revision: D19645714 fbshipit-source-id: 10a7271f13c3588aea666b44b916e90ba7b3c666	2020-02-04 19:28:37 -08:00
Pavel Belevich	d141465713	Fix torch::allclose to handle std::numeric_limits<T>::lowest() for integral types (#32978 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32978 Fixes #32946 Test Plan: Imported from OSS Differential Revision: D19726013 Pulled By: pbelevich fbshipit-source-id: ada4aeabc8e39016d24f1a40f02fb7c56f069cd3	2020-02-04 19:06:52 -08:00
Kimish Patel	820410b505	Added upsample_neartest2d op for lite interpreter. (#32913 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32913 This enables mobile detection and tracking models. Test Plan: buck test caffe2/test/cpp/jit:jit -- JitTest.LiteInterpreterUpsampleNearest2d Reviewed By: iseeyuan Differential Revision: D19664502 fbshipit-source-id: 1c7270dcf394aba7b510c5aa80552c58a5038f24	2020-02-04 07:59:03 -08:00
Will Feng	b564eaf7a8	Bug fixes: torch::tensor(floating-point values) -> default dtype, and torch::tensor(integer values) ->at::kLong (#32367 ) Summary: Some of the `torch::tensor` behavior is updated to better match Python API. Fixes https://github.com/pytorch/pytorch/issues/32234. This PR is BC-breaking in the following way: - `torch::tensor({1.0f, 2.0f})`: float -> default dtype - `torch::tensor(at::ArrayRef<int>({1, 2, 3}))`: int -> at::kLong - `torch::tensor(std::vector<int>({1, 2, 3}))`: int -> at::kLong - `torch::tensor(at::ArrayRef<float>({1.f, 2.f, 3.f}))`: float -> default dtype - `torch::tensor(std::vector<float>({1.f, 2.f, 3.f}))`: float -> default dtype - `torch::tensor(at::ArrayRef<double>({1., 2., 3.}))`: double -> default dtype - `torch::tensor(std::vector<double>({1., 2., 3.}))`: double -> default dtype Pull Request resolved: https://github.com/pytorch/pytorch/pull/32367 Differential Revision: D19498484 Pulled By: yf225 fbshipit-source-id: 19c8dc2a56476266153cff4c404e7f84d309eb12	2020-02-01 15:00:07 -08:00
Elias Ellison	c729614997	[JIT] Improve May Contain Alias Using Contained Elements (#32326 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32326 Now that we have type-level granularity we can improve `mayContainAlias` queries. Each new values is initialized as containing the wildcard set of each contained mutable type. Whenever a value is added to a container it is set to the wildcard set. Now, to check if any two values contain overlapping values, we can just check if the `containedMemoryLocations` of two sets overlap. Test Plan: Imported from OSS Differential Revision: D19563262 Pulled By: eellison fbshipit-source-id: c6d7489749c14b2054a6d50ef75baca699ada471	2020-01-28 18:08:56 -08:00
Elias Ellison	25d33a2ee8	[JIT] Use Type Level Granularity in Alias Analysis Wildcards (#32251 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32251 Previously wildcard sets were associated by TypeKind, meaning all Lists were in one alias set, all Classes were in one alias set, etc. We can improve analysis by bucketing wildcard sets by TypePtr instead. Any two mutable types which can unify should be in the same wildcard set bucket. This also allows us do much simpler `mayContainAlias` analysis, and also improves `analyzeConservative` analysis because now we can recurse through all contained memory locations and mark writes, instead of just recursing only level deep in contained elements. Test Plan: Imported from OSS Differential Revision: D19563263 Pulled By: eellison fbshipit-source-id: 371a37d1a8596abc6c53f41c09840b6c140ea362	2020-01-28 18:07:48 -08:00
James Reed	465ebd58ba	[JIT] pickle serialization for custom bound classes Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32604 Test Plan: Imported from OSS Differential Revision: D19566633 fbshipit-source-id: 9387d3ff45cbd6ccde49ce190a52859481cc301c	2020-01-28 11:02:59 -08:00
James Reed	1719da13f9	[JIT] Support for registering C++ lambdas as methods on custom C++ class Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32553 Test Plan: Imported from OSS Differential Revision: D19543269 Pulled By: jamesr66a fbshipit-source-id: 7e566650295e9d1c4f2f716470e061308a6210a0	2020-01-28 11:01:07 -08:00
Alban Desmaison	db8ce7ea2d	Back out "Make autogen functions correct for multiple outputs and views" (#32681 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32681 Original commit changeset: a2b41c2d231e Test Plan: fb and oss tests Reviewed By: hudeven Differential Revision: D19591864 fbshipit-source-id: 7068b5563e37bc9a5d415fd535c73fd9d71fe131	2020-01-27 19:54:34 -08:00

1 2 3 4 5 ...

725 Commits