pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
James Reed	c7078a1ce8	Fix returning instance of custom class from method Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32312 Test Plan: Imported from OSS Differential Revision: D19433511 Pulled By: jamesr66a fbshipit-source-id: f048d5f60eaba992ee42fea2d318a59b3a156578	2020-01-17 23:09:34 -08:00
Brian Wignall	f326045b37	Fix typos, via a Levenshtein-type corrector (#31523 ) Summary: Should be non-semantic. Uses https://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines to find likely typos, with https://github.com/bwignall/typochecker to help automate the checking. Uses an updated version of the tool used in https://github.com/pytorch/pytorch/pull/30606 . Pull Request resolved: https://github.com/pytorch/pytorch/pull/31523 Differential Revision: D19216749 Pulled By: mrshenli fbshipit-source-id: 7fd489cb9a77cd7e4950c1046f925d57524960ea	2020-01-17 16:03:19 -08:00
Zachary DeVito	7e3c438913	Renaming IValue List functions (#32093 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32093 toGenericListRef -> toListRef isGenericList -> isList toGenericList -> toList toXListRef -> toXVector Test Plan: Imported from OSS Reviewed By: suo Differential Revision: D19369767 Pulled By: zdevito fbshipit-source-id: 4f0078f95b83e6586524c03f7bcf206722fdd9ae	2020-01-17 15:17:45 -08:00
Jerry Zhang	4314620ba0	[jit] Module clone work with shared ClassType (#31970 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31970 Now that the ClassType can be shared among different module instances, we'll preserve the sharing in clone as well, that is if the original module has a ClassType that is shared, we'll clone this ClassType once and share it between different module instances as well. Test Plan: build/test/test_jit Imported from OSS Differential Revision: D19406251 fbshipit-source-id: 2881c695f6e718e5432040a3817cf187a62017bf	2020-01-15 11:24:53 -08:00
Elias Ellison	4dce482acb	dict type unification fix (#32185 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32185 Previously we would unify the contained types of dictionaries, however this breaks type safety. ``` torch.jit.script def test(input: Dict[str, None], cond): if cond: out = input else: out: {"1": 1} out["hi"] = 3 ``` This would only occur if a dictionary is being re-assigned across an if condition with different contained types, which is pretty unlikely. I tested `model_backward_compatibility` for all fb models and this didn't break anything. This PR is a precursor to alias analysis changes. Also fixes `Future` type unification. Because `Future` is an immutable type, it is okay to unify the contained type. Test Plan: Imported from OSS Differential Revision: D19398585 Pulled By: eellison fbshipit-source-id: ebc8812cdf5b6dba37b1cfbc2edc7d8c467b258c	2020-01-14 23:02:05 -08:00
Nikolay Korovaiko	02c3493a84	Fix an invalid peephole transformation if input/output values are written to (#28455 ) Summary: fixes https://github.com/pytorch/pytorch/issues/28360 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28455 Differential Revision: D19374601 Pulled By: Krovatkin fbshipit-source-id: 622f24b40aba03e79e55a6b8d25d88417f7d8bad	2020-01-14 16:28:07 -08:00
Will Feng	2bd179147a	Fix typo in config script to re-enable libtorch build and test in macOS CI (#32072 ) Summary: Currently, libtorch build and test are not running in macOS CI. This PR fixes the issue. Test Plan: Check that libtorch build and test are running again in macOS CI. Pull Request resolved: https://github.com/pytorch/pytorch/pull/32072 Differential Revision: D19391909 Pulled By: yf225 fbshipit-source-id: 1ab345b099869f78e1124f1a8bd185fa51371b6a	2020-01-14 16:23:57 -08:00
Zachary DeVito	14593f077f	remove list specialization from ivalue (#30734 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30734 What are specialized lists? The IValues that hold List[int], List[Tensor], and List[AnythingElse] are different C++ types. e.g. List[int] has a std::vector<int> while List[AnythingElse] holds a std::vector<IValue>. Why do we have specialized lists? When we first created the JIT we needed to bind the ATen C++ API which has std::vector<int>, std::vector<Tensor> as inputs. The easiest way to match this API was to make our IValues contain these same types. Conversion was just unwrapping the IValue, very easy and cheap. What is the problem with specialized lists? We end up with significant special cases through the compiler. Other types like Dict are not specialized. So in the Pickler, for instance, there is a single piece of logic to handle their serialization. For Lists, we end up with multiple cases. Furthermore, it doesn't match Python, leading to problems along translation boundaries. Our pickle serialization is slightly different than python, so it is harder to load objects from our IValue serialization as Python values. They also make it harder to provide an easy-to-use user API. We'd like to match pybind11 for C++ bindings to TorchScript. This would entail having a single torch::List class (untemplated) that can be used to construct inputs. This is made much harder if the underlying ivalue needs to be different depending on the type inside the list. The ideal case would be to have a constructor like ``` template<typename T> List(std::vector<T> foo); ``` It would then set up the type tags correctly based on type T, without the need for passing tags. Do specialized lists improve perf? Not in a way we have been able to measure. Our major concern initially was having to translate a std::vector<IValue> to std::vector<int> to call ATen functions. This was especially a concern for aten::_convolution which takes a number of mostly-constant lists of integers. However, when we measure the effect of actually having to do this conversion for an aten::_convolution, it does not take measurable time (benchmark results below). This is true even if you use a trivial convolution (e.g. 1x1x1), and comment out the actual convolution code. What are the issues removing them? This PR removes list specialization but keeps the serialization format, and IValue APIs almost exactly the same. The only visible change is that toTensorListRef and family have turned into toTensorVector because they now return by value a copy of the list as a vector. Further PRs can then clean up the complexity issues that arose from speclization. This will likely involve removing the isTensorList/isIntList functions, and refactoring the code that used them to work generically. At some point we will also change serialization to no longer write specialized lists in the pickle binary. This is forward incompatible, so will go in its own PR. Benchmark: ``` import torch import torch.nn as nn import torch.nn.functional as F import time class MnistNet(nn.Module): def __init__(self): super(MnistNet, self).__init__() self.conv1 = nn.Conv2d(1, 1, kernel_size=1) self.conv2 = nn.Conv2d(1, 1, kernel_size=1) def forward(self, x): for i in range(10): x = F.relu(self.conv1(x)) x = F.relu(self.conv2(x)) return x model = MnistNet() x = torch.rand(1, 1, 1, 1) r = torch.jit.trace(model, x ) r(x) r(x) r(x) r(x) print(torch.jit.last_executed_optimized_graph()) while True: b = time.time() for i in range(100): r(x) e = time.time() print(e - b) ``` Results (no observable difference): ``` Before (actual conv) 0.13251137733459473 0.13260436058044434 0.13276338577270508 0.1327497959136963 0.13250041007995605 0.13270330429077148 0.13290190696716309 0.13265132904052734 0.13274288177490234 0.1326758861541748 0.13253355026245117 0.13254785537719727 0.13260746002197266 0.13285017013549805 0.13264012336730957 0.132490873336792 0.13280034065246582 0.13243484497070312 0.1325232982635498 0.1326127052307129 0.13264131546020508 0.13274383544921875 0.13298296928405762 0.1326909065246582 ------------------- After (actual conv) 0.13127517700195312 0.13150334358215332 0.13092470169067383 0.13102364540100098 0.13134360313415527 0.13155555725097656 0.13314104080200195 0.13151955604553223 0.13160037994384766 0.1315293312072754 0.13137340545654297 0.13148093223571777 0.131455659866333 0.1327371597290039 0.13134026527404785 0.13152337074279785 0.13151192665100098 0.13165974617004395 0.13403725624084473 0.13251852989196777 0.13135504722595215 0.1315624713897705 0.1317615509033203 0.1314380168914795 0.13157200813293457 -------------------- The following replace the convolution operator with a no-op, to show that even if the conv op was made faster, then we still would not see a difference: Before (fake conv) 0.0069539546966552734 0.0069522857666015625 0.007120847702026367 0.007344722747802734 0.007689952850341797 0.007932662963867188 0.00761723518371582 0.007501363754272461 0.007532835006713867 0.007141828536987305 0.007174253463745117 0.007114410400390625 0.007071495056152344 ------------------ After (fake conv) 0.007458209991455078 0.007337093353271484 0.007268190383911133 0.007313251495361328 0.007306575775146484 0.007468700408935547 0.0073091983795166016 0.007308483123779297 0.007538318634033203 0.007356882095336914 0.007464170455932617 0.007372140884399414 ``` Test Plan: Imported from OSS Differential Revision: D18814702 Pulled By: zdevito fbshipit-source-id: 0371c73b63068fdc12f24b801371ea90f23531a6	2020-01-12 18:28:25 -08:00
Will Feng	b6cee03e29	C++ tensor indexing: add Slice / TensorIndex (#30424 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30424 `at::indexing::TensorIndex` is used for converting C++ tensor indices such as `{None, "...", Ellipsis, 0, true, {1, None, 2}, torch::tensor({1, 2})}` into its equivalent `std::vector<TensorIndex>`, so that further tensor indexing operations can be performed using the supplied indices. Test Plan: Imported from OSS Differential Revision: D18695902 Pulled By: yf225 fbshipit-source-id: d73e14a411cdbec815866b02e75ffd71a9186e89	2020-01-10 17:53:41 -08:00
TH3CHARLie	1296e2d55e	C++ API parity: isinf (#31099 ) Summary: fixes https://github.com/pytorch/pytorch/issues/31021, port the legacy binding method of `isinf` to C++ therefore support JIT Pull Request resolved: https://github.com/pytorch/pytorch/pull/31099 Differential Revision: D19314733 Pulled By: yf225 fbshipit-source-id: 5725c51d19c33b4fddd0fc9e7034078580bd534e	2020-01-09 13:16:13 -08:00
Elias Ellison	8ecd3f783d	check for object equality in constant pooling (#31800 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31800 If we know that two constants are the same object, we can ignore other constraints and pool them together. This fixes an issue introduced by the other PR where quantization relied on constant pooling happening for correctness. Test Plan: Imported from OSS Differential Revision: D19269499 Pulled By: eellison fbshipit-source-id: 9d4396125aa6899cb081863d463d4f024135cbf4	2020-01-08 16:47:07 -08:00
Elias Ellison	319cc21108	Add AliasDb API For Changing Aliasing (#31501 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31501 We have a number of places in our code base where we should be checking if it's safe to change the alias relationship between two sets of values. This PR adds an api to Alias Db to consolidate the logic, and refactors Constant Pooling and `CSE` to use the new api. Next steps: add api usage in peephole.cpp where applicable. Happy to bikeshed `AliasDb::safeToChangeAliasingRelationship`. Previously I suggested `AliasDb::safeToIntroduceAliasing`, however that's not quite accurate, because this API also handles when it is unsafe to remove aliasing. Alternate suggestions: `safeToChangeAliasing`, `validToChangeAliasing`, `validToChangeAliasingRelationship` Related: https://github.com/pytorch/pytorch/issues/28360 Test Plan: Imported from OSS Differential Revision: D19254413 Pulled By: eellison fbshipit-source-id: 17f7f52ad2d1526d303132767cbbb32f8189ae15	2020-01-08 16:47:03 -08:00
davidriazati	3c07eb33bb	Better error for `torch::jit::load`ing a eager file (#31709 ) Summary: This adds a check to catch the case where someone `torch.save`s something then `torch::jit::load`s it in C++. Relevant for #31620 ](https://our.intern.facebook.com/intern/diff/19252172/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/31709 Pulled By: driazati Differential Revision: D19252172 fbshipit-source-id: f2a9b4442647285418b2778306629b4ff77c15e5	2020-01-07 16:20:42 -08:00
Jeremy Lilley	114562cf93	For torch::from_blob() add clue when memory is non-owned. (#31222 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31222 - When constructing torch::from_blob() in the case where the deleter is a nop, switch to using a nullptr context in the DataPtr (with a nop deleter) - No real extra memory/cpu requirements here, actually saves a minor alloc. Why? Trying to get a signal that a Tensor might contain non-owned memory from torch::from_blob(), by detecting the nullptr context. ghstack-source-id: 96336078 Test Plan: buck test mode/dev caffe2/test/cpp/api/... buck test mode/dev-nosan caffe2/test/... Differential Revision: D18992119 fbshipit-source-id: 4eea642f82d0858b57fdfc6995364a760c10567d	2020-01-07 13:12:30 -08:00
Martin Yuan	f362cd510d	Move prim ops from JIT registration to C10 (#30612 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30612 The first version to move prim ops to c10 registration. After the reviewers are fine with the initial changes, more operators will be moved in the same style. Test Plan: Imported from OSS Differential Revision: D19237648 Pulled By: iseeyuan fbshipit-source-id: c5a519604efffb80564a556536f17d829f71d9f9	2020-01-04 13:47:44 -08:00
Jerry Zhang	ebe69236d1	Expose class constant through `attr` and `setattr` in object (#29219 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29219 We added class constant in previous PRs, this PR allows access to class constant in the object API Test Plan: build/bin/test_jit python test/test_jit.py Imported from OSS Differential Revision: D18846851 fbshipit-source-id: 888a6517d5f747d1f8ced283c0c2c30b2f6c72c6	2020-01-04 11:09:35 -08:00
Jerry Zhang	6f62c311a1	Add unsafeRemoveConstant for ClassType (#30787 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30787 This is needed when we fuse conv bn modules, where we need to rewrite a constant bias (None) of conv to an attribute bias of Tensor Test Plan: build/bin/test_jit Imported from OSS Differential Revision: D18846850 fbshipit-source-id: 9fd5fe85d93d07226e180b75d2e068fe00ca25fe	2020-01-04 01:11:59 -08:00
Jerry Zhang	2bac76969c	Fix getConstant (#31012 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31012 - getConstant should throw when the item is not found - add another getConstant which takes slot index as argument Test Plan: test_class_type.cpp Imported from OSS Differential Revision: D18898418 fbshipit-source-id: d3a23a4896fdbf5fa98e1c55c9c4d6205840014b	2020-01-03 23:06:11 -08:00
Lu Fang	cb1af5f61f	Revert D19233558: add float[] str[] constants Test Plan: revert-hammer Differential Revision: D19233558 Original commit changeset: 4f7c6d9ddbe7 fbshipit-source-id: a5020a9169e349a5970323471d673e8cd7818c66	2019-12-31 11:57:34 -08:00
Elias Ellison	dd0f2f0c19	add float[] str[] constants (#31503 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31503 Add support for float lists and string lists constants, which enables better constant propagation + constant pooling + freezing. Test Plan: Imported from OSS Differential Revision: D19233558 Pulled By: eellison fbshipit-source-id: 4f7c6d9ddbe7623757a9a20606ce5f394e14e93d	2019-12-30 11:58:17 -08:00
James Reed	7d630278da	Separate torchbind from Python (#30242 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30242 Pull Request resolved: https://github.com/pytorch/pytorch/pull/29501 Currently blocked on schema serialization issue Test Plan: Imported from OSS Differential Revision: D18463063 Pulled By: jamesr66a fbshipit-source-id: c12a1b644eb9bf04e68ff93cccf91d6cb3e75359	2019-12-21 22:52:40 -08:00
Jerry Zhang	1bb6c51421	Fix getAttribute (#31011 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31011 `getAttribute` is supposed to throw when there the attribute is not found rather than return a `nullptr`. Test Plan: . Imported from OSS Differential Revision: D18898417 fbshipit-source-id: 0fe7d824b978ad19bb5ef094d3aa560e9fc57f87	2019-12-18 19:27:39 -08:00
Jeremy Lilley	dff7b945bf	Avoid sending large unneeded data over wire in process_group_agent. (#31357 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31357 If a user selects a subset of a Tensor and sends it in an RPC, we were sending the whole original Tensor Storage over the network. While this sounds reasonable, in practice, we observed view-like Tensors being sent over rpc, where only 1% of the data in the provided Tensor's Storage was actually used/needed. The simple solution here is to just force a clone in the serializer code if we see that less than (arbitrary) half the bits are used, and the tensor is more than a nominal few KB. Add related tests to ensure this doesn't break. An alternate approach would be to modify the Pickler. That said, since Pickler is shared by more components, the logic might be harder to tailor appropriately at that layer (particularly given that the Pickler has explicit logic to share a single Storage* among several Tensors that commonly point to the same Storage*). It's possible that we might want to further refine the basic thresholds in this change. In practice, we've seen a mostly bimodal distribution thus far for the percent of Tensor Storage referred by a Tensor in observed rpcs (i.e. either 90%+ or sub-10% of the Storage referenced), hence the existing 50% threshold here is probably not an unreasonable starting point. ghstack-source-id: 95925474 Test Plan: buck test mode/dev caffe2/test/cpp/rpc/... Differential Revision: D19137056 fbshipit-source-id: e2b3a4dd0cc6e1de820fd0740aa1d59883dbf8d4	2019-12-18 19:24:24 -08:00
Pavel Belevich	47766e648f	C++ API parity: MultiheadAttention Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27309 Test Plan: Imported from OSS Differential Revision: D17766736 Pulled By: pbelevich fbshipit-source-id: 7a5f2399f081945d31d4c13d7a8d248c387fc1a6	2019-12-18 10:13:29 -08:00
Martin Yuan	58eb15f41c	JIT Type parser for mobile (#30391 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30391 A Type parser to parse the python string of a Type. For example, "Tuple[str, Optional[float], Dict[str, List[Tensor]], int]". Please refer to test_type_parser.cpp for the usage. One of the use cases is in lite interpreter, types needs to be serialized (directly calling the python_str() of the Type) and deserialized (calling parseType(str)). Test Plan: Imported from OSS Differential Revision: D18924268 Pulled By: iseeyuan fbshipit-source-id: 830d411563abfbeec023f01e7f8f4a1796f9a59a	2019-12-14 20:29:42 -08:00
Shihao Xu	a9ad98fb25	Remove unused argument "destId" in addSendRpcBackward (#31207 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31207 Cleanup after #30914. In #30914, `autogradContext->addKnownWorkerId(dst);` was moved out of `addSendRpcBackward()`. So `addSendRpcBackward()` does not need `dstId` as it's argument anymore. ghstack-source-id: 95509218 Test Plan: # Unit tests ``` buck test mode/dev-nosan //caffe2/test:dist_autograd_fork -- test_context_cleanup_tensor_no_grad ``` Differential Revision: D5742365 fbshipit-source-id: accd041a594ec18d369231f5590289828d87baa7	2019-12-14 20:28:29 -08:00
Elias Ellison	85107e72b4	Fix type unification With Specialized Tensor Shapes (#31076 ) Summary: Fix for https://github.com/pytorch/pytorch/issues/30015 We had a model that failed in shape propagation because we could not unify `Tensor` and `Optional[BoolTensor]`. Tensor not subtyping Optional[BoolTensor] was correct, but we should have unified those two types to `Optional[Tensor]`. The fix here is that for immutable types containers (Optional, Tuple Type), we should be attempting to unify with complete shape information, and if that fails, then try to unify those types with unshaped types. Pull Request resolved: https://github.com/pytorch/pytorch/pull/31076 Differential Revision: D18921802 Pulled By: eellison fbshipit-source-id: aa6890277470c60b349ed1da4d81cc5d71d377f6	2019-12-11 20:11:34 -08:00
Jeremy Lilley	e7e6d56b77	Allow async work in rpc RequestCallback processing. (#30637 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30637 RequestCallback api currently forces work to be always synchronous, which, as we scale, means we're going to need to throw large number of (mostly blocked) threads at the rpc problem. For some activities like dependent autograd rpcs, there's not a necessary reason to block in these threads. In this change, the RequestCallback api is updated to return a shared_ptr<FutureMessage> rather than a Message: std::shared_ptr<FutureMessage> operator()(Message& request) const; With a futures-style api, RPC ops that wish to be async can then be async, while short-lived blocking functions (or Python UDFs) can just block. In this change, we keep all of the current ops as synchronous (i.e. we block and then return a completed FutureMessage). We also update the rpc_agents in a manner compatible with this sort of parallelism. Here, we only want to incur overhead when we use the async behavior. Some modest extra cost seems unavoidable here (e.g. the allocation for the std::make_shared<>), but we can trivially detect the synchronous/completed case in the rpc_agent and avoid the extra thread-switches/etc. in that case. ghstack-source-id: 95287026 Test Plan: - Basic: buck test mode/dev-nosan caffe2/test/... - Additional testcase in ThriftRpcAgentTest for deferred work. Differential Revision: D18774322 fbshipit-source-id: cf49922a71707cfb1726de16f93af23b160385d8	2019-12-10 16:11:05 -08:00
Ilia Cherniavskii	3de8584de8	Correct definition of nodes that work with Autograd (#30683 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30683 Assume that a node can work with autograd only if it is not a fusion group and in prim or aten namespaces. Test Plan: CI Reviewed By: lly-zero-one Differential Revision: D18795171 Pulled By: ilia-cher fbshipit-source-id: 301090557e330b58be70e956784f7f0dc343c684	2019-12-10 15:39:38 -08:00
Jerry Zhang	45f0556ba0	Proper print for one element tuple (#30853 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30853 Right now we print one element tuple as `(val)`, and it will be interpreted as `val` in parsing, this PR changes it to `(val,)` so we can recognize the one element tuple in parsing Test Plan: . Imported from OSS Differential Revision: D18846849 fbshipit-source-id: 42959b9190c2567ef021a861497077c550324b7c	2019-12-09 14:15:40 -08:00
Rohan Varma	4f342a61c1	add the worker IDs outside of addSendRpcBackward to ensure they are (#30914 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30914 When tensors don't require grad, we don't call `addSendRpcBackward`, where we record known workerIDs to clean up the dist autograd context later. But since https://github.com/pytorch/pytorch/pull/29781, we always include the autograd context ID in RPCs, even if tensors do not require grad. So, it could be possible that we don't release the contexts on some nodes. This can contribute to OOMs since the contexts will not be cleaned up in this case, which can be checking by running the unit test without this patch. We can fix this issue by moving the `addKnownWorkerIds` call to the `getMessageWithAutograd` function. ghstack-source-id: 95178561 Test Plan: Added a unit test: `test_context_cleanup_tensor_no_grad` Differential Revision: D18869191 fbshipit-source-id: b80f66bfd0dd7d01960abe1691d3f44095bb1b2b	2019-12-09 11:38:34 -08:00
Xintao Chen	9a858aba5f	Moving checks related to options.aliasAnalysis and schema.hasAliasInfo to read callsite (#30671 ) Summary: Context: In D18530964, we allow not set aliasAnalysis at previous registration call, and then update it to the correct one in following registration call. But its not working E2E due to those existing checks. So we want to remove or delay those TORCH_CHECKs. Here is the existing three callsites for operator.aliasAnalysisKind(): https://our.intern.facebook.com/intern/diffusion/FBS/browse/master/fbcode/caffe2/torch/csrc/jit/ir.cpp?lines=994%2C995%2C996%2C1001%2C1004 https://our.intern.facebook.com/intern/diffusion/FBS/browse/master/fbcode/caffe2/torch/csrc/jit/operator.cpp?lines=147%2C155 https://our.intern.facebook.com/intern/diffusion/FBS/browse/master/fbcode/caffe2/torch/csrc/jit/passes/alias_analysis.cpp?lines=260%2C277%2C380 Things to check 1. Those two checks are different. But since in original op_registration code, if options.schemaOrName_->is_right() is FALSE, we kind of convert it to FunctionSchema type, so in the read callsites, we only need to check the following: options.aliasAnalysisKind_ == AliasAnalysisKind::FROM_SCHEMA \|\| !schema.hasAnyAliasInfo() 2. If the three callsites above are indeed needed for those checks. 3. Here we made assumptions that for reads from jit or other places, its always being called after all registrations calls are done. Trying to make sure its a valid assumption Pull Request resolved: https://github.com/pytorch/pytorch/pull/30671 Test Plan: Will update and refactor the tests soon. Differential Revision: D18784623 Pulled By: charliechen0401 fbshipit-source-id: 75edea140d0ae3e54820e1aeef010c81fe26416a	2019-12-06 01:36:22 -08:00
Nathan Goldbaum	f531815526	Deprecate tensor.type() (#30281 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/29161. I looked a bit at the code changes related to this and think I have all of the use cases of `DeprecatedTypeProperties` covered in the message, but suggestions from someone with more context on this would be very much appreciated :) Pull Request resolved: https://github.com/pytorch/pytorch/pull/30281 Differential Revision: D18830818 Pulled By: ezyang fbshipit-source-id: 1a7fcee15354ae09e6644577e7fa33bd26acfe20	2019-12-05 10:55:34 -08:00
Jerry Zhang	1707774417	AddConstant and findConstant for ClassType (#29217 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29217 We want to preserve constant information in ClassType so that users can access the constants in the module by name. This is also used later for freezing some attribute(converting attributes to constant) Test Plan: tbd Imported from OSS Differential Revision: D18799955 fbshipit-source-id: fbfbcd5d3f7f560368b96e2a87e270c822a3d03a	2019-12-04 14:17:13 -08:00
Michael Ranieri	ea3697db69	inline to prevent duplicate obj when linking (#30363 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30363 getting duplicate definition errors when linking test. ghstack-source-id: 94472892 Test Plan: CI passes Differential Revision: D18669686 fbshipit-source-id: 3d3bfc38e4247cf8bea655537824b891b84f67bc	2019-12-03 15:59:25 -08:00
Will Feng	18ec4632b3	Exclude undefined tensors in the result of Module::parameters() / named_paramters() / buffers() / named_buffers() (#30626 ) Summary: PR https://github.com/pytorch/pytorch/pull/30523 attempted to fix https://github.com/pytorch/pytorch/issues/30508 and https://github.com/pytorch/pytorch/issues/30462, but the fix wasn't complete. This PR makes the following improvements: 1. Fixes https://github.com/pytorch/pytorch/issues/30508 and https://github.com/pytorch/pytorch/issues/30462 properly by excluding undefined tensors in the result of `Module::parameters()` / `named_parameters()` / `buffers()` / `named_buffers()`, which mirrors the Python API behavior. 2. Audits all use sites of `Module::parameters_` / `buffers_` and change them to `Module::named_parameters(/recurse=/false)` / `named_buffers(/recurse=/false)` when appropriate, so that use sites of module parameters / buffers never need to worry about undefined tensors. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30626 Differential Revision: D18777507 Pulled By: yf225 fbshipit-source-id: 55b64b69779e1186342efd3c44857f416334ed6b	2019-12-02 21:59:58 -08:00
Brian Wignall	e7fe64f6a6	Fix typos (#30606 ) Summary: Should be non-semantic. Uses https://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines to find likely typos. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30606 Differential Revision: D18763028 Pulled By: mrshenli fbshipit-source-id: 896515a2156d062653408852e6c04b429fc5955c	2019-12-02 20:17:42 -08:00
Jeremy Lilley	f4e7e9039d	Improve process_group_agent() serialization speed (#29785 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29785 TLDR: This change improves process_group's serialization speed: Serialize_Tensor64: 12.38us -> 1.99us (~-84%) Deserialize_Tensor64: 33.89us -> 5.62us (~-84%) Serialize_Tensor1M: 525.74us -> 285.43us (~-45%) Deserialize_Tensor1M: 892.61us -> 273.68us (~-70%) After speaking with the jit team, we had consensus that torch::save()/load() are somewhat high-overhead for RPC serialization, mostly intended for persistent disk data. (Particularly, for large tensors, 35% of the time is spent in CRC checking, even with the fb-side changes to subsitute 40x faster SSE-accelerated crc checking; Also, for small tensors, the zip container overhead is considerable, as is the overhead of lexing/parsing an embedded text python program for each RPC). The jit team encouraged us to use jit::pickler, with the WriteableTensorData way of outputting result tensors (not the default side-tensor table, or with pickling the actual tensors). This ends up just pickling some tensor metadata, and giving us some tensor blobs that we can mindlessly blit over the wire (they copy to cpu memory if needed). There is yet no standardized container format for the pickled data (there is jit::pickle_save() checked in, but but it's experimental, no load function is yet provided), but they encouraged us to just use something sensible for this, and possibly revisit later. For now, I made the directory headers slightly http-inspired. Note that serialization is just one component of the pipeline, but that said, we also see reasonable reductions in end-to-end echo times (noisier): ProcessGroupAgent_Echo(Tensor_Small) 855.25us -> 492.65us (~-42%) ProcessGroupAgent_Echo(Tensor_1M) 10.82ms -> 6.94ms (~-35%) ProcessGroupAgent_Echo(Small_NoTensor) 688.82us -> 301.72us (~-56%) ProcessGroupAgent_Echo(1MB_NoTensor) 4.65ms -> 3.71ms (~-20%) I moved the "wire serialization" logic to a separate file to assist with unittesting. ghstack-source-id: 94694682 Test Plan: buck test mode/dev-nosan caffe2/test/cpp/api:serialize buck test mode/dev-nosan caffe2/test/... Differential Revision: D18493938 fbshipit-source-id: 07ddfe87dbe56472bc944f7d070627052c94a8f4	2019-11-28 09:57:52 -08:00
Will Feng	7ac8efa689	Skip undefined tensors when moving torch::nn module to a different device (#30523 ) Summary: This fixes high-pri issues such as https://github.com/pytorch/pytorch/issues/30508 and https://github.com/pytorch/pytorch/issues/30462. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30523 Differential Revision: D18732904 Pulled By: yf225 fbshipit-source-id: fe5a7a43838000f5803bd9c01ecfba0c3f02df5d	2019-11-27 21:21:02 -08:00
davidriazati	2a7a39c1af	(de)serialization of values between C++ and Python (#30108 ) Summary: This PR updates `torch::pickle_save` to use the new zipfile format introduced in #29232 and adds `torch::pickle_load` which can decode the zipfile format. Now that `torch.save/load` use this format as well (if the `_use_new_zipfile_serialization` flag is `True`), raw values saved in Python can be loaded in C++ and vice versa. Fixes #20356 ](https://our.intern.facebook.com/intern/diff/18607087/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/30108 Pulled By: driazati Differential Revision: D18607087 fbshipit-source-id: 067cdd5b1cf9c30ddc7e2e5021a8cceee62d8a14	2019-11-23 00:06:07 -08:00
Jerry Zhang	1bba0eb35b	Add `clone_instance` for Module (#30168 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30168 Previous implementation of `clone` in `script::Module` copies both the module instance and the class type, after we enabled type sharing https://github.com/pytorch/pytorch/pull/26666 we also need to have a function to clone instance only and share the underlying class type. Test Plan: tbd Imported from OSS Differential Revision: D18631324 fbshipit-source-id: dbadcf19695faee0f755f45093b24618c047b9d1	2019-11-21 13:00:34 -08:00
Will Feng	3ba1456aee	Fix clip_grad_norm_ / clip_grad_value_ to take input by value instead of by non-const ref (#30216 ) Summary: The original design of `torch::nn::utils::clip_grad_norm_` / `clip_grad_value_` takes input by non-const reference, which prevents users from passing rvalue reference as input into the functions. This PR changes the functions to take input by value, which matches the Python version's semantics, and also adheres to the C++ API convention that if a function modifies its input in-place, it should take that input by value. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30216 Differential Revision: D18632543 Pulled By: yf225 fbshipit-source-id: 97a09d6467f982fe9c8120f483a9c07fcf13699e	2019-11-21 10:07:00 -08:00
lsrock1	0a77c090d5	C++ parity, convert_parameters (#29267 ) Summary: yf225 https://github.com/pytorch/pytorch/issues/25883 update parameters_to_vector and vector_to_parameters check please! Pull Request resolved: https://github.com/pytorch/pytorch/pull/29267 Differential Revision: D18628571 Pulled By: yf225 fbshipit-source-id: 03783e6b0f8183dd97ae48f3da4acb1d07083555	2019-11-20 19:59:11 -08:00
Will Feng	5cbdbddc12	Add test for F::max_unpool3d, and update parity table Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30171 Differential Revision: D18620503 Pulled By: yf225 fbshipit-source-id: 52adf9a6c0238b5cdb2e11e03807fb7dd73880bf	2019-11-20 12:42:24 -08:00
Will Feng	a460c856dd	Fix naming for kl_div and binary_cross_entropy functional options (#30146 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30146 This PR fixes naming for kl_div and binary_cross_entropy functional options, to be more consistent with the naming scheme of other functional options. Test Plan: Imported from OSS Differential Revision: D18618971 Pulled By: yf225 fbshipit-source-id: 2af62c1a0ace2cd0c36c2f1071639bf131d8fe61	2019-11-20 12:23:50 -08:00
Pavel Belevich	f8e7f3fca4	C++ API parity: BCEWithLogitsLoss Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28783 Test Plan: Imported from OSS Differential Revision: D18202435 Pulled By: pbelevich fbshipit-source-id: 011b028bbb2a091e98d3548616b99d7b4569c239	2019-11-20 06:46:38 -08:00
Pavel Belevich	cc81769e10	C++ API parity: isfinite Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30083 Test Plan: Imported from OSS Differential Revision: D18594723 Pulled By: pbelevich fbshipit-source-id: 5970e0aa6ef8994e9c4a741784fd053383aaceb7	2019-11-19 20:00:05 -08:00
Mikhail Zolotukhin	59eb682ce3	Add InlinedCallStack class. (#27921 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27921 InlinedCallstack serves a similar purpose to Scope, but instead of storing string names of the functions it stores pointer to Function objects themselves. Currently, scopes are used in tracing and callstacks are used in scripting - hopefully I would be able to merge them in future. gh-metadata: pytorch pytorch 27921 gh/ZolotukhinM/139/head Differential Revision: D17914132 Test Plan: Imported from OSS Pulled By: ZolotukhinM fbshipit-source-id: b1daa6700199ee1a97a7f49a6fced9ac0dc13051	2019-11-19 17:58:46 -08:00
Will Feng	bb1d9b238d	torch::nn::FractionalMaxPool{2,3}d module and functional Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29933 Test Plan: Imported from OSS Differential Revision: D18548174 Pulled By: yf225 fbshipit-source-id: 070776db6e8b7ad94d9b7cbd82b3d6966f061a46	2019-11-19 17:24:07 -08:00
Divyansh Singhvi	ec52d911bd	InstanceNorm{1,2,3}d (#28790 ) Summary: Hi yf225, I have a few doubts related to implementation: 1) What tests do I have to write? 2) What does _load_state_from_dict does? 3) Do I need to override reset() function as I can not see it's utility? 4) InstanceNormOptions could be removed with BatchNormOptions, but I find that `track_running_status` is not defined instead `stateful` is defined. InstanceNorm{1,2,3}d https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28790 Differential Revision: D18588666 Pulled By: yf225 fbshipit-source-id: bb9b81f01f62c3fc8765fa0ba0716768087ee155	2019-11-19 16:57:01 -08:00

1 2 3 4 5 ...

663 Commits