pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-08 07:39:33 +01:00

Author	SHA1	Message	Date
Nikolay Korovaiko	d4c25add45	make sure the counter stays correct in between bailout transitions (#30186 ) Summary: This fixes the second issue reported in https://github.com/pytorch/pytorch/issues/29909 namely, a loop counter is assigned the wrong values after transitioning to a bailout graph. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30186 Differential Revision: D18646845 Pulled By: Krovatkin fbshipit-source-id: 1f7c601dd9f35892979385ffa132fb0886a4f203	2019-12-03 14:59:08 -08:00
Will Feng	03a73cb9ac	Remove namespace F = torch::nn::functional from torch/nn/modules/batchhnorm.h (#30684 ) Summary: This PR removes `namespace F = torch::nn::functional` from `torch/nn/modules/batchhnorm.h`, so that people don't have to define `torch::nn::functional` as `F` if they don't want to. Fixes https://github.com/pytorch/pytorch/issues/30682. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30684 Differential Revision: D18795717 Pulled By: yf225 fbshipit-source-id: c9feffbeb632cc6b4ce3e6c22c0a78533bab69ad	2019-12-03 14:52:23 -08:00
Brian Vaughan	604a27361f	remove tuple_parser (#30659 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30659 I could only find one usage of TupleParser and it doesn't seem worth maintaining just for that one usage. Test Plan: Imported from OSS Differential Revision: D18795979 Pulled By: nairbv fbshipit-source-id: 6e50d65fc8fade0944f36ab20d00f1539a3d4cb8	2019-12-03 14:49:59 -08:00
Supriya Rao	980aead1f8	Add support for quantized slice conversion (#30498 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30498 Updated Int8SliceOp to accept dim, start and end index similar to Pytorch. Test Plan: python test/onnx/test_pytorch_onnx_caffe2_quantized.py TestQuantizedOps.test_slice Imported from OSS Differential Revision: D18740519 fbshipit-source-id: 2313f37a4936edb150ce04911b241e591e191801	2019-12-03 14:37:59 -08:00
Sebastian Messmer	bc2e6d10fa	Back out "Revert D17908478: Switch PyTorch/Caffe2 to C++14" Summary: Original commit changeset: 775d2e29be0b Test Plan: CI Reviewed By: mruberry Differential Revision: D18775520 fbshipit-source-id: a350b3f86b66d97241f208786ee67e9a51172eac	2019-12-03 14:33:43 -08:00
Yanli Zhao	40146eb48e	Skip ProcessGroupGlooAyncTest if there is no CUDA available (#30345 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30345 Skip ProcessGroupGlooAyncTest if there is no CUDA available, otherwise in sandcastle non GPU host the test will abort with failing to load CUDA library ghstack-source-id: 94771241 Test Plan: test skipped on non GPU host Differential Revision: D18665322 fbshipit-source-id: 8c7b89aeecc6ec007bee12d864a6058384254e61	2019-12-03 13:27:34 -08:00
Jerry Zhang	19cd90d303	Globally record observer nodes (#30547 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30547 att Test Plan: test_jit.py test_quantization.py Imported from OSS Differential Revision: D18784752 fbshipit-source-id: 000e140aa86ff12a240d98da71871a5a5053401f	2019-12-03 12:16:00 -08:00
Jerry Zhang	7023e13fbb	Fix mapping white list (#30636 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30636 Currently DeQuantStub is still in whitelist because set union has lower precedence than set difference fix issue: https://github.com/pytorch/pytorch/issues/29646 Test Plan: verified locally that we don't attach qconfig for DeQuantStub Imported from OSS Differential Revision: D18775275 fbshipit-source-id: 8da07e40963555671b3d4326c9291706103f858e	2019-12-03 11:34:28 -08:00
Ailing Zhang	a997f224ac	Add torch.multiprocessing.create_processes Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28493 Differential Revision: D18766066 Pulled By: ailzhang fbshipit-source-id: 7f424c8fae3012be2416cf9bc72ee2dde40c1f89	2019-12-03 10:38:19 -08:00
Lara	4d30415f12	Add ONNX Scripting Conv Support (#30618 ) Summary: Convolution nodes are traced as aten:_convolution and are currently supported in ONNX. Scripting convolution uses aten:conv<1,2,3>d which are currently not supported in ONNX. This PR adds the symbolics for aten:conv<1,2,3>d and aten:conv_transpose<1,2,3>d Pull Request resolved: https://github.com/pytorch/pytorch/pull/30618 Reviewed By: hl475 Differential Revision: D18778145 Pulled By: houseroad fbshipit-source-id: 4af0379f29974a1ce8443024d1d87b3eb8d2dd36	2019-12-03 10:28:38 -08:00
Jerry Zhang	89be1a22d4	split getInvokedMethods (#30546 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30546 factor out this function for later support of quantizing shared types Test Plan: test_jit.py, test_quantization.py Imported from OSS Differential Revision: D18776304 fbshipit-source-id: f5a736b0f69019cefe17ec4517da1ae5462f78e1	2019-12-03 10:11:57 -08:00
Rohan Varma	5a484245d9	Change test_invalid_names test to only test constructor of WorkerInfo (#30620 ) Summary: This tests seems to only test that we throw exceptions in the `WorkerInfo` constructor when invalid names are passed in, so I don't think we need to complicate by initializing RPC, and exposing ourselves to potential flakiness. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30620 Differential Revision: D18766955 Pulled By: rohan-varma fbshipit-source-id: 11643de4d57431e5f46e096c7766de3ab0b9b05a	2019-12-03 09:07:10 -08:00
Shen Li	f9f54201d3	Remove deprecated fromIvalue in RRefForkData Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30646 Test Plan: Imported from OSS Differential Revision: D18777610 Pulled By: mrshenli fbshipit-source-id: 7a749c1035e36bbb464332d3829fd53e2c6cf727	2019-12-03 09:01:40 -08:00
Brian Vaughan	e5b947a3a8	Raise an error for is_signed on quantized types (#30527 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30527 When we introduced dtype.is_signed we allowed for support of quantized types, but we're not sure what the correct result should be. See discussion at https://github.com/pytorch/pytorch/pull/29511 Test Plan: Imported from OSS Differential Revision: D18765410 Pulled By: nairbv fbshipit-source-id: c87cfe999b604cfcbbafa561e04d0d5cdbf41e6d	2019-12-03 06:34:53 -08:00
Will Feng	18ec4632b3	Exclude undefined tensors in the result of Module::parameters() / named_paramters() / buffers() / named_buffers() (#30626 ) Summary: PR https://github.com/pytorch/pytorch/pull/30523 attempted to fix https://github.com/pytorch/pytorch/issues/30508 and https://github.com/pytorch/pytorch/issues/30462, but the fix wasn't complete. This PR makes the following improvements: 1. Fixes https://github.com/pytorch/pytorch/issues/30508 and https://github.com/pytorch/pytorch/issues/30462 properly by excluding undefined tensors in the result of `Module::parameters()` / `named_parameters()` / `buffers()` / `named_buffers()`, which mirrors the Python API behavior. 2. Audits all use sites of `Module::parameters_` / `buffers_` and change them to `Module::named_parameters(/recurse=/false)` / `named_buffers(/recurse=/false)` when appropriate, so that use sites of module parameters / buffers never need to worry about undefined tensors. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30626 Differential Revision: D18777507 Pulled By: yf225 fbshipit-source-id: 55b64b69779e1186342efd3c44857f416334ed6b	2019-12-02 21:59:58 -08:00
Brian Wignall	e7fe64f6a6	Fix typos (#30606 ) Summary: Should be non-semantic. Uses https://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines to find likely typos. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30606 Differential Revision: D18763028 Pulled By: mrshenli fbshipit-source-id: 896515a2156d062653408852e6c04b429fc5955c	2019-12-02 20:17:42 -08:00
Jianyu Huang	0bebfe2143	Add the explicit per-tensor/per-channel quant info when we print the module (#30591 ) Summary: As Title says. We would like to explicitly distinguish per-tensor/per-channel scheme when we print the module. Here is an example for Lenet after applying the per-channel dynamic quantization: Before this PR: ``` FloatModel( (conv1): Conv2d(1, 20, kernel_size=(5, 5), stride=(1, 1)) (conv2): Conv2d(20, 50, kernel_size=(5, 5), stride=(1, 1)) (fc1): DynamicQuantizedLinear( in_features=800, out_features=500 (_packed_params): LinearPackedParams() ) (fc2): DynamicQuantizedLinear( in_features=500, out_features=10 (_packed_params): LinearPackedParams() ) ) ``` After this PR: ``` FloatModel( (conv1): Conv2d(1, 20, kernel_size=(5, 5), stride=(1, 1)) (conv2): Conv2d(20, 50, kernel_size=(5, 5), stride=(1, 1)) (fc1): DynamicQuantizedLinear( in_features=800, out_features=500, qscheme=torch.per_channel_affine (_packed_params): LinearPackedParams() ) (fc2): DynamicQuantizedLinear( in_features=500, out_features=10, qscheme=torch.per_channel_affine (_packed_params): LinearPackedParams() ) ) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/30591 Differential Revision: D18764366 Pulled By: jianyuh fbshipit-source-id: e897ab42ace6b82b2a90729ba788313c7873de1a	2019-12-02 20:14:46 -08:00
Jeremy Lilley	4dab29a2bd	Fix serialization memory lifetime issue. (#30603 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30603 Pickler object needs to be kept in scope until data is written out to the final serialized string. tensorData in particular is a reference to memory owned by the descoped Pickle object. Noticed this by inspection. In practice, this potential read-after-free here is limited to non-cpu tensors, and any such use was very soon after free. ghstack-source-id: 94756036 Test Plan: existing test suite at buck test mode/dev-nosan caffe2/test:rpc_fork Differential Revision: D18760463 fbshipit-source-id: 9de890d66626aa48f13ca376dd9bd50b92e0cb00	2019-12-02 20:10:28 -08:00
Pritam Damania	db81e13d6b	Fix TCPStoreTest and improve tcputils::connect() (#30354 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30354 TCPStoreTest would timeout since the TCPStore constructor for the server would block the main thread waiting for workers. The workers themselves were spawned later on once the server store is created. As a result, this test would always timeout. To fix the test, I moved the server store to a thread so that the workers can register with the server in parallel. In addition to this made a few improvements to tcputils::connect. When tcputils::connect() encountered an exception, it always looked at `errno` for the error code. In some cases `errno` could be overwritten and the real error code would be stored in `std::system_error`. As a result, I've modified the code to look at the error code in `std::system_error` if we catch an exception of that type. ghstack-source-id: 94758939 Test Plan: waitforbuildbot Differential Revision: D18668454 fbshipit-source-id: d5a3c57b066b094bfecda9a79d9d31bfa32e17f0	2019-12-02 19:52:34 -08:00
Supriya Rao	968c0d4a46	Add support for converting quantized AvgPool2d and Reshape operations (#30490 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30490 Add symbolic mapping to Int8AvgPool2d and Int8Reshape op in C2 Test Plan: python test/onnx/test_pytorch_onnx_caffe2_quantized.py TestQuantizedOps Imported from OSS Differential Revision: D18740520 fbshipit-source-id: 1606125500c4b549fbc984e7929b7fd5204396a0	2019-12-02 18:15:01 -08:00
davidriazati	9c02b88791	Add pickler support for Device (#30131 ) Summary: This PR adds (un)pickling support for `c10::Device`. It also adds `torch.device` as a type annotation for device attributes. ](https://our.intern.facebook.com/intern/diff/18664421/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/30131 Pulled By: driazati Differential Revision: D18664421 fbshipit-source-id: 64378fb42b2d1bbe2bd86259e5ed10f24b5d1e49	2019-12-02 17:43:08 -08:00
Mingbo Wan	3636cb0364	windows build (#30556 ) Summary: based on https://github.com/pytorch/pytorch/pull/28677 Pull Request resolved: https://github.com/pytorch/pytorch/pull/30556 Differential Revision: D18764040 Pulled By: mingbowan fbshipit-source-id: 53104636800f5887b74a82c154bc5e9603de9322	2019-12-02 14:54:22 -08:00
Edward Yang	1111a6b810	Use pybind11::gil_scoped_* functions instead of AutoGIL/AutoNoGIL (#30274 ) Summary: Reland of https://github.com/pytorch/pytorch/pull/29095 Pull Request resolved: https://github.com/pytorch/pytorch/pull/30274 Differential Revision: D18762293 Pulled By: ezyang fbshipit-source-id: d3d50c2dd12bcb678ab25fa708eb6587cc4b66f9	2019-12-02 12:19:58 -08:00
Shen Li	dd52f50fc8	Add examples to RRef doc Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30516 Test Plan: Imported from OSS Differential Revision: D18728183 Pulled By: mrshenli fbshipit-source-id: af472ebed0e6dd0a85653b080abd3ac4d482bd26	2019-11-28 15:34:26 -08:00
Shen Li	30d70d5378	Make doc source format consistent in rpc/init.cpp Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30515 Test Plan: Imported from OSS Differential Revision: D18728184 Pulled By: mrshenli fbshipit-source-id: 7b643c7f8225943113fbd7130ff6aadb30c1d4e9	2019-11-28 15:34:22 -08:00
Jeremy Lilley	f4e7e9039d	Improve process_group_agent() serialization speed (#29785 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29785 TLDR: This change improves process_group's serialization speed: Serialize_Tensor64: 12.38us -> 1.99us (~-84%) Deserialize_Tensor64: 33.89us -> 5.62us (~-84%) Serialize_Tensor1M: 525.74us -> 285.43us (~-45%) Deserialize_Tensor1M: 892.61us -> 273.68us (~-70%) After speaking with the jit team, we had consensus that torch::save()/load() are somewhat high-overhead for RPC serialization, mostly intended for persistent disk data. (Particularly, for large tensors, 35% of the time is spent in CRC checking, even with the fb-side changes to subsitute 40x faster SSE-accelerated crc checking; Also, for small tensors, the zip container overhead is considerable, as is the overhead of lexing/parsing an embedded text python program for each RPC). The jit team encouraged us to use jit::pickler, with the WriteableTensorData way of outputting result tensors (not the default side-tensor table, or with pickling the actual tensors). This ends up just pickling some tensor metadata, and giving us some tensor blobs that we can mindlessly blit over the wire (they copy to cpu memory if needed). There is yet no standardized container format for the pickled data (there is jit::pickle_save() checked in, but but it's experimental, no load function is yet provided), but they encouraged us to just use something sensible for this, and possibly revisit later. For now, I made the directory headers slightly http-inspired. Note that serialization is just one component of the pipeline, but that said, we also see reasonable reductions in end-to-end echo times (noisier): ProcessGroupAgent_Echo(Tensor_Small) 855.25us -> 492.65us (~-42%) ProcessGroupAgent_Echo(Tensor_1M) 10.82ms -> 6.94ms (~-35%) ProcessGroupAgent_Echo(Small_NoTensor) 688.82us -> 301.72us (~-56%) ProcessGroupAgent_Echo(1MB_NoTensor) 4.65ms -> 3.71ms (~-20%) I moved the "wire serialization" logic to a separate file to assist with unittesting. ghstack-source-id: 94694682 Test Plan: buck test mode/dev-nosan caffe2/test/cpp/api:serialize buck test mode/dev-nosan caffe2/test/... Differential Revision: D18493938 fbshipit-source-id: 07ddfe87dbe56472bc944f7d070627052c94a8f4	2019-11-28 09:57:52 -08:00
Rohan Varma	1350b99de4	Add local shutdown to process group agent (#30330 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30330 This is now possible due to previous changes made in `gloo` and `ProcessGroupGloo`. We `abort` the listener thread that is waiting for a message, and join all other threads. The API is changed so that the previous `wait_all_workers` does not destroy the agent, and this is now done in a new `shutdown` method. All callsites are updated appropriately. ghstack-source-id: 94673884 ghstack-source-id: 94673884 Test Plan: Unit tests pass. Reviewed By: mrshenli Differential Revision: D18661775 fbshipit-source-id: 5aaa7c14603e18253394224994f6cd43234301c2	2019-11-27 22:34:08 -08:00
Will Feng	7ac8efa689	Skip undefined tensors when moving torch::nn module to a different device (#30523 ) Summary: This fixes high-pri issues such as https://github.com/pytorch/pytorch/issues/30508 and https://github.com/pytorch/pytorch/issues/30462. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30523 Differential Revision: D18732904 Pulled By: yf225 fbshipit-source-id: fe5a7a43838000f5803bd9c01ecfba0c3f02df5d	2019-11-27 21:21:02 -08:00
Sebastian Messmer	a2ed50c920	Revert D17908478: Switch PyTorch/Caffe2 to C++14 Test Plan: revert-hammer Differential Revision: D17908478 Original commit changeset: 6e340024591e fbshipit-source-id: 775d2e29be0bc3a0db64f164c8960c44d4877d5d	2019-11-27 14:57:05 -08:00
Tao Xu	a69be8123a	Use `gettimeofday` on iOS (#30361 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30361 ### Summary By default, the compiler will choose `clock_gettime` for the iOS build. However, that API is not available until iOS 10. Since the Facebook app still supports iOS 9.0, we have to use `gettimeofday` instead. ```shell xplat/caffe2/torch/csrc/autograd/profiler.h:86:3: error: 'clock_gettime' is only available on iOS 10.0 or newer [-Werror,-Wunguarded-availability] xplat/caffe2/torch/csrc/autograd/profiler.h:86:17: error: '_CLOCK_MONOTONIC' is only available on iOS 10.0 or newer [-Werror,-Wunguarded-availability] ``` P.S. the open-sourced version is iOS 12.0 and above, so we don't have this problem. ### Test Plan - buck build works - Don't break CIs Test Plan: Imported from OSS Differential Revision: D18730262 Pulled By: xta0 fbshipit-source-id: fe6d954b8d3c23cbc9d1e25a2e72e0b0c1d4eaa9	2019-11-27 11:48:41 -08:00
Sebastian Messmer	d0acc9c085	Switch PyTorch/Caffe2 to C++14 (#30406 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30406 ghstack-source-id: 94642238 Test Plan: waitforsandcastle Differential Revision: D17908478 fbshipit-source-id: 6e340024591ec2c69521668022999df4a33b4ddb	2019-11-27 10:47:31 -08:00
Richard Zou	ec5c08de74	Revert D18580867: Add logical_and and logical_or Test Plan: revert-hammer Differential Revision: D18580867 Original commit changeset: 7e4d7c37da4d fbshipit-source-id: 81fb604c7aef8d847f518f5faa016e7bd0423016	2019-11-27 09:27:00 -08:00
Bowen Bao	1e8ed021c6	Support logsoftmax with dim != -1 (#30433 ) Summary: PyTorch dim and ONNX axis have different meanings. ONNX only supports log_softmax with dim = -1. Transpose must be added before and after log_softmax to support other cases. This requires input rank to be known at export time. Fixes https://github.com/pytorch/pytorch/issues/17918 Pull Request resolved: https://github.com/pytorch/pytorch/pull/30433 Reviewed By: hl475 Differential Revision: D18723520 Pulled By: houseroad fbshipit-source-id: d0ed3b3f051d08d46495a7abfa854edd120dca3a	2019-11-27 08:34:38 -08:00
Pieter Noordhuis	0282c5ae69	Add helper to aggregate multiple process groups (#25768 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25768 The round robin process group can be constructed from multiple other process groups. Every collective call against this new process group is delegated to the specified process groups in a round robin fashion. Doing so may benefit performance when calling into multiple NCCL process groups. Instead of adding support for round-robin usage of NCCL communicators, we achieve the same without changing the NCCL process group and adding this wrapper class. The API to create this round robin process group is a bit harsh. If we find it adds significant benefit we can revisit and make this a first class citizen in the torch.distributed module. ghstack-source-id: 94578376 Test Plan: The newly added test passes. Reviewed By: chenyangyu1988 Differential Revision: D17226323 fbshipit-source-id: ec9f754b66f33b983fee30bfb86a1c4c5d74767d	2019-11-27 08:34:34 -08:00
Pieter Noordhuis	1d3f3a1a0c	Add pybind11 trampoline class for c10d.Store (#30415 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30415 This enables subclassing of c10d.Store and implementing its interface in Python. ghstack-source-id: 94586627 Test Plan: New tests passes. Reviewed By: vladbelous Differential Revision: D18693018 fbshipit-source-id: fa1eba4bd11cc09a3d6bf3f35369c885033c63c0	2019-11-27 08:34:29 -08:00
neginraoof	512c2a2df5	Enable constant folding (#29834 ) Summary: Set default do_constant_folding = True Pull Request resolved: https://github.com/pytorch/pytorch/pull/29834 Reviewed By: hl475 Differential Revision: D18588037 Pulled By: houseroad fbshipit-source-id: b35c06161321629c886e177ea666eff31cebf06a	2019-11-27 08:34:20 -08:00
Junjie Bai	c1c8105de0	Make the warning of using SparseTensor in JIT less noisy Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30499 Test Plan: waitforsandcastle Reviewed By: wanchaol Differential Revision: D18705553 fbshipit-source-id: d6e16e3285a74a1c031a5312f7a690f1baf392f8	2019-11-27 08:34:16 -08:00
Daya Khudia	2d6b2f39e9	Fix docs so that the example works (#30120 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30120 The example given for functional conv2d didn't work. This diff fixes the example in docs so that it works. Fixes https://github.com/pytorch/pytorch/issues/29649 ghstack-source-id: 94601559 Test Plan: Tried the example locally Differential Revision: D18604606 fbshipit-source-id: ff1a4f903e2843efe30d962d4ff00e5065cd1d7e	2019-11-26 17:38:40 -08:00
Pavel Belevich	6bd8937aee	FunctionParameter::set_default_str replace \|\| with && Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30471 Test Plan: Imported from OSS Differential Revision: D18710958 Pulled By: pbelevich fbshipit-source-id: 7e5339175c7e16cd975a90bf6b123df728045e4d	2019-11-26 17:38:31 -08:00
Hong Xu	8bbafa0b32	Add logical_and and logical_or (#28162 ) Summary: Superseding https://github.com/pytorch/pytorch/issues/24379 as type promotion has been implemented. Close https://github.com/pytorch/pytorch/issues/24379 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28162 Differential Revision: D18580867 Pulled By: ailzhang fbshipit-source-id: 7e4d7c37da4dc8df87314bd4f1f6a7539e46586a	2019-11-26 17:38:22 -08:00
James Reed	05a1644ce3	Fix BC for quantized linear Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30481 Test Plan: Imported from OSS Differential Revision: D18714602 Pulled By: jamesr66a fbshipit-source-id: d51206c22cf2446e98053446789c6324c0481321	2019-11-26 17:38:09 -08:00
Elias Ellison	634f370c63	Add comment to ops bound at python layer Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30419 Test Plan: Imported from OSS Reviewed By: suo Differential Revision: D18714000 Pulled By: eellison fbshipit-source-id: 22ccb941b2db24031921f378c600e68fe70e1346	2019-11-26 17:37:59 -08:00
albanD	b0871f211b	Make all optimizers consistent so that they don't change gradients inplace Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30257 Test Plan: Imported from OSS Differential Revision: D18665461 Pulled By: albanD fbshipit-source-id: cfdafef919468a41007881b82fd288b7128baf95	2019-11-26 12:16:25 -08:00
vishwakftw	dcd9f49809	Specify ordering on singular values and eigenvalues output from torch… (#30389 ) Summary: ….svd/symeig respectively Changelog: - Adds a note to docstrings of the both functions specifying the ordering Fixes https://github.com/pytorch/pytorch/issues/30301 Pull Request resolved: https://github.com/pytorch/pytorch/pull/30389 Differential Revision: D18707608 Pulled By: zou3519 fbshipit-source-id: b0f73631578f39a24fae9af4997c6491de8be9a8	2019-11-26 10:23:47 -08:00
BowenBao	0febff36ac	Export dynamic unbind/split and __getitem__ (#29136 ) Summary: In ONNX opset 11, a series of sequence ops were added. Operators that are related to Tensor[] in PyTorch can be exported using these sequence ops. In this PR, unbind/split that produces Tensor[], and __getitem__ that takes Tensor[] as input, are exported correctly to ONNX opset 11. Pull Request resolved: https://github.com/pytorch/pytorch/pull/29136 Reviewed By: hl475 Differential Revision: D18309222 Pulled By: houseroad fbshipit-source-id: be12c96bf8d0a56900683ef579f1c808c0a1af21	2019-11-26 06:54:06 -08:00
Supriya Rao	2599b9b551	Add output_size argument to caffe2 Int8ResizeNearest (#30202 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30202 Pytorch Upsample operator has output_size as an argument. For quantized tensor inputs we cannot get the input_size to calculate the width and height scale factor. Instead we pass the output_size directly to caffe2 to calculate the scale factors. Test Plan: python test/onnx/test_pytorch_onnx_caffe2_quantized.py TestQuantizedOps.test_upsample Imported from OSS Differential Revision: D18631478 fbshipit-source-id: 38a39129bc863f4ecf2293acc068e40ab7edc825	2019-11-26 06:54:02 -08:00
Shen Li	efe1859ad9	By default ignore RRef leaks during shutdown (#30217 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30217 Before this commit, RRefContext throws an error if it detects any RRef leak during shutdown. However, this requires applications to make sure that is has freed all references to RRefs in application code, which can be a bad debugging experience when for large applications. Besides, this also relies on Python GC to free things up in time, which might not always be true. After this commit, RRefContext would ignore leaking RRefs during shutdown, as shutdown is called when the application has finished training and no longer care about local states. Hence, it should be OK to just ignore those leaks and destroy OwnerRRefs. If application would like to enforce no leaks, just set torch.distributed.rpc.api._ignore_rref_leak to False. Test Plan: Imported from OSS Differential Revision: D18632546 Pulled By: mrshenli fbshipit-source-id: 2744b2401dafdd16de0e0a76cf8e07777bed0f38	2019-11-26 06:53:58 -08:00
Spandan Tiwari	06db5ad707	Provide names for operator nodes in ONNX exported graph. (#27342 ) Summary: The PyTorch exporter does not add any name to the ONNX operators in the exported graph. A common request is to add names to op nodes by default. This helps the readability of the graph in visualization tools such a Netron, or when the ONNX graph is printed as a string. Also, it helps with the debuggability of the ONNX graph. Therefore this PR adds name to operators in the exporters. The names follow a simple format, <op_type>_<index>. Expect files for tests in `test/onnx/test_operators.py` have been updated. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27342 Reviewed By: hl475 Differential Revision: D17790979 Pulled By: houseroad fbshipit-source-id: 1eaae88b5f51f152735a2ff96e22827837e34d9d	2019-11-26 06:53:53 -08:00
BowenBao	584be86c3f	Try exporting ONNX with force_outplace=False (#29466 ) Summary: This should resolve https://github.com/pytorch/pytorch/issues/29008. This flag has two effects on the tracer. - Remove the underscroll for inplace operators. E.g.: index_put_ ==> index_put. This is handled in utils.py separately as well. - Add out as input for backward computation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/29466 Reviewed By: hl475 Differential Revision: D18422815 Pulled By: houseroad fbshipit-source-id: 317b6a3c8a5751fe6fe49d7543e429d281ed0d6d	2019-11-26 06:53:49 -08:00
Raghuraman Krishnamoorthi	eccf42fd15	Bug fix: Handle missing keys in observer state dict during load (#30357 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30357 Fix issue https://github.com/pytorch/pytorch/issues/29032 in loading from state dict for observers and fake quant. ghstack-source-id: 94468814 Test Plan: Ensures that load/save of fake quant and observers with missing keys works correctly. Differential Revision: D18668517 fbshipit-source-id: 0eda6f47c39102e55977fc548b9a03664f123ad7	2019-11-26 06:53:45 -08:00

1 2 3 4 5 ...

8286 Commits