pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Jiyan Yang	c48e1679f9	Add validator for optimizers when parameters are shared Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18497 Reviewed By: kennyhorror Differential Revision: D14614738 fbshipit-source-id: beddd8349827dcc8ccae36f21e5d29627056afcd	2019-04-17 21:10:38 -07:00
Yinghai Lu	f1f31b634d	Eliminate AdjustBatch ops (#19083 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19083 As we have discussed, there are too many of AdjustBatch ops and they incur reallocation overhead and affects the performance. We will eliminate these ops by - inling the input adjust batch op into Glow - inling the output adjust batch op into OnnxifiOp and do that only conditionally. This is the C2 part of the change and requires change from Glow side to work e2e. Reviewed By: rdzhabarov Differential Revision: D14860582 fbshipit-source-id: ac2588b894bac25735babb62b1924acc559face6	2019-04-17 10:00:25 -07:00
Jerry Zhang	ff0a7ae43f	Testing for folded conv_bn_relu (#19298 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19298 Proper testing for conv_bn_relu folding Differential Revision: D13998891 fbshipit-source-id: ceb58ccec19885cbbf38964ee0d0db070e098b4a	2019-04-16 19:04:06 -07:00
Huamin Li	c480798a1c	use C10_REGISTER for GELU op Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19090 Reviewed By: BIT-silence Differential Revision: D14864737 fbshipit-source-id: 8debd53171f7068726f0ab777a13ca46becbfbdf	2019-04-12 11:41:04 -07:00
Xing Wang	b6f130aa70	try to enable uncertainty for lr loss (#17236 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17236 Following the paper in https://papers.nips.cc/paper/7141-what-uncertainties-do-we-need-in-bayesian-deep-learning-for-computer-vision.pdf, approximate the classification case with the regression formulation. For the LRLoss, add penalty based on the variance and regularization on the variance with a tunable parameter lambda. Reviewed By: chocjy Differential Revision: D14077106 fbshipit-source-id: 4405d8995cebdc7275a0dd07857d32a8915d78ef	2019-04-11 07:35:19 -07:00
Gu, Jinghui	575aebc182	implement operators for DNNLOWP (#18656 ) Summary: Implement operators for DNNLOWP, including int8_conv, int8_FC, int8_pooling, int8_relu, int8_sum, quantize/dequantize, and order_swtich operators. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18656 Differential Revision: D14767092 Pulled By: yinghai fbshipit-source-id: 1f3e24929a358a42214da333bd304c593ea4468f	2019-04-10 12:04:39 -07:00
Xiaomeng Yang	fd40c0eba0	Add gelu op (#18992 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18992 Add gelu op Reviewed By: houseroad Differential Revision: D14814811 fbshipit-source-id: 00f126b8b83763c57ebbf28fbd2de5a8fab6d491	2019-04-08 21:58:29 -07:00
Lu Fang	443a58e03d	Export C10 operator in PyTorch Model (#18210 ) Summary: Almost there, feel free to review. these c10 operators are exported to _caffe2 domain. TODO: - [x] let the onnx checker pass - [x] test tensor list as argument - [x] test caffe2 backend and converter - [x] check the c10 schema can be exported to onnx - [x] refactor the test case to share some code - [x] fix the problem in ONNX_ATEN_FALLBACK Pull Request resolved: https://github.com/pytorch/pytorch/pull/18210 Reviewed By: zrphercule Differential Revision: D14600916 Pulled By: houseroad fbshipit-source-id: 2592a75f21098fb6ceb38c5d00ee40e9e01cd144	2019-04-08 16:06:00 -07:00
Duc Ngo	e7b2669151	caffe2 - Expose tensor filler util to Python (#18886 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18886 Expose tensor filler util to Python and add a unit test (both C++/Python) Reviewed By: salexspb Differential Revision: D14784470 fbshipit-source-id: bb8e013d1755c27c166e87d5a8491a97c65d3d8d	2019-04-08 11:54:10 -07:00
Dmytro Dzhulgakov	c34e5ff952	ScriptModuleOp in caffe2 (#18716 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18716 Might be useful as an intermediate stage for some systems that currently use Caffe2 nets as an execution mechanism. Not sure it's a good idea all together, please comment. Limitations: - only Tensor types as inputs/outputs - the entire module is serialized as a zip archive inside a proto in Caffe2 db, it'd be subject to 4Gb limit and is likely very slow. For small models it'd work though. - no autograd, though it can be attached in principle - no way to retrieve parameters inside the script module from C2 runtime perspective (though they potentially can be alias-fetched and stored as individual blobs) - after deserialization, python wrappers returned don't have correct type (as we don't do module_lookup trick) Build-wise, I had to add dependency from pybind_state to libtorch.so. I don't think we build Caffe2 python frontend independently anymore, so it should be fine. Reviewed By: amirshim, houseroad Differential Revision: D14339599 fbshipit-source-id: 88a37a8abd1f1c4703e5ef937031f222535d4080	2019-04-05 01:07:43 -07:00
Xiaomeng Yang	b145dcca04	Add support for group ConvTranspose (#18794 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18794 Add support for group ConvTranspose Reviewed By: houseroad Differential Revision: D14741327 fbshipit-source-id: 5d947ca044bf8495dd7f8f56122441ebbcc6c7e4	2019-04-04 11:52:06 -07:00
Dmytro Dzhulgakov	3af2d6d904	Enforce import order to make protobuf cpp implementation in python work (#18560 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18560 We have to import python protobuf here before we load cpp extension. Otherwise it breaks under certain build conditions if cpp implementation of protobuf is used. Presumably there's some registry in protobuf library and python side has to initialize the dictionary first, before static initialization in python extension does so. Otherwise, duplicated protobuf descriptors will be created and it can lead to obscure errors like Parameter to MergeFrom() must be instance of same class: expected caffe2.NetDef got caffe2.NetDef. I think it also fixes https://github.com/facebookarchive/caffe2/issues/1573 Reviewed By: ezyang, iroot900 Differential Revision: D14622054 fbshipit-source-id: 2499eb88ecdee85ff8d845859048f7ae5da2a480	2019-04-03 13:17:08 -07:00
Gu, Jinghui	a7b82a44c4	Upgrade mkldnn-bridge for dnnlowp support (#16308 ) Summary: The mkldnn-bridge is upgraded in this PR to support DNNLOWP operators. Meanwhile, APIs have been updated in caffe2 to use latest version. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16308 Differential Revision: D14697018 Pulled By: yinghai fbshipit-source-id: ca952589098accb08295fd5aa92924c61e74d69c	2019-04-03 12:47:17 -07:00
Mingzhe Li	5f5a2aaab9	Operator-level performance microbenchmarks (#18740 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18740 Test utilities for writing Caffe2/PyTorch performance microbenchmarks. Brief description of the file structure * benchmark_core.py : core utiltiites for running microbenchmark tests * benchmark_caffe2.py : Caffe2 specific benchmark utilitites * benchmark_pytorch.py: PyTorch specific benchmark utilities * benchmark_runner.py : Main function. Currently it can run the microbenchmark tests in a stand-alone mode. The next step is to have this integrate with AI-PEP. The utilities are located at https://github.com/pytorch/pytorch/tree/master/test to have access to both Caffe2/PyTorch Python's frontend. Include two operator microbenchmarks; support both Caffe2/PyTorch: * MatMul * Add Reference: PyTorch benchmarks : https://github.com/pytorch/benchmark/tree/master/timing/python. In this work, we start with two example binary operators MatMul and Add, but eventually we should to cover unary operators like in the PyTorch benchmark repo. Reviewed By: zheng-xq Differential Revision: D13887111 fbshipit-source-id: b7a56b95448c9ec3e674b0de0ffb96af4439bfce	2019-04-02 17:06:19 -07:00
Duc Ngo	16f07d7dac	caffe2 - set up correct inheritance structure for remaining operator test classes (#18622 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18622 Set up correct inheritance structure for remaining operator test classes Reviewed By: ezyang Differential Revision: D14685941 fbshipit-source-id: a6b1b3be325935b7fec7515be13a4994b3016bf0	2019-04-01 15:53:22 -07:00
Ru Li	3749d65a7e	Create Node2Vec ModuleKeeper Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18504 Reviewed By: sunnieshang Differential Revision: D14632091 fbshipit-source-id: d4544866552dc6bcbc7515be9e88cb11e7622a44	2019-04-01 10:36:23 -07:00
Cheng,Penghui	e13101e069	support pre-convert filter format for mkldnn training mode and change 'OptimizeForIdeep' to 'OptimizeForMkldnn' (#15171 ) Summary: For MKL-DNN,the filter data will be reorderd to primitive format, it takes a lot of time. So the patch provide a method to convert filter format before training. And "OptimizeForIdeep" will be changed to "OptimizeForMkldnn" in this patch. This patch depends on https://github.com/pytorch/pytorch/pull/12866 Pull Request resolved: https://github.com/pytorch/pytorch/pull/15171 Differential Revision: D14590741 Pulled By: yinghai fbshipit-source-id: 07971c9977edac3c8eec08ca2c39cda639683492	2019-03-29 19:00:48 -07:00
Gu, Jinghui	84f020fe09	Fix bug in tensor feed which caused crash due to wrong tensor type (#18552 ) Summary: In blob feeder for ideep device, the wrong device option is given and led to a crash issue. This patch aims to correct the device option to fix this bug. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18552 Differential Revision: D14679838 Pulled By: yinghai fbshipit-source-id: bde11e6a6fe44822166881dcb7c9bd0b34b4ecf3	2019-03-29 14:12:36 -07:00
Yanghan Wang	f4e35d30ed	register BoxWithNMSLimit with C10 Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17956 Reviewed By: houseroad Differential Revision: D14417300 fbshipit-source-id: eb5e2ba84513b3b7bfa509dc442424b13fe9148f	2019-03-29 13:41:40 -07:00
Lu Fang	a5a1c9a171	Automatic update of fbcode/onnx to fb1a80692c1ab0bd27b1072f2e7bffacba336777 (#18585 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18585 Previous import was b29e78a4efb8e5d8995f576bbf19a846807829b6 Included changes: - [fb1a8069](https://github.com/onnx/onnx/commit/fb1a8069): Fix wrongly handled attribute in MVN and test generating scripts (#1877) <Raymond Yang> - [b22041c3](https://github.com/onnx/onnx/commit/b22041c3): Add dilation attribute to MaxPool (#1864) <karljang> Reviewed By: zrphercule, benoitsteiner Differential Revision: D14668623 fbshipit-source-id: fa7f44b1ecc949d8dd654939d20b1e93db98b1d2	2019-03-28 23:47:10 -07:00
bddppq	1989716ae5	Resubmit PR-18512: Improved onnx export for 3 onnx ops (#18571 ) Summary: Fix ROCm CI failure Pull Request resolved: https://github.com/pytorch/pytorch/pull/18571 Differential Revision: D14669323 Pulled By: bddppq fbshipit-source-id: 022afe5c20e680295c9cfdfe1ec14650305955a8	2019-03-28 18:12:49 -07:00
Ahmed Aly	9eb0f435d9	Inference LSTM integration test (#18559 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18559 Adding integration test for inference LSTM Reviewed By: houseroad Differential Revision: D14656698 fbshipit-source-id: 80fb2a72be30fcb695f4471b72bf9d6e3965bf81	2019-03-28 11:31:06 -07:00
Junjie Bai	77280b11e3	Revert D14635130: Improved onnx export for 3 onnx ops. Differential Revision: D14635130 Original commit changeset: d54a2b6e2950 fbshipit-source-id: f624e2befdde245cb88435a95508b2a8e6b12e61	2019-03-28 10:26:34 -07:00
Benoit Steiner	eee760dbd3	Improved onnx export for 3 onnx ops. (#18512 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18512 Ceil and Floor have been supported since version 6 of ONNX: export them using the native onnx ops instead of an Aten op. Similarly, support for the Where op has been added in version 9, so we don't need to wrap these op in an Aten op. Reviewed By: houseroad Differential Revision: D14635130 fbshipit-source-id: d54a2b6e295074a6214b5939b21051a6735c9958	2019-03-28 08:55:21 -07:00
Xianjie Chen	d74b11ce0e	add extra info for the auto gen sum ops Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17934 Reviewed By: iroot900 Differential Revision: D14418689 fbshipit-source-id: 9e11e461001467f0000ea7c355d5b0f0d738fa85	2019-03-27 14:56:32 -07:00
Min Ni	c3e3c5cc39	Skip tests if C2/ONNX models cannot be read (#18494 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18494 Today we have some C2 end2end test run requiring reading model data from external filesystem (for example, Gluster and AWS). This could be a source for flaky test when the external filesystems are not reachable during the tests. In this diff, we add try/catch logic around where we download models and open model files from external system. In case such attempts fails, we will catch the excption and let the unittest skip the current test instead of failure. I also refactor the code a little bit by removing some duplicated logic on downloading and build the c2 model data. It has been duplicated in two classes and a few functions... Reviewed By: yinghai Differential Revision: D14442241 fbshipit-source-id: da8bf56c8d096efa34ca2070de5cd10a18aad70c	2019-03-27 11:21:44 -07:00
Junjie Bai	e912632b74	Fix direct comparison of OperatorDef proto structs (#18466 ) Summary: arguments order is okay to be different ajyu Pull Request resolved: https://github.com/pytorch/pytorch/pull/18466 Differential Revision: D14627258 Pulled By: bddppq fbshipit-source-id: 430e1fb1bea2c5639a547ae7c1652368788c86b9	2019-03-26 17:25:09 -07:00
Duc Ngo	6a1a019c0a	caffe2 - support flaky operator tests for caffe2 build (#18155 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18155 - Make a python decorator caffe2_flaky for caffe2 operator unit tests. - The environment variable CAFFE2_RUN_FLAKY_TESTS are now used to mark flaky test mode During test run, - If flaky tests mode are on, only flaky tests are run - If flaky tests mode are off, only non-flaky tests are run Mark ctc_beam_search_decoder_op_test as flaky Reviewed By: ezyang, salexspb Differential Revision: D14468816 fbshipit-source-id: dceb4a48daeb5437ad9cc714bef3343e9761f3a4	2019-03-25 16:58:34 -07:00
Gerard Goossen	46990c20fa	Verify def before infer fensor (#18129 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18129 A lot of tensor interference function assume the operator passes the schema. So call Verity to make sure this is actually the case. Created diff before to add checking in Concat (https://github.com/pytorch/pytorch/pull/17110), but I encountered lot more places where this is assumed (for example ElementwiseOpShapeInference) Reviewed By: mdschatz Differential Revision: D14503933 fbshipit-source-id: cf0097b8c3e4beb1cded6b61e092a6adee4b8fcb	2019-03-22 06:36:25 -07:00
Weiyi Zheng	f3cf6ed789	add fbgemm fp16 (fbfcpacked) support, add global_init_net in predictor_export_meta (#18257 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18257 support adding op in global_init_net. because pred_init_net is per thread, and just doesn't cut it. Reviewed By: jspark1105 Differential Revision: D14552695 fbshipit-source-id: 53dd44c84ad019019ab9f35fc04d076b7f941ddc	2019-03-22 00:19:59 -07:00
Lu Fang	afc7574aed	Automatic update of fbcode/onnx to c05f2ae412daf8fd64136ca354b97ccf73e0ea6c (#18285 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18285 Previous import was 96c58ceeacf0f2b73d752e413e4fd78787a12da3 Included changes: - [c05f2ae4](https://github.com/onnx/onnx/commit/c05f2ae4): update both core and ml docs (#1879) <Lu Fang> - [f895279b](https://github.com/onnx/onnx/commit/f895279b): fix the problems introduced in previous PRs in operator registration (#1878) <Lu Fang> - [f6f80657](https://github.com/onnx/onnx/commit/f6f80657): Skip the schema check on ops in non-standard domain (#1876) <Lu Fang> - [8c8be722](https://github.com/onnx/onnx/commit/8c8be722): Introduce Function Body Helper (#1868) <Sherlock> - [b605eafb](https://github.com/onnx/onnx/commit/b605eafb): Support down sampling for Upsample with scales < 1. (#1773) <Ke Zhang> - [47f7aa71](https://github.com/onnx/onnx/commit/47f7aa71): Remove scaledtanh (#1866) <Ashwini Khade> - [4dfc56de](https://github.com/onnx/onnx/commit/4dfc56de): Add Ceil support for Max and Average Pooling (#1860) <Lara Haidar> - [552a8efc](https://github.com/onnx/onnx/commit/552a8efc): Add testcase generator for functions (#1862) <Raymond Yang> - [fdb978a5](https://github.com/onnx/onnx/commit/fdb978a5): Promote Thresholded Relu Op (#1856) <Ashwini Khade> - [ce332628](https://github.com/onnx/onnx/commit/ce332628): Update Slice with dynamic input & optional input steps (#1836) <Bowen Bao> - [3a9a8787](https://github.com/onnx/onnx/commit/3a9a8787): Merge function into opschema (#1834) <Raymond Yang> - [3dbf8fe9](https://github.com/onnx/onnx/commit/3dbf8fe9): Handle string comparision represented as np.objects (#1851) <Dmitri Smirnov> - [3b0d3bb2](https://github.com/onnx/onnx/commit/3b0d3bb2): remove global variable in header file (#1850) <Lu Fang> - [1cca8733](https://github.com/onnx/onnx/commit/1cca8733): bump the version for drop out - fix the issue that the version was not bumped when changing its type constraint declaration. (#1848) <Ke Zhang> - [1ec81bc6](https://github.com/onnx/onnx/commit/1ec81bc6): Change TopK operator to allow dynamic 'k' (#1829) <Hariharan Seshadri> - [a89a4a16](https://github.com/onnx/onnx/commit/a89a4a16): Remove exp op: Affine, ImageScaler,ParametricSoftplus, Crop. (#1832) <Ke Zhang> Reviewed By: yinghai Differential Revision: D14566202 fbshipit-source-id: b1e5912ae6887e2865fc628363071e2b9938dfa4	2019-03-22 00:13:42 -07:00
Lu Fang	e12091d0a3	Revert D14114134: [asr] add fbgemm fp16 (fbfcpacked) support, add global_init_net in predictor_export_meta Differential Revision: D14114134 Original commit changeset: 112bb2ceb9d3 fbshipit-source-id: 763262c1b78eed88a653caad5adc27d97feb43aa	2019-03-20 16:32:53 -07:00
Weiyi Zheng	1b71f6d4eb	add fbgemm fp16 (fbfcpacked) support, add global_init_net in predictor_export_meta (#17905 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17905 support adding op in global_init_net. because pred_init_net is per thread, and just doesn't cut it. Reviewed By: jspark1105 Differential Revision: D14114134 fbshipit-source-id: 112bb2ceb9d3d5e663dd430585567f4eaa2db35f	2019-03-20 13:52:10 -07:00
Deepali Chourasia	542c273e5b	handle scenario when GPU support is not available and p2p_access_pattern is empty (#17974 ) Summary: Observed that when there is no GPU support available `workspace `sets `GetGpuPeerAccessPattern `to `[]` in https://github.com/pytorch/pytorch/blob/master/caffe2/python/workspace.py#L79 and this case is not handled in https://github.com/pytorch/pytorch/blob/master/caffe2/python/data_parallel_model.py#L1065. Pull Request resolved: https://github.com/pytorch/pytorch/pull/17974 Differential Revision: D14517066 Pulled By: ezyang fbshipit-source-id: 186911d95c07e9a55ab82a41d0c7c919e4281bb4	2019-03-18 23:11:54 -07:00
Duc Ngo	da3cc6e7ee	Caffe2 - Add flag to fails if float point exceptions is detected in operator runs (#18040 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18040 Add flag to fails if float point exceptions is detected in operator runs Sample exception Exception [enforce fail at operator.h:837] !std::fetestexcept(FE_DIVBYZERO). Division by zero floating point exception (FE_DIVBYZERO) reported. Error from operator: input: "1" input: "0" output: "out" name: "" type: "Div" Reviewed By: jspark1105 Differential Revision: D14467731 fbshipit-source-id: fad030b1d619a5a661ff2114edb947e4562cecdd	2019-03-16 12:28:05 -07:00
Jongsoo Park	c7448aa13c	remove unused parameters in optimizer tests (#18084 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18084 data_strategy parameter was not used in some of unit tests for optimizers Reviewed By: hyuen Differential Revision: D14487830 fbshipit-source-id: d757cd06aa2965f4c0570a4a18ba090b98820ef4	2019-03-15 18:06:15 -07:00
Sebastian Messmer	7a3488e0fc	Expose c10 cuda ops to caffe2 (#18036 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18036 - Add macros to export c10 cuda operators to caffe2 frontend - Instead of having a separate caffe2 registry for the c10 operator wrappers, use the existing caffe2 registries Reviewed By: ezyang Differential Revision: D14467495 fbshipit-source-id: 7715ed2e38d2bbe16f1446ae82c17193a3fabcb9	2019-03-15 16:58:12 -07:00
yuanhaoxie	aafbefa4d6	Remove the identical if branch (#18019 ) Summary: elif branch and else branch have the same content. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18019 Differential Revision: D14475107 Pulled By: ezyang fbshipit-source-id: 5075cc938f57649af7537de1a7c9d76ea976cafc	2019-03-15 13:14:26 -07:00
ttup7777	54ef852d7f	Fix unclosed files in download.py, test_onnxifi.py, test_trt.py (#18017 ) Summary: According to https://docs.python.org/3/tutorial/inputoutput.html, it is good practice to use the "with" keyword when dealing with file objects. If not, you should call f.close() to close the file and immediately free up any system resources used by it. Thus, I adjust the open file function to "with open() as f". Pull Request resolved: https://github.com/pytorch/pytorch/pull/18017 Differential Revision: D14475112 Pulled By: ezyang fbshipit-source-id: d1c0821e39cb8a09f86d6d08b437b4a99746416c	2019-03-15 07:29:46 -07:00
Yanghan Wang	53fb9a462a	register RoIAlign with C10 Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17889 Reviewed By: smessmer Differential Revision: D14411630 fbshipit-source-id: c3b7941d725ae2c78e8d79f52a7983db92b75807	2019-03-14 11:55:29 -07:00
Jongsoo Park	8bd9465b79	make momentum non negative in adagrad test (#18009 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18009 momentum should be initialized with non-negative values Reviewed By: hyuen Differential Revision: D14450841 fbshipit-source-id: 5bbbd11645db9e6f2dc42b26a00ff3caf378c59f	2019-03-14 03:15:07 -07:00
Duc Ngo	66556f48e3	Remove sinkMaxPool transformation (#17694 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17694 Remove sinkMaxPool transformation as it's unused Differential Revision: D14328624 fbshipit-source-id: bd245403b756157120faa61a0f9253c15120e7a8	2019-03-12 20:10:46 -07:00
Xiaomeng Yang	54b33503ec	Optimize channel_stats_op (#16243 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16243 Optimize channel_stats_op and add NHWC impl Reviewed By: takatosp1 Differential Revision: D13775515 fbshipit-source-id: decb889e646f5316d4afefdf9f9b6bc6343613cd	2019-03-12 12:08:00 -07:00
Hector Yuen	5bf9e41938	move half<->float conversions to oss operators (#17548 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17548 expose half float operators to OSS common/math/Float16.h is the original implementation this is substituted by caffe2/c10/util/Half.h from the comments seems like the both implementations don't handle denormals Reviewed By: jspark1105 Differential Revision: D14244200 fbshipit-source-id: f90ba28c5bf6a2b451b429cc4925b8cc376ac651	2019-03-07 13:00:13 -08:00
Ahmed Aly	f8778aef78	Implement a Caffe2 standalone LSTM operator (#17726 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17726 Pull Request resolved: https://github.com/pytorch/pytorch/pull/17725 Pull Request resolved: https://github.com/pytorch/pytorch/pull/17461 Implementing a standalone LSTM Operator in Caffe2 adopted from this Aten implementation: diffusion/FBS/browse/master/fbcode/caffe2/aten/src/ATen/native/RNN.cpp. The most tricky thing in this exercise was that caffe2::Tensor has no copy constructor that made it necessary to implement a custom templated copy constructor for the different Tensor containers used in the code. Also there was no way to use off-the-shelf C2 operators in my code easily so I had to copy some code that is doing basic matmul, cat, split, transpose and linear as utility functions. Two things missing: - Profiling this implementation against the current ONNXified LSTM op - Make this operator available to use in PyTorch Reviewed By: dzhulgakov Differential Revision: D14351575 fbshipit-source-id: 3b99b53212cf593c7a49e45580b5a07b90809e64	2019-03-07 01:08:49 -08:00
youkaichao	b87abdfc12	typo fix Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17653 Differential Revision: D14302003 Pulled By: ezyang fbshipit-source-id: 8ad90985a392b07127c7e315d4e74ce77962b573	2019-03-06 11:36:44 -08:00
Deepali Chourasia	e3516d0a95	omit group conv NHWC test for GPU (#17715 ) Summary: Observed the test `TestGroupConvolution.test_group_convolution` to fail with the following error: ``` Falsifying example: test_group_convolution(self=<caffe2.python.operator_test.group_conv_test.TestGroupConvolution testMethod=test_group_convolution>, stride=3, pad=0, kernel=5, size=8, group=4, input_channels_per_group=7, output_channels_per_group=8, batch_size=2, order='NHWC', engine='', use_bias=False, gc=, dc=[, device_type: 1]) You can reproduce this example by temporarily adding reproduce_failure('3.59.1', b'AAAA') as a decorator on your test case ``` This example generated by hypothesis has `group=2, order='NHWC' and dc=[, device_type: 1])`. I think this example should be skipped. I have mimicked the change corresponding to [PR#13554](https://github.com/pytorch/pytorch/pull/13554) to skip this example. Pull Request resolved: https://github.com/pytorch/pytorch/pull/17715 Differential Revision: D14346642 Pulled By: ezyang fbshipit-source-id: b1f1fef09f625fdb43d31c7213854e61a96381ba	2019-03-06 11:32:35 -08:00
Edward Yang	1e6acc676f	Replace caffe2::DeviceGuard with c10::cuda::CUDAGuard (#17623 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17623 Despite it's generic sounding name, caffe2::DeviceGuard actually only worked on CUDA devices. Rename it to something that more clearly spells out its applicability. I'm not sure if it's the right call, but in this patch I added 'using CUDAGuard = c10::cuda::CUDAGuard', as this seems to be more in-line with how the Caffe2 codebase is currently written. More idiomatic c10 namespace style would be to say cuda::CUDAGuard. Willing to change this if people shout. This is a respin of D13156470 (#14284) Reviewed By: dzhulgakov Differential Revision: D14285504 fbshipit-source-id: 93b8ab938b064572b3b010c307e1261fde0fff3d	2019-03-06 10:48:15 -08:00
Soumith Chintala	507c93bad2	Revert D14160172: Implement a Caffe2 standalone LSTM operator Differential Revision: D14160172 Original commit changeset: c33e3f9e8aea fbshipit-source-id: cffe35d93f0ac75ca93aa98a3b82af3d372f2fc1	2019-03-06 08:44:25 -08:00
Ahmed Aly	bfe7a58f69	Implement a Caffe2 standalone LSTM operator (#17461 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17461 Implementing a standalone LSTM Operator in Caffe2 adopted from this Aten implementation: diffusion/FBS/browse/master/fbcode/caffe2/aten/src/ATen/native/RNN.cpp. The most tricky thing in this exercise was that caffe2::Tensor has no copy constructor that made it necessary to implement a custom templated copy constructor for the different Tensor containers used in the code. Also there was no way to use off-the-shelf C2 operators in my code easily so I had to copy some code that is doing basic matmul, cat, split, transpose and linear as utility functions. Two things missing: - Profiling this implementation against the current ONNXified LSTM op - Make this operator available to use in PyTorch Reviewed By: dzhulgakov Differential Revision: D14160172 fbshipit-source-id: c33e3f9e8aeae578b64d97593cb031a251216029	2019-03-05 17:34:44 -08:00
Sebastian Messmer	910519e45b	Expose cuda kernel for caffe2::GenerateProposals Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17066 Reviewed By: ezyang, wat3rBro Differential Revision: D14071130 fbshipit-source-id: 6fe26503f6069c36ec31d6c09b549b932d5db242	2019-03-04 14:59:08 -08:00
Dmytro Dzhulgakov	dec116e96f	PyTorch/Caffe2 tensor interop in Python (#17190 ) Summary: Because of two separate python extensions with different pybind instances I have to go through void* conversion. Since it's hidden from user, it's fine. New APIs added on C2 side: - workspace.FetchTorch('blob') - workspace.Workspace.current.blobs['blob'].to_torch() - workspace.FeedBlob('blob', pytorch_tensor) Works on CPU an GPU. The only glitches are with resizing because of variable/tensor split. But data sharing works properly. Pull Request resolved: https://github.com/pytorch/pytorch/pull/17190 Reviewed By: ezyang Differential Revision: D14163882 Pulled By: dzhulgakov fbshipit-source-id: d18e5b8fcae026f393c842a1149e972515732de2	2019-03-04 11:34:01 -08:00
Martin Schatz	5b835682e3	Remove GPU dependency from ProfileObserver (#17592 ) Summary: Remove GPU dependency and register ProfileObserver. Pull Request resolved: https://github.com/pytorch/pytorch/pull/17592 Reviewed By: ezyang Differential Revision: D14265801 Pulled By: mdschatz fbshipit-source-id: f98c0c32653c64a8b087c58ece4f864dfbe1d4b8	2019-03-04 10:00:46 -08:00
peter	698f947463	Revert #17191 and #17215 that no longer apply on Windows (#17567 ) Summary: They are previously merged to resolve #17051. However, since it was resolved by the upstream, and it was causing some issues like https://github.com/abjer/tsds/issues/8, I think it's time to revert these changes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/17567 Differential Revision: D14265241 Pulled By: kostmo fbshipit-source-id: 7fa2b7dd4ebc5148681acb439cf82d983898694e	2019-03-01 10:37:27 -08:00
Huan Gui	d3fcd0d798	add dropout during eval (#17549 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17549 Currently Dropout is only enabled in training, we enable the option of having dropout in Eval. This is to follow [1]. This functionality would be used for uncertainty estimation in exploration project. [1] Gal, Yarin, and Zoubin Ghahramani. "Dropout as a bayesian approximation: Representing model uncertainty in deep learning." international conference on machine learning. 2016. Reviewed By: Wakeupbuddy Differential Revision: D14216216 fbshipit-source-id: 87c8c9cc522a82df467b685805f0775c86923d8b	2019-02-28 23:21:29 -08:00
rohithkrn	8c72217817	Enable boolean_mask, adadelta, adagrad fp16 on ROCm (#17235 ) Summary: - Fix bugs, indentation for adadelta and adagrad tests to enable fp16 - Enable boolean_mask fp16 on ROCm Pull Request resolved: https://github.com/pytorch/pytorch/pull/17235 Differential Revision: D14240828 Pulled By: bddppq fbshipit-source-id: ab6e8f38aa7afb83b4b879f2f4cf2277c643198f	2019-02-27 10:07:36 -08:00
Lu Fang	29c27d7b99	Automatic update of fbcode/onnx to e18bb41d255a23daf368ffd62a2645db55db4c72 (#17460 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17460 Previous import was 4c091e048ca42682d63ccd3c1811560bc12b732d Included changes: - [e18bb41](https://github.com/onnx/onnx/commit/e18bb41): Infer shape of the second output of Dropout op (#1822) <Shinichiro Hamaji> - [cb544d0](https://github.com/onnx/onnx/commit/cb544d0): Clarify dtype of Dropout's mask output (#1826) <Shinichiro Hamaji> - [b60f693](https://github.com/onnx/onnx/commit/b60f693): Fix shape inference when auto_pad is notset (#1824) <Li-Wen Chang> - [80346bd](https://github.com/onnx/onnx/commit/80346bd): update test datat (#1825) <Rui Zhu> - [b37fc6d](https://github.com/onnx/onnx/commit/b37fc6d): Add stringnormalizer operator to ONNX (#1745) <Dmitri Smirnov> Reviewed By: zrphercule Differential Revision: D14206264 fbshipit-source-id: 0575fa3374ff2b93b2ecee9989cfa4793c599117	2019-02-25 11:09:08 -08:00
Tongliang Liao	65ecef1509	Export ElementwiseLinear to ONNX (Mul + Add). (#17411 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17411 Reshape-based approach to support dynamic shape. The first Reshape flatten inner dimensions and the second one recover the actual shape. No Shape/Reshape will be generated unless necessary. ![image](https://user-images.githubusercontent.com/5203025/52215001-114ace80-28ce-11e9-815f-28ad190d3189.png) Pull Request resolved: https://github.com/pytorch/pytorch/pull/16716 Reviewed By: zrphercule Differential Revision: D14094532 Pulled By: houseroad fbshipit-source-id: bad6a1fbf5963ef3dd034ef4bf440f5a5d6980bc	2019-02-25 08:11:13 -08:00
Junjie Bai	d92ddcf7ca	Skip convnets benchmark in rocm CI (#17331 ) Summary: random coredump Pull Request resolved: https://github.com/pytorch/pytorch/pull/17331 Differential Revision: D14162018 Pulled By: bddppq fbshipit-source-id: 3ed15a79b7bca2498c50f6af80cbd6be7229dea8	2019-02-20 21:12:24 -08:00
Cheng,Penghui	376bb40379	Implementation convolutionTranspose operator for mkl-dnn (#12866 ) Summary: the speed-up of a single operation is up to 2-3X on BDW. This PR depend on https://github.com/pytorch/pytorch/pull/14308 Pull Request resolved: https://github.com/pytorch/pytorch/pull/12866 Differential Revision: D13936110 Pulled By: ezyang fbshipit-source-id: 34e3c2ca982a41e8bf556e2aa0477c999fc939d3	2019-02-20 17:26:10 -08:00
Cheng,Penghui	c02e2ff0b0	Support multi-device configuration for MKL-DNN (#12856 ) Summary: MKL-DNN support multi-node mode，but not support multi-devices mode,this commit will support multi-devices for MKL-DNN.This commit depend on https://github.com/pytorch/pytorch/pull/11330 Pull Request resolved: https://github.com/pytorch/pytorch/pull/12856 Differential Revision: D13735075 Pulled By: ezyang fbshipit-source-id: b63f92b7c792051f5cb22e3dda948013676e109b	2019-02-20 16:57:43 -08:00
Peizhao Zhang	54e4c4d7de	Removed obsolete argument correct_transform_coords in bbox_transform op. (#16723 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16723 Removed obsolete argument correct_transform_coords in bbox_transform op. * It was only for backward compatibility. We should not have models using it now. Differential Revision: D13937430 fbshipit-source-id: 504bb066137ce408c12dc9dcc2e0a513bad9b7ee	2019-02-20 13:22:33 -08:00
Hector Yuen	075c7b1fef	make the threshold for acurracy more precise (#17194 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17194 we found that there is a per row absolute error due to int8 quant and a relative error table-wide in case fp16 is used Reviewed By: csummersea Differential Revision: D14113353 fbshipit-source-id: c7065aa9d15c453c2e5609f421ad0155145af889	2019-02-20 13:14:11 -08:00
peter	972fc5f191	Fix dll loading issue for Caffe2 and Windows Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17215 Reviewed By: orionr Differential Revision: D14138445 Pulled By: kostmo fbshipit-source-id: 0bb4f2f1ed5bda7416ba7e4c6b0618414b328934	2019-02-19 17:04:06 -08:00
Junjie Bai	bf16a6bc3c	Skip onnx logsoftmax tests in rocm (#17170 ) Summary: similar to softmax there are issues of getting nan randomly Pull Request resolved: https://github.com/pytorch/pytorch/pull/17170 Differential Revision: D14110515 Pulled By: bddppq fbshipit-source-id: 5c97661184d45a02122fd69d35a839fdf4520c8c	2019-02-16 18:06:04 -08:00
Hector Yuen	cde7204636	change the epsilon for fp32/fp16 to uint8 to be the same (#17062 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17062 from jiyan's training jobs it seems like we found a quantization bug fp32 fp32->rowwise int8 is fine fp16 is fine fp16->rowwise int8 is not fine we are preconverting everything to fp32 and using the existing code, so there is no need to change the epsilon in the case of fp16 since at the time of converting, everything is a float Reviewed By: jspark1105 Differential Revision: D14063271 fbshipit-source-id: 747297d64ed8c6fdf4be5bb10ac584e1d21a85e6	2019-02-15 18:33:37 -08:00
Yinghai Lu	70ee257ad4	Fix batch insert (#17158 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17158 Because of Reshape op, batch size can be changed. This diff addresses first order issue raised from multiple batch size system. We need to export different real_batch_size for different max_batch_size input and attach it to the right output. It also fixes a false exception. Reviewed By: ipiszy Differential Revision: D14099541 fbshipit-source-id: 0fa9e86826f417a11d2b5dd2ee60dff64a7ce8c4	2019-02-15 12:28:23 -08:00
Yinghai Lu	58648a19df	Create BackendTransformerBase to host common functions used for backend lowering (#17074 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17074 There are some common functionalities in backend lowering. This diff creates a base class which hosts these common stuff. Reviewed By: ipiszy Differential Revision: D14073192 fbshipit-source-id: 9617603d0e73db6f7fcc5572756b9dbab506dae5	2019-02-14 17:57:03 -08:00
Yinghai Lu	b515ebc6f1	Remove fake inference for shape info in ONNXIFI transform (#17046 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17046 As we are moving to use bound shape inference, we can remove the awkward fake inference run path and make the code cleaner. Reviewed By: ipiszy Differential Revision: D14061501 fbshipit-source-id: b3ace98b3dabef3c3359086a0bb1410518cefa26	2019-02-14 15:12:20 -08:00
Michael Liu	92a516b9ff	Apply modernize-use-override - 2/2 Summary: Use C++11’s override and remove virtual where applicable. Change are automatically generated. Reviewed By: Orvid Differential Revision: D14054721 fbshipit-source-id: 15d266fa1779b1e3ea6270f00841d7fb1e4d44ee	2019-02-13 21:01:28 -08:00
Tongliang Liao	a670824fee	Support FC (Caffe2) -> Gemm (ONNX) with variable input shape. (#16184 ) Summary: For >2D input, previously the code uses static shape captured during tracing and reshape before/after `Gemm`. Now we add `-1` to the first `Reshape`, and uses `Shape(X) => Slice(outer) => Concat(with -1 for inner) => Reshape` for the second. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16184 Differential Revision: D14070754 Pulled By: ezyang fbshipit-source-id: 86c69e9b254945b3406c07e122e57a00dfeba3df	2019-02-13 17:12:34 -08:00
Junjie Bai	2ad5dcbbe4	Make timeout in resnet50_trainer configurable (#17058 ) Summary: xw285cornell petrex dagamayank Pull Request resolved: https://github.com/pytorch/pytorch/pull/17058 Differential Revision: D14068458 Pulled By: bddppq fbshipit-source-id: 15df4007859067a22df4c6c407df4121e19aaf97	2019-02-13 17:03:48 -08:00
Tongliang Liao	491f2d4cb8	Support conversion from Caffe2 MergeDim to ONNX Reshape + Squeeze. (#16189 ) Summary: `MergeDim` can be done by `Reshape([1, -1, 0, 0, ...]) + Squeeze`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16189 Differential Revision: D14070676 Pulled By: ezyang fbshipit-source-id: 28d7e9b35cc2c1dcbd4afb3fbdf7383e219b1777	2019-02-13 15:53:38 -08:00
Sebastian Messmer	9696fee635	Register CUDA kernels for caffe2 operators (#16691 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16691 Previous diffs already introduced a macro that registers caffe2 CPU kernels with c10. This now also registers the CUDA kernels with it. Reviewed By: bwasti Differential Revision: D13901619 fbshipit-source-id: c15e5b7081ff10e5219af460779b88d6e091a6a6	2019-02-12 17:24:01 -08:00
Yinghai Lu	f435fb8290	Allow customization of blob node in net_drawer (#16915 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16915 TSIA Reviewed By: ipiszy Differential Revision: D14018010 fbshipit-source-id: df5ccc06fa37f08e7a02a8acc466c4ad47afe04e	2019-02-12 15:02:50 -08:00
Tongliang Liao	0eee56fff7	Export ReduceMean/ReduceFrontMean/ReduceBackMean (Caffe2) to ReduceMean (ONNX). (#16727 ) Summary: The second input (`lengths`) is not supported. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16727 Differential Revision: D14054105 Pulled By: houseroad fbshipit-source-id: 36b8d00460f9623696439e1bd2a6bc60b7bb263c	2019-02-12 13:35:32 -08:00
Kimish Patel	4292d13240	Keep weights name unchanged during SsaRewrite (#16932 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16932 During onnxifi transformation net ssa is rewritten. At the last step the weight names are changed back to what they were before. The diff keeps the weight names unchanged thru the process. Reviewed By: yinghai Differential Revision: D13972597 fbshipit-source-id: 7c29857f788a674edf625c073b345f2b44267b33	2019-02-11 14:55:31 -08:00
Sebastian Messmer	920c684367	Expose GenerateProposals to PyTorch Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16880 Reviewed By: bwasti Differential Revision: D13998092 fbshipit-source-id: 23ab886ba137377312557fa718f262f4c8149cc7	2019-02-11 14:15:47 -08:00
Sebastian Messmer	0c02d317ea	Expose BBoxTransform to pytorch Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16879 Reviewed By: bwasti Differential Revision: D13998093 fbshipit-source-id: ddfe4bff83e9a1a4cedf1e520e6d2977b21cb3af	2019-02-11 14:15:45 -08:00
Junjie Bai	f169f398d0	Change the default image size from 227 to 224 in resnet50 trainer (#16924 ) Summary: cc xw285cornell Pull Request resolved: https://github.com/pytorch/pytorch/pull/16924 Differential Revision: D14018509 Pulled By: bddppq fbshipit-source-id: fdbc9e94816ce6e4b1ca6f7261007bda7b80e1e5	2019-02-09 11:18:58 -08:00
Chandler Zuo	6737190b5c	Make the exception raised from "numpy.dtype(numpy.void, (INT,))" less cryptic (#16809 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16809 https://fb.facebook.com/groups/582508038765902/permalink/736710343345670/?comment_id=824042307945806&reply_comment_id=824318864584817 numpy.dtype(numpy.void, (<INT>, )) raises a cryptic message "invalid itemsize in generic type tuple" that is hard to debug. This diff adds the message to ask the user to investigate the error causing blob. Reviewed By: kennyhorror Differential Revision: D13973359 fbshipit-source-id: 43a0c492ffafbabdfd7f7541c08a258e5ac0280f	2019-02-08 16:46:50 -08:00
Nikita Shulga	0799a81cb7	Extend Net.RunAllOnGPU() to support RecurrentNetwork op (#15713 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15713 [caffe2] Extend Net.RunAllOnGPU() to support RecurrentNetwork op Reviewed By: dzhulgakov Differential Revision: D13576507 fbshipit-source-id: f517127492c9d516ece663d42fef84338c70344e	2019-02-08 15:48:42 -08:00
Gu, Jinghui	5ada54e0bc	Impl ExpandDims op and fallback to CPU if needed (#15264 ) Summary: Impl ExpandDims op and fallback to CPU if needed Pull Request resolved: https://github.com/pytorch/pytorch/pull/15264 Differential Revision: D13808797 Pulled By: yinghai fbshipit-source-id: 7795ec303a46e85f84e5490273db0ec76e8b9374	2019-02-08 12:04:53 -08:00
peter.yeh@amd.com	c65b03b9f8	Enable arg_ops_test/unique_ops_test on AMD/rocm (#16853 ) Summary: Verified both tests are passing on rocm 2.1 env. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16853 Differential Revision: D13996279 Pulled By: bddppq fbshipit-source-id: c0df610d7d9ca8d80ed2d1339cdadef59105a71c	2019-02-07 16:51:15 -08:00
Sebastian Messmer	64339dbd51	Fix and re-enable test case (#16643 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16643 The test was disabled in D13908117 because it conflicted with another diff that was about to land. Now fixed the merge conflict and re-landing it. Reviewed By: ezyang Differential Revision: D13911775 fbshipit-source-id: b790f1c3a3f207916eea41ac93bc104d011f629b	2019-02-07 13:58:16 -08:00
Sebastian Messmer	6750e1e3e9	C10_REGISTER_CAFFE2_OPERATOR: Macro for registering c2 kernels (#16548 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16548 With this macro, a caffe2 operator can now directly be registered with c10. No need to write custom wrapper kernels anymore. Differential Revision: D13877076 fbshipit-source-id: e56846238c5bb4b1989b79855fd44d5ecf089c9c	2019-02-07 13:58:14 -08:00
Wanchao Liang	ac00e85e36	Remove undefined tensor in jit script (#16379 ) Summary: This PR is a follow up of #15460, it did the following things: * remove the undefined tensor semantic in jit script/tracing mode * change ATen/JIT schema for at::index and other index related ops with `Tensor?[]` to align with what at::index is really doing and to adopt `optional[tensor]` in JIT * change python_print to correctly print the exported script * register both TensorList and ListOfOptionalTensor in JIT ATen ops to support both * Backward compatibility for `torch.jit.annotate(Tensor, None)` List of follow ups: * remove the undefined tensor semantic in jit autograd, autodiff and grad_of * remove prim::Undefined fully For easy reviews, please turn on `hide white space changes` in diff settings. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16379 Differential Revision: D13855677 Pulled By: wanchaol fbshipit-source-id: 0e21c14d7de250c62731227c81bfbfb7b7da20ab	2019-02-07 11:02:14 -08:00
rohithkrn	aa88c2c0b6	Unify gpu_support variable in python tests (#16748 ) Summary: Assign `has_gpu_support = has_cuda_support or has_hip_support` and make according changes in python tests. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16748 Differential Revision: D13983132 Pulled By: bddppq fbshipit-source-id: ca496fd8c6ae3549b736bebd3ace7fa20a6dad7f	2019-02-07 00:29:51 -08:00
Yinghai Lu	e5e0bf4152	Add AdjustBatch Op (#16676 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16676 This op is used for changing batch size (first dimension) of the tensor. Reviewed By: bertmaher, ipiszy Differential Revision: D13929200 fbshipit-source-id: 4f2c3faec072d468be8301bf00c80d33adb3b5b3	2019-02-06 19:15:41 -08:00
Yinghai Lu	1b919ca93e	Use bound shape inference in onnxifi transform (#16598 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16598 ATT. Reviewed By: bertmaher, rdzhabarov Differential Revision: D13893698 fbshipit-source-id: 8d2ad9814fe76924a46b450eb7ebd3601fbdbbc7	2019-02-06 16:34:37 -08:00
Jongsoo Park	929cd23da1	no EIGEN engine for DeformConv (#16785 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16785 There's no EIGEN engine implemented for DeformConv but unit test was checking it. Reviewed By: BIT-silence Differential Revision: D13967306 fbshipit-source-id: e29c19f59f5700fc0501c59f45d60443b87ffedc	2019-02-06 11:59:31 -08:00
Jongsoo Park	8d4b2db529	format deform_conv_test.py (#16786 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16786 Format to prepare D13967306 Reviewed By: BIT-silence Differential Revision: D13967317 fbshipit-source-id: 2de895f8474b04c55ba067fbf788c553dc010c60	2019-02-06 11:59:29 -08:00
Gu, Jinghui	887080e92a	Fallback sum/add to CPU if needed (#15267 ) Summary: Fallback sum/add to CPU if needed Pull Request resolved: https://github.com/pytorch/pytorch/pull/15267 Differential Revision: D13935064 Pulled By: yinghai fbshipit-source-id: eb228683d00a0462a1970f849d35365bc98340d6	2019-02-06 09:35:14 -08:00
Edward Yang	a3f600e394	Revert D13854304: [redo][c10] LayerNorm Registration Example Differential Revision: D13854304 Original commit changeset: ec463ce22721 fbshipit-source-id: 4262b9a2ef486e1c7c0283ea021331ac97cc5f56	2019-02-06 08:26:23 -08:00
Edward Yang	fc0e88dd77	Revert D13855525: [c10] Expose RoIAlign to torch Differential Revision: D13855525 Original commit changeset: cfee7bb1544d fbshipit-source-id: 0b4124b78c4082b52e592a1275069c879a9aed39	2019-02-06 08:26:22 -08:00
Edward Yang	33a6a7a3ea	Revert D13856086: [c10] Expose GenerateProposals to torch Differential Revision: D13856086 Original commit changeset: a4873646a71a fbshipit-source-id: 79b634426404236ddbc407d3796a350ad3dae5ca	2019-02-06 08:26:20 -08:00
Edward Yang	018485130f	Revert D13864292: [c10] Expose BBoxTransform to pytorch Differential Revision: D13864292 Original commit changeset: 1f57664e7834 fbshipit-source-id: 37663b7e8213185ecaa5c219076fc7de64704549	2019-02-06 08:26:18 -08:00
Edward Yang	c0a7bf94ed	Revert D13865221: [c10] Expose BoxWithNMSLimit Differential Revision: D13865221 Original commit changeset: 8a3f1d420183 fbshipit-source-id: 0057be9619b660dcad8c01bae67b54400127577e	2019-02-06 08:26:17 -08:00
Edward Yang	cda43336d4	Revert D13866214: [c10] Expose HeatmapMaxKeypoints to torch Differential Revision: D13866214 Original commit changeset: 2ca79037fc07 fbshipit-source-id: d2c653f4f32cf0ea76875888f3523c0dc7db9960	2019-02-06 08:26:16 -08:00
Bram Wasti	a9713d07b0	Expose HeatmapMaxKeypoints to torch (#16528 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16528 .. Reviewed By: smessmer Differential Revision: D13866214 fbshipit-source-id: 2ca79037fc070bade5542345af5ce09f88beda44	2019-02-05 12:56:58 -08:00
Bram Wasti	3df7b321cc	Expose BoxWithNMSLimit (#16529 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16529 .. Reviewed By: smessmer Differential Revision: D13865221 fbshipit-source-id: 8a3f1d420183ed5ae51b3c9e4eb6e033078c7ae4	2019-02-05 12:56:56 -08:00
Bram Wasti	add39b85cc	Expose BBoxTransform to pytorch (#16530 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16530 .. Reviewed By: smessmer Differential Revision: D13864292 fbshipit-source-id: 1f57664e78347e72c0087aa3d825a6a9517c1945	2019-02-05 12:56:54 -08:00
Bram Wasti	f33a2b960e	Expose GenerateProposals to torch (#16477 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16477 expose generateproposals to torch Reviewed By: smessmer Differential Revision: D13856086 fbshipit-source-id: a4873646a71a6b6c01740d21729e827f4b36588f	2019-02-05 12:56:52 -08:00
Bram Wasti	f5d4636021	Expose RoIAlign to torch (#16476 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16476 enable calling roialign (caffe2) from torch frontend Reviewed By: smessmer Differential Revision: D13855525 fbshipit-source-id: cfee7bb1544dc58df4231604ba01d61ca905ae3f	2019-02-05 12:56:50 -08:00
Bram Wasti	240240bb10	LayerNorm Registration Example (#16478 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16478 This diff includes an example registration of a caffe2 op in torch. A previous attempt ran into a static initialization order bug. Reviewed By: smessmer Differential Revision: D13854304 fbshipit-source-id: ec463ce2272126d08a5163d1599361ee5b718bbc	2019-02-05 12:56:48 -08:00
Dmytro Dzhulgakov	dc528fd734	Fix build with cuda but no cudnn in caffe2 (#16701 ) Summary: Just noticed while building on a machine without cudnn present - it was building but the runtime failed since some methods weren't bound Pull Request resolved: https://github.com/pytorch/pytorch/pull/16701 Differential Revision: D13937247 Pulled By: dzhulgakov fbshipit-source-id: c81f05be7a9e64a1a8591036dcf8692c0ed4064e	2019-02-03 22:14:51 -08:00
Sebastian Messmer	f36f3cce9a	Simplify layer_norm_op_test Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16570 Reviewed By: ezyang Differential Revision: D13883913 fbshipit-source-id: 7437d3cbc00c0de92bb01562c620cb658aa9f0d3	2019-02-01 21:34:18 -08:00
Hui Wu	31ab03e34f	Add Winograd Conv method for CPU (#15196 ) Summary: Add winograd conv method. Users can select the direct conv or winograd conv in the model file. We close the origin pr https://github.com/pytorch/pytorch/pull/12154 and create this new one for better rebasing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/15196 Differential Revision: D13463721 Pulled By: yinghai fbshipit-source-id: c5cd5c8aa7622ae7e52aeabd3dbb8ffb99b9b4ee	2019-02-01 16:41:30 -08:00
peter.yeh@amd.com	10cd9d5a03	Skip dag_net_forking test on Rocm (#16639 ) Summary: -Skip the test due to flaky behavior on AMD/Rocm -The fix is expected in Rocm 2.2 ( HSA runtime) bddppq Pull Request resolved: https://github.com/pytorch/pytorch/pull/16639 Differential Revision: D13915231 Pulled By: bddppq fbshipit-source-id: 66e1d275836337170b15ceb9d60cfdd3242d4df8	2019-02-01 00:53:54 -08:00
Xiaomeng Yang	4ae9ab24b6	Update conv_base to support empty batch (#16603 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16603 Update conv_base to support empty batch Reviewed By: houseroad Differential Revision: D13894111 fbshipit-source-id: fc4370ff16ba6046f374e77bd845d28e6af05ea3	2019-01-31 23:46:18 -08:00
Jerry Zhang	d5d7718770	fix scope related naming issue in build_quant_conv_bn_relu, and also format function signature Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14885 Reviewed By: harouwu Differential Revision: D13374077 fbshipit-source-id: 5082c4ea0d2fdc197243b022b9b489f38b04c8e9	2019-01-31 15:53:27 -08:00
Dmytro Dzhulgakov	51752e09c6	Disable layernorm_c10 test for now (#16630 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16630 two PRs landed concurrently - enforcing tensor constraints and refactoring c10. Since it's not a prod code - disable test and I'll let Sebastian to fix it properly. Reviewed By: ezyang Differential Revision: D13908117 fbshipit-source-id: 381c5626078b794afa1fc7a95cb1ea529650424c	2019-01-31 15:47:13 -08:00
Jongsoo Park	db121375e7	more careful use of inline/template function in perfkernels (#15388 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15388 This is another pass to make perfkernels code safer from illegal instruction error. Removed dependency to c10/util/Logging.h We're err on the safer side at the expense of some verbosity. Reviewed By: dskhudia Differential Revision: D13502902 fbshipit-source-id: 4f833115df885c5b4f8c1ca83b9badea1553f944	2019-01-30 22:49:37 -08:00
Jerry Zhang	d3742603cb	DeviceScope support for CUDA and testing (#15357 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15357 Supporting device option in FQ bn folding for ITER related ops Reviewed By: wat3rBro Differential Revision: D13370259 fbshipit-source-id: 4324c2716dfa69ddedc661ae2b1ad34c2f6fc4b6	2019-01-30 18:42:12 -08:00
Sebastian Messmer	c43917b0a3	Add a test case calling caffe2 layer_norm from caffe2 executor but through the c10 dispatcher Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16283 Reviewed By: ezyang Differential Revision: D13792591 fbshipit-source-id: 9c190649e38e8706549102b2e136ceaf508eb37f	2019-01-30 13:16:47 -08:00
Yinghai Lu	fa717cba63	Support int64_t shape data for ideep reshape op Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16533 Reviewed By: jerryzh168 Differential Revision: D13867402 fbshipit-source-id: ff53a851f142ef915ad69da3868bb3aab4d48987	2019-01-30 09:00:09 -08:00
Lu Fang	719134f3c3	Automatic update of fbcode/onnx to 15c33c945851907411619f599900c3852108e7e3 (#16493 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16493 Previous import was dc75285d4a1cff9618400164dfdb26c5a1bab70a Included changes: - [15c33c9](https://github.com/onnx/onnx/commit/15c33c9): Add ppc64le build (#1768) <Chin Huang> - [198f840](https://github.com/onnx/onnx/commit/198f840): Update Broadcasting.md (#1769) <Verma-Rajat> - [60ac95f](https://github.com/onnx/onnx/commit/60ac95f): Merge back from release 1.4.1 (#1767) <Raymond Yang> - [a683372](https://github.com/onnx/onnx/commit/a683372): Bump up version number for v1.4.0 (#1761) (#1763) <Raymond Yang> - [dbf3581](https://github.com/onnx/onnx/commit/dbf3581): Add TfIdfVectorizer operator to ONNX (#1721) <Dmitri Smirnov> Reviewed By: zrphercule Differential Revision: D13858840 fbshipit-source-id: 1d00f63f265cc6deed965b92ed00c44f547ff03e	2019-01-29 13:48:49 -08:00
Your Name	e3c0926c44	Remove usage of deprecated "min_satisfying_examples" hypothesis setting (#16401 ) Summary: This setting has been deprecated in [hypythesis 3.56.0](`d1b0df5b91/hypothesis-python/docs/changes.rst (3560---2018-04-17)`) and recently has been removed in [hypothesis 4.x](`d1b0df5b91/hypothesis-python/docs/changes.rst (400---2019-01-14)`). Pull Request resolved: https://github.com/pytorch/pytorch/pull/16401 Reviewed By: ezyang Differential Revision: D13832528 Pulled By: bddppq fbshipit-source-id: 04b9f1dfdf2dcfe0ef121dd02f7fbfdf6bf4aead	2019-01-28 14:17:10 -08:00
Sebastian Messmer	80f4374dde	Handle stack correctly (#16246 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16246 The op schema says it returns multiple values, so let's actually return multiple values instead of one tuple. For some reason, this did work when called from python (probably some auto-unpacking), but once called from JIT, it segfaulted. This diff fixes that. Reviewed By: dzhulgakov Differential Revision: D13780147 fbshipit-source-id: fe94f82f4c53b7454f77c4484fca4ac9dc444475	2019-01-28 11:46:03 -08:00
Juan Miguel Pino	41e9b092a9	Revert D13821061: [redo][c10] layernorm example Differential Revision: D13821061 Original commit changeset: 82f0dade0145 fbshipit-source-id: e5b0b1bab0c9e731ae04add35e9a6c91656dd178	2019-01-25 22:52:04 -08:00
Bram Wasti	27a1ba3ef2	layernorm example (#16374 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16374 this fixes the original attempt in OSS (adds to CMake and python build files) Reviewed By: smessmer Differential Revision: D13821061 fbshipit-source-id: 82f0dade0145fd04bdf8e3cb3954b5790e918162	2019-01-25 16:52:33 -08:00
Gu, Jinghui	0e6791b275	Impl Shape op for mkldnn (#15266 ) Summary: Impl Shape op for mkldnn Pull Request resolved: https://github.com/pytorch/pytorch/pull/15266 Differential Revision: D13804558 Pulled By: yinghai fbshipit-source-id: 8a35f608c23973d7a15c3d645aee4059eb55f245	2019-01-25 11:04:57 -08:00
Bram Wasti	958f846fb3	Back out "[c10] layernorm example" Summary: Original commit changeset: 87240ca7f48d Reviewed By: bddppq Differential Revision: D13816657 fbshipit-source-id: bafcf0779d811c7e4a134cfb323a89352fa8c180	2019-01-25 10:22:30 -08:00
Junjie Bai	52135e9b12	Revert D13551909: [fbcode] logdevice for generic feature type Differential Revision: D13551909 Original commit changeset: 807830c50bee fbshipit-source-id: 48cacf4ec1765253a9be9d78f4b28cc48330be59	2019-01-25 00:33:06 -08:00
Qin Huang	11a2b3799b	logdevice for generic feature type (#16191 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16191 logdevice related modifications for generic feature type we directly convert the generic feature structures to json strings, which corresponds to the column input in offline and dper Reviewed By: itomatik Differential Revision: D13551909 fbshipit-source-id: 807830c50bee569de202530bc3700374757793a2	2019-01-24 23:33:19 -08:00
Bram Wasti	265ed8ff45	layernorm example (#16350 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16350 Example usage of the new caffe2 integration Reviewed By: smessmer Differential Revision: D13408546 fbshipit-source-id: 87240ca7f48d653a70241d243aa0eb25efa67611	2019-01-24 22:28:22 -08:00
Jongsoo Park	6700eff03e	disable testing group conv with EIGEN engine (#16335 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16335 group conv is not implemented with EIGEN engine so this diff disables related tests Reviewed By: jamesr66a Differential Revision: D13807204 fbshipit-source-id: 41f6de43da40882f57e64474520e185733caefb7	2019-01-24 16:39:20 -08:00
Jongsoo Park	f0dd85d141	reduce parameter space of test_1x1_conv to avoid timeout (#16223 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16223 As title says Reviewed By: jamesr66a Differential Revision: D13758202 fbshipit-source-id: 3cdffb80a5dad53b29e65e8eb0ae128edba70dbb	2019-01-24 14:17:11 -08:00
rohithkrn	ddeaa541aa	fix typo in resnet50_trainer.py Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16219 Differential Revision: D13776742 Pulled By: bddppq fbshipit-source-id: 10a6ab4c58159b3f619b739074f773662722c1d9	2019-01-22 17:28:04 -08:00
Lu Fang	c33512bdfc	Automatic update of fbcode/onnx to c553fb32a0902ce5dd42e1b40123e9e9b38bdbe7 (#16190 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16190 Previous import was fd60104394fa353e1762f44ecad1b2166e33deef Included changes: - [c553fb3](https://github.com/onnx/onnx/commit/c553fb3): Handle negative axis in scan shape inference (#1748) <G. Ramalingam> - [51b6ecc](https://github.com/onnx/onnx/commit/51b6ecc): external_data: Store large tensor values in separate files (#678) <Michał Karzyński> - [ba05f26](https://github.com/onnx/onnx/commit/ba05f26): Scan output axes (#1737) <G. Ramalingam> - [90920c0](https://github.com/onnx/onnx/commit/90920c0): Add NonZero op. (#1714) <Sergii Dymchenko> - [c4cf112](https://github.com/onnx/onnx/commit/c4cf112): fix the test cases for constantofshape (#1746) <Lu Fang> - [d902349](https://github.com/onnx/onnx/commit/d902349): Add sample implementation support (#1712) <Lu Fang> Differential Revision: D13745693 fbshipit-source-id: 05e2cce9ae1dfa2865db83840df64673d55cea57	2019-01-21 09:46:29 -08:00
Lu Fang	daedec2350	Support ConstantOfShape in Caffe2 ONNX Backend (#16108 ) Summary: This PR is the prerequisite to land https://github.com/pytorch/pytorch/pull/16095 Pull Request resolved: https://github.com/pytorch/pytorch/pull/16108 Reviewed By: BIT-silence Differential Revision: D13725722 Pulled By: houseroad fbshipit-source-id: 28c0fb72f075cd04f9db44dfab0163844c20c620	2019-01-18 22:58:23 -08:00
Dmytro Dzhulgakov	aaff2fecda	Remove caffe2::Tensor copy constructor (#15416 ) Summary: Based on offline discussion it should be less surprising to the users of existing code. Thus caffe2::Tensor is now a move-only class (as it used to be), explicit calls to UnsafeSharedInstance() are necessary to get shared_ptr behavior. This change also identified a few places that misused the copy constructor - those are fixed Pull Request resolved: https://github.com/pytorch/pytorch/pull/15416 Reviewed By: Yangqing Differential Revision: D13524598 fbshipit-source-id: aea12d6dff77342606fa88ce4ddddbff266245a7	2019-01-18 00:31:56 -08:00
bddppq	1a09a2a27f	Export PyTorch erf to ONNX Erf and add Caffe2 Erf operator Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16106 Differential Revision: D13709490 Pulled By: bddppq fbshipit-source-id: 1b5b32261f06543371f7bd7ac9b11957a5eb4ad0	2019-01-17 09:18:08 -08:00
Derek Kim	1e425d1a47	A trivial typo fix in caffe2.python (#15907 ) Summary: blobl -> globl Pull Request resolved: https://github.com/pytorch/pytorch/pull/15907 Differential Revision: D13709586 Pulled By: ezyang fbshipit-source-id: 9d3ad76b7fea76c7934407d3c164417b4157e234	2019-01-17 04:57:34 -08:00
Xiaomeng Yang	7536887cb7	Add count_include_pad for avg_pool on CuDNN (#16100 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16100 Add count_include_pad for avg_pool on CuDNN Reviewed By: houseroad Differential Revision: D13707959 fbshipit-source-id: 261f5d116066fef75cf9a5787dfbc5d12b5b9f9b	2019-01-17 02:10:12 -08:00
Xiaomeng Yang	7a5f782c2e	Fix max_pool_grad test (#16088 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16088 Fix max_pool_grad test Reviewed By: houseroad Differential Revision: D13700917 fbshipit-source-id: f4f942ee920bcd943c38a8f8a6aafd1d13c4515f	2019-01-16 15:32:27 -08:00
Jerry Zhang	5e72e99c86	Remaining Tensor API fixes - dims() -> sizes() (#15743 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15743 Remaining fixes so that D12812029 will compile Reviewed By: dzhulgakov Differential Revision: D13535559 fbshipit-source-id: 2c8b3403570c8c35ac8efe2d827233abc0e6e0d1	2019-01-15 18:42:02 -08:00
Lu Fang	8f11df3cb7	Automatic update of fbcode/onnx to 84a0441ae28795a928005863dc142bee81827566 (#16046 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16046 Previous import was 7abd834091f1024c11749dcfd25126802db9fdd5 Included changes: - [84a0441](https://github.com/onnx/onnx/commit/84a0441): Clarify namescopes in the presence of nested subgraphs (#1665) <G. Ramalingam> - [118fec5](https://github.com/onnx/onnx/commit/118fec5): Add Where op. (#1569) <Sergii Dymchenko> - [beefa15](https://github.com/onnx/onnx/commit/beefa15): Use strings directly for casing as np.object w/o redundant StringHolder. (#1736) <Dmitri Smirnov> - [4023bae](https://github.com/onnx/onnx/commit/4023bae): Add a capability to input/output unicode strings (#1734) <Dmitri Smirnov> - [1a8a7fc](https://github.com/onnx/onnx/commit/1a8a7fc): typos fixed: iutput -> input (#1726) <Beomsoo Kim> - [0128478](https://github.com/onnx/onnx/commit/0128478): Scan test update (#1732) <G. Ramalingam> - [c6a24fd](https://github.com/onnx/onnx/commit/c6a24fd): turn rtol to 0.002 on densenet121, since AMD and Nvidia GPU's precion difference (#1733) <Lu Fang> - [5b7ac72](https://github.com/onnx/onnx/commit/5b7ac72): Add Shrink operator (#1622) <Rui Zhu> Reviewed By: yinghai Differential Revision: D13676711 fbshipit-source-id: 513cc137223469b47af48919432aaecf58006012	2019-01-15 17:17:31 -08:00
Xiaomeng Yang	13f38ab79d	Add count_include_pad to average_pool_gradient_op (#15997 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15997 Add count_include_pad to average_pool_gradient_op Reviewed By: houseroad Differential Revision: D13648339 fbshipit-source-id: 205cb2acb32dc24a85256b628298b1a11f0ffa2c	2019-01-15 16:56:40 -08:00
Sebastian Messmer	57b5e7572b	Test cases for calling caffe2 LayerNorm from PyTorch and JIT Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15895 Reviewed By: dzhulgakov Differential Revision: D13615336 fbshipit-source-id: de28fef8ce025d6d37a4c80c029ec97b7195cfd9	2019-01-15 12:03:57 -08:00
Shane Li	620ff25bdb	Enhance cpu support on gloo based multi-nodes mode. (#11330 ) Summary: 1. Add some gloo communication operators into related fallback list; 2. Work around to avoid compiling errors while using fallback operator whose CPU operator inherits from 'OperatorBase' directly like PrefetchOperator; 3. Add new cpu context support for some python module files and resnet50 training example file. Pull Request resolved: https://github.com/pytorch/pytorch/pull/11330 Reviewed By: yinghai Differential Revision: D13624519 Pulled By: wesolwsk fbshipit-source-id: ce39d57ddb8cd7786db2e873bfe954069d972f4f	2019-01-15 11:47:10 -08:00
Sebastian Messmer	4ed9de8680	Remove code duplication (#15880 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15880 The layer_norm reference was implemented twice. Removing one of them. Reviewed By: dzhulgakov Differential Revision: D13611232 fbshipit-source-id: cee96c78d3255c3a4e34300693bf9260cf096615	2019-01-14 17:59:37 -08:00
Jesse Hellemn	8964a2e6e6	Split Caffe2 CI into cmake-only and python builds (#15917 ) Summary: bypass-lint - Change all Caffe2 builds to use setup.py instead of cmake - Add a -cmake- Caffe2 build configuration that uses cmake and only builds cpp - Move skipIfCI logic from onnx test scripts to the rest of CI logic - Removal of old PYTHONPATH/LD_LIBRARY_PATH/etc. env management Pull Request resolved: https://github.com/pytorch/pytorch/pull/15917 Reviewed By: orionr Differential Revision: D13637583 Pulled By: pjh5 fbshipit-source-id: c5c5639db0251ba12b6e4b51b2ac3b26a8953153	2019-01-14 15:20:44 -08:00
Sergei Nikolaev	a282378baf	Caffe 2: Reshape Op upgrade (#15380 ) Summary: This is follow up on #13945 where we had to turn off some TRT tests because some ops were not ready to accept ONNX opset 9+ models. This PR fixes Reshape. Pull Request resolved: https://github.com/pytorch/pytorch/pull/15380 Differential Revision: D13649825 Pulled By: houseroad fbshipit-source-id: b72e62803de5b63cc001c3fe4b3bf64dfa996e94	2019-01-13 22:49:40 -08:00
Cheng,Penghui	926e718d5f	Add/fallback some operators for mkl-dnn (#11696 ) Summary: Implementation LeakyRelu operator for mkl-dnn,the speed-up of a single operation is up to 10X on BDW. Implementation rashape operator for mkl-dnn,it will resolve occasionally crash issue which use fallback reshape operator. Implementation CreateBlobQueue and SafeEnqueueBlobs operators,it will resolve crash issue which use fallback operators. Fallback CreateBlobsQueueDBOp,TensorProtosDBInput,CloseBlobsQueue operators. Implement adam operator for mkl-dnn,the speed-up of a single operator is up to 6X on BDW. Pull Request resolved: https://github.com/pytorch/pytorch/pull/11696 Reviewed By: yinghai Differential Revision: D10100438 Pulled By: wesolwsk fbshipit-source-id: 0b6e06897cc11e0a8e349d80a870b1e72e47f10d	2019-01-11 12:53:06 -08:00
Mickaël Schoentgen	71c6e24373	Fix several ResourceWarning: unclosed file (#15746 ) Summary: Hello, This is a patch to fix `ResourceWarning: unclosed file`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/15746 Differential Revision: D13587286 Pulled By: soumith fbshipit-source-id: 08ac34c5b51d9334867f65a2927bff11511553f3	2019-01-09 15:36:53 -08:00
Andre Georg Holzner	961f829067	deduplicated code in elementwise_op_broadcast_test.py (#15865 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15865 factored out code used in tests for operators Add, Mul and Sub into two new methods: a first one to generate the test vectors, a second one to run the actual tests given a caffe2 and python operator. Reviewed By: houseroad Differential Revision: D13526955 fbshipit-source-id: 8970ba5a1305ca19a54a14b51816d4a19f19d678	2019-01-09 03:07:22 -08:00
Yinghai Lu	5fe2697655	Initialize tensor with fp32 in Caffe2Backend.prepare() (#15832 ) Summary: Fix https://github.com/pytorch/pytorch/issues/14104 Pull Request resolved: https://github.com/pytorch/pytorch/pull/15832 Reviewed By: bddppq Differential Revision: D13598332 Pulled By: yinghai fbshipit-source-id: 3302ac47928974f49353c5da8af440e5c1716c22	2019-01-08 22:33:52 -08:00
David Carrillo Cisneros	2b22612289	Add NHWC support to Resize Operator (#15553 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15553 Add unit test and implementation of NHWC layout for Resize operator. Also, add pragma parallel loop to old NCHWC layout. Reviewed By: jspark1105 Differential Revision: D13540762 fbshipit-source-id: eebf252bf0d1efdff180a171d804181045f100a5	2019-01-08 16:44:17 -08:00
Yinghai Lu	c3a0000864	Support communicating with C2 protobuf in Onnxifi flow (#15472 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15472 Create a path to pass serialized C2 protobuf instead of ONNX during ONNXIFI flow Reviewed By: houseroad Differential Revision: D13536603 fbshipit-source-id: 7d016474f4beedbda480ed2e2c0004af7868aafe	2019-01-07 22:12:29 -08:00

1 2 3 4 5 ...

2494 Commits