pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Dmytro Dzhulgakov	c25e33789e	Lightweight at-most-once logging for API usage (#20745 ) Summary: Resubmit #20698 which got messed up. Idea is that when PyTorch is used in a custom build environment (e.g. Facebook), it's useful to track usage of various APIs centrally. This PR introduces a simple very lightweight mechanism to do so - only first invocation of a trigger point would be logged. This is significantly more lightweight than #18235 and thus we can allow to put logging in e.g. TensorImpl. Also adds an initial list of trigger points. Trigger points are added in such a way that no static initialization triggers them, i.e. just linking with libtorch.so will not cause any logging. Further suggestions of what to log are welcomed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20745 Differential Revision: D15429196 Pulled By: dzhulgakov fbshipit-source-id: a5e41a709a65b7ebccc6b95f93854e583cf20aca	2019-05-23 23:17:59 -07:00
Will Feng	8cde4c4d22	Remove Variable::Impl and DifferentiableViewImpl (#17072 ) Summary: As part of the Variable/Tensor merge work: https://github.com/pytorch/pytorch/issues/13638, we make the following changes in this PR: 1. Remove the `Variable::Impl` class and the `DifferentiableViewImpl` class 2. Change all `Variable.data()` call sites to either use `Variable` directly, or use `Variable.tensor_data()` 3. Remove `Variable.data()` API 3. Add `Variable.variable_data()` that matches `tensor.data` in Python API, which creates a new `Variable` that shares the same storage and tensor metadata with the original `Variable`, but with a completely new autograd history. After this PR, Variable doesn't wrap a Tensor internally anymore, and both Variable and Tensor use the same TensorImpl class as its `impl_`. The only difference is that Variable always has AutogradMeta in its TensorImpl, but Tensor doesn't. Note that this PR is BC-breaking in the following use cases: Use Case 1: Previously, `x.data = y` works even if `x` and `y` are of different TensorImpl type (e.g. `x` is a CPU dense tensor whose impl is of type TensorImpl, while `y` is a CPU sparse tensor whose impl is of type SparseTensorImpl). However, after this PR, `x.data = y` doesn't work anymore if `x` and `y` are of different TensorImpl type, because the underlying implementation `variable.set_data(tensor)` no longer works if `variable` and `tensor` have different TensorImpl type. Use Case 2: If a tensor `x`'s `grad` is sparse, accumulating dense gradients to `x` will change the tensor that `x.grad` is pointing to. This is better illustrated with the following example: ```python params = torch.tensor([1.5, 1.5]).requires_grad_() with torch.no_grad(): # Change gradient to a sparse tensor params.grad = torch.sparse_coo_tensor(torch.tensor([[1, 1]]).long(), torch.tensor([1., 1.])) grad_saved = params.grad params.backward(torch.tensor([1.5, 1.5])) assert id(grad_saved) == id(params.grad) # This will fail after this PR ``` The assertion in the last line will fail after this PR, because adding dense gradients to sparse gradients will change the `params.grad` tensor reference. Pull Request resolved: https://github.com/pytorch/pytorch/pull/17072 Differential Revision: D14075257 Pulled By: yf225 fbshipit-source-id: 0e681df641270dea586042dd26db59f2e76b5957	2019-05-23 21:09:04 -07:00
Abhinav Jauhri	f93e0619f3	Adding ShufflenetV2 to caffe2's benchmark suite. (#20180 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20180 Adding ShufflenetV2 (by Ma et. al. 2018) to the caffe2's benchmark suite. To run, use: `buck run mode/opt caffe2/caffe2/python/examples:imagenet_trainer -- --train_data null --batch_size 128 --epoch_size 3200 --num_epochs 2 --num_gpus 2 --model shufflenet` Reviewed By: bddppq, xw285cornell Differential Revision: D15094282 fbshipit-source-id: 0e1ce9c5975868e917b0f179e2c5b15647a76b4e	2019-05-23 20:40:17 -07:00
Kittipat Virochsiri	fd2aa93b37	Exposing LengthsSum/Mean/Max in pytorch (#20802 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20802 Need this for sequence model Reviewed By: dzhulgakov Differential Revision: D15448529 fbshipit-source-id: cd5abe3b689fc0e02feff10faf8cd61c99369f4f	2019-05-22 13:55:19 -07:00
Shunting Zhang	fea4a56af3	Add ability to filter metric schema in LayerModelHelper (#20786 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20786 Add a method to LayerModelHelper to filter metrics_schema. A general model builder may add metric schema that is not needed in some situations. This change add the ability to skip those unneeded. Reviewed By: alex1o1o7cloud Differential Revision: D15418140 fbshipit-source-id: 520f5dffd9938cf206cb1352e2953a4d4d2b6ab1	2019-05-22 12:26:20 -07:00
Abhijith Reddy	371cf109a3	Increase static tolerance for negative feature ids Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20671 Reviewed By: Wakeupbuddy Differential Revision: D15401078 fbshipit-source-id: a946b1df6fae2851d60fadf32e57feb44ba95f38	2019-05-20 19:09:22 -07:00
Huan Gui	fbdafdffa1	Move bucketize_op to open source Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19952 Reviewed By: houseroad Differential Revision: D15145552 fbshipit-source-id: e0074c878a5c164324a9cc477783285dedffd188	2019-05-20 18:03:27 -07:00
Edward Z. Yang	9b1dbffba5	Re-sync with internal repository (#20702 )	2019-05-20 09:22:57 -04:00
Dmytro Dzhulgakov	d3059b9c49	Lightweight logging for once-only API usage	2019-05-19 23:04:40 -07:00
Chenguang Xi	96a1f7695f	Support plot norm of specific embeddings of a LUT in diagnose_options (#19809 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19809 as title Reviewed By: chocjy Differential Revision: D15100505 fbshipit-source-id: cba290fd4317b260e2bf1689b9ca215d3d19a9e2	2019-05-18 01:08:45 -07:00
Jongsoo Park	ea9c6e7581	eliminate FE_INVALID in unit test (#20502 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20502 Following D15307410 removing more floating point exceptions in unit tests Reviewed By: hx89 Differential Revision: D15340930 fbshipit-source-id: 269fc75e0800bc9d39126767a0f3ca15cd8b0cad	2019-05-16 21:55:28 -07:00
Yanghan Wang	373e6a78bf	make box plus one a legacy argument in detection ops Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20550 Reviewed By: newstzpz Differential Revision: D15348610 fbshipit-source-id: 12b1e119e9bc9191ba9f2aa6d695ef215780c349	2019-05-16 18:17:12 -07:00
Yanghan Wang	61012080c8	split and register CollectAndDistributeFpnRpnProposals with C10 Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20509 Reviewed By: newstzpz Differential Revision: D15302181 fbshipit-source-id: 7d3b29b667cd900f2976101f35200e1ee20b0f64	2019-05-16 13:40:46 -07:00
Jongsoo Park	5f8e849d84	eliminate FE_INVALID in optimizer related operators and tests (#20501 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20501 Fixing unit tests related to optimizer related operators and tests Reviewed By: hx89 Differential Revision: D15307410 fbshipit-source-id: e5400c26e08f26191ee542fe6b02e0a69bc4e1ae	2019-05-16 08:23:46 -07:00
David Reiss	1891614aa5	Add GivenTensorInt16Fill (#20515 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20515 Needed by the upcoming quantized version of GenerateProposals Reviewed By: dzhulgakov Differential Revision: D14430952 fbshipit-source-id: ea852f04cc4b070f8fbe7a1e6535bba4d5b230fd	2019-05-15 19:45:15 -07:00
Edward Yang	73a97387c1	Replace AT_CHECK with TORCH_CHECK [shard 9/10] Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20435 Reviewed By: jerryzh168 Differential Revision: D15318877 fbshipit-source-id: 4d83571187ea14a604fef83ac355d328b46d93e1	2019-05-15 08:05:59 -07:00
Cheng Cheng	fd18b89c98	shape inference for learning rate op (#20020 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20020 Add shape inference for LearningRate op. The output (lr) should have similar shape with input (iteration), but not the same type (float vs int). Reviewed By: un-disclosed Differential Revision: D15112300 fbshipit-source-id: 09969aefa15172a6f3c70cd9b2548e3020da5d7a	2019-05-14 23:34:32 -07:00
Jiyan Yang	33f421027c	Allow recency weight pooling for fp16 (#20506 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20506 as titled Reviewed By: alex1o1o7cloud Differential Revision: D15342758 fbshipit-source-id: 89e7cb6d7b9511ef6c70611359736328571d7fc0	2019-05-14 20:13:38 -07:00
Ansha Yu	a9aaf698a4	add c2 benchmark runs in cpp (#20108 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20108 Add cpp runs for c2, hooked up via pybinds. Print output to terminal. This is not hooked up with the pep output yet because I'd like to verify the numbers first. Note that this isn't quite the same mechanism as the pytorch cpp hookup, which uses cpp_python_extensions. If I can use the same mechanism to pull all the inputs for c2 through cpp and do FeedBlobs in cpp, then I'll switch to that. Reviewed By: zheng-xq Differential Revision: D15155976 fbshipit-source-id: 708079dacd3e19aacfe43d70c5e5bc54da2cf9e3	2019-05-13 17:01:08 -07:00
Jiyan Yang	6c3b8a24ff	Make sure reducer=None is not used when fp16 embedding is enabled Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20349 Reviewed By: hyuen Differential Revision: D15291545 fbshipit-source-id: fa5fd0b97aeca6e5f45866908f3f205b701c931b	2019-05-13 11:53:14 -07:00
Xue Feng	1129b3344a	move DistillBatchLRLoss Layer from open source to fb Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20291 Reviewed By: chocjy Differential Revision: D15272181 fbshipit-source-id: 2e0964fa1b1031607134548bb87c4e103c5b1383	2019-05-10 17:46:04 -07:00
Gerard Goossen	148e90ba2a	Give clear error message when attempting to merge struct which can't be merged. Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19804 Differential Revision: D15098833 fbshipit-source-id: 2950e247c74e125e033cd9cfbf5631eee5298ea0	2019-05-10 07:01:01 -07:00
Gerard Goossen	c2c0a32155	Remove setting logger level in caffe2.python.checkpoint (#19803 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19803 There is no reason to set a specific logging level for this module. Removing it to just use the default logging level. Differential Revision: D15098834 fbshipit-source-id: 1654c04500c19690ddde03343f2e84b04bb0f1ef	2019-05-10 07:00:58 -07:00
Bilge Acun	3ee97183b0	ScaleBlobs Operator (#19660 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19660 Implementation of aggregated Scale operator. The operator takes a list of tensors as an input and scales all of them them with the argument float value. The tensor sizes can be different, therefore bookkeeping of the sizes and pointers to the tensors are necessary for the GPU version of the kernel. Reviewed By: BIT-silence Differential Revision: D14984233 fbshipit-source-id: 37cc97159a4f2c38cd6fff4f5710ab7d3a773611	2019-05-08 17:57:33 -07:00
Junjie Bai	8defcbfcf4	Enable caffe2 softmax tests with ROCm 2.4 (#20280 ) Summary: cc xw285cornell petrex Pull Request resolved: https://github.com/pytorch/pytorch/pull/20280 Reviewed By: xw285cornell Differential Revision: D15262695 Pulled By: bddppq fbshipit-source-id: d72490ff599cdab0331230bc9b12075085386319	2019-05-08 13:29:11 -07:00
Junjie Bai	cd72be20e0	Update ROCm 2.4 (#20253 ) Summary: xw285cornell Pull Request resolved: https://github.com/pytorch/pytorch/pull/20253 Reviewed By: ezyang Differential Revision: D15256826 Pulled By: bddppq fbshipit-source-id: 405c21fc727d8145c4d3ca4fe8d84804569ebe53	2019-05-08 09:35:40 -07:00
Jongsoo Park	42e9a619b3	add decay parameter in ref_adagrad (#15329 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15329 Add decay parameter to match with C++ Adagrad implementation. Reviewed By: chocjy Differential Revision: D13300991 fbshipit-source-id: db734df0202d8f5fd156f2742207d0b5a3aa7348	2019-05-07 18:58:58 -07:00
Kittipat Virochsiri	0aa7407dd0	Rearrange stopping condition in CompositeReader (#20062 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20062 Previously, the batch counter is incremented even if none of the readers has data. In this diff, 1) Limiter is applied to the last reader so that the batch counter is not incremented unless the first N-1 readers have data 2) The stop blob of the last reader as the stop blob of the task so that it's checked before the counter is incremented Reviewed By: xianjiec Differential Revision: D15099761 fbshipit-source-id: 47ed6c728118fe453cf57ac3457085867939485b	2019-05-06 15:06:32 -07:00
Xue Feng	23ba0561c3	Add Gate Policy GateLearningRateOp (#20044 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20044 We do not have a gating functor. This diff adds it. I'm leveraging existing learning rate op because there are other policies I'll need to use as a union together. * Since there are other policy in LearningRateOp which will be used as a union, I chose to add it as a LearningRateOp. * constantwarmup cannot do step function of nonzero first and zero later * There are multiple uses for it, * e.g. as a gating blob generator that is useful for turning off. * e.g. as a learning rate switcher at certain iteration. * For generalizability, no regulation or constraint is applied on the range of the values * see figure below for illustration {F157366621} Reviewed By: ccheng16 Differential Revision: D15178229 fbshipit-source-id: 1e66e9a4bc1bfb946a57f8aefc97d8170f6be731	2019-05-05 20:11:04 -07:00
Xiaomeng Yang	271f005eeb	Add elementwise_affine for LayerNormGradientOp (#19982 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19982 Add elementwise_affine for LayerNormGradientOp Reviewed By: houseroad Differential Revision: D15157493 fbshipit-source-id: 7465f2c1d4df4649b4903b93483c4861e9c7afa9	2019-05-03 15:33:46 -07:00
Yanghan Wang	a285cbcccf	support different class modes for bbox in box_with_nms_limit_op Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19820 Reviewed By: newstzpz Differential Revision: D15112955 fbshipit-source-id: a757622a32cff7159c39735607103138dbbafc24	2019-04-30 16:02:44 -07:00
Chandler Zuo	472be69a73	Avoid Output Uninitialized Blobs in Load with load_all=1 (#19133 ) Summary: When output blob names are specified while load_all=1, output blob names are ignored. However, this behavior is not documented. In this diff, we just disallow users to provide blob names when load_all=1. See discussion at https://fb.workplace.com/groups/1405155842844877/permalink/2714909788536136/ Pull Request resolved: https://github.com/pytorch/pytorch/pull/19133 Reviewed By: dzhulgakov Differential Revision: D14883698 Pulled By: chandlerzuo fbshipit-source-id: 6e4171e36c4ccc4f857e79da98b858a06b7d8ad6	2019-04-27 10:45:44 -07:00
Xiaomeng Yang	2ce39de3fc	Add elementwise_affine for layer_norm_op (#19713 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19713 Add elementwise_affine for layer_norm_op Reviewed By: houseroad Differential Revision: D15075454 fbshipit-source-id: e8a7d3da1c81e49fa55323f5e74a68bc4ef8d83f	2019-04-26 17:20:01 -07:00
Spandan Tiwari	9ef8eb4cbc	Fix case for `activations` attribute in nn.RNN ONNX export. (#19368 ) Summary: This PR addresses the https://github.com/pytorch/pytorch/issues/19366 issue. Pull Request resolved: https://github.com/pytorch/pytorch/pull/19368 Reviewed By: zrphercule Differential Revision: D15043949 Pulled By: houseroad fbshipit-source-id: 9b90410307d31bc5f2fd14aa0cdd33b22572ed7c	2019-04-25 16:31:25 -07:00
Lu Fang	5025d1d5e4	Automatic update of fbcode/onnx to 27d4b617e7097cda7d0d4c45ff2b09d248f33179 (#19718 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19718 Previous import was 0e8d2bc5e51455c70ef790b9f65aa632ed9bc8a7 Included changes: - [27d4b617](https://github.com/onnx/onnx/commit/27d4b617): Adding RoIAlign operator (#1869) <Sam Pepose> - [70c9026c](https://github.com/onnx/onnx/commit/70c9026c): add ReverseSequence op (#1927) <Guoliang Hua> - [ed2db02a](https://github.com/onnx/onnx/commit/ed2db02a): README.md: Update badge style for build status (#1942) <Yulong Wang> - [e36d3b54](https://github.com/onnx/onnx/commit/e36d3b54): Enable python 3.7 in CI for Windows (#1943) <Raymond Yang> Differential Revision: D15077516 fbshipit-source-id: c8c6935381ff5a96ab9a4ee519685814f4ea6e59	2019-04-25 10:54:15 -07:00
Zachary DeVito	87a6974193	Make it possible for self.forward to return a ScriptMethod (#19217 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19217 ghimport-source-id: 6fdd7f5ac041dae950b47ca316f30682ede0b083 Reviewed By: suo Differential Revision: D14922120 Pulled By: zdevito fbshipit-source-id: 5e82e5d7ee72df6f401146d2519c80ea336ff40e	2019-04-24 11:14:34 -07:00
Jiyan Yang	714344a976	Specify to use Float16UniformFill if necessary in sparse lookup layer (#18499 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18499 If the init op is not fp16 compatible, it should throw. However, in the special case where the original init op is UniformFill, we replace it with Float16UniformFill Reviewed By: kennyhorror Differential Revision: D14627209 fbshipit-source-id: eb427772874a732ca8b3a25d06670d119ce8ac14	2019-04-23 10:14:08 -07:00
Lu Fang	5a796d15be	Automatic update of fbcode/onnx to 0e8d2bc5e51455c70ef790b9f65aa632ed9bc8a7 (#19568 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19568 Previous import was 83dd62659fc07d5b7fa93b5d1c1879f93509c7db Included changes: - [0e8d2bc5](https://github.com/onnx/onnx/commit/0e8d2bc5): [Minor need to be in 1.5]Fix an issue in NMS test data which introduce wrong shape. (#1953) <Hector Li> - [9346dd5d](https://github.com/onnx/onnx/commit/9346dd5d): adding modulus operator (#1874) <Jeff Saremi> - [414dbc73](https://github.com/onnx/onnx/commit/414dbc73): Fix shape inference for slice (#1950) <Hariharan Seshadri> - [6fb0775d](https://github.com/onnx/onnx/commit/6fb0775d): Fix shape inference for ConstantOfShape op (#1951) <Ashwini Khade> Reviewed By: bddppq, zrphercule, benoitsteiner Differential Revision: D15033070 fbshipit-source-id: f7eb90b142cbdc9bf1600cfd33e5a8df709045fb	2019-04-22 17:36:36 -07:00
Yinghai Lu	767d184b77	Add back option to not adjust output batch size (#19442 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19442 For cases like CV, some of ops like transpose and tile will mangle the batch size so that we don't know how to adjust output batch size. In this case, the current solution is just fix the input batch statically and do not adjust output batch size. Reviewed By: zrphercule Differential Revision: D15007237 fbshipit-source-id: a21b943a52ee5462d9d7804dfae44360f579f8cf	2019-04-22 12:29:24 -07:00
Michael Antonov	7655b857f7	Add debug logic to c2_ref_test and its helpers (#19359 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19359 Even with file IO exception handling, some of the sandcastle c2_ref_tests are still failing in length-check assert, as can be seen here: https://our.intern.facebook.com/intern/test/844424932589974?ref_report_id=0 This is an attempt to add printing logic to debug what's going on. Reviewed By: dzhulgakov Differential Revision: D14966274 fbshipit-source-id: adce6d4780d664c5ef59f9341b6133b0d09324cb	2019-04-22 12:08:55 -07:00
Lu Fang	e714429bf4	Automatic update of fbcode/onnx to 83dd62659fc07d5b7fa93b5d1c1879f93509c7db (#19454 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19454 Previous import was ad7313470a9119d7e1afda7edf1d654497ee80ab Included changes: - [83dd6265](https://github.com/onnx/onnx/commit/83dd6265): Add NonMaxSuppression operator (#1703) <Hector Li> - [31ca5d6f](https://github.com/onnx/onnx/commit/31ca5d6f): add node tests for quantized ops (#1944) <Ashwini Khade> - [e6076c1d](https://github.com/onnx/onnx/commit/e6076c1d): Fix test stat coverage script (#1948) <Raymond Yang> - [ad036405](https://github.com/onnx/onnx/commit/ad036405): Add IsInf to detect infinity values (#1884) <Wei-Sheng Chin> Reviewed By: benoitsteiner Differential Revision: D15010015 fbshipit-source-id: 4b29de21de60f8e6a2db75309809a4e619c92532	2019-04-22 10:46:08 -07:00
Jiyan Yang	deadf3ba89	Add assertion to make sure init op is always fp16 compatible in fp16 training Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18498 Reviewed By: kennyhorror Differential Revision: D14626755 fbshipit-source-id: d8a0b3c02920ab3835911a21bf05e8956853fcd7	2019-04-21 23:43:13 -07:00
Gu, Jinghui	c96c91da22	Improve optimizations for DNNLOWP support on MKL-DNN (#18843 ) Summary: In this PR, the fusion alogrithms are improved to support DNNLOWP. 1. Enabled conv fusions for DNNLOWP 2. Fused order switch op into following quantize op 3. Improve conv+sum fusion to parse larger scope/window 4. re-org fusion code to fix random crash issue due to changing graph Pull Request resolved: https://github.com/pytorch/pytorch/pull/18843 Differential Revision: D15021030 Pulled By: yinghai fbshipit-source-id: 88d2199d9fc69f392de9bfbe1f291e0ebf78ab08	2019-04-20 02:12:06 -07:00
Xiaomeng Yang	f5fe7aa0b2	Fix relu bug for empty tensor (#19451 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19451 Fix relu bug for empty tensor Reviewed By: xianjiec Differential Revision: D15009811 fbshipit-source-id: b75e567c3bec08d7d12b950d8f1380c50c138704	2019-04-19 15:21:07 -07:00
Jiyan Yang	c48e1679f9	Add validator for optimizers when parameters are shared Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18497 Reviewed By: kennyhorror Differential Revision: D14614738 fbshipit-source-id: beddd8349827dcc8ccae36f21e5d29627056afcd	2019-04-17 21:10:38 -07:00
Yinghai Lu	f1f31b634d	Eliminate AdjustBatch ops (#19083 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19083 As we have discussed, there are too many of AdjustBatch ops and they incur reallocation overhead and affects the performance. We will eliminate these ops by - inling the input adjust batch op into Glow - inling the output adjust batch op into OnnxifiOp and do that only conditionally. This is the C2 part of the change and requires change from Glow side to work e2e. Reviewed By: rdzhabarov Differential Revision: D14860582 fbshipit-source-id: ac2588b894bac25735babb62b1924acc559face6	2019-04-17 10:00:25 -07:00
Jerry Zhang	ff0a7ae43f	Testing for folded conv_bn_relu (#19298 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19298 Proper testing for conv_bn_relu folding Differential Revision: D13998891 fbshipit-source-id: ceb58ccec19885cbbf38964ee0d0db070e098b4a	2019-04-16 19:04:06 -07:00
Huamin Li	c480798a1c	use C10_REGISTER for GELU op Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19090 Reviewed By: BIT-silence Differential Revision: D14864737 fbshipit-source-id: 8debd53171f7068726f0ab777a13ca46becbfbdf	2019-04-12 11:41:04 -07:00
Xing Wang	b6f130aa70	try to enable uncertainty for lr loss (#17236 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17236 Following the paper in https://papers.nips.cc/paper/7141-what-uncertainties-do-we-need-in-bayesian-deep-learning-for-computer-vision.pdf, approximate the classification case with the regression formulation. For the LRLoss, add penalty based on the variance and regularization on the variance with a tunable parameter lambda. Reviewed By: chocjy Differential Revision: D14077106 fbshipit-source-id: 4405d8995cebdc7275a0dd07857d32a8915d78ef	2019-04-11 07:35:19 -07:00
Gu, Jinghui	575aebc182	implement operators for DNNLOWP (#18656 ) Summary: Implement operators for DNNLOWP, including int8_conv, int8_FC, int8_pooling, int8_relu, int8_sum, quantize/dequantize, and order_swtich operators. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18656 Differential Revision: D14767092 Pulled By: yinghai fbshipit-source-id: 1f3e24929a358a42214da333bd304c593ea4468f	2019-04-10 12:04:39 -07:00

1 2 3 4 5 ...

2438 Commits