pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Kevin Chen	275dfa3485	Initial commit for L0 norm approx (#27756 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27756 Implement approximate L0 norm for use in the dense feature regularizer that will be used for feature importance. The formula is as follows: {F212246801} Reviewed By: wx1988 Differential Revision: D17432708 fbshipit-source-id: 57d6c9c3dd1b4e210b9f10264075c57dbc9c8cb6	2019-10-11 11:24:34 -07:00
Kutta Srinivasan	415b17e81c	Fix for flaky caffe2 dataio test (test_time_limit_reader_with_short_limit) (#27592 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27592 The caffe2 data reader test `test_time_limit_reader_with_short_limit` is flaky as-written because it places an upper bound on how much can be read, but under stress it is possible for fewer records to be read. The fix is to make the assertion check a fuzzy/range check rather than exact equality, since there's not a straightforward way to precisely test a timer-based feature. ghstack-source-id: 91543898 Test Plan: `buck test mode/dev-tsan //caffe2/caffe2/python:dataio_test-2.7 -- --stress-runs 20` -> P117156924 (with fix, 100% pass) P117158750 - without fix, lots of failures in this test Reviewed By: boryiingsu Differential Revision: D17816775 fbshipit-source-id: 2ab0d3304fbd9c9806d37a4fe2912c840616db61	2019-10-10 13:53:58 -07:00
Jason Fried	b96f49885f	caffe2 python ideep conv_op test_int8_convolution skip for python 3 Summary: This test was failing in 3.7, turns out it was ommitted by test director in 3.6 so I added a skip for both versions Test Plan: unittests is skipped in 3.7 and 3.6 all other tests pass. Reviewed By: tomdz Differential Revision: D17820967 fbshipit-source-id: 571f0ec7fe1b0cb50ead4e0d18c00151a701f36a	2019-10-08 21:31:11 -07:00
Lin Jiang	1f158adeee	Add support for attention weight in SparseLookup (#26748 ) Summary: Support attention weights input to SparseLookup. In attention sum pooling, if attention weights can be pre-calculated before embedding lookup, they can be passed to SparseLookup and processed by SparseLengthsWeightedSum op. One example is id_score attention sum pooling. Essentially the net is converted from: LengthsSum(Mul(Gather(keys, w), att_weight)) to: SpaseLenghtsWeightedSum(keys, w, att_weight) It unblocks potential efficiency gain with distributed training. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26748 Test Plan: unit test Reviewed By: chocjy Differential Revision: D17553345 Pulled By: wheatkit fbshipit-source-id: 60cc3c4b0bc1eade5459ac598e85286f3849a412	2019-10-08 20:22:25 -07:00
Swati Rallapalli	e63addfff6	Exponential decay of the weight of task loss (#27508 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27508 Implemented a simple exponential decay of the weight of lr loss function, with a lower bound. Test Plan: buck test //caffe2/caffe2/fb/dper/layer_models/tests:mtml_test -- test_task_weight_decay https://our.intern.facebook.com/intern/testinfra/testrun/3377699729136308 canary: f140103452 Reviewed By: chenshouyuan Differential Revision: D17524101 fbshipit-source-id: 9a653e21a4ecb74dfc4ac949c9e3388f36ef3a20	2019-10-08 09:15:41 -07:00
Kevin Chen	c2223df578	Implement LpNorm regularizer to be used on the inputs for feature importance (#26376 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26376 * Create the new dense_feature_reg (FCInputLpNorm) for feature importance to be applied to the fully-connected layer for feature-importance. Test Plan: * Unit test located in: `caffe2/caffe2/fb/dper/layer_models/tests/split_1/sparse_nn_test.py` Reviewed By: un-disclosed Differential Revision: D17360361 fbshipit-source-id: 1a0e119eeb17199a13dfffe58b3036ea4255e301	2019-10-03 09:39:42 -07:00
Xing Wang	a1513dced3	Integrate FC fp16 exporter into Dper2 (#26582 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26582 Add the blob quantization. replace the op in the eval/predictor net. Test Plan: # Unit test: ----- buck build fblearner/flow/projects/dper/tests/validators:test_exporter_options_validators ./buck-out/gen/fblearner/flow/projects/dper/tests/validators/test_exporter_options_validators#binary.par ---- buck build caffe2/caffe2/fb/dper/layer_models/tests:exporter_test ./buck-out/gen/caffe2/caffe2/fb/dper/layer_models/tests/exporter_test-2.7#binary.par Reviewed By: chocjy Differential Revision: D17439720 fbshipit-source-id: 68de5d0322b0111aeca5ed552210bf80a4cddc78	2019-09-29 10:19:28 -07:00
Simran Suresh Motwani	d63d7ab997	Expose PiecewiseLinearTransform to PyTorch Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26903 Test Plan: Unit Test Reviewed By: bddppq Differential Revision: D17585637 fbshipit-source-id: fe669aaf3301d7efb5c28ec0097945d55a71773d	2019-09-27 12:49:04 -07:00
Lu Fang	7163bfdf58	Fix the weird bug in control_flow_op_test.py (#26931 ) Summary: In some version of python, then_net and else_net may switch the order. Let's make sure we are iterating the right arg node. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26931 Reviewed By: hl475 Differential Revision: D17614829 Pulled By: houseroad fbshipit-source-id: 3f1b4eb91ecf4d808f58c34896d3e628aa2e0af0	2019-09-26 20:44:03 -07:00
Jongsoo Park	8fb756d3b2	batch size 0 support in ChannelShuffle DNNLOWP op (#26858 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26858 Handle batch size = 0 in ChannelShuffle operator Test Plan: CI Reviewed By: jianyuh Differential Revision: D17591041 fbshipit-source-id: 63373aa752406c1f38401c3e93d8e1954ce7281e	2019-09-26 00:40:07 -07:00
Lu Fang	d6ee58494f	Automatic update of fbcode/onnx to 23bb6ea1a71f08e200114a153f48bd7adb66d486 (#26441 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26441 Previous import was 1316afc9f972f81340faa05763e2898f38bcc3b0 Included changes: - [23bb6ea1](https://github.com/onnx/onnx/commit/23bb6ea1): Gemm optional bias (#2330) <James Allingham> - [1ac1f219](https://github.com/onnx/onnx/commit/1ac1f219): Changes for AIX platform (#1913) <kavanabhat> - [13b026f5](https://github.com/onnx/onnx/commit/13b026f5): Updated test cases for reshape (#2127) <James Allingham> - [97fcfe30](https://github.com/onnx/onnx/commit/97fcfe30): Replace is by == (#2326) <G. Ramalingam> - [3b5601e6](https://github.com/onnx/onnx/commit/3b5601e6): Updated docs for strides and dilations attributes (#2291) <James Allingham> - [d0c697b1](https://github.com/onnx/onnx/commit/d0c697b1): Revamped test cases for Gemm (#2060) <James Allingham> - [a3955c3c](https://github.com/onnx/onnx/commit/a3955c3c): Add more shape inference tests for Logical operators to improve coverage (#2133) <Hariharan Seshadri> - [e2e12d97](https://github.com/onnx/onnx/commit/e2e12d97): Change incorrect use of ValueError to TypeError (#2304) <prcvih> - [1f4b5f8c](https://github.com/onnx/onnx/commit/1f4b5f8c): Support dynamic 'pads' and 'value' in Pad operator (#2031) <Hariharan Seshadri> Test Plan: ci Reviewed By: hl475 Differential Revision: D17466717 fbshipit-source-id: 0f89a7a5a821d2c693492c99b4bebd5966e21d9f	2019-09-24 05:38:52 -07:00
Aapo Kyrola	aeb6532e7f	BlobReference __getattr__ can only throw AttributeError (#26654 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26654 As per python contract, __getattr__ can only throw AttributeError. Throwing something else breaks hasattr() and causes upstream issues. Similar bug was in pytorch earlier. Test Plan: builds Differential Revision: D17529471 fbshipit-source-id: bb6ac6c9e3be8b80fa2967e6a2e293afd1594cf9	2019-09-23 13:01:00 -07:00
Xing Wang	73ae23a4ea	add support for real4bits quant (#25426 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25426 Add embedding table 4bit quantization support. * add the conversion from fp32 to int4. * using brew to pass the context so that the 4bit operators are added when generating the predictor net. Reviewed By: kennyhorror, chocjy Differential Revision: D16859892 fbshipit-source-id: a06c3f0b56a7eabf9ca4a2b2cb6c63735030d70b	2019-09-20 13:45:23 -07:00
Huan Gui	a8386d2a7d	fix composite learning rate (#26227 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26227 In the previous implementation of composite lr, the lr_scale for each sub policy will be rewritten by the last lr_scale. Due to another bug in unittest (where policy_lr_scale being the same for all sub policies), this bug was not detected by unittest... Fix: add an additional field in CompositeLearningRateItem so that we store lr_scale values for all sub policies If fix unittest, the error in previous implementation: https://fburl.com/testinfra/ikdbnmey With the fix, https://fburl.com/testinfra/m694ehl1 Test Plan: unittest buck test caffe2/caffe2/python/operator_test:learning_rate_op_test -- test_composite_learning_rate_op Reviewed By: chocjy, alex1o1o7cloud Differential Revision: D17380363 fbshipit-source-id: 161e9cb71bb2ea7f0734a3361e270616057a08e4	2019-09-18 17:34:17 -07:00
Xiaodong Wang	f341291bfb	Support unpickle py2 NetDef object in py3 (#26147 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26147 We may try to unpickle a byte string in py3 that was pickled from py2. Therefore we need to add encoding latin1. Reviewed By: kennyhorror Differential Revision: D17305677 fbshipit-source-id: c0c8a51909629a65eb72bb81cccfbabaee9f8d01	2019-09-18 02:02:34 -07:00
Lu Fang	bebc3d6aad	Automatic update of fbcode/onnx to 1316afc9f972f81340faa05763e2898f38bcc3b0 (#26309 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26309 Previous import was 95252c2adec185e305e34486c6756ece9aa8f57f Included changes: - [1316afc9](https://github.com/onnx/onnx/commit/1316afc9): Update IR doc to clarify initializers are permitted as node inputs (#2320) <G. Ramalingam> - [5e920d0c](https://github.com/onnx/onnx/commit/5e920d0c): Avoid uses of special chars (#2315) <Wei-Sheng Chin> - [2fa08b0f](https://github.com/onnx/onnx/commit/2fa08b0f): Regenerate ONNX proto and add release date to ver 6 IR (#2316) <Wei-Sheng Chin> - [adf9c7a3](https://github.com/onnx/onnx/commit/adf9c7a3): Add description of default type about y_zero_point (#2110) <Takeshi Watanabe> - [ee7072c7](https://github.com/onnx/onnx/commit/ee7072c7): Support make_attribute empty string (#2129) <shjwudp> - [f913b6e7](https://github.com/onnx/onnx/commit/f913b6e7): More unsqueeze tests (#2200) <James Allingham> - [57b51937](https://github.com/onnx/onnx/commit/57b51937): Fix resize shape inference issue in opset10 (#2294) <Bowen Bao> - [d7595f34](https://github.com/onnx/onnx/commit/d7595f34): Sequence related ops (#2249) <Bowen Bao> - [599f3da9](https://github.com/onnx/onnx/commit/599f3da9): Add helper function update_inputs_outputs_dims to tools (#2148) <Bowen Bao> - [3e6382bc](https://github.com/onnx/onnx/commit/3e6382bc): Update documentation about required input output types (#2310) <G. Ramalingam> - [0c765d9b](https://github.com/onnx/onnx/commit/0c765d9b): Shape inference for NMS (#2269) <Hariharan Seshadri> - [89266710](https://github.com/onnx/onnx/commit/89266710): Fix extra collect_snippets warning (#2277) (#2307) <Lutz Roeder> Test Plan: ci Reviewed By: hl475 Differential Revision: D17403954 fbshipit-source-id: 78a9c3ecf5aa7f7a0ba8ea30286eab61ee903772	2019-09-17 06:46:59 -07:00
Andrey Malevich	28d3eb8156	Back out "Back out "[Caffe2] Fix device_option propagation"" (#25908 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25908 Original commit changeset: f6e961e88c01 device_option propagation is completely broken in Caffe2 for cases when pass through operators are used. As an example Gather operator don't have gradient and passes through it's inputs, which results in incorrect detection of the components for sparse parameter aggregation (component will be empty instead of the real device). This diff is trying to fix this issue. Original diff had a problem, that Caffe2 is not handling cases when device option is present, but contains only metadata (for example one for auto-generated reduction ops in backward pass). This diff is addressing this issue by merging device options during the backward pass Test Plan: 1. net_transform is finally working with Gather + FloatToHalf transformed model instead of failing because of incorrect number of components. 2. New unit-test. 3. Verify that previously broken benchmark is now passing ezyang do you have suggestions what else I should test? Reviewed By: ezyang Differential Revision: D17281528 fbshipit-source-id: 4a1bc386f29f6a34fbf8008effde9d4890abebfa	2019-09-17 04:01:36 -07:00
Aapo Kyrola	20124c4814	guard dyndep with a lock (#26153 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26153 I am suspecting that our multithreaded test-system causes issue with dyndep, if two places try to concurrently InitOpsLibrary. So perhaps we just guard this by a lock. This is just a guess-fix, as it is impossible to repro. Test Plan: sandcastle Reviewed By: bddppq Differential Revision: D17361310 fbshipit-source-id: 596634a2098b18881abbd26a5a727a5ba0d03b6e	2019-09-13 11:38:14 -07:00
Qi Zhou	076eaf4ccf	Exposing Fused8BitRowwiseQuantizedToFloat in PyTorch (#26080 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26080 Will be used in c2 ctr_mbl_feed model to PyTorch conversion Test Plan: Unit test Reviewed By: yinghai Differential Revision: D17337604 fbshipit-source-id: a90d9f5dc38301608d1562c6f2418e7f4616e753	2019-09-12 12:36:33 -07:00
Lu Fang	7e4ac8b851	Automatic update of fbcode/onnx to 7988d8360b11e6003560076e9b1d4aa426db3244 (#25959 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25959 Previous import was 28ca699b69b5a31892619defca2391044a9a6052 Included changes: - [7988d836](https://github.com/onnx/onnx/commit/7988d836): Supporting negative axes for all existing onnx ops (#2281) <Negin Raoof> - [5ca0a09e](https://github.com/onnx/onnx/commit/5ca0a09e): Update managingexperimentalops.md (#1981) <Joseph Spisak> - [bc0495c1](https://github.com/onnx/onnx/commit/bc0495c1): Fix link to community docs in readme (#2261) <Prasanth Pulavarthi> - [2fdb3ef6](https://github.com/onnx/onnx/commit/2fdb3ef6): move map and sequence types to onnx domain, (#2244) <Ke Zhang> - [568b65aa](https://github.com/onnx/onnx/commit/568b65aa): Improve compatiblity with proto3 and enable reading attributes (#2288) <Dmitri Smirnov> - [1f350f2c](https://github.com/onnx/onnx/commit/1f350f2c): Remove type info for loop variadic input in Loop op used to compose the Range op (#2287) <Hariharan Seshadri> - [eb139446](https://github.com/onnx/onnx/commit/eb139446): Add Foundation WG to working-groups.md (#2276) <Ryan Loney> - [4eabc4b3](https://github.com/onnx/onnx/commit/4eabc4b3): Fix testdata model for CumSum. Add exclusive attribute. (#2271) <jignparm> - [1a62afdb](https://github.com/onnx/onnx/commit/1a62afdb): Support GatherND operator in ONNX (#2106) <Hariharan Seshadri> - [0e330e9d](https://github.com/onnx/onnx/commit/0e330e9d): Support ScatterND operator in ONNX (#2220) <Bowen Bao> - [733f7a6a](https://github.com/onnx/onnx/commit/733f7a6a): Add Det to ONNX (#2233) <Bowen Bao> - [52187738](https://github.com/onnx/onnx/commit/52187738): Update the description of nearest_mode of resize op (#2257) <daquexian> - [64b4b686](https://github.com/onnx/onnx/commit/64b4b686): Adding sparse tensor to ONNX (#2019) <G. Ramalingam> - [c8a8b7cc](https://github.com/onnx/onnx/commit/c8a8b7cc): Support Range operator in ONNX (#2242) <Hariharan Seshadri> - [44b0d6d5](https://github.com/onnx/onnx/commit/44b0d6d5): Update resize op (#2057) <daquexian> - [7d907964](https://github.com/onnx/onnx/commit/7d907964): Add function to fuse dynamic quantization graph into 1 node (#2187) <Ashwini Khade> - [36f8e6d9](https://github.com/onnx/onnx/commit/36f8e6d9): Update logo_request.md (#2231) <Prasanth Pulavarthi> - [4eb737c8](https://github.com/onnx/onnx/commit/4eb737c8): Update Clip in opset 11 to support min/max as inputs instead of attributes (#2096) <Bowen Bao> - [a25e1388](https://github.com/onnx/onnx/commit/a25e1388): Fix segfault in tile shape inference (#2221) <daquexian> - [2dc273c7](https://github.com/onnx/onnx/commit/2dc273c7): update onehot shape inference to reflect the spec for depth input (#2224) <Ashwini Khade> - [665211c1](https://github.com/onnx/onnx/commit/665211c1): Add GatherElements Op and Rename ScatterElements (#2143) <Lara Haidar> - [3ba2e31a](https://github.com/onnx/onnx/commit/3ba2e31a): Unique (#2141) <liqunfu> - [5a5588ad](https://github.com/onnx/onnx/commit/5a5588ad): Clarify dimension variable scoping (#2211) <G. Ramalingam> - [fabe39d5](https://github.com/onnx/onnx/commit/fabe39d5): Liqun/topk sort (#2126) <liqunfu> - [453aa644](https://github.com/onnx/onnx/commit/453aa644): Update document for NMS (#2193) <Hector Li> - [34e28ec2](https://github.com/onnx/onnx/commit/34e28ec2): Handle negative 'axis' value in Split type and shape inferencing (#2177) <Scott McKay> - [28ec4583](https://github.com/onnx/onnx/commit/28ec4583): depth to space shuffle order (#2163) <Negin Raoof> - [98f72629](https://github.com/onnx/onnx/commit/98f72629): minor updates to fix links in readme (#2189) <Prasanth Pulavarthi> - [321d1467](https://github.com/onnx/onnx/commit/321d1467): Add check to disallow squeezing input axes which are not 1 (#2204) <Ashwini Khade> - [573f0dc9](https://github.com/onnx/onnx/commit/573f0dc9): fix a bug in fun shape inference (#2188) <Tang, Cheng> - [36dc7110](https://github.com/onnx/onnx/commit/36dc7110): Clarify ambiguity in gather spec regarding indices expectation (#2202) <Ashwini Khade> - [a2449673](https://github.com/onnx/onnx/commit/a2449673): Fix some minor issues in IR.md and Versioning.md (#2108) <edgchen1> - [349aff69](https://github.com/onnx/onnx/commit/349aff69): Skip install typing package for python >=3.5 (#2199) <bddppq> Test Plan: ci Reviewed By: bddppq, benoitsteiner Differential Revision: D17296390 fbshipit-source-id: 9f9f5ce85d9694128008d756c2ea393bd4e0cb71	2019-09-12 12:15:03 -07:00
Dmytro Dzhulgakov	a6a7f35481	Better error messages in C2 ONNX backend (#25809 ) Summary: Just a tiny fix to make debugging easier (output errors to stderr and include in the exception message) Pull Request resolved: https://github.com/pytorch/pytorch/pull/25809 Reviewed By: zrphercule Differential Revision: D17329957 Pulled By: houseroad fbshipit-source-id: 0d73dd9f62c735fbc5096e6a7c0e5f58e4cd90ae	2019-09-11 17:00:42 -07:00
Junjie Bai	a7eb18e243	Enable Unique operator tests on ROCm Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26046 Differential Revision: D17331522 Pulled By: bddppq fbshipit-source-id: 729624d1df15a1c0c7ba2b7e7e3c3a903fb13abf	2019-09-11 16:36:14 -07:00
Swati Rallapalli	c47ccfd01d	Enable variable size embedding (#25782 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25782 Enable variable size embedding for dot processor. We split the embedding matrix into multiple towers, based on the embedding size and perform dot product in a loop over each of the towers and finally concatenate all the dot product outputs. Test Plan: buck test //caffe2/caffe2/fb/dper/layer_models/tests/split_1: https://our.intern.facebook.com/intern/testinfra/testrun/3659174703037560 Specific unit tests -- buck test //caffe2/caffe2/fb/dper/layer_models/tests/split_1:sparse_nn_test -- test_per_feature_emb_dim https://our.intern.facebook.com/intern/testinfra/testrun/3377699726358808 Reviewed By: chenshouyuan Differential Revision: D16690811 fbshipit-source-id: 8f5bce5aa5b272f5f795d4ac32bba814cc55210b	2019-09-09 22:08:32 -07:00
Edward Yang	f70ef229ce	Back out "[Caffe2] Fix device_option propagation" Summary: Original commit changeset: 916551b93346 Test Plan: none Reviewed By: nairbv Differential Revision: D17259017 fbshipit-source-id: f6e961e88c01126393ed2b6be0adeb6fcc68cb3c	2019-09-09 07:22:42 -07:00
Andrey Malevich	bd0e564d40	Fix device_option propagation (#25203 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25203 device_option propagation is completely broken in Caffe2 for cases when pass through operators are used. As an example Gather operator don't have gradient and passes through it's inputs, which results in incorrect detection of the components for sparse parameter aggregation (component will be empty instead of the real device). This diff is trying to fix this issue. Test Plan: net_transform is finally working with Gather + FloatToHalf transformed model instead of failing because of incorrect number of components. Reviewed By: dzhulgakov Differential Revision: D16936041 fbshipit-source-id: 916551b933469f04e32ddf86ec4b2c07f76c9176	2019-09-06 19:05:04 -07:00
Frank Jiang	3be1745b3c	Make SparseNormalize backwards compatible (#25660 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25660 As title Test Plan: buck test caffe2/caffe2/python/operator_test:sparse_normalize_test https://our.intern.facebook.com/intern/testinfra/testrun/5910974517813190 Reviewed By: boryiingsu Differential Revision: D17187839 fbshipit-source-id: 1e5a6eaac0e825db4ae969540a1f689444070579	2019-09-05 15:14:21 -07:00
Jongsoo Park	8199bb3dd3	add options to flush cache in SLS benchmarks (#25530 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25530 Add an option to flush cache for more consistent benchmarking. Test Plan: buck run mode/opt caffe2/caffe2/fb/python/benchmarks:sparse_lengths_sum_4bit_benchmark -- --flush-cache buck run mode/opt caffe2/caffe2/python/operator_test:sparse_lengths_sum_benchmark -- --flush-cache Reviewed By: hyuen Differential Revision: D17148087 fbshipit-source-id: 7eb782986676620254c1619a9a48c656cb1a6856	2019-09-03 05:09:03 -07:00
Jongsoo Park	f1059d4e6a	format sparse_lengths_sum_benchmark (#25529 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25529 To prepare D17148087 Test Plan: Just formatting Reviewed By: hyuen Differential Revision: D17148085 fbshipit-source-id: faff90ee7dfec543d47037d20ce00f251144bc06	2019-09-03 05:08:59 -07:00
Xing Wang	8a8844dc83	Add the sparse feature information during logging in sparse lookup layer (#24863 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24863 Add the sparse feature name in logging for ease of debugging Test Plan: ./buck-out/gen/caffe2/caffe2/fb/dper/layer_models/sparse_nn/pooling_test#binary.par -r test_simple_sum_pooling_named_exception Another test for id_score_list. the original sparse_key is equivalent to get_key(self.input_record)() P98343716 ./buck-out/gen/caffe2/caffe2/python/layers_test-2.7#binary.par -r test_get_key Reviewed By: chocjy Differential Revision: D16901964 fbshipit-source-id: 2523de2e290aca20afd0b909111541d3d152a588	2019-08-27 23:25:26 -07:00
Yanghan Wang	e34ef04301	register HeatmapMaxKeypoint with C10 (#25191 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25191 registering as C10. Test Plan: buck test mode/dev-nosan caffe2/caffe2/python/operator_test:heatmap_max_keypoint_op_test Reviewed By: newstzpz Differential Revision: D17056321 fbshipit-source-id: 989b72d7e3c9f23684b10d5fc9b98177ad4ee47b	2019-08-27 20:13:57 -07:00
Yu Shi	43a2fd0e24	Support focal loss in MTML Summary: [Not in need of review at this time] Support focal loss in MTML (effectively dper2 in general) as described in https://arxiv.org/pdf/1708.02002.pdf. Adopt approach similar to Yuchen He's WIP diff D14008545 Test Plan: Passed the following unit tests buck test //caffe2/caffe2/fb/dper/layer_models/tests/split_1:sparse_nn_test -- test_lr_loss_based_focal_loss buck test //caffe2/caffe2/fb/dper/layer_models/tests:mtml_test_2 -- test_mtml_with_lr_loss_based_focal_loss buck test //caffe2/caffe2/fb/dper/layer_models/tests/split_1:sparse_nn_test -- test_lr_loss_based_focal_loss_with_stop_grad_in_focal_factor Passed ./fblearner/flow/projects/dper/canary.sh; URL to track workflow runs: https://fburl.com/fblearner/446ix5q6 Model based on V10 of this diff f133367092 Baseline model f133297603 Protobuf of train_net_1 https://our.intern.facebook.com/intern/everpaste/?color=0&handle=GEq30QIFW_7HJJoCAAAAAABMgz4Jbr0LAAAz Reviewed By: hychyc90, ellie-wen Differential Revision: D16795972 fbshipit-source-id: 7bacae3e2255293d337951c896e9104208235f33	2019-08-25 01:42:25 -07:00
Xiao Fang	3385693edd	gradient clipping by norm Summary: as titled Reviewed By: hbjerry, alyssawangqq Differential Revision: D16797498 fbshipit-source-id: 4ea05ab9f06b309d32faa3218e79899c9f8d9cf2	2019-08-22 11:20:40 -07:00
Frank Jiang	d7c6debc14	Remove gradient value as input from SparseNormalize op (#24357 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24357 SparseNormalize does not need to know the gradient value to the lookup table, only the indices of the embeddings that need to be updated. By removing this input, we allow SparseNormalize to be used alongside SparseAdagradFusion Differential Revision: D16809919 fbshipit-source-id: cc19692ba4dea8854663ae1ed8cf9365e90c99bc	2019-08-19 14:47:09 -07:00
Yanghan Wang	3b22bbeb5b	enable "keeps" from BoxWithNMSLimit and caffe2_fastrcnn_outputs_inference Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24451 Reviewed By: newstzpz Differential Revision: D16850259 fbshipit-source-id: 22f69d71a558d63c32a27d271a7557fc35a55176	2019-08-19 10:54:22 -07:00
Bin Wen	e78dad3593	Add BPR loss to TTSN (#24439 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24439 many literatures mentioned BPR is useful for improving recommendation quality. Add a BPR loss so that we can train TTSN with it. Would like to see if it can improve retrieval models. reference: https://arxiv.org/pdf/1205.2618.pdf Reviewed By: dragonxlwang Differential Revision: D16812513 fbshipit-source-id: 74488c714a37ccd10e0666d225751a845019eb94	2019-08-15 23:20:15 -07:00
neginraoof	3574d9ff70	updated pixel_shuffle in opset 11 to use depthToSpace Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23739 Differential Revision: D16800355 Pulled By: bddppq fbshipit-source-id: 1502c5b7ec1495286bad17b6ffa359cf995f78fb	2019-08-15 11:37:44 -07:00
Fan Wang	59094c409e	Refactor and expose metadata of tum_history layer for online prediction Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24290 Reviewed By: xianjiec Differential Revision: D16570968 fbshipit-source-id: f68d42f3a8e1a6c8d30e00c2dd7f7efc1fb35d7c	2019-08-15 00:27:11 -07:00
Kevin Wilfong	88b1f6619e	Return list of AccessedFeatures from get_accessed_features (#23983 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23983 While testing I realized that model layers can extract different types of features from the same column. For example, MultifeedFeaturesTransform uses float and ID list features from the "features" column. get_accessed_features returns a map from column to AccessedFeatures, and AccessedFeatures only has the feature IDs for one feature type. This is incompatible with have multiple types of features per column, one type ends up overwriting another in the map. To fix this, I've modified get_accessed_features to return a map from column to a list of AccessedFeatures objects. Reviewed By: itomatik Differential Revision: D16693845 fbshipit-source-id: 2099aac8dc3920dd61de6b6ad5cf343c864803bc	2019-08-14 10:50:27 -07:00
Frank Jiang	1439152e72	Make hashing default for bucket-weighted pooling (#24266 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24266 As title Reviewed By: huginhuangfb Differential Revision: D16775870 fbshipit-source-id: f919fdffa014ef3ce9a69fe173dd240e91813c3e	2019-08-13 13:56:32 -07:00
Sergio Giro	dc870a3761	Hypothesis tests: add ability to enforce shape inference (#23935 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23935 Add parameter to enforce that outputs are inferred Reviewed By: yinghai Differential Revision: D16667772 fbshipit-source-id: 44f9c47133749b48c0db25a54f9bd9f4698f3e7d	2019-08-13 05:32:41 -07:00
Tongliang Liao	4f254c3c33	Fix typo "properlyh" Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24067 Differential Revision: D16732526 Pulled By: ezyang fbshipit-source-id: 0f3a5b53c0e46bd40a6e5c838504301766c00a82	2019-08-09 11:43:04 -07:00
Yanghan Wang	ad64789a1e	add aligned option to RoIAlign Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23706 Reviewed By: ppwwyyxx Differential Revision: D16615823 fbshipit-source-id: fd9152af8bc979cb04044413e66af349b032a99d	2019-08-07 21:22:33 -07:00
Shali Jiang	15d3f0242b	support Gather different indices for different examples in one batch (#23813 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23813 Pull Request resolved: https://github.com/pytorch/pytorch/pull/23285 for example: Inputs: data: [[[2 4 2 0], [0 1 2 0], [1 1 0 0]], [[3 4 1 3], [0 3 2 2], [4 1 0 4]]] idx: [[0 2], [0 1]] outputs: [[[2 4 2 0], [1 1 0 0]], [[3 4 1 3], [0 3 2 2]]] data and idx must have the same outer dimension call Gather or BatchGather with argument match_outer=True Reviewed By: huayuli00 Differential Revision: D16652485 fbshipit-source-id: 9e144e97a8d6fceaf3b5714df1534338068f4a10	2019-08-07 21:14:30 -07:00
Amy Yang	9588cd921e	weight_names bug fix (#23848 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23848 Problem: In experiment running feed model 127607201 (/mnt/public/tracelog/feed_repro2/127607201_0.predictor), encountered blob dimensionality mismatch error when running onnxified net. This is due to the model initializing input blobs in current workspace with blob size 0, and onnxifi() falsely identified those input blobs as weight blobs and assigned wrong dimension. Solution: Add option to pass correct weight blob names to onnxifi() instead of using all blobs in current workspace. Reviewed By: yinghai Differential Revision: D16661396 fbshipit-source-id: cabe44db6b64e6538bef4b65e380312214b3ba9f	2019-08-06 10:58:43 -07:00
Andrey Malevich	d58059bc6f	Fix SliceGradientOp to handle properly empty batches (#23784 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23784 Backward path does nothing during the gradient path when the input as empty, as a result workspace can preserve gradient values from previous iteration and get inconsistent inputs for some of the backward pass operators. This diff should fix this disrepancy by always reinitializing output during the backward path. Reviewed By: dzhulgakov Differential Revision: D16646096 fbshipit-source-id: 8ca68dfad17a63fc87c033cce7b36b40bd77245c	2019-08-06 02:43:32 -07:00
Michael Suo	a3c165f9d2	Revert D16452539: support Gather different indices for different examples in one batch Differential Revision: D16452539 Original commit changeset: 7229489f4a9c fbshipit-source-id: 010c177e551cb81521d2af84ce951bf964cdab44	2019-08-05 10:22:01 -07:00
Shali Jiang	f87a4cc23f	support Gather different indices for different examples in one batch (#23285 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23285 for example: Inputs: data: [[[2 4 2 0], [0 1 2 0], [1 1 0 0]], [[3 4 1 3], [0 3 2 2], [4 1 0 4]]] idx: [[0 2], [0 1]] outputs: [[[2 4 2 0], [1 1 0 0]], [[3 4 1 3], [0 3 2 2]]] data and idx must have the same outer dimension call Gather or BatchGather with argument match_outer=True Reviewed By: huayuli00 Differential Revision: D16452539 fbshipit-source-id: 7229489f4a9c02ee9f3c6a8a24bcd02925d96e07	2019-08-04 21:17:49 -07:00
Le Fang	a1b10270c2	Fix the bug in regularizer matching (#23485 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23485 In previous diff D16326492, the "regularizer" in dot processor is defined according to input regularizer options through the function "get_emb_weighting_reg" in processor_utils.py. The option matching is only valid in local test, but doesn't work in workflows. This bug causes the regularizer not added in actual models and has made previous trimmed lasso implementation useless. An evidence is that before D16326492, a flow f126010621 has elastic regularizer added: https://our.intern.facebook.com/intern/chronos/jobinstance/?jobinstanceid=5375243255&smc=chronos_gp_admin_client {F171862755} while after D16326492, the regularizer is gone in flow f127262007 https://our.intern.facebook.com/intern/chronos/jobinstance/?jobinstanceid=5428982684&smc=chronos_gp_admin_client {F171862770} Differential Revision: D16535466 fbshipit-source-id: 6b0b5e95b2b14a0d6c6d65f96bab89529f4e79c5	2019-08-02 15:54:48 -07:00
Jiexian Li	302adf1d20	add LambdaRank DCG Loss Option (#23679 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23679 Full Canary: https://fburl.com/fblearner/sa1pkpya Add LambdaRank DCG Loss Option * when use_idcg_normalization == true, regular LambdaRank with NDCG loss * when use_idcg_normalization == false, gradient and loss functions are not normalized by idcg. Differential Revision: D16605459 fbshipit-source-id: a16f071e69516974e48d27bef4ca179019ca4ae7	2019-08-02 11:47:46 -07:00
Jiexian Li	fc6aec9491	format only change (#23685 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23685 format only changes. Differential Revision: D16607482 fbshipit-source-id: 572afb59c6ff9f8a8842ba044fed6c87f8506843	2019-08-02 11:47:42 -07:00

1 2 3 4 5 ...

2535 Commits