pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Omkar Salpekar	ae1ed27756	[codemod][numpy] replace np.str with str (#103931 ) Summary: `np.str` is removed from numpy 1.20.0. It was an alias to builtin `str` and it's safe to do the replacement. The whole changes is mechanical, generated using the following onliner: ``` fbgr -sl 'np\.str\b' \| xargs perl -pi -e 's,\bnp\.str\b,str,g' ``` Test Plan: sandcastle Differential Revision: D46586144 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103931 Approved by: https://github.com/huydhn	2023-06-21 18:16:42 +00:00
Xuehai Pan	8d45f555d7	[BE] [1/3] Rewrite `super()` calls in caffe2 and benchmarks (#94587 ) Rewrite Python built-in class `super()` calls. Only non-semantic changes should be applied. - #94587 - #94588 - #94592 Also, methods with only a `super()` call are removed: ```diff class MyModule(nn.Module): - def __init__(self): - super().__init__() - def forward(self, ...): ... ``` Some cases that change the semantics should be kept unchanged. E.g.: `f152a79be9/caffe2/python/net_printer.py (L184-L190)` `f152a79be9/test/test_jit_fuser_te.py (L2628-L2635)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94587 Approved by: https://github.com/ezyang	2023-02-11 18:19:48 +00:00
Aaron Gokaslan	8fce9a09cd	[BE]: pyupgrade Python to 3.8 - imports and object inheritance only (#94308 ) Apply parts of pyupgrade to torch (starting with the safest changes). This PR only does two things: removes the need to inherit from object and removes unused future imports. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94308 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-02-07 21:10:56 +00:00
Aaron Gokaslan	748bac8757	[BE]: Apply pyupgrade yield from and unit test alias upgrades (#94309 ) Applies some more harmless pyupgrades. This one gets rid of deprecated aliases in unit_tests and more upgrades yield for loops into yield from generators which are more performance and propagates more information / exceptions from original generator. This is the modern recommended way of forwarding generators. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94309 Approved by: https://github.com/albanD	2023-02-07 20:08:58 +00:00
Nikita Shulga	1906eaf22f	[BE] Get rid of `future` (#92596 ) PyTorch has been Python-3.X+ for ages, so it's a shame to still rely on `future.utils` even in a deprecated Caffe2 codebase For the reference: https://peps.python.org/pep-0469/#migrating-directly-to-python-3 Pull Request resolved: https://github.com/pytorch/pytorch/pull/92596 Approved by: https://github.com/kit1980, https://github.com/orionr	2023-01-19 08:46:50 +00:00
Ram Rachum	351d73b97f	Fix exception causes all over the codebase (#90271 ) This is the continuation to #90134 and hopefully the final PR in this series. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90271 Approved by: https://github.com/kit1980	2022-12-07 04:29:00 +00:00
PyTorch MergeBot	8c1c6759b2	Revert "remove assert_allclose from torch.testing (#87974 )" This reverts commit `5669e10d37`. Reverted https://github.com/pytorch/pytorch/pull/87974 on behalf of https://github.com/mehtanirav due to Internal breakages from method removal	2022-11-04 19:12:37 +00:00
Philip Meier	5669e10d37	remove assert_allclose from torch.testing (#87974 ) See #87969 or #86586 for the reasoning. Pull Request resolved: https://github.com/pytorch/pytorch/pull/87974 Approved by: https://github.com/mruberry	2022-11-02 14:05:01 +00:00
Philip Meier	bc73affdad	prepare removal of deprecated functionality in torch.testing (#87969 ) _Redo of #86586 with all BC breaking changes granularly placed into separate commits._ --- Per title. Deprecation happened on Feb 25, 2022 in `c6f1bbc0ac`, which made it into the 1.12 release. Since it is now 245 days later and the next release will be 1.14, the removals later in the stack comply with the [BC policy](https://github.com/pytorch/pytorch/wiki/PyTorch's-Python-Frontend-Backward-and-Forward-Compatibility-Policy#minimizing-the-disruption-of-bc-breaking-changes). Pull Request resolved: https://github.com/pytorch/pytorch/pull/87969 Approved by: https://github.com/mruberry	2022-11-02 14:04:48 +00:00
John Shahid	4766314de1	Disable GPU tests for the PiecewiseLinearTransform operator. (#75738 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/75738 The tests are failing on platform010 and blocking the upgrade. Skip the tests given that Caffe2 on GPU is no longer supported. Test Plan: signals Reviewed By: ezyang Differential Revision: D35613544 Pull Request resolved: https://github.com/pytorch/pytorch/pull/75771 Approved by: https://github.com/jamesr66a	2022-04-14 12:07:50 +00:00
John Shahid	b311f255d8	Disable GPU tests for the Dropout operator. (#75739 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/75739 The tests are failing on platform010 and blocking the upgrade. Skip the tests given that Caffe2 on GPU is no longer supported. Test Plan: signals Reviewed By: ezyang Differential Revision: D35614159 Pull Request resolved: https://github.com/pytorch/pytorch/pull/75767 Approved by: https://github.com/ezyang	2022-04-14 05:43:41 +00:00
Yulv-git	ac2d2e3a3d	Fix some typos. Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/75561 Approved by: https://github.com/albanD	2022-04-11 21:55:59 +00:00
Richard Barnes	c021824128	Clean up bisect_percentile_op (#73148 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73148 Makes a bunch of things const, eliminates extraneous variables Test Plan: Sandcastle Reviewed By: malfet Differential Revision: D34365183 fbshipit-source-id: 56e4c43e0c14d28f9d18903e9b05f993637489b1 (cherry picked from commit 51520edd16084270aefe8f8143799f918d7ae22d)	2022-02-25 04:33:45 +00:00
Xiaohan Wei	ca0ac3a74b	[caffe2] allow dropout to take 1.0 as dropout ratio to zero-out a layer (#72741 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72741 as titled. Context: This is useful in fast mitigating feature induced overfitting in the sense that we can do omni-transfer on a trained model and apply dropout with ratio = 1 on features resulting in overfitting. Directly removing the features would not be feasible on omni-transfer scenarios since the downstream FC sizes would change. Experimental records: https://fb.quip.com/npIkAgRc8jl9#temp:C:DWC050ceaba14424d23a78462c01 Doing dropout = 1 on selected features improves the eval NE over the next few hours (compared to v0 baseline) as is shown in the figures. Test Plan: ``` buck test caffe2/caffe2/python/operator_test:dropout_op_test ``` Reviewed By: ustctf Differential Revision: D34178732 fbshipit-source-id: 533feebe21bc582eefd756de397d5c7807c7438d (cherry picked from commit `5dabf9c484`)	2022-02-15 19:14:46 +00:00
Nikita Shulga	511ec7f366	Fix `sequence_ops_test` (#72844 ) Summary: Fuzzing gone bad again: `np.unique([])` returns array or float64, but `np.delete` expects array of int Fixes recent regressions in ONNX tests in OSS CI, see https://github.com/pytorch/pytorch/runs/5188636426?check_suite_focus=true for example Pull Request resolved: https://github.com/pytorch/pytorch/pull/72844 Reviewed By: gmagogsfm Differential Revision: D34235295 Pulled By: malfet fbshipit-source-id: 37ad39ac04f81ac519a5d4e4e8a86901944973bd (cherry picked from commit `683c767e72`)	2022-02-15 06:49:38 +00:00
Dmytro Dzhulgakov	6b24d7e4e5	[caffe2] Allow LpNorm to accept empty tensor (#72660 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72660 Sometimes it might happen when model gets an empty input. For consistency with numpy and torch we should just return 0 without averaging or NaN with averaging. Test Plan: Modified unittest Differential Revision: D33782786 fbshipit-source-id: 90d8d63d685c96acc903c08c59eb39fad39e493c (cherry picked from commit `ca85779a4e`)	2022-02-11 23:14:02 +00:00
Ziheng Huang	ae285d837e	[1/n][caffe2] Add session based margin loss function in caffe2 operator Summary: Add session based margin loss into caffe2 operator. This is the first diff make these 2 loss available to dper3 Test Plan: unit test succeeds with gradient check for both new loss function buck test //caffe2/caffe2/python/operator_test:softmax_l2r_operator_test buck test //caffe2/caffe2/python/operator_test:margin_loss_l2r_operator_test E2E test in bento notebook with model training in N1488923 margin loss model: f318207967 f318207399 Notice that the E2E test is run with dper change in D33532976 to change a full model Reviewed By: devashisht Differential Revision: D32902460 fbshipit-source-id: 8f21b9109f500583431156908b632e503ed90dbd (cherry picked from commit `1592111aa4`)	2022-01-21 23:13:36 +00:00
Hector Yuen	0fc6bd2e47	[gpu ne eval] disable adam decay unit test for gpu (#66056 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66056 keep running into this unrelated failure when landing diffs regarding the gpu inference project, disabling this operator unit test in gpu because it doesn't exist RuntimeError: [enforce fail at operator.cc:277] op. Cannot create operator of type 'SmartDecaySparseAdam' on the device 'CUDA'. Verify that implementation for the corresponding device exist. It might also happen if the binary is not linked with the operator implementation code. If Python frontend is used it might happen if dyndep.InitOpsLibrary call is missing. Operator def: input: "param" input: "mom1" input: "mom2" input: "last_seen" input: "indices" input: "grad" input: "lr" input: "iter" output: "param" output: "mom1" output: "mom2" output: "last_seen" name: "" type: "SmartDecaySparseAdam" arg { name: "beta1" f: 0 } arg { name: "beta2" f: 0.9 } arg { name: "epsilon" f: 1e-05 } device_option { device_type: 1 } https://www.internalfb.com/intern/testinfra/diagnostics/5910974579962988.562949996565057.1633122845/ Test Plan: sandcastle Reviewed By: jianyuh Differential Revision: D31364731 fbshipit-source-id: 7fbd994cbe7f6ca116f5f34506a1ed7f14759bdf	2021-10-03 07:40:23 -07:00
Tanvir Zaman	25e2578967	Fix bytes_written and bytes_read (#64244 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64244 Pull Request resolved: https://github.com/pytorch/pytorch/pull/64040 In operator cost inference functions, in many places we are using sizeof(x.data_type()). Since data_type() returns a 32 bit integer from [this enum](https://www.internalfb.com/code/fbsource/[15e7ffe4073cf08c61077c7c24a4839504b964a2]/fbcode/caffe2/caffe2/proto/caffe2.proto?lines=20), we are basically always getting 4 for sizeof(x.data_type()) no matter what actual data type x has. Big thanks to Jack Langman for specifically pointing to this bug. We would instead use the size in bytes based on actual data type. Test Plan: Added unit tests BatchMatMulMemCostTest: buck test //caffe2/caffe2/fb/fbgemm:batch_matmul_op_test -- BatchMatMulMemCostTest Extended existing unit test test_columnwise_concat for different data types: buck test //caffe2/caffe2/python/operator_test:concat_op_cost_test -- test_columnwise_concat Reviewed By: CrazySherman Differential Revision: D30656698 fbshipit-source-id: d42c0c9a0c5b0ddc5dba39e4994f1f85a5e618bf	2021-09-01 13:35:41 -07:00
Alban Desmaison	c3464e78a4	Revert D30561459: Fix bytes_written and bytes_read Test Plan: revert-hammer Differential Revision: D30561459 (`e98173ff34`) Original commit changeset: 976fa5167097 fbshipit-source-id: 43f4c234ca400820fe6db5b4f37a25e14dc4b0dd	2021-08-30 14:59:54 -07:00
Tanvir Zaman	e98173ff34	Fix bytes_written and bytes_read (#64040 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64040 In operator cost inference functions, in many places we are using sizeof(x.data_type()). Since data_type() returns a 32 bit integer from [this enum](https://www.internalfb.com/code/fbsource/[15e7ffe4073cf08c61077c7c24a4839504b964a2]/fbcode/caffe2/caffe2/proto/caffe2.proto?lines=20), we are basically always getting 4 for sizeof(x.data_type()) no matter what actual data type x has. Big thanks to Jack Langman for specifically pointing to this bug. We would instead use the size in bytes based on actual data type. Test Plan: Added unit tests BatchMatMulMemCostTest: buck test //caffe2/caffe2/fb/fbgemm:batch_matmul_op_test -- BatchMatMulMemCostTest Extended existing unit test test_columnwise_concat for different data types: buck test //caffe2/caffe2/python/operator_test:concat_op_cost_test -- test_columnwise_concat Differential Revision: D30561459 fbshipit-source-id: 976fa5167097a35af548498480001aafd7851d93	2021-08-30 12:57:31 -07:00
Tanvir Zaman	cc6b023cba	Add CostInferenceFunction for SplitOp (#63133 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63133 SplitOp is costly but missing cost inference function which hurts cost based balancing. Changes are: (1) Addition of CostInferenceFunction for SplitOp (2) Small fix in CostInferenceFunction for ConcatOp Test Plan: Added unit tests: buck test //caffe2/caffe2/python/operator_test:split_op_cost_test buck test //caffe2/caffe2/python/operator_test:concat_op_cost_test Reviewed By: smacke Differential Revision: D30247360 fbshipit-source-id: 989e962f3a981acc85b73aac3fb23e603b7d1591	2021-08-13 12:28:15 -07:00
Stephen Macke	174433267c	[dte] fastpath implementation for broadcast utility function (4/x) (#62493 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62493 This diff adds a broadcast fastpath for the caffe2 broadcast utility function, which just copies the contents of a smaller tensor into a larger one. We also update the tests to exercise the new functionality. Test Plan: unit tests + let CI run Differential Revision: D29938285 fbshipit-source-id: 543ecc548500380e307be91902696033454964a2	2021-07-30 16:15:10 -07:00
Stephen Macke	956c22b1f9	[dte] fastpath implementations for mulgrad / divgrad (3/x) (#62437 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62437 In this diff we add a broadcast fastpath for MulGradient and DivGradient ops, whose tests we update to exercise the new functionality. Test Plan: Added test cases to elementwise ops (which will exercise the new MulGradient / DivGradient broadcast fastpath functionality) that will be run by CI. It's worth noting there's still no code (outside of the new test cases) that takes the new code paths added -- the user must explicitly request allow_broadcast_fastpath=True, and nothing outside of the added tests currently does so. Differential Revision: D29938273 fbshipit-source-id: 281c1a109e38c25b9bf9ff8d832de60ac3c231a9	2021-07-30 00:05:34 -07:00
Stephen Macke	eef85f89b9	[dte] broadcast fastpath implementations for reduce utility functions (2/x) (#62428 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62428 In this diff we add a broadcast fastpath for reduce utility functions. These functions are used by various elementwise ops, whose tests we update to exercise the new functionality. Test Plan: Added test cases to elementwise ops (which will exercise the new reducer functionality) that will be run by CI. It's worth noting there's still no code (outside of the new test cases) that takes the new code paths added -- the user must explicitly request `allow_broadcast_fastpath=True`, and nothing outside of the added tests currently does so. Differential Revision: D29938264 fbshipit-source-id: 5d5542bd93afb85fd9f7a4073f766adc07eb3b65	2021-07-29 17:27:39 -07:00
Jamie King	1dfb687f3c	Fixed off-by-one bug in Adam Smart Decay (#62135 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62135 The initial implementation of Adam with Smart Decay had an off-by-one error. This was in the summation of the geometric series used to calculate how much built-up momentum would have been discharged in skipped minibatches. The unit tests should have caught these, but the testing strategy missed this because k, the "number of skipped minibatches" was always either 0 or so high that the impact of the bug was too small. The impact of the bug was proportional to 1/k. The testing strategy has also been adjusted to cover this bug. Differential Revision: D29889309 fbshipit-source-id: b086c0efed5c27f621061e726533c73658daffc6	2021-07-26 11:55:38 -07:00
Kaige Liu	094abf5fd0	[BE] Include a unit test for Save Operator with db_options Summary: A test case that triggers db_options with the save operator is missing. Test Plan: buck test Differential Revision: D29642719 fbshipit-source-id: 72b7374d40430398abac26dfe91538550525384d	2021-07-19 12:22:59 -07:00
Jamie King	c23db9327a	Smart Decay for Adam - Caffe2 (#61548 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61548 We want to decay learning parameters properly. Previously this was not done when a parameter is absent from a minibatch. We fix this by keeping track of missed minibatches and making decay catch up accordingly. The exponential moving averages (EMA) for the first and second moments used in Adam are updated only for parameters seen in a minibatch. Actually, for these parameters, 0 should be added to the EMAs and the EMAs should then be decayed by multiplying by beta1 and beta2 respectively. To avoid the computational overhead of touching every parameter for every minibatch, we: * keep track of the last time a parameter is seen * instead of decaying the EMAs by multiplying by beta1 and beta2, we multiply by beta1^k and beta2^k, where k is the number of minibatches since the parameter was last seen * we calculate the amount of momentum that would have been discharged over the missed minibatches and update the weight accordingly. Differential Revision: D29654246 fbshipit-source-id: 7a6cd7966eb1f31116d99dfce79a78b2d3ee9e3e	2021-07-14 10:22:38 -07:00
Kaige Liu	58adaaba60	Enable C2 load rate limiter [2/n] (#61551 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61551 We aim to enable rate limiter in C2 load, with a fix bandwidth limit. This diff update LoadOp to pass down the manifold db options. Test Plan: ``` buck test mode/opt caffe2/caffe2/python/operator_test:load_save_test ``` Differential Revision: D29639102 fbshipit-source-id: cf69549adadf4c7f12a8a2b7f3ca39092cab4b99	2021-07-14 08:27:05 -07:00
Nikita Shulga	f291b1899f	Revert D27978269: Smart Decay for Adam - Caffe2 Test Plan: revert-hammer Differential Revision: D27978269 (`aaa1e07609`) Original commit changeset: e47524101ddf fbshipit-source-id: 334824bbf9a6ed788e75af9c292754081f70a19b	2021-07-10 13:09:58 -07:00
Jamie King	aaa1e07609	Smart Decay for Adam - Caffe2 (#61488 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61488 We want to decay learning parameters properly. Previously this was not done when a parameter is absent from a minibatch. We fix this by keeping track of missed minibatches and making decay catch up accordingly. The exponential moving averages (EMA) for the first and second moments used in Adam are updated only for parameters seen in a minibatch. Actually, for these parameters, 0 should be added to the EMAs and the EMAs should then be decayed by multiplying by beta1 and beta2 respectively. To avoid the computational overhead of touching every parameter for every minibatch, we: * keep track of the last time a parameter is seen * instead of decaying the EMAs by multiplying by beta1 and beta2, we multiply by beta1^k and beta2^k, where k is the number of minibatches since the parameter was last seen. Differential Revision: D27978269 fbshipit-source-id: e47524101ddfcb281c46c505b9b7a8f0835bc64a	2021-07-09 18:28:21 -07:00
Feng Shi	b4a4a8434d	[1/n]support double for Caffe2 ScatterWeightedSum (#60402 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60402 Add float64 data type support for ScatterWeightedSum for cases that 10^7 precision is not sufficient. Test Plan: buck test caffe2/caffe2/python/operator_test:sparse_ops_test -- testScatterWeightedSum Reviewed By: jianyuh Differential Revision: D29190324 fbshipit-source-id: 871a60744694e901a2c7685a67350860745d6729	2021-06-29 14:17:04 -07:00
Adam Simpkins	fadaa52f64	[caffe2] add an EstimateAllBlobSizes operator (#59775 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59775 This operator is similar to `GetAllBlobNames` but also returns the estimated size required to serialize each node. One goal of this operator is to allow checkpoint saving logic to estimate the amount of space/bandwidth required to save a checkpoint when first starting training, without actually serializing any blobs yet. Currently the checkpointing logic uses `GetAllBlobNames` to determine the blobs to checkpoint. It can instead be updated to use `EstimateAllBlobSizes` to also get an estimate for how much space will be required for the checkpoint. ghstack-source-id: 132275153 Test Plan: Included a new unit test. Reviewed By: mraway Differential Revision: D29020227 fbshipit-source-id: 811e5d86c4b59183e84e6424c48c97739be09043	2021-06-24 16:55:22 -07:00
Baichuan Yuan	dca97b4394	Weighted decay with frequency (count-based) (#60382 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60382 Instead of setting weight_decay w uniformly for all ids, for each row i in the sparse embedding table, the actual weight_decay `w_i` becomes `wfreq_i` where `freq_i = halflife/counter_i \in [\log(2), halflife]`. Counter is from `rowwise_counter` with definition `counter_i = 1 + \exp(-iter_{\delta}\rho)*counter_i`. Test Plan: buck test //caffe2/caffe2/python/operator_test:adagrad_test -- test_row_wise_sparse_adagrad buck test caffe2/caffe2/fb/dper/layer_models/tests/split_1:sparse_nn_test_weight_decay Reviewed By: 0x10cxR1 Differential Revision: D25581030 fbshipit-source-id: 54b3831b20516c76c559b13d8deb809e2ee3b446	2021-06-21 18:46:35 -07:00
Stephen Macke	769c299dcf	[caffe2] add tests for inplace elementwise ops (#60106 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60106 In Caffe2, some elementwise in-place compatible ops lack coverage for the in-place case. We add tests for a subset of them here and thereby increase coverage. Test Plan: ``` buck test //caffe2/caffe2/python/operator_test:elementwise_ops_test ``` Let CI run. Reviewed By: clrfb Differential Revision: D29143189 fbshipit-source-id: 83138ad8eff8fe95c40aece53714da3577396a23	2021-06-21 12:04:18 -07:00
Stephen Macke	e50f264b51	[caffe2] make MulGradient implementation in-place compatible (#60035 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60035 In Caffe2, the operator schema for the MulGradient op indicates that MulGradient may be performed in-place, overwriting one of its inputs as the output. The implementation is not safe to perform in-place however, due to an accidentally-introduced write-read dependency on the overwriten input in the in-place case. We fix it here. Test Plan: ``` buck test //caffe2/caffe2/python/operator_test:elementwise_ops_test ``` Note that the newly added test fails without this change, but passes with this change: ``` ✓ ListingSuccess: caffe2/caffe2/python/operator_test:elementwise_ops_test - main (24.992) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_exp (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_log1p (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_abs (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_bitwise_and (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_reciprocal (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_sqr (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_rsqrt (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_mul (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_sqrt (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_add (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_swish_gradient_inplace (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_sigmoid (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_bitwise_or (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_cbrt_grad (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_not (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_sub (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_div (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_eq (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_softsign (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_eq_bcast (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_powt (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) *********************************************************************************************************************************************************************************** *******************************<NEW_TEST_YAY>******************************************************************************************************************************** ********************************************************************************************************************************************************************************* ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_mul_gradient_inplace (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ********************************************************************************************************************************************************************************* *******************************</NEW_TEST_YAY>******************************************************************************************************************************* *********************************************************************************************************************************************************************************** ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_hard_sigmoid (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_bitwise_xor (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_log (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_cube (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_swish (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_cbrt (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - test_div_legacy_grad (caffe2.caffe2.python.operator_test.elementwise_ops_test.TestElementwiseOps) (125.898) ✓ Pass: caffe2/caffe2/python/operator_test:elementwise_ops_test - main (125.898) Summary Pass: 30 ListingSuccess: 1 ``` Reviewed By: clrfb Differential Revision: D29034265 fbshipit-source-id: 98550e1d5976398e45d37ff2120591af1439c42a	2021-06-15 20:26:04 -07:00
Wei Wen	3b0c6a7b50	fix AddPadding tensor shape inference (#59572 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59572 fix AddPadding tensor shape inference Test Plan: sandcastle Reviewed By: dehuacheng Differential Revision: D28686983 fbshipit-source-id: 03f70335fcfd94a1241562f8fbf12043a0deac2b	2021-06-08 11:02:33 -07:00
Jeongmin Lee	bca25d97ad	[itemwise-dropout][1/x][low-level module] Implement Itemwise Sparse Feature Dropout in Dper3 (#59322 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59322 Implement sparse feature dropout (with replacement) that can drop out individual items in each sparse feature. For example, the existing sparse feature dropout with replacement drops out whole feature (e.g., a list of page ids) when the feature is selected for drop out. This itemwise dropout assigns probability and drops out to individual items in sparse features. Test Plan: ``` buck test mode/dev caffe2/torch/fb/sparsenn:test ``` https://www.internalfb.com/intern/testinfra/testrun/281475166777899/ ``` buck test mode/dev //dper3/dper3/modules/tests:sparse_itemwise_dropout_with_replacement_test ``` https://www.internalfb.com/intern/testinfra/testrun/6473924504443423 ``` buck test mode/opt caffe2/caffe2/python:layers_test ``` https://www.internalfb.com/intern/testinfra/testrun/2533274848456607 ``` buck test mode/opt caffe2/caffe2/python/operator_test:sparse_itemwise_dropout_with_replacement_op_test ``` https://www.internalfb.com/intern/testinfra/testrun/8725724318782701 Reviewed By: Wakeupbuddy Differential Revision: D27867213 fbshipit-source-id: 8e173c7b3294abbc8bf8a3b04f723cb170446b96	2021-06-04 19:59:17 -07:00
Janet Yang	c06d2afa99	[caffe2] Add support for int32 lengths in BatchSparseToDense (#58062 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58062 Make templated function to make sure BatchSparseToDense supports int32 lengths/indices Test Plan: ```buck test //caffe2/caffe2/python/operator_test:batch_sparse_to_dense_op_test ``` Reviewed By: khabinov Differential Revision: D28271423 fbshipit-source-id: 41b88b7a3663616b533aaf4731ff35cdf6ec4c85	2021-05-26 10:33:32 -07:00
Natalia Gimelshein	db5e5781ad	replace all remaining occurrences of deadline=1000, to prevent test flakiness Summary: Per title Test Plan: Fixes existing tests Reviewed By: robieta Differential Revision: D28690296 fbshipit-source-id: d7b5b5065517373b75d501872814c89b24ec8cfc	2021-05-25 15:55:30 -07:00
Natalia Gimelshein	45aa54d83c	relax test deadlines Summary: Relax test deadlines for c2 tests. We run on loaded machines, and timings are unreliable. Test Plan: Fixes existing tests Reviewed By: mruberry Differential Revision: D28690006 fbshipit-source-id: 457707e81a1ec92548c1f23ea7a0022fa0a3bfda	2021-05-25 15:02:52 -07:00
Natalia Gimelshein	056287aec4	turn off deadline for adagrad test Summary: Tests are frequently failing with "exceeded the deadline of 1000.00ms", we expect this to happen, so remove the deadline Test Plan: N/A: Fix breakages Reviewed By: robieta Differential Revision: D28581051 fbshipit-source-id: 4825ada9af151fa5d57c45c549138c15ba613705	2021-05-20 13:47:02 -07:00
Taylor Robie	6989eb60e5	Remove timeouts for C2 tests Summary: When run on very heavily loaded machines, some of these tests are timing out. It's not an issue with the test, it's an issue with the environment. I've removed the timeout so we at least keep unit test coverage. Test Plan: N/A: Fix breakages Reviewed By: ngimel Differential Revision: D28492334 fbshipit-source-id: aed3ee371763161aab2d356f5623c7df053fda6f	2021-05-17 16:39:30 -07:00
Valentin Andrei	da06ae73a3	[c2] Fix flaky test_spatial_bn_multi_batch_grad Summary: Removed the deadline restriction since the first run can take more than the deadline, wile subsequent runs are shorter. Reviewed By: ngimel Differential Revision: D28260077 fbshipit-source-id: 8ed2f5c16bc184bf4fae0a59b662fa1da2d4dd0a	2021-05-06 12:50:53 -07:00
Yukio Siraichi	93bf0ae6fc	Remove legacy constructor calls from pytorch codebase. (#54142 ) Summary: Follow up from https://github.com/pytorch/pytorch/issues/53889 Related to https://github.com/pytorch/pytorch/issues/47112 Removing every occurrence of the legacy constructor call present in PyTorch at: - _docs_ - _benchmarks_ - _test_ - _caffe2_ - _CONTRIBUTING.md_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/54142 Reviewed By: ngimel Differential Revision: D27699450 Pulled By: mruberry fbshipit-source-id: 530aa3f5746cc8bc1407d5d51b2bbd8075e30546	2021-04-11 15:45:17 -07:00
Oleg Khabinov	6145ac07b5	[caffe2] Reintroduce Log1p operator (#55073 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55073 Original diff D27422219 (`d92e2520de`) was reverted, reintroducing this op again. Reviewed By: ChunliF Differential Revision: D27473735 fbshipit-source-id: 1af0281724e9ada699ebf2045d51f65083daf5b4	2021-03-31 22:29:23 -07:00
Alexander Golynski	25e07c6e91	Revert D27422219: [caffe2] Support Log1p operator Test Plan: revert-hammer Differential Revision: D27422219 (`d92e2520de`) Original commit changeset: f9eba82bf09c fbshipit-source-id: 7cd5b778ae5f296187f57b6efaa782de97a6f013	2021-03-31 06:03:45 -07:00
Oleg Khabinov	d92e2520de	[caffe2] Support Log1p operator (#54968 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54968 Support Log1p operator to add feature parity with PyTorch. NumPy doc https://numpy.org/doc/stable/reference/generated/numpy.log1p.html PyTorch doc https://pytorch.org/docs/stable/generated/torch.log1p.html Test Plan: ``` $ buck test mode/dev-nosan //caffe2/caffe2/python/operator_test:log1p_op_test ``` Differential Revision: D27422219 fbshipit-source-id: f9eba82bf09c1c440f11a33f8ae2bf8084609457	2021-03-30 16:38:37 -07:00
Sam Estep	5bcbbf5373	Lint trailing newlines (#54737 ) Summary: Context: https://github.com/pytorch/pytorch/issues/53406 added a lint for trailing whitespace at the ends of lines. However, in order to pass FB-internal lints, that PR also had to normalize the trailing newlines in four of the files it touched. This PR adds an OSS lint to normalize trailing newlines. The changes to the following files (made in 54847d0adb9be71be4979cead3d9d4c02160e4cd) are the only manually-written parts of this PR: - `.github/workflows/lint.yml` - `mypy-strict.ini` - `tools/README.md` - `tools/test/test_trailing_newlines.py` - `tools/trailing_newlines.py` I would have liked to make this just a shell one-liner like the other three similar lints, but nothing I could find quite fit the bill. Specifically, all the answers I tried from the following Stack Overflow questions were far too slow (at least a minute and a half to run on this entire repository): - [How to detect file ends in newline?](https://stackoverflow.com/q/38746) - [How do I find files that do not end with a newline/linefeed?](https://stackoverflow.com/q/4631068) - [How to list all files in the Git index without newline at end of file](https://stackoverflow.com/q/27624800) - [Linux - check if there is an empty line at the end of a file [duplicate]](https://stackoverflow.com/q/34943632) - [git ensure newline at end of each file](https://stackoverflow.com/q/57770972) To avoid giving false positives during the few days after this PR is merged, we should probably only merge it after https://github.com/pytorch/pytorch/issues/54967. Pull Request resolved: https://github.com/pytorch/pytorch/pull/54737 Test Plan: Running the shell script from the "Ensure correct trailing newlines" step in the `quick-checks` job of `.github/workflows/lint.yml` should print no output and exit in a fraction of a second with a status of 0. That was not the case prior to this PR, as shown by this failing GHA workflow run on an earlier draft of this PR: - https://github.com/pytorch/pytorch/runs/2197446987?check_suite_focus=true In contrast, this run (after correcting the trailing newlines in this PR) succeeded: - https://github.com/pytorch/pytorch/pull/54737/checks?check_run_id=2197553241 To unit-test `tools/trailing_newlines.py` itself (this is run as part of our "Test tools" GitHub Actions workflow): ``` python tools/test/test_trailing_newlines.py ``` Reviewed By: malfet Differential Revision: D27409736 Pulled By: samestep fbshipit-source-id: 46f565227046b39f68349bbd5633105b2d2e9b19	2021-03-30 13:09:52 -07:00
Lanlan Liu	695eef05a4	optimizer exploration - v1 and v2 + fix position_weighted optimizer + decoupled weight decay (#54042 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54042 Pull Request resolved: https://github.com/pytorch/pytorch/pull/53881 1. Fix position_weighted optimizer: Position weighted layer uses default optimizer but is actually gradient_slice, which will cause problem if we do not handle it properly in the new optimizier. The solution is to use sparseadagrad when it is gradient_slices. 2. Optimizer implementation of v1 and v2: using 1st momentum with/without bias_correction. 3. also implemented decoupled weight decay in the new optimizer. Test Plan: buck test //caffe2/caffe2/fb/dper/layer_models/tests/split_1:sparse_nn_test_2 -- test_mlp_optimization buck test //caffe2/caffe2/python:optimizer_test -- TestDecayAdagrad buck test //caffe2/caffe2/python/operator_test:decay_adagrad_test ctr_mbl_feed work flow: f255731660 oc work flow: f255739503 Reviewed By: 0x10cxR1 Differential Revision: D26839668 fbshipit-source-id: 2b6881c1a88540ef5766be40f5e80001257e2199	2021-03-27 23:03:29 -07:00

1 2 3 4 5 ...

1004 Commits