pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Zhijing Li	05542f6222	EMA op (#50393 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50393 Exponential Moving Average Usage: add ema_options in adagrad optimizer. For details, plz refer to the test workflow setting. if ema_end == -1, it means ema will never end. Test Plan: buck test caffe2/caffe2/fb/optimizers:ema_op_optimizer_test buck test caffe2/caffe2/fb/optimizers:ema_op_test f240459719 Differential Revision: D25416056 fbshipit-source-id: a25e676a364969e3be2bc47750011c812fc3a62f	2021-01-13 08:58:01 -08:00
Hugo van Kemenade	473e78c0fa	Remove redundant code for unsupported Python versions (#49486 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49486 Remove code for Python 3.5 and lower. There's more that can be removed/modernised, but sticking mainly to redundant version checks here, to keep the diff/PR smaller. Pull Request resolved: https://github.com/pytorch/pytorch/pull/46579 Reviewed By: zou3519 Differential Revision: D24453571 Pulled By: ezyang fbshipit-source-id: c2cfcf05d6c5f65df64d89c331692c9aec09248e	2021-01-06 12:45:46 -08:00
Richard Barnes	9945fd7253	Drop unused imports from caffe2/python (#49980 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49980 From ``` ./python/libcst/libcst codemod remove_unused_imports.RemoveUnusedImportsWithGlean --no-format caffe2/ ``` Test Plan: Standard sandcastle tests Reviewed By: xush6528 Differential Revision: D25727359 fbshipit-source-id: c4f60005b10546423dc093d31d46deb418352286	2021-01-05 13:17:46 -08:00
Samuel Marks	e6779d4357	[*.py] Rename "Arguments:" to "Args:" (#49736 ) Summary: I've written custom parsers and emitters for everything from docstrings to classes and functions. However, I recently came across an issue when I was parsing/generating from the TensorFlow codebase: inconsistent use of `Args:` and `Arguments:` in its docstrings. ```sh (pytorch#c348fae)$ for name in 'Args:' 'Arguments:'; do printf '%-10s %04d\n' "$name" "$(rg -IFtpy --count-matches "$name" \| paste -s -d+ -- \| bc)"; done Args: 1095 Arguments: 0336 ``` It is easy enough to extend my parsers to support both variants, however it looks like `Arguments:` is wrong anyway, as per: - https://google.github.io/styleguide/pyguide.html#doc-function-args @ [`ddccc0f`](https://github.com/google/styleguide/blob/ddccc0f/pyguide.md) - https://chromium.googlesource.com/chromiumos/docs/+/master/styleguide/python.md#describing-arguments-in-docstrings @ [`9fc0fc0`](https://chromium.googlesource.com/chromiumos/docs/+/9fc0fc0/styleguide/python.md) - https://sphinxcontrib-napoleon.readthedocs.io/en/latest/example_google.html @ [`c0ae8e3`](https://github.com/sphinx-contrib/napoleon/blob/c0ae8e3/docs/source/example_google.rst) Therefore, only `Args:` is valid. This PR replaces them throughout the codebase. PS: For related PRs, see tensorflow/tensorflow/pull/45420 PPS: The trackbacks automatically appearing below are sending the same changes to other repositories in the [PyTorch](https://github.com/pytorch) organisation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/49736 Reviewed By: albanD Differential Revision: D25710534 Pulled By: soumith fbshipit-source-id: 61e8ff01abb433e9f78185c2d1d0cbd7c22c1619	2020-12-28 09:34:47 -08:00
skyline75489	46b83212d1	Remove unused six code for Python 2/3 compatibility (#48077 ) Summary: This is basically a reborn version of https://github.com/pytorch/pytorch/issues/45254 . Ref: https://github.com/pytorch/pytorch/issues/42919 Pull Request resolved: https://github.com/pytorch/pytorch/pull/48077 Reviewed By: ngimel Differential Revision: D25687042 Pulled By: bugra fbshipit-source-id: 05f20a6f3c5212f73d0b1505b493b720e6cf74e5	2020-12-22 18:07:08 -08:00
Taylor Robie	faf6032945	Remove deadlines for Caffe2 hypothesis_test when running on GPU. (#49591 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49591 A bunch of these tests are marked flaky, and have been since time immemorial. (Read: as far back as Buck will build.) However closer inspection reveals that they fail if and only if run on a GPU worker. What seems to be going on is that there are more jobs than GPUs, so the contention causes waits which registers as timeouts on the test. This diff is kind of hacky, but it basically just drops deadlines if a GPU is present. Because Caffe2 is going away I'm not too terribly concerned about a beautiful solution, but we may as well keep some test coverage if it's easy. CC Sebastian, Ilia, Min, and Hongzheng who also have tasks for what seems to be the same flakiness. Test Plan: Turn the tests back on and see if they fall over. (The failure repros reliably on an OnDemand GPU and is fixed by this change, so it's not really just a hail Mary.) Reviewed By: ngimel Differential Revision: D25632981 fbshipit-source-id: 43dcce416fea916ba91f891e9e5b59b2c11cca1a	2020-12-18 10:00:24 -08:00
Andrey Malevich	f5a26a554b	[C2] Revive unsafe CoalesceOp (#49402 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49402 In cases of NCCLAllReduce operations there could be non-trivial overhead for launching cooperative kernels (especially in case of async execution of different parts of the model). This diff is reviving this operator to make it possible to fuse multiple operations into a single kernel. Test Plan: Unit-test. Used in a later diff. Reviewed By: xianjiec Differential Revision: D25531206 fbshipit-source-id: 64b1c161233a726f9e2868f1059316e42a8ea1fc	2020-12-17 04:31:29 -08:00
Andrey Malevich	46debe7f23	[DPER] Introduce barrier operation to force synchronization of threads in async execution (#49322 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49322 In some cases async execution might loose dependencies (Alias like ops) or produce suboptimal scheduling when there is an option which parts to schedule first. Example of the later behavior can happen in ModelParallel training where copy can get lower priority compared to the rest of the execution on the given GPU, which will caused other GPUs to starve. This operator allows to address these issues by introducing extra explicit dependencies between ops. Test Plan: Unit-test/ E2E testing in the future diffs. Reviewed By: xianjiec Differential Revision: D24933471 fbshipit-source-id: 1668994c7856d73926cde022378a99e1e8db3567	2020-12-15 16:13:42 -08:00
Newsha Ardalani	0fb58d76a1	Support ArgMin in c2_pt_converter Summary: + Add ArgMin support to Caffe2 to PyTorch converter + Using hypothesis to parameterize different conditions for test Test Plan: buck test //caffe2/torch/fb/model_transform/c2_convert:c2_pt_converter_test Reviewed By: houseroad Differential Revision: D25016203 fbshipit-source-id: 94489fcf1ed3183ec96f9796a5b4fb348fbde5bc	2020-12-05 16:35:34 -08:00
Rahul Manghwani	142b21fd44	Add SparseLengthsSum4BitRowwiseSparse in c2_pt_converter (#48240 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48240 Adds the support for converting the SparseLengthsSum4BitRowwiseSparse operator from caffe2 to pytorch as a part of c2_pt_converter Test Plan: Added a unit tested buck test //caffe2/torch/fb/model_transform/c2_convert:c2_pt_converter_test Tests Passed : https://our.intern.facebook.com/intern/testinfra/testrun/2251799856412296 Reviewed By: houseroad Differential Revision: D25067833 fbshipit-source-id: 45cbc331ca35bee27e083714e65a1e87a2a2d2e0	2020-12-04 14:16:25 -08:00
Tristan Rice	dc7d8a889e	caffe2: refactor context to allow being typed (#48340 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48340 This changes the context managed classes from using a decorator to define them to using inheritance. Inheritance allows the python static type checking to work correctly. ``` context.define_context() class Bar(object): ... context.define_context(allow_default=True) class Foo(object): ... ``` becomes ``` class Foo(context.Managed): ... class Bar(context.DefaultManaged): ... ``` Behavior differences: * arg_name has been removed since it's not used anywhere * classes need to call `super()` in `__enter__/__exit__` methods if they override (none do) This also defines a context.pyi file to add types for python3. python2 support should not be affected Test Plan: ci buck test //caffe2/caffe2/python:context_test //caffe2/caffe2/python:checkpoint_test Reviewed By: dongyuzheng Differential Revision: D25133469 fbshipit-source-id: 16368bf723eeb6ce3308d6827f5ac5e955b4e29a	2020-11-30 18:31:14 -08:00
Frank Seide	29f0e1e2ce	Fused8BitRowwiseQuantizedToFloat operator support (#48407 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48407 T79817692: Fused8BitRowwiseQuantizedToFloat operator support for c2_pt_converter. Also refactored some repeated code from the existing test functions. (Initial commit only has refactoring.) Test Plan: buck test //caffe2/torch/fb/model_transform/c2_convert:c2_pt_converter_test Reviewed By: bugra Differential Revision: D25069936 fbshipit-source-id: 72f6a845a1b4639b9542c6b230c8cd74b06bc5a0	2020-11-30 17:11:39 -08:00
Xiaodong Wang	d386d3323f	[dper] supress excessive msg (#48404 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48404 On bento this is printing a lot of msgs like (see N408483 if you're an internal user) ``` W1123 120952.322 schema.py:811] Scalar should be considered immutable. Only call Scalar.set() on newly created Scalar with unsafe=True. This will become an error soon. ``` And it's ignoring the log level I set at global level. Removing this line unless this is super important. Test Plan: build a local dper package and verify Differential Revision: D25163808 fbshipit-source-id: 338d01c82b4e67269328bbeafc088987c4cbac75	2020-11-30 14:55:52 -08:00
shubhambhokare1	bdf360f9f2	[ONNX] Update onnx submodule (#47366 ) Summary: Update onnx submodule to 1.8 release Pull Request resolved: https://github.com/pytorch/pytorch/pull/47366 Reviewed By: hl475 Differential Revision: D24968733 Pulled By: houseroad fbshipit-source-id: 2f0a3436ab3c9380ed8ff0887a483743c1209721	2020-11-30 00:05:46 -08:00
Tristan Rice	6eaf1e358c	caffe2/core.Net: is_external_input rebuild lookup tables when necessary Summary: is_external_input doesn't check if the lookup tables are valid. Calling .Proto() should invalidate all lookup tables and have them rebuilt on call to any methods depending on them. This adds this check to is_external_input. Test Plan: internal unit tests Reviewed By: dzhulgakov, esqu1 Differential Revision: D25100464 fbshipit-source-id: d792dec7e5aa9ffeafda88350e05cb757f4c4831	2020-11-20 10:53:24 -08:00
Xiaomeng Yang	2039ff3fbb	[Caffe2] Optimize MishOp on CPU (#48212 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48212 Optimize MishOp on CPU Test Plan: buck test mode/dev-nosan //caffe2/caffe2/python/operator_test:activation_ops_test -- "mish" Reviewed By: houseroad Differential Revision: D25071304 fbshipit-source-id: fe94bfab512188d60412d66962983eff4f37bc07	2020-11-19 14:17:27 -08:00
Scott Wolchok	4c9eb57914	[PyTorch] Narrow Device to 2 bytes by narrowing DeviceType and DeviceIndex (#47023 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47023 DeviceType pretty clearly only needs 1 byte. DeviceIndex only needs 1 byte given that machines don't have anywhere near 255 GPUs in them as far as I know. ghstack-source-id: 116901430 Test Plan: Existing tests, added assertion to catch if my assumption about DeviceIndex is incorrect Reviewed By: dzhulgakov Differential Revision: D24605460 fbshipit-source-id: 7c9a89027fcf8eebd623b7cdbf6302162c981cd2	2020-11-18 19:39:40 -08:00
Tristan Rice	b10d6c6089	[caffe2] cache NextName indexes for faster name generation (#47768 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47768 This stores the next ID for a given NextName(prefix, output_id) so repeated calls to NextName are significantly faster. This accounts for ~65% of time spent for large models. Test Plan: buck test //caffe2/caffe2/python/... will launch canary job before landing to ensure no regressions + confirm speedup Reviewed By: dzhulgakov Differential Revision: D24876961 fbshipit-source-id: 668d73060d800513bc72d7cd405a47d15c4acc34	2020-11-17 12:24:00 -08:00
Ankur Singla	549ef1d668	[caffe][memonger] Extend operator schema check to dag memonger (#48021 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48021 Extending operator schema check for simple memonger to dag memonger as well. As part of this a fix is being made to handle inplace ops (having at least one output name same as input blob). Earlier all the output blobs from ops were being treated as shareable but it failed assertion of external input blobs with the same name not allowed to share. Test Plan: Added corresponding unit tests Reviewed By: hlu1 Differential Revision: D24968862 fbshipit-source-id: b6679a388a82b0d68f65ade64b85560354aaa3ef	2020-11-16 19:17:55 -08:00
Ankur Singla	f743b5639a	[caffe2][memonger] Add support for distributed inference predict nets in DAG memonger (#47718 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47718 Distributed Inference splits a predict net into multiple parts, part0 being the main part which contains ops to make remote calls to other parts. part0 predict net may contain AsyncIf ops to optimize rpc call usage. AsyncIf ops have internal nets which may refer to memongered blobs. This change handles AsyncIf ops to update internal nets to refer to memongered blobs. As part of this change, I am also updating dag memonger traversal to always start from root op, i.e. ops with 0 in degree. Earlier logic will start traversing ops based on input head blobs and if one of the head inputs is getting used in a non-root op which gets visited before its parent, the traversal will throwing assertion error here: https://fburl.com/diffusion/ob110s9z . Almost for all the distributed inference part0 nets, it was throwing this assertion error. Test Plan: Added corresponding tests in memonger_test.py . Could not find unit tests in c++ version of memonger. Reviewed By: hlu1 Differential Revision: D24872010 fbshipit-source-id: 1dc99b2fb52b2bc692fa4fc0aff6b7e4c5e4f5b0	2020-11-13 14:12:07 -08:00
Jonathan Kwok	a3e08e5344	Support ReduceSum in c2_pt_converter (#47889 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47889 Adds support for converting the [caffe2 ReduceSum](https://caffe2.ai/docs/operators-catalogue#reducesum) operator to torch. ghstack-source-id: 116580127 Test Plan: buck test //caffe2/torch/fb/model_transform/c2_convert:c2_pt_converter_test : [results](https://our.intern.facebook.com/intern/testinfra/testrun/6755399466095119) ✓ ListingSuccess: caffe2/torch/fb/model_transform/c2_convert:c2_pt_converter_test - main (60.273) ✓ Pass: caffe2/torch/fb/model_transform/c2_convert:c2_pt_converter_test - test_sub_op (caffe2.torch.fb.model_transform.c2_convert.c2_pt_converter_test.C2PTConverterTest) (101.119) ✓ Pass: caffe2/torch/fb/model_transform/c2_convert:c2_pt_converter_test - test_layer_norm_conversion (caffe2.torch.fb.model_transform.c2_convert.c2_pt_converter_test.C2PTConverterTest) (101.404) ✓ Pass: caffe2/torch/fb/model_transform/c2_convert:c2_pt_converter_test - test_local_model_conversion (caffe2.torch.fb.model_transform.c2_convert.c2_pt_converter_test.C2PTConverterTest) (101.966) ✓ Pass: caffe2/torch/fb/model_transform/c2_convert:c2_pt_converter_test - test_reduce_sum (caffe2.torch.fb.model_transform.c2_convert.c2_pt_converter_test.C2PTConverterTest) (114.896) Reviewed By: bugra Differential Revision: D24925318 fbshipit-source-id: 3f3b791eff1b03e8f5adee744560fe8bc811c659	2020-11-13 12:02:58 -08:00
Gary Zheng	f1babb00f0	[caffe2] Fix ListWithEvicted _pprint_impl wrongly printing _evicted_values (#47881 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47881 ListWithEvicted's _pprint_impl was accidentally printing _items before this change. Reviewed By: dzhulgakov Differential Revision: D24928521 fbshipit-source-id: 0d7940719b4a27defbaae3b99af104d7fe7b5144	2020-11-13 09:23:10 -08:00
Alberto Alfarano	59e96c55f7	Support MatMul in c2_pt_converter Summary: Added the MatMul operator for caffe2 Test Plan: buck test //caffe2/torch/fb/model_transform/c2_convert:c2_pt_converter_test Reviewed By: bugra Differential Revision: D24920937 fbshipit-source-id: 7ba09ba0439cb9bd15d6a41fd8ff1a86d8d11437	2020-11-12 20:56:58 -08:00
Peiyao Zhou	4078f44668	[TB][embedding supporting] Modify histogram to accept multipy types to skip Castop and avoid OOMing in Castop Summary: To support min/max/mean/std, SummarizeOp need to skip size checking (similar to the LpNorm error mentioned above) and accept multiple types Test Plan: unit test: `buck test //caffe2/caffe2/fb/tensorboard/tests:tensorboard_accumulate_histogram_op_test` https://our.intern.facebook.com/intern/testinfra/testrun/1407375057859572 `buck test //caffe2/caffe2/fb/tensorboard/tests:tensorboard_accumulate_histogram_op_test --stress-runs 1000` https://our.intern.facebook.com/intern/testinfra/testrun/2533274832166362 Reviewed By: cryptopic Differential Revision: D24605507 fbshipit-source-id: fa08372d7c9970083c38abd432d4c86e84fb10e0	2020-11-11 12:03:54 -08:00
Richard Zou	17c58720fe	Revert D24346771: [caffe2][memonger] Add support for distributed inference predict nets in DAG memonger Test Plan: revert-hammer Differential Revision: D24346771 (`5882f2e540`) Original commit changeset: ad2dd2e63f3e fbshipit-source-id: 90346f08c890eebe71f068748a8e24e4db88c250	2020-11-10 12:11:22 -08:00
Ankur Singla	5882f2e540	[caffe2][memonger] Add support for distributed inference predict nets in DAG memonger Summary: Distributed Inference splits a predict net into multiple parts, part0 being the main part which contains ops to make remote calls to other parts. part0 predict net may contain AsyncIf ops to optimize rpc call usage. AsyncIf ops have internal nets which may refer to memongered blobs. This change handles AsyncIf ops to update internal nets to refer to memongered blobs. Here is one reference part0 predict net with AsyncIf ops: https://www.internalfb.com/intern/paste/P145812115/ As part of this change, I am also updating dag memonger traversal to always start from root op, i.e. ops with 0 in degree. Earlier logic will start traversing ops based on input head blobs and if one of the head inputs is getting used in a non-root op which gets visited before its parent, the traversal will throwing assertion error here: https://fburl.com/diffusion/ob110s9z . Almost for all the distributed inference part0 nets, it was throwing this assertion error. Reviewed By: hlu1 Differential Revision: D24346771 fbshipit-source-id: ad2dd2e63f3e822ad172682f6d63f8474492255d	2020-11-10 09:35:28 -08:00
Gary Zheng	8b3f1d1288	[caffe2] Add __slots__ to all classes in schema.py (#47541 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47541 The profiler has guided us to `schema.py`. Since these `Field`s are used everywhere and in huge quantities, we can easily make some optimizations system wide by adding `__slots__`. From StackOverflow, benefits include: * faster attribute access. * space savings in memory. Read more: https://stackoverflow.com/a/28059785/ Reviewed By: dzhulgakov Differential Revision: D24771078 fbshipit-source-id: 13f6064d367440069767131a433c820eabfe931b	2020-11-09 16:16:28 -08:00
Gary Zheng	4c52a56c40	[caffe2] Properly call super init in schema.py (#47542 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47542 The previous way of doing `Field.__init__(self, [])` is just wrong. Switching to Python2 compatible way: `super(ObjectName, self).__init__(...)` Reviewed By: dzhulgakov Differential Revision: D24771077 fbshipit-source-id: d6798c72090c0264b6c583602cae441a1b14587c	2020-11-09 15:02:22 -08:00
Gary Zheng	4a58f35bef	[caffe2] Fix duplicate name bug in Net.AddExternalInput (#47530 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47530 `Net.AddExternalInput` should raise if there are duplicate names. The previous code would only raise if the addition of duplicates was in separate calls, but not if it was in the same call. Test Plan: Added two new regression tests ``` ✓ Pass: caffe2/caffe2/python:core_test - testSetInputRecordWithBlobs (caffe2.caffe2.python.core_test.TestExternalInputs) (9.622) ✓ Pass: caffe2/caffe2/python:core_test - testAddExternalInputShouldRaiseIfDuplicate (caffe2.caffe2.python.core_test.TestExternalInputs) (9.639) ✓ Pass: caffe2/caffe2/python:core_test - testSetInputRecordWithoutBlobs (caffe2.caffe2.python.core_test.TestExternalInputs) (9.883) ✓ Pass: caffe2/caffe2/python:core_test - testAddExternalInputShouldRaiseIfDuplicateInSameCall (caffe2.caffe2.python.core_test.TestExternalInputs) (10.153) ``` Test trained 2 models. No issues f230755456 f230754926 Reviewed By: dzhulgakov Differential Revision: D24763586 fbshipit-source-id: c87088441d76f7198f8b07508b2607aec13521ed	2020-11-09 08:30:58 -08:00
Shiyan Deng	c19eb4ad73	BoxWithNMSLimit support int `batch_splits` input (#47504 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47504 allow int type input of `batch_splits` Test Plan: ``` buck test caffe2/caffe2/python/operator_test:torch_integration_test -- test_box_with_nms_limits ``` Reviewed By: jackm321 Differential Revision: D24629522 fbshipit-source-id: 61cb132e792bddd8f9f1bca5b808f1a9131808f0	2020-11-07 00:27:51 -08:00
Gary Zheng	582e852fba	[caffe2] Add unittests for schema.Field init (#47512 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47512 I deleted the last line of `__init__` -- `self._field_offsets.append(offset)` -- and the unittests didn't fail. So this diff is to improve test coverage. Test Plan: ``` ✓ Pass: caffe2/caffe2/python:schema_test - testInitShouldSetEmptyParent (caffe2.caffe2.python.schema_test.TestField) (8.225) ✓ Pass: caffe2/caffe2/python:schema_test - testInitShouldSetFieldOffsetsIfNoChildren (caffe2.caffe2.python.schema_test.TestField) (8.339) ✓ Pass: caffe2/caffe2/python:schema_test - testInitShouldSetFieldOffsets (caffe2.caffe2.python.schema_test.TestField) (8.381) ``` Reviewed By: dzhulgakov Differential Revision: D24767188 fbshipit-source-id: b6ce8cc96ecc61768b55360e0238f7317a2f18ea	2020-11-06 13:27:58 -08:00
Bugra Akyildiz	c26c4690fe	Add sub operator Summary: Add sub operator for caffe2 Test Plan: ``` buck test //caffe2/torch/fb/model_transform/c2_convert:c2_pt_converter_test ``` Reviewed By: houseroad Differential Revision: D24685090 fbshipit-source-id: 60d745065d01b634ebd3087e533d8b9ddab77a1f	2020-11-06 12:31:17 -08:00
Tristan Rice	47198e3208	[caffe2] improve core.Net cloning/init performance (24x for large models!) (#47475 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47475 This improves the core.Net cloning/init performance by quite a bit. It makes set_input_record run in linear time instead of O(n) by checking the external_input map instead of regenerating the external inputs each time and then iterating over it. Test Plan: unit tests + canary runs Reviewed By: dzhulgakov Differential Revision: D24765346 fbshipit-source-id: 92d9f6dec158512bd50513b78675174686f0f411	2020-11-06 11:34:12 -08:00
Yen-Jung Chang	6e22b6008d	[MLF] Allow for computing prune quantile thresholds on absolute value of indicators in distributed-inference-compatible embedding LUT pruning (#46789 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46789 1. Now `SelfBinningHistogram` can calculate the binning histogram using the absolute values from the given an array of values. 2. Update the invocation of `SelfBinningHistogram` in `post_training_prune`. Test Plan: 1. [buck test caffe2/caffe2/python/operator_test:self_binning_histogram_test](https://www.internalfb.com/intern/testinfra/testconsole/testrun/6473924488326108/) 2. [buck test dper3/dper3_backend/delivery/tests:post_training_prune_test](https://www.internalfb.com/intern/testinfra/testconsole/testrun/2251799854023163/) Reviewed By: hwangjeff Differential Revision: D24494097 fbshipit-source-id: 95e47137b25746e686ef9baa9409560af5d58fc1	2020-11-02 11:31:31 -08:00
Basil Hosmer	f05b66b70d	pass TypeMeta by value (#45026 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45026 Test Plan: Imported from OSS Reviewed By: ezyang Differential Revision: D23802943 Pulled By: bhosmer fbshipit-source-id: 81b06ef00bf8eb4375c0e0ff2032e03bd1d1188a	2020-10-30 10:14:17 -07:00
Bugra Akyildiz	eec201c138	Add last_n_window_collector Summary: Add `last_n_window_collector` as C2 supports and PyTorch currently does not have this operator: https://www.internalfb.com/intern/diffusion/FBS/browsefile/master/fbcode/caffe2/caffe2/operators/last_n_window_collector.cc?lines=139 ## Problem that we are solving This operator works on multiple pieces of data and collects last `n` element that has been seen. If you have the following pieces of data that has been passed around: ``` [1, 2, 3, 4] [5, 6, 7] [8, 9, 10, 11] ``` for 3 times and the number of collector is given to be 6. The expected result is: ``` [6, 7, 8, 9, 10, 11] ``` What this means is that, almost like we need a FIFO(First in First Out) mechanism where as we are passing this data through the collector, we will be pushing some other data at the end. In this particular example, in the first pass(the data is `[1, 2, 3, 4]`) , we hold `[1, 2, 3, 4]` in the queue as our queue size is 6. In the second pass(the data is `[5, 6, 7]`), we hold `[2, 3, 4, 5, 6, 7]` in the queue and since 1 is inserted the last, it will drop due to the size limitation of the queue. In the third pass(the data is `[8, 9, 10, 11]`), we hold `[6, 7, 8, 9, 10, 11]` in the queue and `2,3,4,5` are dropped due the the size of the queue. For multidimension case, when we have the following data: ``` [[1, 2], [2, 3], [3, 4], [4, 5]] [[5, 6], [6, 7], [7, 8]] [[8, 9], [9, 10], [10, 11], [11, 12]] ``` and our queue size is 6. In the first pass, we will have ` [[1, 2], [2, 3], [3, 4], [4, 5]]` In the second pass, we will have `[2, 3], [3, 4], [4, 5]] [[5, 6], [6, 7], [7, 8]]` In the third pass, we will have `[6, 7], [7, 8]] [[8, 9], [9, 10], [10, 11], [11, 12]]` ### The implementation I am using FIFO queue in Python which is in the collections library. This accepts `maxlen` argument which can be used to set the size of the queue. I am using last n indices of the tensor through list indices and in this operator, I am not doing copy. In the test plan, I have both single dimension tensors as well as multi-dimension tensors. ### Benchmark I used various different configurations and added a benchmark test. PyTorch implementation is much master than Caffe2 implementation: #### CPU Benchmark ``` torch_response.median 0.00019254473969340324 caffe_response.median 0.00030233583599794657 ``` #### GPU Benchmark ``` torch_response.mean 0.000081007429903838786 caffe_response.mean 0.00010279081099724863 ``` Test Plan: ### For CPU: ``` buck test //caffe2/torch/fb/sparsenn:test ``` ### For GPU: - Used an on-demand machine and did the following commands: ``` jf get D24435544 buck test mode/opt //caffe2/torch/fb/sparsenn:test ``` https://www.internalfb.com/intern/testinfra/testconsole/testrun/4222124688138052/ Reviewed By: dzhulgakov, radkris-git Differential Revision: D24435544 fbshipit-source-id: 8193b4746b20f2a4920fd4d41271341045cdcee1	2020-10-30 02:35:54 -07:00
Brandon Lin	4a581ba6c2	Implement LengthsToOffsets operator in Caffe2 (#46590 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46590 This operator is very similar to LengthsToRanges but doesn't pack the offsets next to the original lengths. Reviewed By: yf225 Differential Revision: D24419746 fbshipit-source-id: aa8b014588bb22eced324853c545f8684086c4e4	2020-10-29 07:03:34 -07:00
Kunal Bhalla	18d273dc0e	[RFC][LocalSession] Fix workspace type Summary: I was reading/looking into how LocalSession works and realized that the workspace type being passed around was the bound function on TaskGroup instead of the actual type. This meant that all workspaces for localsession would always be global, because they'd never match the private workspace type. Test Plan: <not sure, could use some suggestions> Reviewed By: cryptopic Differential Revision: D24458428 fbshipit-source-id: 0f87874babe9c1ddff25b5363b443f9ca37e03c1	2020-10-29 04:12:17 -07:00
Dmytro Dzhulgakov	115bbf9945	[caffe2] Disable running full grad check in tests by default Summary: We've been seeing a lot of Hypothesis timeouts and from profiling a few of the failing tests one of the contributing factors is really slow grad checker. In short, it launches the whole op for each of the input elements so the overall complexity is O(numel^2) at least. This applies a very unscientific hack to just run grad check on the first and last few elements. It's not ideal, but it's better than flaky tests. One can still explicitly opt in with the env var. Reviewed By: malfet Differential Revision: D23336220 fbshipit-source-id: f04d8d43c6aa1590c2f3e72fc7ccc6aa674e49d2	2020-10-27 16:10:03 -07:00
Huan Gui	b5662ba0f0	[uhm][0/n] add cuda Mod Op (#46732 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46732 as titled Test Plan: unittest buck test mode/dev-nosan //caffe2/caffe2/python/operator_test:mod_op_test Reviewed By: xianjiec Differential Revision: D24368100 fbshipit-source-id: 1232d22a67ac268986043911d548fa9d657470ec	2020-10-26 11:07:51 -07:00
Yunfan Zhong	e519fcd1aa	Remap net name inside arg.n for AsyncIf operator Summary: Similar to If operator, AsyncIf also contains nets in args. It needs the same handling. Test Plan: New unit test test_control_op_remap `buck test caffe2/caffe2/python:core_test` Also it worked end to end in prototype of dist bulk eval workflow f226680903 Reviewed By: yyetim Differential Revision: D24451775 fbshipit-source-id: 50594e2ab9bb457329ed8da7b035f7409461b5f6	2020-10-23 10:41:06 -07:00
Alexander Grund	93719440b8	Replace map(lambda constructs (#46462 ) Summary: Follow-up of https://github.com/pytorch/pytorch/issues/46461 with a similar goal Makes them more readable and possibly faster. Care has to be taken because `map` applies the function immediately while `(x for x in xs)` is a generator expression which gets evaluated later. This is a benefit in some cases where it is not required to actually create the list of values in memory (e.g. when passing to `tuple` or `extend` or `join`) Pull Request resolved: https://github.com/pytorch/pytorch/pull/46462 Reviewed By: zou3519 Differential Revision: D24422343 Pulled By: ezyang fbshipit-source-id: 252e33499c92ac0b15238f2df32681dbbda2b237	2020-10-22 09:50:22 -07:00
Jeff Hwang	9b5197b763	[mlf][efficiency] add tensor inference function to last-n collector op (#46693 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46693 title Test Plan: unit tests Reviewed By: hx89 Differential Revision: D23946770 fbshipit-source-id: f7c3d4a1b4ef3b0e5f56e5a9a30f5003ce9f40b0	2020-10-22 01:15:00 -07:00
Alexander Grund	5b0f400488	Replace list(map(...)) constructs by list comprehensions (#46461 ) Summary: As discussed in https://github.com/pytorch/pytorch/issues/46392 this makes the code more readable and possibly more performant. It also fixes a bug detected by this where the argument order of `map` was confused: `030a24906e (diff-5bb26bd3a23ee3bb540aeadcc0385df2a4e48de39f87ed9ea76b21990738fe98L1537-R1537)` Fixes https://github.com/pytorch/pytorch/issues/46392 Pull Request resolved: https://github.com/pytorch/pytorch/pull/46461 Reviewed By: ailzhang Differential Revision: D24367015 Pulled By: ezyang fbshipit-source-id: d55a67933cc22346b00544c9671f09982ad920e7	2020-10-19 18:42:49 -07:00
Jongsoo Park	c37baa9177	[caffe2] add concat benchmark (#46457 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46457 Wanted to see if using CopyMatrix specialized for float that uses mkl_somatcopy can be faster but it wasn't. Still want to check in benchmark that can be used later. Test Plan: . Reviewed By: dskhudia Differential Revision: D24345901 fbshipit-source-id: d3e68dbb560e3138fda11c55789cd41bc0715c6d	2020-10-16 08:48:42 -07:00
Nikita Shulga	84771fc64f	[caffe2] Add 10s deadline for all Caffe2 hypothesis fuzz tests Test Plan: CI Reviewed By: walterddr Differential Revision: D24298118 fbshipit-source-id: 2286c1e37ed9c43f404b888386c0bd4b0b6a55c6	2020-10-14 06:30:09 -07:00
Jianyu Huang	5c67cc7a9e	[caffe2] Enable fp16 for SparseNormalize op (#45551 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45551 The FP16 version of SparseNormalize op in Caffe2 is missing. This Diff adds FP16 support to unblock MC process of adding FP16 to Dper3. Check https://fb.quip.com/L0T2AXGwUY3n#EReACAeifk3 . One question is whether the pure FP16 Sparse Normalized op will affect the accuracy? Maybe we should do it in FP32 domain. ghstack-source-id: 114184398 Test Plan: ``` buck run mode/opt //caffe2/caffe2/python/operator_test:sparse_normalize_test ``` ``` buck run mode/opt -c python.package_style=inplace mode/no-gpu //caffe2/caffe2/python/benchmarks:sparse_normalize_benchmark -- --fp16 ``` Reviewed By: jspark1105 Differential Revision: D24005618 fbshipit-source-id: 8b918ec4063fdaafa444779b95206ba2b7b38537	2020-10-13 15:35:22 -07:00
Bugra Akyildiz	298e0e0d57	Refactor gather_ranges_to_dense from Python to C++ (#46021 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46021 Refactor gather_ranges_to_dense from Python to C++ https://www.internalfb.com/intern/tasks/?t=71935517 Test Plan: General build/test: ``` buck build -c python.helpers=true fbcode/caffe2 buck test -c python.helpers=true fbcode/caffe2 ``` Specific Test: ```buck test mode/dev-nosan //caffe2/torch/fb/sparsenn:test -- 'test_gather_ranges_to_dense $caffe2\.torch\.fb\.sparsenn\.tests\.sparsenn_operators_test\.SparseNNOperatorsTest$' ``` Reviewed By: houseroad Differential Revision: D23858186 fbshipit-source-id: 8bce7c279275c8ff7316901b455e1d1dd7e36b13	2020-10-08 11:03:06 -07:00
Pawel Garbacki	fb50fcaa82	[C2] Add string equality operator (#45886 ) Summary: This diff adds a string equality checking operator. Another attempt at reverted D24042344 (`cf48872d28`) Pull Request resolved: https://github.com/pytorch/pytorch/pull/45886 Test Plan: unit tests, github builds Reviewed By: dzhulgakov Differential Revision: D24129953 fbshipit-source-id: caa53c7eac5c67c414c37e9d93416104f72556b9	2020-10-06 12:08:26 -07:00
Dmytro Dzhulgakov	519c086418	Revert D24042344: [C2] Add string equality operator Test Plan: revert-hammer Differential Revision: D24042344 (`cf48872d28`) Original commit changeset: c8997c6130e3 fbshipit-source-id: 3d8aec1104a2a59c67ab4b7e77caeaf9fc94ae1d	2020-10-05 15:09:03 -07:00
Pawel Garbacki	cf48872d28	[C2] Add string equality operator Summary: This diff adds a string equality checking operator. Test Plan: Unit tests Differential Revision: D24042344 fbshipit-source-id: c8997c6130e3438f2ae95dae69f76978e2e95527	2020-10-05 10:47:53 -07:00
Marcio Porto	c31066ac9d	Torch Integration Test Formatting Changes (#45740 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45740 Reviewed By: esqu1 Differential Revision: D23869021 fbshipit-source-id: 5910d44f9475bd7a53dc0478b69b39572dc8666f	2020-10-02 14:02:31 -07:00
Marcio Porto	b234acd414	Exposes SparseToDenseMask Caffe2 Operator (#45670 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45670 Reviewed By: esqu1 Differential Revision: D23868280 fbshipit-source-id: d6afa129c073fe611cb43a170025bc3c880a4bec	2020-10-02 10:05:13 -07:00
Kunal Bhalla	4564444c91	[RFC][caffe2] TaskGroup.__repr__ shouldn't have side effects Summary: `__repr__` calling self.tasks() ends up marking the instance as "used", which doesn't seem appropriate. I was debugging a value being passed around and then ran into `Cannot add Task to an already used TaskGroup.` because the value had been logged once. Test Plan: Added a unit test -- didn't see a clean public method to test it, but I'm happy to add one if that makes sense. Will wait for sandcastle to trigger everything else; I'm not at all familiar with this code so any other recommendations would be great! Reviewed By: cryptopic Differential Revision: D23541198 fbshipit-source-id: 5d1ec674a1ddaedf113140133b90e0da6afa7270	2020-10-01 14:21:03 -07:00
Thomas Bredillet	0fa551f0ab	[c2] Fix int types for learning rate Summary: Currently GetSingleArgument is overflowing since it's expecting an int instead of an int64 when using a 1cycle (hill policy) annealing schedule Test Plan: unittest buck test caffe2/caffe2/python/operator_test:learning_rate_op_test Differential Revision: D23938169 fbshipit-source-id: 20d65df800d7a0f1dd9520705af31f63ae716463	2020-09-26 10:59:29 -07:00
Dianshi Li	03dde4c62a	Resend diff D23858329 (#45315 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45315 Pull Request resolved: https://github.com/pytorch/pytorch/pull/45314 in D23858329 (`721cfbf842`), we put PriorCorrectionCalibrationPrediction unit test in OSS file which causes test failure issue in public trunk. this diff moves it to FB only test file. Test Plan: ``` buck test //caffe2/caffe2/python/operator_test:torch_integration_test -- test_gather_ranges_to_dense_op buck test //caffe2/caffe2/fb/python/operator_test:torch_integration_test -- test_prior_correct_calibration_prediction_op ``` all pass. Reviewed By: houseroad Differential Revision: D23899012 fbshipit-source-id: 1ed97d8702e2765991e6caf5695d4c49353dae82	2020-09-24 18:41:49 -07:00
Danny Huang	cd7a682282	[caffe2] adds hypothesis test for queue ops cancel (#45178 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45178 ## Motivation * To be able to make C2 ops cancellable so we can safely exit. * Some C2 operators are now blocking thus being non-cancellable. If an error occurs we need to be able to safely stop all net execution so we can throw the exception to the caller. ## Summary * Adds a hypothesis test for queue ops cancellation. Test Plan: ## Unit test added to verify that queue ops propagate errors ``` buck test caffe2/caffe2/python:hypothesis_test buck test caffe2/caffe2/python:hypothesis_test -- test_safe_dequeue_blob__raises_exception_when_hang --stress-runs 1000 ``` ``` Summary Pass: 1000 ListingSuccess: 1 ``` Reviewed By: d4l3k Differential Revision: D23847576 fbshipit-source-id: 2fc351e1ee13ea8b32d976216d2d01dfb6fcc1ad	2020-09-24 14:43:52 -07:00
Xiaomeng Yang	e2bcdc7b69	[Caffe2] Fix LayerNormOp when batch_size == 0. (#45250 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45250 [Caffe2] Fix LayerNormOp when batch_size == 0. Test Plan: buck test mode/dev-nosan //caffe2/caffe2/python/operator_test:layer_norm_op_test Reviewed By: houseroad Differential Revision: D23892091 fbshipit-source-id: 9a34654dd6880c9d14b7111fcf850e4f48ffdf91	2020-09-24 12:30:03 -07:00
Mike Ruberry	956a25d061	Revert D23858329: [PT Model Split] Support 2 operators in PT by C2 conversion Test Plan: revert-hammer Differential Revision: D23858329 (`721cfbf842`) Original commit changeset: ed37118ca7f0 fbshipit-source-id: 30c700f80665be11afc608b00a77766064e60b35	2020-09-23 21:20:21 -07:00
Dianshi Li	721cfbf842	[PT Model Split] Support 2 operators in PT by C2 conversion (#45231 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45231 There are two operators: `PriorCorrectionCalibrationPrediction` and `GatherRangesToDense` is not supported in PT which makes GLOW cannot work. To unblock, we first try to use C2->PT conversion. In the long-term, we need to implement PT custom ops. This diff does this conversion to unblock current project. Test Plan: Run unit test. the Test input is from current DPER example. All pass. ```buck test //caffe2/caffe2/python/operator_test:torch_integration_test -- test_prior_correct_calibration_prediction_op --print-passing-details > c2 reference output > [0.14285715 0.27272728 0.39130434 0.5 ] > PT converted output > tensor([0.1429, 0.2727, 0.3913, 0.5000]) buck test //caffe2/caffe2/python/operator_test:torch_integration_test -- test_gather_ranges_to_dense_op --print-passing-details c2 reference output > [array([[6, 5, 4, 3], [0, 0, 0, 0]], dtype=int64)] > PT converted output > [tensor([[6, 5, 4, 3], [0, 0, 0, 0]])] ``` Reviewed By: allwu, qizzzh Differential Revision: D23858329 fbshipit-source-id: ed37118ca7f09e1cd0ad1fdec3d37f66dce60dd9	2020-09-23 18:31:57 -07:00
Bugra Akyildiz	27c7158166	Remove __future__ imports for legacy Python2 supports (#45033 ) Summary: There is a module called `2to3` which you can target for future specifically to remove these, the directory of `caffe2` has the most redundant imports: ```2to3 -f future -w caffe2``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/45033 Reviewed By: seemethere Differential Revision: D23808648 Pulled By: bugra fbshipit-source-id: 38971900f0fe43ab44a9168e57f2307580d36a38	2020-09-23 17:57:02 -07:00
Lin.Sung	f77ba0e48c	Change typo 'momemtum' to 'momentum' (#45045 ) Summary: As the title. Pull Request resolved: https://github.com/pytorch/pytorch/pull/45045 Reviewed By: mruberry Differential Revision: D23808563 Pulled By: mrshenli fbshipit-source-id: ca818377f4c23d67b037c146fef667ab8731961e	2020-09-21 19:03:26 -07:00
Mike Ruberry	b6f4bb0a70	Revert D23236088: [pytorch][PR] [caffe2] adds Cancel to SafeDequeueBlobsOp and SafeEnqueueBlobsOp Test Plan: revert-hammer Differential Revision: D23236088 (`0ccc38b773`) Original commit changeset: daa90d9ee324 fbshipit-source-id: 933c7deab177250075683a9bea143ac37f16a598	2020-09-16 23:32:50 -07:00
Danny Huang	0ccc38b773	[caffe2] adds Cancel to SafeDequeueBlobsOp and SafeEnqueueBlobsOp (#44495 ) Summary: ## Motivation * To be able to make C2 ops cancellable so we can safely exit. * Some C2 operators are now blocking thus being non-cancellable. If an error occurs we need to be able to safely stop all net execution so we can throw the exception to the caller. * When an error occurs in a net or it got cancelled, running ops will have the `Cancel` method called. * This diff adds `Cancel` method to the `SafeEnqueueBlobsOp` and `SafeDequeueBlobsOp` to have the call queue->close() to force all the blocking ops to return. * Adds unit test that verified the error propagation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/44495 Test Plan: ## Unit Test added to verify that queue ops propagate errors ``` buck test caffe2/caffe2/python:hypothesis_test ``` Reviewed By: dzhulgakov Differential Revision: D23236088 Pulled By: dahsh fbshipit-source-id: daa90d9ee32483fb51195e269a52cf5987bb0a5a	2020-09-16 18:17:34 -07:00
Xiang Gao	20ac736200	Remove py2 compatible future imports (#44735 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44735 Reviewed By: mruberry Differential Revision: D23731306 Pulled By: ezyang fbshipit-source-id: 0ba009a99e475ddbe22981be8ac636f8a1c8b02f	2020-09-16 12:55:57 -07:00
Nikita Shulga	1718b16d15	[Caffe2] gcs_cuda_only is trivial if CUDA not available (#44578 ) Summary: Make `gcs_cuda_only` and `gcs_gpu_only` return empty device lists if CUDA/GPU(CUDA or RocM) not available Pull Request resolved: https://github.com/pytorch/pytorch/pull/44578 Reviewed By: walterddr Differential Revision: D23664227 Pulled By: malfet fbshipit-source-id: 176b5d964c0b02b8379777cd9a38698c11818690	2020-09-16 12:24:08 -07:00
Yan Xie	285ba0d068	Enable fp16 for UniformFill (#44540 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44540 Support output type to be fp16 for UniformFill Reviewed By: jianyuh Differential Revision: D23558030 fbshipit-source-id: 53a5b2c92cfe78cd11f55e6ee498e1bd682fe4a1	2020-09-15 15:09:18 -07:00
Yan Xie	4ce6af35c4	Enable fp16 for CUDA SparseLengthsSum/Mean (#44089 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44089 Add support of fp16 as input type in SparseLengthSum/Mean caffe2 operator Reviewed By: xianjiec Differential Revision: D23436877 fbshipit-source-id: 02fbef2fde17d4b0abea9ca5d17a36aa989f98a0	2020-09-15 11:10:54 -07:00
Brandon Lin	ea55820606	[dper3] Export PackSegments and UnpackSegments to Pytorch Summary: As title. Test Plan: ``` buck test //caffe2/caffe2/python/operator_test/:torch_integration_test -- test_pack_segments ``` Reviewed By: yf225 Differential Revision: D23610495 fbshipit-source-id: bd8cb61f2284a08a54091a4f982f01fcf681f215	2020-09-11 09:29:24 -07:00
Gang Shen	058d7228ec	Expose the interface of nesterov of SGD Optimizer from caffe2 to dper Summary: Expose the interface of `nesterov` of SGD Optimizer from caffe2 to dper. dper sgd optimizer (https://fburl.com/diffusion/chpobg0h) has referred to NAG sgdoptimizer in caffe2: https://fburl.com/diffusion/uat2lnan. So just need to add the parameter 'nesterov' in dper sgd optimizer. Analysis of run resutls: N345540. - train_ne increases as momentum (m) decreases. - for m=0.95, 0.9: eval_ne is lower with NAG than production (no NAG, m = 0.95). - for m=0.99: eval_ne with or without NAG is higher than production. It indicates larger variance in validation and overfit in training (lower train_ne). Test Plan: 1. unit tests: `buck test caffe2/caffe2/fb/dper/layer_models/tests/split_1:sparse_nn_test -- test_sgd_without_nesterov` `buck test caffe2/caffe2/fb/dper/layer_models/tests/split_1:sparse_nn_test -- test_sgd_with_nesterov` . 1. build dper front end package: `flow-cli canary ads.dper3.workflows.sparse_nn.train --mode opt --entitlement ads_global --run-as-secure-group team_ads_ml_ranking`. The build result (refreshed) is here https://www.internalfb.com/intern/buck/build/2a368b55-d94b-45c1-8617-2753fbce994b. Flow package version is ads_dper3.canary:856b545cc6b249c0bd328f845adeb0d2. . 2. To build dper back end package: `flow-cli canary dper.workflows.dper3.train --mode opt --entitlement ads_global --run-as-secure-group team_ads_ml_ranking`. The build result (refreshed) is here: https://www.internalfb.com/intern/buck/build/70fa91cd-bf6e-4a08-8a4d-41e41a77fb52. Flow package version is aml.dper2.canary:84123a34be914dfe86b1ffd9925869de. . 3. Compare prod with NAG-enabled runs: a) refreshed prod run (m=0.95): f213877098 NAG enabled run (m=0.95): f213887113 . b) prod run (m=0.9): f214065288 NAG enabled run (m=0.9): f214066319 . c) prod run (m=0.99): f214065804 NAG enabled run (m=0.99): f214066725 . d) change date type of nestrov to `bool` and launched a validation run NAG enabled (m=0.95): f214500597 Reviewed By: ustctf Differential Revision: D23152229 fbshipit-source-id: 61703ef6b4e72277f4c73171640fb8afc6d31f3c	2020-09-09 19:37:00 -07:00
Danny Huang	5ee31308e6	[caffe2] exposes Net cancellation through pybind state (#44043 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44043 To invoke `cancel` from the net instance in Python, we expose it through pybind state. Reviewed By: dzhulgakov Differential Revision: D23249660 fbshipit-source-id: 45a1e9062dca811746fcf2e5e42199da8f76bb54	2020-09-09 18:13:13 -07:00
Xiaomeng Yang	135ebbde6d	[Caffe2] Add RMSNormOp (#44338 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44338 Add RMSNormOp in Caffe2 Test Plan: buck test mode/dev-nosan //caffe2/caffe2/python/operator_test:rms_norm_op_test Reviewed By: houseroad Differential Revision: D23546424 fbshipit-source-id: 8f3940a0bb42230bfa647dc66b5e359cc84491c6	2020-09-08 23:50:44 -07:00
Brandon Lin	5de805d8a7	[dper3] Export Caffe2 operator LearningRate to PyTorch Summary: Exports the operator to PyTorch, to be made into a low-level module. Test Plan: ``` buck test //caffe2/caffe2/python/operator_test:torch_integration_test -- test_learning_rate ``` Reviewed By: yf225 Differential Revision: D23545582 fbshipit-source-id: 6b6d9aa6a47b2802ccef0f87c1263c6cc2d2fdf6	2020-09-08 08:50:09 -07:00
Chunli Fu	3699274ce2	[DPER3] AOT integration Summary: Integrate aot flow with model exporter. Test Plan: buck test dper3/dper3_backend/delivery/tests:dper3_model_export_test replayer test see D23407733 Reviewed By: ipiszy Differential Revision: D23313689 fbshipit-source-id: 39ae8d578ed28ddd6510db959b65974a5ff62888	2020-09-04 18:37:22 -07:00
Jordan Fix	2f8a43341d	Add API for onnxifi with AOT Glow ONNX (#44021 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44021 Pull Request resolved: https://github.com/pytorch/glow/pull/4854 Test Plan: Added `test_onnxifi_aot.py` Reviewed By: yinghai Differential Revision: D23307003 fbshipit-source-id: e6d4f3e394f96fd22f80eb2b8a686cf8171a54c0	2020-09-03 22:46:20 -07:00
Lingyi Liu	bc64efae48	Back out "Revert D19987020: [pytorch][PR] Add the sls tensor train op" (#43938 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43938 resubmit Test Plan: unit test included Reviewed By: mruberry Differential Revision: D23443493 fbshipit-source-id: 7b68f8f7d1be58bee2154e9a498b5b6a09d11670	2020-09-01 11:42:12 -07:00
Mike Ruberry	cc52386096	Revert D19987020: [pytorch][PR] Add the sls tensor train op Test Plan: revert-hammer Differential Revision: D19987020 (`f31b111a35`) Original commit changeset: e3ca7b00a374 fbshipit-source-id: a600c747a45dfb51e0882196e382a21ccaa7b989	2020-08-29 12:46:11 -07:00
Lingyi Liu	f31b111a35	Add the sls tensor train op (#33525 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33525 Reviewed By: wx1988 Differential Revision: D19987020 Pulled By: lly-zero-one fbshipit-source-id: e3ca7b00a374a75ee42716c4e6236bf168ebebf1	2020-08-29 12:16:44 -07:00
kshitij12345	c7787f7fbf	[numpy compatibility]Fix `argmin/argmax` when multiple max/min values (#42004 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/41998 Fixes https://github.com/pytorch/pytorch/issues/22853 Pull Request resolved: https://github.com/pytorch/pytorch/pull/42004 Reviewed By: ngimel Differential Revision: D23049003 Pulled By: mruberry fbshipit-source-id: a6fddbadfec4b8696730550859395ce4f0cf50d6	2020-08-28 06:42:42 -07:00
Nikita Shulga	a91e1cedc5	Reduce number of hypothesis tests in CI (#43591 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43591 100 randomized inputs vs 50 doesn't change the balance that much but speed up test runtime Test Plan: CI Reviewed By: orionr, seemethere Differential Revision: D23332393 fbshipit-source-id: 7a8ff9127ee3e045a83658a7a670a844f3862987	2020-08-26 11:54:49 -07:00
Chunli Fu	d70b263e3a	[DPER3] Separate user embeddings and ad embeddings in blob reorder Summary: Separate user embeddings and ad embeddings in blobsOrder. New order: 1. meta_net_def 2. preload_blobs 3. user_embeddings (embeddings in remote request only net) 4. ad_embeddings (embeddings in remote other net) Add a field requestOnlyEmbeddings in meta_net_def to record user_embeddings. This is for flash verification. Test Plan: buck test dper3/dper3_backend/delivery/tests:blob_reorder_test Run a flow with canary package f211282476 Check the net: n326826, request_only_embeddings are recorded as expected Reviewed By: ipiszy Differential Revision: D23008305 fbshipit-source-id: 9360ba3d078f205832821005e8f151b8314f0cf2	2020-08-22 23:40:04 -07:00
Priyanshu	c89d2c6bf2	Replace black_list with block_list (#42088 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/41735 Pull Request resolved: https://github.com/pytorch/pytorch/pull/42088 Reviewed By: pbelevich Differential Revision: D22794582 Pulled By: SplitInfinity fbshipit-source-id: e256353befefa2630b99f9bcf0b79df3a7a8dcbd	2020-08-20 14:34:02 -07:00
Sean Lynch	f9a766bb39	Increase deadline time for load_save tests (#43205 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43205 A number of tests that forward to `TestLoadSaveBase.load_save` are all marked as flaky due to them regularly taking much longer to start up than hypothesis' default timeout of 200ms. This diff fixes the problem by removing the timeout for `load_save`. This is alright as these tests aren't meant to be testing the performance of these operators. I would set the deadline to 60s if I could however it appears the that caffe2 github CI uses a different version of hypothesis that doesn't allow using `dateutil.timedelta` so instead of trying to figure out an approach that works on both I've just removed the deadline time. I've also tagged all existing tasks WRT these failures. Differential Revision: D23175752 fbshipit-source-id: 324f9ff034df1ac4874797f04f50067149a6ba48	2020-08-20 08:41:24 -07:00
Edson Romero	5014cf4a4d	Export MergeIdLists Caffe2 Operator to PyTorch Summary: As titled. Test Plan: buck test //caffe2/caffe2/python/operator_test:torch_integration_test -- test_merge_id_lists Reviewed By: yf225 Differential Revision: D23076951 fbshipit-source-id: c37dfd93003590eed70b0d46e0151397a402dde6	2020-08-14 14:46:17 -07:00
Hector Yuen	c8e789e06e	add fake fp16 fusions to net transforms (#42927 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42927 added fp16 fusion to net transforms refactored the transforms as well as glow_transform to get out of opt/custom so that the OSS builds passed Test Plan: added net runner tests for this Reviewed By: yinghai Differential Revision: D23080881 fbshipit-source-id: ee6451811fedfd07c6560c178229854bca29301f	2020-08-14 13:30:27 -07:00
Ren Chen	e182ec97b3	Fix illegal memory acess issue for CUDA versionn of SplitByLengths operator. Summary: 1. Fix illegal memory access issue for SplitByLengths operator in the CUDA context. 2. Add support to scaling lengths vector for SplitByLengths operator. 3. Add support to test SplitByLengths operator in the CUDA context. Example for SplitByLengths operator processing scaling lengths vector: value vector A = [1, 2, 3, 4, 5, 6] length vector B = [1, 2] after execution of SplitByLengths operator, the output should be [1,2] and [3,4,5,6] Test Plan: buck test mode/dev-nosan caffe2/caffe2/python/operator_test:concat_split_op_test Reviewed By: kennyhorror Differential Revision: D23079841 fbshipit-source-id: 3700e7f2ee0a5a2791850071fdc16e5b054f8400	2020-08-14 01:04:08 -07:00
Christopher Whelan	7a9ae52550	[hypothesis] Deadline followup (#42842 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42842 Test Plan: `buck test` Reviewed By: thatch Differential Revision: D23045269 fbshipit-source-id: 8a3f4981869287a0f5fb3f0009e13548b7478086	2020-08-11 15:33:23 -07:00
Edson Romero	71dbfc79b3	Export BatchBucketOneHot Caffe2 Operator to PyTorch Summary: As titled. Test Plan: ``` buck test caffe2/caffe2/python/operator_test:torch_integration_test -- test_batch_bucket_one_hot_op ``` Reviewed By: yf225 Differential Revision: D23005981 fbshipit-source-id: 1daa8d3e7d6ad75e97e94964db95ccfb58541672	2020-08-11 14:00:19 -07:00
Mike Ruberry	ddcf3ded3e	Revert D23002043: add net transforms for fusion Test Plan: revert-hammer Differential Revision: D23002043 (`a4b763bc2c`) Original commit changeset: f0b13d51d68c fbshipit-source-id: d43602743af35db825e951358992e979283a26f6	2020-08-10 21:22:57 -07:00
Mike Ruberry	dedcc30c84	Fix ROCm CI by increasing test timeout (#42827 ) Summary: ROCm is failing to run this test in the allotted time. See, for example, https://app.circleci.com/pipelines/github/pytorch/pytorch/198759/workflows/f6066acf-b289-46c5-aad0-6f4f663ce820/jobs/6618625. cc jeffdaily Pull Request resolved: https://github.com/pytorch/pytorch/pull/42827 Reviewed By: pbelevich Differential Revision: D23042220 Pulled By: mruberry fbshipit-source-id: 52b426b0733b7b52ac3b311466d5000334864a82	2020-08-10 20:26:20 -07:00
Hector Yuen	a4b763bc2c	add net transforms for fusion (#42763 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42763 add the fp16 fusions as net transforms: -layernorm fused with mul+add -swish int8 Test Plan: added unit test, ran flows Reviewed By: yinghai Differential Revision: D23002043 fbshipit-source-id: f0b13d51d68c240b05d2a237a7fb8273e996328b	2020-08-10 20:16:14 -07:00
Christopher Whelan	5cd0f5e8ec	[PyFI] Update hypothesis and switch from tp2 (#41645 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/41645 Pull Request resolved: https://github.com/facebookresearch/pytext/pull/1405 Test Plan: buck test Reviewed By: thatch Differential Revision: D20323893 fbshipit-source-id: 54665d589568c4198e96a27f0ed8e5b41df7b86b	2020-08-08 12:13:04 -07:00
Venkata Chintapalli	e95fbaaba3	Adding Peter's Swish Op ULP analysis. (#42573 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42573 * Generate the ULP png files for different ranges. Test Plan: test_op_ulp_error.py Reviewed By: hyuen Differential Revision: D22938572 fbshipit-source-id: 6374bef6d44c38e1141030d44029dee99112cd18	2020-08-07 19:13:01 -07:00
Edson Romero	2b04712205	Exposing Percentile Caffe2 Operator in PyTorch Summary: As titled. Test Plan: ``` buck test caffe2/caffe2/python/operator_test:torch_integration_test -- test_percentile ``` Reviewed By: yf225 Differential Revision: D22999896 fbshipit-source-id: 2e3686cb893dff1518d533cb3d78c92eb2a6efa5	2020-08-07 16:22:37 -07:00
Rui Liu	92b7347fd7	Enforce counter value to double type in rowwise_counter Summary: Enforce counter value to double type in rowwise_counter. Context: The existing implementation is using float type for counter value. But due to the precision limit of a floating number [1], we observed that the counter value can't increment beyond 16777216.0 (i.e., the max value is 16777216.0) in our earlier experiments. We decide to enforce double type to avoid this issue. [1] https://stackoverflow.com/questions/12596695/why-does-a-float-variable-stop-incrementing-at-16777216-in-c Test Plan: op test ``` ruixliu@devvm1997:~/fbsource/fbcode/caffe2/caffe2/python/operator_test(f0b0b48c)$ buck test :rowwise_counter_test Trace available for this run at /tmp/testpilot.20200728-083200.729292.log TestPilot test runner for Facebook. See https://fburl.com/testpilot for details. Testpilot build revision cd2638f1f47250eac058b8c36561760027d16add fbpkg f88726c8ebde4ba288e1172a348c7f46 at Mon Jul 27 18:11:43 2020 by twsvcscm from /usr/local/fbprojects/packages/testinfra.testpilot/887/t.par Discovering tests Running 1 test Started new test run: https://our.intern.facebook.com/intern/testinfra/testrun/7881299364977047 ✓ caffe2/caffe2/python/operator_test:rowwise_counter_test - test_rowwise_counter (caffe2.caffe2.python.operator_test.rowwise_counter_test.TestRowWiseCounter) 0.265 1/1 (passed) ✓ caffe2/caffe2/python/operator_test:rowwise_counter_test - main 14.414 (passed) Finished test run: https://our.intern.facebook.com/intern/testinfra/testrun/7881299364977047 Summary (total time 18.51s): PASS: 2 FAIL: 0 SKIP: 0 FATAL: 0 TIMEOUT: 0 OMIT: 0 ``` optimizer test ``` ruixliu@devvm1997:~/fbsource/fbcode/caffe2/caffe2/python(7d66fbb9)$ buck test :optimizer_test Finished test run: https://our.intern.facebook.com/intern/testinfra/testrun/7036874434841896 Summary (total time 64.87s): PASS: 48 FAIL: 0 SKIP: 24 caffe2/caffe2/python:optimizer_test - testGPUDense (caffe2.caffe2.python.optimizer_test.TestMomentumSgd) caffe2/caffe2/python:optimizer_test - testGPUDense (caffe2.caffe2.python.optimizer_test.TestGFtrl) caffe2/caffe2/python:optimizer_test - test_caffe2_cpu_vs_numpy (caffe2.caffe2.python.optimizer_test.TestYellowFin) caffe2/caffe2/python:optimizer_test - testGPUDense (caffe2.caffe2.python.optimizer_test.TestSparseRAdam) caffe2/caffe2/python:optimizer_test - testGPUDense (caffe2.caffe2.python.optimizer_test.TestRowWiseAdagradWithCounter) caffe2/caffe2/python:optimizer_test - testGPUDense (caffe2.caffe2.python.optimizer_test.TestAdagrad) caffe2/caffe2/python:optimizer_test - test_caffe2_gpu_vs_numpy (caffe2.caffe2.python.optimizer_test.TestYellowFin) caffe2/caffe2/python:optimizer_test - testDense (caffe2.caffe2.python.optimizer_test.TestRowWiseAdagrad) caffe2/caffe2/python:optimizer_test - testGPUDense (caffe2.caffe2.python.optimizer_test.TestFtrl) caffe2/caffe2/python:optimizer_test - testSparse (caffe2.caffe2.python.optimizer_test.TestRmsProp) ...and 14 more not shown... FATAL: 0 TIMEOUT: 0 OMIT: 0 ``` param download test ``` ruixliu@devvm1997:~/fbsource/fbcode/caffe2/caffe2/fb/net_transforms/tests(7ef20a38)$ sudo buck test :param_download_test Finished test run: Finished test run: https://our.intern.facebook.com/intern/testinfra/testrun/6473924481526935 ``` e2e flow: f208394929 f207991149 f207967273 ANP notebook to check the counter value loaded from the flows https://fburl.com/anp/5fdcbnoi screenshot of the loaded counter (note that counter max is larger than 16777216.0) {F250926501} Reviewed By: ellie-wen Differential Revision: D22711514 fbshipit-source-id: 426fed7415270aa3f276dda8141907534734337f	2020-08-05 20:40:51 -07:00
Andres Suarez	9ea9d1b52e	[fbs][2/n] Remove .python3 markers Test Plan: `xbgr '\.python3'` shows only one (dead) usage of this file: https://www.internalfb.com/intern/diffusion/FBS/browse/master/fbcode/python/repo_stats/buck.py?commit=9a8dd3243207819325d520c208218f6ab69e4e49&lines=854 Reviewed By: lisroach Differential Revision: D22955631 fbshipit-source-id: e686d9157c08c347d0ce4acdd05bd7ab29ff7df5	2020-08-05 18:25:50 -07:00
Mike Ruberry	24e2a8a171	Revert D22780307: Fix illegal memory acess issue for CUDA versionn of SplitByLengths operator. Test Plan: revert-hammer Differential Revision: D22780307 (`76905527fe`) Original commit changeset: c5ca60ae16b2 fbshipit-source-id: f3c99eec5f05121e2bed606fe2ba84a0be0cdf16	2020-08-05 12:47:56 -07:00
Ren Chen	76905527fe	Fix illegal memory acess issue for CUDA versionn of SplitByLengths operator. Summary: 1. Fix illegal memory access issue for SplitByLengths operator in the CUDA context. 2. Add support to scaling lengths vector for SplitByLengths operator. 3. Add support to test SplitByLengths operator in the CUDA context. Example for SplitByLengths operator processing scaling lengths vector: value vector A = [1, 2, 3, 4, 5, 6] length vector B = [1, 2] after execution of SplitByLengths operator, the output should be [1,2] and [3,4,5,6] Test Plan: buck test mode/dev-nosan caffe2/caffe2/python/operator_test:concat_split_op_test Reviewed By: kennyhorror Differential Revision: D22780307 fbshipit-source-id: c5ca60ae16b24032cedfa045a421503b713daa6c	2020-08-05 11:46:00 -07:00
Dmytro Dzhulgakov	06d978a9ad	[c10/cuda] Reorganize device_count() and robustly surface ASAN warnings (#42249 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42249 Main change is to bring Caffe2's superior error messages for cuda initialization into c10 and use them in all code paths. Basic logic: \| Case \| Call to device_count() \| init_cuda, e.g. allocating tensor \| \| -- \| -- \| -- \| \| all good \| non-zero \| just works \| \| no gpus \| 0, no warning \| throw exception with good message \| \| driver issues \| 0, produce warning \| throw exception with good message \| \| out of memory with ASAN \| 0, produce warning\| throw exception with ASAN message \| Previously, the error thrown from init_cuda was very generic and the ASAN warning (if any) was buried in the logs. Other clean up changes: * cache device_count() always in a static variable * move all asan macros in c10 Test Plan: Hard to unittest because of build modes. Verified manually that the behavior from the table above holds by running the following script in different modes (ASAN/no-ASAN, CUDA_VISIBLE_DEVICES=): ``` print('before import') import torch print('after import') print('devices: ', torch.cuda.device_count()) x = torch.tensor([1,2,3]) print('tensor creation') x = x.cuda() print('moved to cuda') ``` Reviewed By: ngimel Differential Revision: D22824329 fbshipit-source-id: 5314007313a3897fc955b02f8b21b661ae35fdf5	2020-08-05 11:39:31 -07:00
Yinghai Lu	8850fd1952	Add python inferface to create OfflineTensor (#42516 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42516 att. We need it for some scripts. Reviewed By: houseroad Differential Revision: D22918112 fbshipit-source-id: 8a1696ceeeda67a34114bc57cb52c925711cfb4c	2020-08-04 01:31:34 -07:00
Yinghai Lu	dbdd28207c	Expose a generic shape info struct for ONNXIFI Python interface (#42421 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42421 Previously, we can only feed shape info from Python with float dtype, and batch based dim type when we do onnxifi from Python. This diff removes this limitation and uses TensorBoundShapes protobuf as a generic shape info struct. This will make the onnxifi interface in Python more flexible. Reviewed By: ChunliF Differential Revision: D22889781 fbshipit-source-id: 1a89f3a68c215a0409738c425b4e0d0617d58245	2020-08-03 16:10:05 -07:00
Xing Wang	ebfff31e19	[distributedhogwild] Introducing new tags for distributed hogwild. (#42381 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42381 Introduce new tag to support distributed hogwild. Reviewed By: boryiingsu Differential Revision: D20484099 fbshipit-source-id: 5973495589e0a7ab185d3867b37437aa747f408a	2020-08-03 07:10:44 -07:00
Xiaomeng Yang	5769b06ab5	[Caffe2] Remove explicitly divide by zero in SpatialBN training mode (#42380 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42380 [Caffe2] Remove explicitly divide by zero in SpatialBN training mode Test Plan: buck test mode/dev-nosan //caffe2/caffe2/python/operator_test:spatial_bn_op_test Reviewed By: houseroad Differential Revision: D22873214 fbshipit-source-id: 70b505391b5db02b45fc46ecd7feb303e50c6280	2020-08-01 11:54:58 -07:00
Yan Xie	bdd9ef1981	Support RowWiseSparseAdam on GPU (#35404 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35404 Implement RowWiseSparseAdam on CUDA Reviewed By: xw285cornell Differential Revision: D20650225 fbshipit-source-id: 5f871e2f259e362b713c9281b4d94534453995cf	2020-07-31 10:47:29 -07:00
Priyanshu	6c251f74b2	replace black_list/blacklist with blocklist/block_list (#42089 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/41734 Pull Request resolved: https://github.com/pytorch/pytorch/pull/42089 Reviewed By: pbelevich Differential Revision: D22794556 Pulled By: SplitInfinity fbshipit-source-id: 4404845b6293b076b3c8cc02b135b20c91397a79	2020-07-29 16:26:02 -07:00
Xing Wang	27b03d62de	[HT] Clear the device placement tag for the auto gen sum so that we could break the component for FC sharing the same input (#42219 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42219 Introduce a new extra info that is tagged on the forward net for the operators sharing the same input. The effect is that the auto gen sum of gradient for the input will not follow the tag of the operator tags in the forward net. This allow more flexible device allocation. Test Plan: # unit test `./buck-out/gen/caffe2/caffe2/python/core_gradients_test#binary.par -r testMultiUseInputAutoGenSumDevice` Reviewed By: xianjiec, boryiingsu Differential Revision: D22609080 fbshipit-source-id: d558145e5eb36295580a70e1ee3a822504dd439a	2020-07-29 15:21:27 -07:00
Xiaomeng Yang	60f51542dc	[Caffe2] Fix spatial_bn bug for computing running_var on CPU or on CUDA without CuDNN (#42151 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42151 Previously our Caffe2 SpatialBN op impl was incorrect for computing running_var without unbias coefficent. Actually it should fail the test because the output will be different with CuDNN's output. However, our tests are too weak to find this bug. This diff fix all of them. Test Plan: buck test mode/dev-nosan //caffe2/caffe2/python/operator_test:spatial_bn_op_test Reviewed By: houseroad Differential Revision: D22786127 fbshipit-source-id: db80becb67d60c44faae180c7e4257cb136a266d	2020-07-29 11:20:03 -07:00
Nikita Shulga	fd9205e14b	Enable caffe2 tests for RocM jobs (#41604 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/41604 Reviewed By: ezyang Differential Revision: D22603703 Pulled By: malfet fbshipit-source-id: 789ccf2bb79668a5a68006bb877b2d88fb569809	2020-07-28 14:21:42 -07:00
Nikita Shulga	48ae5945de	Skip TestExtractPredictorNet if compiled without OpenCV (#42168 ) Summary: Found while trying to get RocM Caffe2 CI green Pull Request resolved: https://github.com/pytorch/pytorch/pull/42168 Reviewed By: seemethere Differential Revision: D22791879 Pulled By: malfet fbshipit-source-id: 8f7ef9711bdc5941b2836e4c8943bb95c72ef8af	2020-07-28 11:26:55 -07:00
Nikita Shulga	2f61aca17b	Skip DataIO tests relying on LevelDB if compiled without it (#42169 ) Summary: Found while trying to get RocM Caffe2 job green Pull Request resolved: https://github.com/pytorch/pytorch/pull/42169 Reviewed By: seemethere Differential Revision: D22791896 Pulled By: malfet fbshipit-source-id: 9df6233876aec5ead056365499bab970aa7e8bdc	2020-07-28 10:18:26 -07:00
Jiyan Yang	c062cdbd90	Log the net if blob doesn't exist when setting output record (#41971 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/41971 Reviewed By: wx1988 Differential Revision: D22490309 fbshipit-source-id: d967ee211b610f5523a307b5266b9fcb0277a21c	2020-07-27 19:13:50 -07:00
Lingyi Liu	d6f1346c37	Add a new op for converting the dense feature to sparse representation Summary: we need this op to avoid the splicing of a dense tensor and then use the Mergesinglescaler op Test Plan: integrated test with dper2 Differential Revision: D22677523 fbshipit-source-id: f4f9a1f06841b0906ec8cbb435482ae0a89e1721	2020-07-27 12:45:37 -07:00
Tristan Rice	976e614915	caffe2: add PIPELINE tag (#41482 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/41482 This adds a new tag for use with pipeline parallelism. Test Plan: CI Reviewed By: heslami Differential Revision: D22551487 fbshipit-source-id: 90910f458a9bce68f7ef684773322a49aa24494a	2020-07-24 15:25:14 -07:00
Colin L Reliability Rice	dfa914a90c	Modify lazy_dyndep loading to trigger inside workspace. (#41687 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/41687 Specifically, this makes a new library (lazy), which can be used from both core and workspace. This allows workspace.Createnet to trigger lazy loading of dyndep dependencies. Test Plan: Added a unit test specifically for workspace.CreateNet Reviewed By: dzhulgakov Differential Revision: D22441877 fbshipit-source-id: 3a9d1af9962585d08ea2566c9c85bec7377d39f2	2020-07-22 15:36:43 -07:00
Jeong Ukjae	f03156f9df	replace blacklist in caffe2/python/onnx/frontend.py (#41777 ) Summary: Close https://github.com/pytorch/pytorch/issues/41712 Pull Request resolved: https://github.com/pytorch/pytorch/pull/41777 Reviewed By: izdeby Differential Revision: D22648532 Pulled By: yinghai fbshipit-source-id: 7f4c9f313e2887e70bb4eb1ab037aea6b549cec7	2020-07-22 10:02:16 -07:00
Eileen Pan	f07816003a	[2/n][Compute Meta] support analysis for null flag features Summary: ## TLDR Support using NaN default value for missing dense features in RawInputProcessor for DPER2. In preparation for subsequent support for null flag features in compute meta. For train_eval this is already supported in DPER3 and we do not plan to support this in DPER2 train eval. Differential Revision: D22439142 fbshipit-source-id: 99ae9755bd41a5d5f43bf5a9a2819d64f3883005	2020-07-20 13:13:45 -07:00
Hongzheng Shi	581e9526bb	[GradualGating] support better k value change (#41557 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/41557 - add new learning rate functor "slope" - use "slope" learning rate in gated_sparse_feature module Test Plan: buck test dper3/dper3/modules/tests:core_modules_test -- test_gated_sparse_features_shape_num_warmup_tensor_k buck test caffe2/caffe2/python/operator_test:learning_rate_op_test -- test_slope_learning_rate_op Reviewed By: huayuli00 Differential Revision: D22544628 fbshipit-source-id: f2fcae564e79e1d8bcd3a2305d0c11ca7c0d3b3c	2020-07-17 20:44:28 -07:00
Stanislau Hlebik	b774ce54f8	remediation of S205607 fbshipit-source-id: 798decc90db4f13770e97cdce3c0df7d5421b2a3	2020-07-17 17:19:47 -07:00
Stanislau Hlebik	8fdea489af	remediation of S205607 fbshipit-source-id: 5113fe0c527595e4227ff827253b7414abbdf7ac	2020-07-17 17:17:03 -07:00
Colin L Reliability Rice	415ff0bceb	Create lazy_dyndeps to avoid caffe2 import costs. (#41343 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/41343 Currently caffe2.InitOpLibrary does the dll import uniliaterally. Instead if we make a lazy version and use it, then many pieces of code which do not need the caffe2urrenoperators get a lot faster. One a real test, the import time went from 140s to 68s. 8s. This also cleans up the algorithm slightly (although it makes a very minimal difference), by parsing the list of operators once, rather than every time a new operator is added, since we defer the RefreshCall until after we've imported all the operators. The key way we maintain safety, is that as soon as someone does an operation which requires a operator (or could), we force importing of all available operators. Future work could include trying to identify which code is needed for which operator and only import the needed ones. There may also be wins available by playing with dlmopen (which opens within a namespace), or seeing if the dl flags have an impact (I tried this and didn't see an impact, but dlmopen may make it better). Note that this was previously landed and reverted. The issue was that if a import failed and raised an exception, the specific library would not be removed from the lazy imports. This caused our tests which had libraries that failed to poison all other tests that ran after it. This has been fixed and a unit test has been added for this case (to help make it obvious what failed). Test Plan: I added a new test a lazy_dyndep_test.py (copied from all_compare_test.py). I'm a little concerned that I don't see any explicit tests for dyndep, but this should provide decent coverage. I've added a specific test to handle the poisoning issues mentioned above, which caused the previous version to get reverted. Differential Revision: D22506369 fbshipit-source-id: 7395df4778e8eb0220630c570360b99a7d60eb83	2020-07-16 15:17:41 -07:00
Dinesh Govindaraj	f153b35b9b	Shape inference for SparseToDense in ExpertCombiner Summary: Adding shape inference for SpraseToDense. Proposal impl of shape inference only works when data_to_infer_dim is given, otherwise SpraseToDense output dimension depends on max value of input tensor Test Plan: buck test //caffe2/caffe2/python:sparse_to_dense_test buck test //caffe2/caffe2/python:hypothesis_test -- test_sparse_to_dense Dper3 Changes: f204594813 buck test dper3/dper3_models/ads_ranking/model_impl/sparse_nn/tests:sparse_nn_lib_test Reviewed By: zhongyx12, ChunliF Differential Revision: D22479511 fbshipit-source-id: 8983a9baea8853deec53ad6f795c874c3fb93de0	2020-07-15 08:04:48 -07:00
rohithkrn	c528faac7d	[ROCm] Skip problematic mgpu tests on ROCm3.5 (#41409 ) Summary: nccl tests and parallelize_bmuf_distributed test are failing on rocm3.5.1. Skipping these tests to upgrade the CI to rocm3.5.1 jeffdaily sunway513 Pull Request resolved: https://github.com/pytorch/pytorch/pull/41409 Reviewed By: orionr Differential Revision: D22528928 Pulled By: seemethere fbshipit-source-id: 928196b7a62a441d391e69f54b278313ecc75d77	2020-07-14 11:55:43 -07:00
Yavuz Yetim	d04a2e4dae	Back out "Revert D22329069: Self binning histogram" (#41313 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/41313 This diff backs out the backout diff. The failure was due to C++ `or` not being supported in MSVC. This is now replaced with \|\| Original commit changeset: fc7f3f8c968d Test Plan: Existing unit tests, check github CI. Reviewed By: malfet Differential Revision: D22494777 fbshipit-source-id: 3271288919dc3a6bfb82508ab9d021edc910ae45	2020-07-13 11:46:34 -07:00
Nikita Shulga	7bae5780a2	Revert D22329069: Self binning histogram Test Plan: revert-hammer Differential Revision: D22329069 (`16c8146da9`) Original commit changeset: 28406b94e284 fbshipit-source-id: fc7f3f8c968d1ec7d2a1cf7a4d05900f51055d82	2020-07-10 16:22:29 -07:00
Yavuz Yetim	16c8146da9	Self binning histogram (#40875 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40875 This op uses the given num_bins and a spacing strategy to automatically bin and compute the histogram of given matrices. Test Plan: Unit tests. Reviewed By: neha26shah Differential Revision: D22329069 fbshipit-source-id: 28406b94e284d52d875f73662fc82f93dbc00064	2020-07-10 13:55:42 -07:00
rohithkrn	df252c059c	[ROCm] Skip caffe2 unique op test for rocm3.5 (#41219 ) Summary: unique op test failure in caffe2 blocks upgrading CI to rocm3.5.1. Skipping the test to unblock will re-enable after root causing and fixing the issue. jeffdaily sunway513 Pull Request resolved: https://github.com/pytorch/pytorch/pull/41219 Differential Revision: D22471452 Pulled By: xw285cornell fbshipit-source-id: 9e503c8b37c0a4b92632f77b2f8a90281a9889c3	2020-07-09 20:00:29 -07:00
Nikita Shulga	1f1351488e	Revert D21870844: Create lazy_dyndeps to avoid caffe2 import costs. Test Plan: revert-hammer Differential Revision: D21870844 (`07fd5f8ff9`) Original commit changeset: 3f65fedb65bb fbshipit-source-id: 4f661072d72486a9c14711e368247b3d30e28af9	2020-07-09 14:18:38 -07:00
Colin L Reliability Rice	07fd5f8ff9	Create lazy_dyndeps to avoid caffe2 import costs. (#39488 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39488 Currently caffe2.InitOpLibrary does the dll import uniliaterally. Instead if we make a lazy version and use it, then many pieces of code which do not need the caffe2urrenoperators get a lot faster. One a real test, the import time went from 140s to 68s. 8s. This also cleans up the algorithm slightly (although it makes a very minimal difference), by parsing the list of operators once, rather than every time a new operator is added, since we defer the RefreshCall until after we've imported all the operators. The key way we maintain safety, is that as soon as someone does an operation which requires a operator (or could), we force importing of all available operators. Future work could include trying to identify which code is needed for which operator and only import the needed ones. There may also be wins available by playing with dlmopen (which opens within a namespace), or seeing if the dl flags have an impact (I tried this and didn't see an impact, but dlmopen may make it better). Test Plan: I added a new test a lazy_dyndep_test.py (copied from all_compare_test.py). I'm a little concerned that I don't see any explicit tests for dyndep, but this should provide decent coverage. Differential Revision: D21870844 fbshipit-source-id: 3f65fedb65bb48663670349cee5e1d3e22d560ed	2020-07-09 11:34:57 -07:00
lcskrishna	302cf6835e	[ROCm][Caffe2] Enable MIOpen 3D Pooling (#38260 ) Summary: This PR contains the following updates: 1. MIOpen 3D pooling enabled in Caffe2. 2. Refactored the MIOpen pooling code in caffe2. 3. Enabled unit test cases for 3D pooling. CC: ezyang jeffdaily ashishfarmer Pull Request resolved: https://github.com/pytorch/pytorch/pull/38260 Differential Revision: D21524754 Pulled By: xw285cornell fbshipit-source-id: ddfe09dc585cd61e42eee22eff8348d326fd0c3b	2020-07-08 17:42:55 -07:00
Alyssa Wang	e0e8b98c43	Export logic op to pytorch Summary: Export logit op to pt for better preproc perf Test Plan: unit test Also tested with model re-generation Reviewed By: houseroad Differential Revision: D22324611 fbshipit-source-id: 86accb6b4528e5c818d2c3f8c67926f279d158d6	2020-07-08 02:27:09 -07:00
Will Feng (FAIAR)	078669f6c3	Back out "[2/n][Compute Meta] support analysis for null flag features" Summary: Original commit changeset: 46c59d849fa8 The original commit is breaking DPER3 release pipeline with the following failures: https://www.internalfb.com/intern/chronos/jobinstance?jobinstanceid=9007207344413239&smc=chronos_gp_admin_client&offset=0 ``` Child workflow f 202599639 failed with error: c10::Error: [enforce fail at operator.cc:76] blob != nullptr. op Save: Encountered a non-existing input blob: feature_preproc/feature_sparse_to_dense/default_float_value ``` https://www.internalfb.com/intern/chronos/jobinstance?jobinstanceid=9007207344855973&smc=chronos_gp_admin_client&offset=0 ``` Child workflow f 202629391 failed with error: c10::Error: [enforce fail at operator.cc:76] blob != nullptr. op Save: Encountered a non-existing input blob: tum_preproc/inductive/feature_sparse_to_dense/default_float_value ``` Related UBN tasks: T69529846, T68986110 Test Plan: Build a DPER3 package on top of this commit, and check that DPER3 release test `model_deliverability_test` is passing. Differential Revision: D22396317 fbshipit-source-id: 92d5b30cc146c005d6159a8d5bfe8973e2c546dd	2020-07-06 16:29:03 -07:00
Dongxin Liu	cbe52d762c	Mish Activation Function (#40856 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40856 Add a new activation function - Mish: A Self Regularized Non-Monotonic Neural Activation Function https://arxiv.org/abs/1908.08681 Test Plan: buck test //caffe2/caffe2/python/operator_test:elementwise_ops_test -- 'test_mish' {F242275183} Differential Revision: D22158035 fbshipit-source-id: 459c1dd0ac5b515913fc09b5f4cd13dcf095af31	2020-07-06 15:51:23 -07:00
Vitaly Fedyunin	a1c234e372	Revert D22330340: [C2] Fixed a bug in normalization operator Test Plan: revert-hammer Differential Revision: D22330340 (`ce63f70981`) Original commit changeset: 0bccf925bb76 fbshipit-source-id: e27d70dee0fbe9e708b0cf3be81dbd33c4015026	2020-07-02 16:05:23 -07:00
Hao Lu	9cc73966b3	[TVM] Fix build and sync with caffe2/caffe2/python/dlpack.h (#40888 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40888 Reviewed By: yinghai Differential Revision: D22326379 fbshipit-source-id: 96ffcff5738973312c49368f53f35bf410e4c0c9	2020-07-02 15:37:45 -07:00
Pawel Garbacki	ce63f70981	[C2] Fixed a bug in normalization operator (#40925 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40925 normalization operator does not handle empty tensors correctly. This is a fix. Test Plan: unit tests Differential Revision: D22330340 fbshipit-source-id: 0bccf925bb768ebb997ed0c88130c5556308087f	2020-07-02 13:24:56 -07:00
Eileen Pan	db39542509	[2/n][Compute Meta] support analysis for null flag features Summary: ## TLDR Support using NaN default value for missing dense features in RawInputProcessor for DPER2. In preparation for subsequent support for null flag features in compute meta. For train_eval this is already supported in DPER3 and we do not plan to support this in DPER2 train eval. ## Overview Intern project plan to support adding dense flags for missing feature values instead of replacing with zero. ## Project plan : https://docs.google.com/document/d/1OsPUTjpJycwxWLCue3Tnb1mx0uDC_2KKWvC1Rwpo2NI/edit?usp=sharing ## Code paths: See https://fb.quip.com/eFXUA0tbDmNw for the call stack for all affected code paths. Test Plan: ## fblearner flow test 1. `flow-cli clone f197867430 --run-as-secure-group ads_personalization_systems --force-build` to build a ephemeral package and start a fblearner flow run (may fail) 2. Clone the new run and change the secure_group to `XXXX` and entitlement to `default` in the UI 3. Adds explicit_null_min_coverage flag 4. Optionally reduce `max_examples` since we only test pass/fail instead of quality. 5. Submit the run to test the change Example: f198538878 ## compare output coverages to daiquery runs 1. Randomly select null flag features from compute meta workflow output 2. Look up the feature id in feature metadata using feature name 3. Check against a daiquery sample of coverage to see if the coverage falls within guidelines. https://www.internalfb.com/intern/daiquery/workspace/275342740223489/192619942076136/ ## Sampled features: GFF_C66_ADS_USER_SUM_84_PAGE_TYPE_RATIO_EVENT_LIKE_IMPRESSION: 15694257 - original feature compute meta coverage: 0.999992 - daiquery feature coverage (10k rows): 0.69588 - null flag compute meta coverage: 0.293409 GFF_R1303_ADS_USER_SUM_7_PAGE_TYPE_COUNTER_CONVERSION: 16051183 - original feature compute meta coverage: 0.949868 - daiquery feature coverage: 0.82241 - null flag compute meta coverage: 0.151687 ## Unit tests: `buck test fblearner/flow/projects/dper/tests/workflows:ads_test` https://www.internalfb.com/intern/testinfra/testconsole/testrun/6192449504303863/ Differential Revision: D22026450 fbshipit-source-id: 46c59d849fa89253f14dc2b035c4c677cd6e3a4c	2020-07-02 12:44:41 -07:00
Rui Liu	9d8dc0318b	[pruning] add rowwise counter to sparse adagrad Summary: Use the newly added counter op in sparse adagrad Reviewed By: chocjy, ellie-wen Differential Revision: D19221100 fbshipit-source-id: d939d83e3b5b3179f57194be2e8864d0fbbee2c1	2020-06-30 14:40:02 -07:00
Eileen Pan	4102fbdf08	[1/n] Allow dense NaN value in dper raw input processor output Summary: ## TLDR Support using NaN default value for missing dense features in RawInputProcessor for DPER2. In preparation for subsequent support for null flag features in compute meta. For train_eval this is already supported in DPER3 and we do not plan to support this in DPER2 train eval. ## Overview Intern project plan to support adding dense flags for missing feature values instead of replacing with zero. Project plan : https://docs.google.com/document/d/1OsPUTjpJycwxWLCue3Tnb1mx0uDC_2KKWvC1Rwpo2NI/edit?usp=sharing ## Code paths: See https://fb.quip.com/eFXUA0tbDmNw for the call stack for all affected code paths. Test Plan: # A. DPER3 blob value inspection ## 1. Build local bento kernel in fbcode folder `buck build mode/dev-nosan //bento/kernels:bento_kernel_ads_ranking` ## 2. Use kernel `ads_ranking (local)` to print dense feature blob values n280239 ## 2.1 Try `default_dense_value = "0.0"` (default) ``` preproc_6/feature_preproc_6/dper_feature_processor_7/raw_input_proc_7/float_feature_sparse_to_dense_7/float_features [[0. ] [0. ] [0. ] [0. ] [0. ] [0. ] [0. ] [1. ] [1.7857143] [1.7777778] [1. ] [0. ] [0.5625 ] [0. ] [0. ] [0.8 ] [0. ] [1. ] [0.56 ] [0. ]] ``` ## 2.2 Try `default_dense_value = "123"` ``` preproc_2/feature_preproc_2/dper_feature_processor_3/raw_input_proc_3/float_feature_sparse_to_dense_3/float_features [[123. ] [123. ] [123. ] [123. ] [123. ] [123. ] [123. ] [ 1. ] [ 1.7857143] [ 1.7777778] [ 1. ] [123. ] [ 0.5625 ] [123. ] [123. ] [ 0.8 ] [123. ] [ 1. ] [ 0.56 ] [123. ]] ``` ## 2.3 Try `default_dense_value = float("nan")` ``` RuntimeError: [enforce fail at enforce_finite_op.h:40] std::isfinite(input_data[i]). Index 0 is not finite (e.g., NaN, Inf): -nan (Error from operator: input: "unary_4/logistic_regression_loss_4/average_loss_4/average_loss" name: "" type: "EnforceFinite" device_option { random_seed: 54 }) ``` which is expected due to nan input. # B. Unit test `buck test fblearner/flow/projects/dper/tests/preprocs:raw_feature_extractor_test` https://www.internalfb.com/intern/testinfra/testconsole/testrun/5348024586274923/ {F241336814} Differential Revision: D21961595 fbshipit-source-id: 3dcb153b3c7f42f391584f5e7f52f3d9c76de31f	2020-06-26 16:54:14 -07:00
peter	e2201e2ed8	Fixes caffe2 loading issues on Windows (#39513 ) Summary: Addresses https://github.com/pytorch/pytorch/issues/27840#issuecomment-638715422. Contains a bunch of fixes (https://github.com/pytorch/pytorch/pull/39376 + https://github.com/pytorch/pytorch/pull/39334 + https://github.com/pytorch/pytorch/pull/38302 + https://github.com/pytorch/pytorch/pull/35362) Pull Request resolved: https://github.com/pytorch/pytorch/pull/39513 Differential Revision: D22190761 Pulled By: malfet fbshipit-source-id: b2d52f6cb16c233d16071e9c0670dfff7da2710e	2020-06-23 20:11:24 -07:00
Neha Shah	5ad885b823	[Caffe2][Pruning] Make the caffe2 Sum operator support long types (#40379 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40379 The current sum operator doesn't support Long .. hence modify the code Test Plan: Write a test case Reviewed By: jspark1105, yinghai Differential Revision: D21917365 fbshipit-source-id: b37d2c100c70d17d2f89c309e40360ddfab584ee	2020-06-23 14:18:29 -07:00
Jongsoo Park	7a837019a4	[caffe2] optimize 2/4-bit row-wise quantization (#387 ) Summary: Pull Request resolved: https://github.com/pytorch/FBGEMM/pull/387 Pull Request resolved: https://github.com/pytorch/pytorch/pull/39985 avx2 optimized 2/4-bit row-wise quantization/dequantization in perfkernels. This diff slightly change the numerics of quantization by multiplying with the inverse of scale instead of dividing with scale. Test Plan: In my devserver for i in 2 4 8; do echo $i; buck run mode/opt :fused_rowwise_nbit_conversion_bench -- --bit-rate=$i; done Before this diff 2-bit 3.35394 ms. 100%. FloatToFused2BitRowwiseQuantized 4-bit 3.60351 ms. 100%. FloatToFused4BitRowwiseQuantized 8-bit 0.434467 ms. 100%. FloatToFused8BitRowwiseQuantized After this diff 2-bit 0.606386 ms. 100%. FloatToFused2BitRowwiseQuantized 4-bit 0.446683 ms. 100%. FloatToFused4BitRowwiseQuantized 8-bit 0.4349 ms. 100%. FloatToFused8BitRowwiseQuantized Reviewed By: choudharydhruv, jianyuh Differential Revision: D22033195 fbshipit-source-id: d3a219e47b8345268d90a160c9314ed0d5b71467	2020-06-19 21:28:31 -07:00
Yinghai Lu	3ea15af630	[Onnxifi] Allow adding timeout for OnnxifOp run (#40081 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40081 Adding the functionality to enable timeout of OnnxifiOp run. In the case of backend hanging, it can error out quickly. Test Plan: ``` buck test glow/fb/test:test_onnxifinnpi -- test_timeout ``` Reviewed By: jackm321 Differential Revision: D22064533 fbshipit-source-id: 25487287c10ab217eb95692f09d48e13e19436ab	2020-06-17 16:21:25 -07:00
Hongzheng Shi	f6b0fbe2c5	topk tensor k support (#39407 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39407 - support passing a single element tensor as k for topk module - support passing a single element tensor to constant fill output Test Plan: buck test dper3/dper3/modules/tests:core_modules_test -- test_topk_gating_without_split_examples_tensor_k buck test caffe2/caffe2/python:hypothesis_test -- test_constant_fill_from_tensor Reviewed By: huayuli00 Differential Revision: D21843739 fbshipit-source-id: 0c5f5c03e9f57eeba40c0068784625164c2527ec	2020-06-15 13:10:20 -07:00
Jeff Daily	1e05e5e0ae	Correct #39759 for HIP. (#39801 ) Summary: Changes in PR https://github.com/pytorch/pytorch/issues/39759 broke HIP caffe2. hipify for caffe2 renames CUDA to HIP; torch does not. If caffe2 calls into torch, it needs to use CUDA-named functions. CC ezyang xw285cornell sunway513 houseroad dzhulgakov Pull Request resolved: https://github.com/pytorch/pytorch/pull/39801 Differential Revision: D21982493 Pulled By: xw285cornell fbshipit-source-id: 8e88e0fb80c71f0342e23ef0214a42d5542bdc70	2020-06-12 10:34:28 -07:00
Dmytro Dzhulgakov	1f027ac02d	Disable testTHCAllocator on HIP (#39843 ) Summary: THCAllocator functionality is pretty obscure and it's hard to get it working with HIP because of how Caffe2/PyTorch rules are set up (see https://github.com/pytorch/pytorch/issues/39801). Let's just disable the test. Pull Request resolved: https://github.com/pytorch/pytorch/pull/39843 Reviewed By: zou3519 Differential Revision: D21998687 Pulled By: dzhulgakov fbshipit-source-id: cd12ba30cdfee658b98393ed3a72e83f4ecf1c9c	2020-06-11 11:36:17 -07:00
Taiqing Wang	ad91a3a11f	Skipping L2 regularization on sparse biases Summary: # Motivations As explained in the [link](https://stats.stackexchange.com/questions/86991/reason-for-not-shrinking-the-bias-intercept-term-in-regression/161689#161689), regularizing biases will cause mis-calibration of predicted probabilities. In SparseNN, the unary processor may use 1d embedding tables for the sparse features to serve as biases. In this diff, the regularization term is automatically skipped for the 1d sparse parameters to avoid regularizing biases. # Experiments Experiments were conducted to verify that it has no significant impact on the NE to skip the regularization on 1d sparse parameters. Baseline.1 (no L2 regularization): f193105372 Baseline.2 (L2 regularization in prod): f193105522 Treatment (skipping L2 regularization on 1d sparse params): f193105708 {F239859690} Test Plan: Experiments were conducted to verify that it has no significant impact on the NE to skip the regularization on 1d sparse parameters using a canary package: `aml.dper2.canary:9efc576b35b24361bb600dcbf94d31ea`. Baseline.1 (no L2 regularization): f193105372 Baseline.2 (L2 regularization in prod): f193105522 Treatment (skipping L2 regularization on 1d sparse params): f193105708 Reviewed By: zhongyx12 Differential Revision: D21757902 fbshipit-source-id: ced126e1eab270669b9981c9ecc287dfc9dee995	2020-06-11 11:21:25 -07:00
Jongsoo Park	262dbdf0a5	[caffe2/nomnigraph] handle when PATH env is not defined (#39373 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39373 Line 114 is the only actual change. Other changes are just formatting. Test Plan: CI Reviewed By: zrphercule Differential Revision: D21830893 fbshipit-source-id: 83e49b1b3c48f6bc6de3c48ccce60c84aa49339b	2020-06-10 17:09:59 -07:00
Dmytro Dzhulgakov	e46060701d	[caffe2] Fix of initializing ATen's CUDA before using caching allocator (#39759 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39759 Caffe2 has a mode where it uses PT's caching allocator. Somehow we were not calling the initialization explicitly. Now, I have no idea why it worked before. Probably worth to run a bisect separately. Reviewed By: houseroad Differential Revision: D21962331 fbshipit-source-id: f16ad6b27a67dbe0bda93939cca8c94620d22a09	2020-06-09 17:25:42 -07:00
Jamie King	2633a9cca1	Adding LpNorm regularization for sparse features in DPER3 (#38582 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38582 Adding LpNorm regularization for sparse features in DPER3. This is done using a sparse regularization op with run_after_optimizer (see D21003029). * Added code calling new caffe2 operator from D21003029 to caffe2/python/regularizer.py * Added l1norm and l2norm to sparse regularizer thrift definition. * Added the new regularization references to test utils. * Added a new file for unit tests "sparse_nn_sparse_reg_test.py" Test Plan: buck test mode/dev //caffe2/caffe2/fb/dper/layer_models/tests:sparse_nn_sparse_reg_test buck test mode/dev //caffe2/caffe2/fb/dper/layer_models/tests:sparse_nn_reg_test DPER canary: https://fburl.com/fblearner/rcp5yzeh New DPER canary: https://fburl.com/fblearner/0krgd74x Differential Revision: D20704248 fbshipit-source-id: 7e3d5013b3ff3da95ea027f0f2dd855f3ae8e41d	2020-06-09 12:34:50 -07:00
Chunli Fu	834569232b	[online trainer] Add blob reorder (#39534 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39534 Reviewed By: boryiingsu Differential Revision: D21871352 fbshipit-source-id: 00cce83b7351fdafd36d4db57c99fb8a58e8a260	2020-06-05 17:33:08 -07:00
Nikita Shulga	e2a178ca21	Update cafe2 hypothesis_test_util to support hypothesis-5 (#39498 ) Summary: Extracting forward-backward `hypothesis` interface update parts of https://github.com/pytorch/pytorch/pull/39430 into a separate PR Pull Request resolved: https://github.com/pytorch/pytorch/pull/39498 Differential Revision: D21900210 Pulled By: malfet fbshipit-source-id: 75e637cf839f49dc141d37e1686ce45ff4721245	2020-06-05 08:27:50 -07:00
Lu Fang	15ad9dd30f	[ONNX] Bump up ONNX submodule to a82c6a7010e2e332d8f74ad5b0c726fd47c85376 (#39372 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39372 we only bump the submodule in oss to unblock some works Test Plan: ci Reviewed By: hl475 Differential Revision: D21830800 fbshipit-source-id: fb4a716992efcd71926f7bba24a7c24422c17e38	2020-06-02 21:08:14 -07:00
Jongsoo Park	fca928cabf	[caffe2] fix test error in video_input_op_test (#39382 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39382 Test Plan: buck test caffe2/caffe2/python/operator_test:video_input_op_test Reviewed By: dutran Differential Revision: D21832355 fbshipit-source-id: 47b1b0610b9600437fe1ed317d5af47d624767fb	2020-06-02 11:48:01 -07:00
Jongsoo Park	04ac41fe70	[caffe2] format video_input_op_test.py (#39381 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39381 To prepare D21832355 Test Plan: Just formatting Reviewed By: dutran Differential Revision: D21832354 fbshipit-source-id: bbf6a1377752adaa115ee2e2a5ba546964e3fd08	2020-06-02 11:46:01 -07:00
Jamie King	7f1a96d43c	Adding sparse Lp regularization operator to Caffe2 (#38574 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38574 Adding sparse L1 and L2 regularization operator to Caffe2. This doesn't work using run_on_loss, only run_after_optimize. Applying it to run_after_optimize rather than run_on_loss was easier to implement, particularly for the L1 norm which is preferable in some cases and is non-differentiable at zero. Test Plan: Wrote and ran unit tests in operator_test:sparse_lp_regularizer_test. Differential Revision: D21003029 fbshipit-source-id: 81070a621752560ce03e320d065ce27807a5d278	2020-06-01 15:21:19 -07:00
Xiaodong Wang	fcef43965b	[AMD] Fix broken test (#39297 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39297 histogram op doesn't have GPU implementation. It's breaking the CI GPU test. Make the test run cpu only. Test Plan: CI Reviewed By: hwangjeff Differential Revision: D21800824 fbshipit-source-id: 9c835786f22bac7d420ce610397a6ee69084c19a	2020-05-30 13:12:24 -07:00
Jeff Hwang	0b9d537056	[dper][pruning] add histogram op (#38514 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38514 this diff introduces the `Histogram` caffe2 op, which computes a histogram tensor for a list of input tensors. the bin edges of the histogram are defined by arg `bin_edges`. Test Plan: tests Reviewed By: chocjy Differential Revision: D21553956 fbshipit-source-id: fc98c8db691d66d2dad57b6ad14867109913cb6f	2020-05-28 15:45:04 -07:00
Lu Fang	90a8cdfdbf	Automatic update of fbcode/onnx to eae3eb8c61cf5ad27cc9a416dbdc5274982385a6 (#39089 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39089 Previous import was 79a7e0df7e86e0f32e7a05f563b24a566540c18b Included changes: - [eae3eb8c](https://github.com/onnx/onnx/commit/eae3eb8c): Use cmake GNUInstallDirs (#2661) <Gustavo Alvarez> - [106821e9](https://github.com/onnx/onnx/commit/106821e9): Update sequence test case so input is not scalar and splits are specified (#2675) <Scott McKay> - [e094e101](https://github.com/onnx/onnx/commit/e094e101): Remove unnecessary copies and std::move (#2684) <Changming Sun> - [71145275](https://github.com/onnx/onnx/commit/71145275): Update Batchnorm test (#2674) <Lara Haidar> - [da13be2d](https://github.com/onnx/onnx/commit/da13be2d): Rename OPTIONAL to OPTIONAL_VALUE (#2682) <Changming Sun> - [2987fa06](https://github.com/onnx/onnx/commit/2987fa06): Adding CI for ONNX Debug mode (Linux, OSX) (#2651) <Vinitra Swamy> - [46fe392d](https://github.com/onnx/onnx/commit/46fe392d): Update Pow input types in Opset 12 (#2666) <Lara Haidar> - [ac1caf3b](https://github.com/onnx/onnx/commit/ac1caf3b): Change type of label tensor to int32/int64 in SoftmaxCrossEntropyLoss spec. (#2667) <M. Zeeshan Siddiqui> - [c2fefcbf](https://github.com/onnx/onnx/commit/c2fefcbf): [Training] SG with Momentum Optimizer (#1959) <Wei-Sheng Chin> - [8d15705e](https://github.com/onnx/onnx/commit/8d15705e): [Training] Add Adagrad optimizer operator (#1955) <Wei-Sheng Chin> - [94b01cdd](https://github.com/onnx/onnx/commit/94b01cdd): Suppress a warning in unsqueeze (#2637) <Hong Xu> - [0582d526](https://github.com/onnx/onnx/commit/0582d526): Fix Greater/LessOrEqual function definition (#2645) <Takeshi Watanabe> - [b852d819](https://github.com/onnx/onnx/commit/b852d819): Increment version number to 1.7.0 (#2639) <Chin Huang> - [ff4bb553](https://github.com/onnx/onnx/commit/ff4bb553): Regenerate Min test data (#2644) <Takeshi Watanabe> Test Plan: ci Reviewed By: hl475 Differential Revision: D21750299 fbshipit-source-id: c33ec1b1e0dc65d0187e78db96d749f9037aae9c	2020-05-27 18:55:48 -07:00
Natalia Gimelshein	93d87a16eb	Revert D21493165: Automatic update of fbcode/onnx to 20b3e10e6c3a9cdab90d2bb864d1c36d3e3651cd Test Plan: revert-hammer Differential Revision: D21493165 Original commit changeset: 6863b289bfbf fbshipit-source-id: 47b530c8ffceb3673a86b6cf0c064fe6af0eb72d	2020-05-26 21:35:29 -07:00
Lu Fang	51274b501a	Automatic update of fbcode/onnx to 20b3e10e6c3a9cdab90d2bb864d1c36d3e3651cd (#38203 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38203 Previous import was 79a7e0df7e86e0f32e7a05f563b24a566540c18b Included changes: - [20b3e10e](https://github.com/onnx/onnx/commit/20b3e10e): Add 'ignore_index' input in the spec for SoftmaxCrossEntropyLoss and NLLLoss. (#2680) <M. Zeeshan Siddiqui> - [eae3eb8c](https://github.com/onnx/onnx/commit/eae3eb8c): Use cmake GNUInstallDirs (#2661) <Gustavo Alvarez> - [106821e9](https://github.com/onnx/onnx/commit/106821e9): Update sequence test case so input is not scalar and splits are specified (#2675) <Scott McKay> - [e094e101](https://github.com/onnx/onnx/commit/e094e101): Remove unnecessary copies and std::move (#2684) <Changming Sun> - [71145275](https://github.com/onnx/onnx/commit/71145275): Update Batchnorm test (#2674) <Lara Haidar> - [da13be2d](https://github.com/onnx/onnx/commit/da13be2d): Rename OPTIONAL to OPTIONAL_VALUE (#2682) <Changming Sun> - [2987fa06](https://github.com/onnx/onnx/commit/2987fa06): Adding CI for ONNX Debug mode (Linux, OSX) (#2651) <Vinitra Swamy> - [46fe392d](https://github.com/onnx/onnx/commit/46fe392d): Update Pow input types in Opset 12 (#2666) <Lara Haidar> - [ac1caf3b](https://github.com/onnx/onnx/commit/ac1caf3b): Change type of label tensor to int32/int64 in SoftmaxCrossEntropyLoss spec. (#2667) <M. Zeeshan Siddiqui> - [c2fefcbf](https://github.com/onnx/onnx/commit/c2fefcbf): [Training] SG with Momentum Optimizer (#1959) <Wei-Sheng Chin> - [8d15705e](https://github.com/onnx/onnx/commit/8d15705e): [Training] Add Adagrad optimizer operator (#1955) <Wei-Sheng Chin> - [94b01cdd](https://github.com/onnx/onnx/commit/94b01cdd): Suppress a warning in unsqueeze (#2637) <Hong Xu> - [0582d526](https://github.com/onnx/onnx/commit/0582d526): Fix Greater/LessOrEqual function definition (#2645) <Takeshi Watanabe> - [b852d819](https://github.com/onnx/onnx/commit/b852d819): Increment version number to 1.7.0 (#2639) <Chin Huang> - [ff4bb553](https://github.com/onnx/onnx/commit/ff4bb553): Regenerate Min test data (#2644) <Takeshi Watanabe> Test Plan: ci Reviewed By: hl475 Differential Revision: D21493165 fbshipit-source-id: 6863b289bfbf4235e36f0e2456ce44c776aaf164	2020-05-26 20:12:36 -07:00
Yan Zhu	c40a79a027	[c2] cuda impl for WeightScale op (#38712 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38712 as title Test Plan: buck test; Reviewed By: ustctf Differential Revision: D21586705 fbshipit-source-id: 12cd34f04f074ee12b77304055f3ba6068cf38fb	2020-05-26 12:50:54 -07:00
Yan Zhu	dfbf9f397f	Back out "Back out "[c2] register cuda op for LpNorm (fallback)"" (#38566 ) Summary: Previously we got a CI issue in original submission (D21562485), so we backout the original diff (D21588831). Resubmitting here to reprod the CI issue and ask caffe2 dev to take a look. Pull Request resolved: https://github.com/pytorch/pytorch/pull/38566 Original commit changeset: 6dda4b71904d Test Plan: buck test Reviewed By: houseroad Differential Revision: D21589352 fbshipit-source-id: de40ff2884019e14476e31c4c952f24d6e438f5f	2020-05-19 10:37:25 -07:00
Nirav Mehta	acacad2575	Adding support for manifold files in DBReader (#37727 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37727 Check if the file exists locally only for `log_file_db` db_type. Reader files in other `db_type` like `manifold_log_file_db` are excluded from this check. Test Plan: Verified that files stored in manifold can be loaded using `DBFileReader`. Reviewed By: hbjerry Differential Revision: D21329671 fbshipit-source-id: bbc0e88851783ca3f78f7c61bfe84b480c09b5ac	2020-05-15 07:18:30 -07:00
Yan Zhu	3d0532f3ab	[c2] fix compute_norm test (#38529 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38529 (Note: this ignores all push blocking failures!) Test Plan: buck test mode/opt //caffe2/caffe2/python/modeling:compute_norm_for_blobs_test Reviewed By: olittle Differential Revision: D21588603 fbshipit-source-id: bdb0ae455e85a934cb5e369fbb0078f2ff842814	2020-05-14 20:49:36 -07:00
Yan Zhu	fac9f36563	Back out "[c2] register cuda op for LpNorm (fallback)" Summary: Original commit changeset: 573419e5a8da Test Plan: D21562485 breaks CI build. Unlanding Reviewed By: olittle Differential Revision: D21588831 fbshipit-source-id: 6dda4b71904d7765f32f570f9722e4a9a6cbc97b	2020-05-14 20:25:30 -07:00
Yan Zhu	0e0b9496fe	[c2] [easy] stop gradient when diagnose (#38518 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38518 as title Test Plan: buck test Reviewed By: olittle Differential Revision: D21562570 fbshipit-source-id: 3a2e8dea3d821a2bdb9f30db25816a2bfa6c5dcf	2020-05-14 17:30:39 -07:00
Yan Zhu	bbfd0ef244	[c2] register cuda op for LpNorm (fallback) (#38517 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38517 as title Test Plan: buck test Reviewed By: olittle Differential Revision: D21562485 fbshipit-source-id: 573419e5a8dae4121d99d5b72ed3960a92db7a54	2020-05-14 16:54:12 -07:00
Jongsoo Park	6be3e5d3bb	[caffe2] weight_decay in reduced precision adagrad Summary: As title Test Plan: CI Reviewed By: taiqing Differential Revision: D21512729 fbshipit-source-id: 0777c90954ebad0cbd5785460e7b2a7c8c146316	2020-05-12 20:33:40 -07:00
Tao Wu	08c3339e7c	[pyfi] override TP2 networkx -> PyFI networkx (#37764 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37764 Auto-generated diff for TP2->PyFI migration. ``` networkx TP2 version: 2.0 PyFI active wheels (networkx): py2-darwin -> 2.3 py2-platform007 -> 2.2 py3-darwin -> 2.3 py3-platform007 -> 2.3 py3.7-platform007 -> 2.3 ``` #buildmore excited_python Test Plan: buildallthethings Reviewed By: thatch Differential Revision: D19790867 fbshipit-source-id: d6f893beee794df5408a5117978b534cafc6ec83	2020-05-11 13:20:00 -07:00
Lu Fang	8181711637	Automatic update of fbcode/onnx to 79a7e0df7e86e0f32e7a05f563b24a566540c18b (#38106 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38106 Previous import was 807c62cf7e4c96ce49040bcf073b7e4a054f28a5 Included changes: - [79a7e0df](https://github.com/onnx/onnx/commit/79a7e0df): Fix copy paste error of Min op test case (#2640) <Takeshi Watanabe> - [4cd2538d](https://github.com/onnx/onnx/commit/4cd2538d): Add a release pipeline for Windows python packages (#2632) <Changming Sun> - [e8b33a5a](https://github.com/onnx/onnx/commit/e8b33a5a): Adding UnfoldToDepth op [1.7 Release] (#2616) <Negin Raoof> - [c2a8d525](https://github.com/onnx/onnx/commit/c2a8d525): update docs (#2627) <Ksenija Stanojevic> - [22752354](https://github.com/onnx/onnx/commit/22752354): Generate tests for MeanSquareDistance and SoftmaxCrossEntropyLoss (#2623) <Jonny Shipton> - [602bd622](https://github.com/onnx/onnx/commit/602bd622): Add section about external tensor data to IR.md (#2323) <Jonny Shipton> - [165c3f3b](https://github.com/onnx/onnx/commit/165c3f3b): Add integer support to Clip (#2532) <Jonny Shipton> - [a5fabf87](https://github.com/onnx/onnx/commit/a5fabf87): Add operators LessOrEqual and GreaterOrEqual (as functions) (#2606) <Jeremy Cochoy> - [f1dcdafc](https://github.com/onnx/onnx/commit/f1dcdafc): Fix input document of quantized operators (#2117) <Takeshi Watanabe> - [43af9b69](https://github.com/onnx/onnx/commit/43af9b69): Add reference impl for sequence ops (#2380) <Bowen Bao> - [aa50aa12](https://github.com/onnx/onnx/commit/aa50aa12): Print value case of TypeProto more friendly (#2422) <Takeshi Watanabe> - [2e67bfc3](https://github.com/onnx/onnx/commit/2e67bfc3): Fix issue #2436 (#2447) <daquexian> - [d27ffc6b](https://github.com/onnx/onnx/commit/d27ffc6b): Add support for integer tensors to Min and Max (#2608) <Jonny Shipton> - [5cc668af](https://github.com/onnx/onnx/commit/5cc668af): Update IR.md to describe training extension. (#2615) <G. Ramalingam> - [8c5bf9d4](https://github.com/onnx/onnx/commit/8c5bf9d4): Generate node backend tests for celu operator (#2607) <Jonny Shipton> - [7b65287e](https://github.com/onnx/onnx/commit/7b65287e): Change dtype of dd_da in gradient test to float32 (#2620) <Shinichiro Hamaji> - [e91739f2](https://github.com/onnx/onnx/commit/e91739f2): Introduce SoftmaxCrossentropy as a loss function (#2573) <Ksenija Stanojevic> - [b008ed3a](https://github.com/onnx/onnx/commit/b008ed3a): Support gathernd with batch_dim mode (#2585) <wezuo> - [d2fe4f22](https://github.com/onnx/onnx/commit/d2fe4f22): Introduce MeanSquaredError as Loss Function (#2570) <Ksenija Stanojevic> - [10b812a6](https://github.com/onnx/onnx/commit/10b812a6): Add support for default attributes within FunctionExpandHelper (#2588) <Ewa Tusień> - [3368834c](https://github.com/onnx/onnx/commit/3368834c): adding version update content. (#2609) <Ke Zhang> - [8873cb02](https://github.com/onnx/onnx/commit/8873cb02): Adding Inverse Op (#2578) <Negin Raoof> Test Plan: ci Reviewed By: hl475 Differential Revision: D21471424 fbshipit-source-id: 5009a5f9558458a0aba56b2a9e8fffc3895a9e02	2020-05-08 21:47:11 -07:00
Lu Fang	3cade9cdd4	Automatic update of fbcode/onnx to 807c62cf7e4c96ce49040bcf073b7e4a054f28a5 (#37983 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37983 Previous import was 9fdae4c68960a2d44cd1cc871c74a6a9d469fa1f Included changes: - [807c62cf](https://github.com/onnx/onnx/commit/807c62cf): Training Proposal: Spec Changes and Gradient Operator (#2314) <Wei-Sheng Chin> Test Plan: ci Reviewed By: hl475 Differential Revision: D21441188 fbshipit-source-id: 88b5be5bd479b59bdb45525f5dfe61d787151cdd	2020-05-07 20:07:31 -07:00
Lu Fang	9efbc19f75	Fix the issue with C2 cont build Summary: Issue was introduced in D21258652. We need to make sure it compiles with opt mode. We may still have some left over py2 packages. Let's just use some format work with both. Test Plan: ci Reviewed By: xush6528 Differential Revision: D21457394 fbshipit-source-id: cde79a0fc6b4feba307bd9d45e1a1d4a42de9263	2020-05-07 19:33:00 -07:00
Nikita Shulga	952e0f00a4	Skip c2_ref_tests on network failures (#37972 ) Summary: Skip the tests if network is unaccessible and model can not be downloaded Pull Request resolved: https://github.com/pytorch/pytorch/pull/37972 Differential Revision: D21441996 Pulled By: malfet fbshipit-source-id: 5ce59764584974aee9195572338ada1fa0351a75	2020-05-06 22:19:28 -07:00
Quang Luong	9d7a79ac27	[Caffe2] raise exceptions instead of str (#37744 ) Summary: Some exceptions are not correctly wrapped inside a class. Pull Request resolved: https://github.com/pytorch/pytorch/pull/37744 Differential Revision: D21388197 Pulled By: mrshenli fbshipit-source-id: 2d69e2543c2e05116c367d137968b982c254d2dc	2020-05-05 13:34:33 -07:00
Jeff Daily	4a2c642e1f	fix ROCm bench CI by increasing first iter timeout (#37633 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37633 Differential Revision: D21395519 Pulled By: ngimel fbshipit-source-id: 03b31417dde0758db6c189c21b6cb5771c776115	2020-05-04 20:49:32 -07:00
Taiqing Wang	8cb1f2f9dc	implement L2 regularization for Adagrad in caffe2 and dper (#37705 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37705 Pull Request resolved: https://github.com/pytorch/pytorch/pull/37372 Posted note: [Regularizing SparseNN Against Over-fitting](https://fb.workplace.com/notes/taiqing-wang/regularizing-sparsenn-against-over-fitting/220306075902708/) Problem formulation L(w) = J(w) + lambda/2 * \|\|w\|\|^2 J(w) is the empirical loss, and \|\|w\|\|^2 is the squared L2 norm of the parameters, a.k.a. L2 regularizer. dL(w)/ dw_i = dJ(w)/dw_i + lambda w_i dL(w)/ dw_i is the gradient of L(w) w.r.t. w_i. To implement the L2 regularizer, the gradient of J(w) w.r.t. w_i is added with w_i. lambda is called as weight decay in this implementation. Code changes * In the initialization method of AdagradOptimizer, a new input argument, weight_decay, is added. * In the _run function of AdagradOptimizer, the weight decay will be skipped for 1d bias vectors. * In the parameter update functions of Adagrad, the gradient is updated by weight_decay * w_i. The default value for weight_decay is zero. Test Plan: ` buck build caffe2/caffe2/fb/dper/layer_models/tests/split_1:sparse_nn_test_weight_decay ` ` ./buck-out/gen/caffe2/caffe2/fb/dper/layer_models/tests/split_1/sparse_nn_test_weight_decay#binary.par ` Reviewed By: jspark1105 Differential Revision: D21258652 fbshipit-source-id: d2366ddcd736a03205a2d16f914703b16d9fce8f	2020-05-03 10:42:49 -07:00
Huayu Li	cd4c3b48a6	Add LN after specialzied output embeddings and flexible LCE (#35178 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35178 * add layer norm (LN) after specialized output embeddings * add flexible lce inside specialized module Test Plan: * unit-tests * buck test caffe2/caffe2/fb/dper/layer_models/tests/split_1:sparse_nn_test_4 -- * buck test caffe2/caffe2/fb/dper/layer_models/tests/split_1:sparse_nn_test_6 -- * workflows * flexible lce: f177025325 {F232112501} * LN: f177025301 {F232112982} Differential Revision: D20586281 fbshipit-source-id: 664e77cb4cb5bec6646cafd2e4afb88aff27df03	2020-04-30 15:32:09 -07:00
Lu Fang	1aedc2c5b9	Skip c2 ref onnx model tests (#37591 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37591 skip the tests since gluster is gone. Test Plan: ci Reviewed By: ezyang Differential Revision: D21330359 fbshipit-source-id: a4e158fb72eddb08ba49fcfa9541569a150f8481	2020-04-30 14:32:47 -07:00
Ralf Gommers	78d5707041	Fix type annotations and make MyPy run on torch/ (#36584 ) Summary: This PR fixes a couple of syntax errors in `torch/` that prevent MyPy from running, fixes simple type annotation errors (e.g. missing `from typing import List, Tuple, Optional`), and adds granular ignores for errors in particular modules as well as for missing typing in third party packages. As a result, running `mypy` in the root dir of the repo now runs on: - `torch/` - `aten/src/ATen/function_wrapper.py` (the only file already covered in CI) In CI this runs on GitHub Actions, job Lint, sub-job "quick-checks", task "MyPy typecheck". It should give (right now): `Success: no issues found in 329 source files`. Here are the details of the original 855 errors when running `mypy torch` on current master (after fixing the couple of syntax errors that prevent `mypy` from running through): <details> ``` torch/utils/tensorboard/_proto_graph.py:1: error: Cannot find implementation or library stub for module named 'tensorboard.compat.proto.node_def_pb2' torch/utils/tensorboard/_proto_graph.py:2: error: Cannot find implementation or library stub for module named 'tensorboard.compat.proto.attr_value_pb2' torch/utils/tensorboard/_proto_graph.py:3: error: Cannot find implementation or library stub for module named 'tensorboard.compat.proto.tensor_shape_pb2' torch/utils/backcompat/__init__.py:1: error: Cannot find implementation or library stub for module named 'torch._C' torch/for_onnx/__init__.py:1: error: Cannot find implementation or library stub for module named 'torch.for_onnx.onnx' torch/cuda/nvtx.py:2: error: Cannot find implementation or library stub for module named 'torch._C' torch/utils/show_pickle.py:59: error: Name 'pickle._Unpickler' is not defined torch/utils/show_pickle.py:113: error: "Type[PrettyPrinter]" has no attribute "_dispatch" torch/utils/tensorboard/_onnx_graph.py:1: error: Cannot find implementation or library stub for module named 'tensorboard.compat.proto.graph_pb2' torch/utils/tensorboard/_onnx_graph.py:2: error: Cannot find implementation or library stub for module named 'tensorboard.compat.proto.node_def_pb2' torch/utils/tensorboard/_onnx_graph.py:3: error: Cannot find implementation or library stub for module named 'tensorboard.compat.proto.versions_pb2' torch/utils/tensorboard/_onnx_graph.py:4: error: Cannot find implementation or library stub for module named 'tensorboard.compat.proto.attr_value_pb2' torch/utils/tensorboard/_onnx_graph.py:5: error: Cannot find implementation or library stub for module named 'tensorboard.compat.proto.tensor_shape_pb2' torch/utils/tensorboard/_onnx_graph.py:9: error: Cannot find implementation or library stub for module named 'onnx' torch/contrib/_tensorboard_vis.py:10: error: Cannot find implementation or library stub for module named 'tensorflow.core.util' torch/contrib/_tensorboard_vis.py:11: error: Cannot find implementation or library stub for module named 'tensorflow.core.framework' torch/contrib/_tensorboard_vis.py:12: error: Cannot find implementation or library stub for module named 'tensorflow.python.summary.writer.writer' torch/utils/hipify/hipify_python.py:43: error: Need type annotation for 'CAFFE2_TEMPLATE_MAP' (hint: "CAFFE2_TEMPLATE_MAP: Dict[<type>, <type>] = ...") torch/utils/hipify/hipify_python.py:636: error: "object" has no attribute "items" torch/nn/_reduction.py:27: error: Name 'Optional' is not defined torch/nn/_reduction.py:27: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/nn/_reduction.py:47: error: Name 'Optional' is not defined torch/nn/_reduction.py:47: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/utils/tensorboard/_utils.py:17: error: Skipping analyzing 'matplotlib.pyplot': found module but no type hints or library stubs torch/utils/tensorboard/_utils.py:17: error: Skipping analyzing 'matplotlib': found module but no type hints or library stubs torch/utils/tensorboard/_utils.py:18: error: Skipping analyzing 'matplotlib.backends.backend_agg': found module but no type hints or library stubs torch/utils/tensorboard/_utils.py:18: error: Skipping analyzing 'matplotlib.backends': found module but no type hints or library stubs torch/nn/modules/utils.py:27: error: Name 'List' is not defined torch/nn/modules/utils.py:27: note: Did you forget to import it from "typing"? (Suggestion: "from typing import List") caffe2/proto/caffe2_pb2.py:17: error: Unexpected keyword argument "serialized_options" for "FileDescriptor"; did you mean "serialized_pb"? caffe2/proto/caffe2_pb2.py:25: error: Unexpected keyword argument "serialized_options" for "EnumDescriptor" caffe2/proto/caffe2_pb2.py:31: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:35: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:39: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:43: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:47: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:51: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:55: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:59: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:63: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:67: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:71: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:75: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:102: error: Unexpected keyword argument "serialized_options" for "EnumDescriptor" caffe2/proto/caffe2_pb2.py:108: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:112: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:124: error: Unexpected keyword argument "serialized_options" for "EnumDescriptor" caffe2/proto/caffe2_pb2.py:130: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:134: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:138: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:142: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:146: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:150: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:154: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:158: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:162: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:166: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:170: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:174: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:178: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:182: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:194: error: Unexpected keyword argument "serialized_options" for "EnumDescriptor" caffe2/proto/caffe2_pb2.py:200: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:204: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:208: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:212: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:224: error: Unexpected keyword argument "serialized_options" for "EnumDescriptor" caffe2/proto/caffe2_pb2.py:230: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:234: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:238: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:242: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:246: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:250: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:254: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/caffe2_pb2.py:267: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:274: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:281: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:288: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:295: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:302: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:327: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:334: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:341: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:364: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:371: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:378: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:385: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:392: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:399: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:406: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:413: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:420: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:427: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:434: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:441: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:448: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:455: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:462: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:488: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:495: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:502: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:509: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:516: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:523: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:530: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:537: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:544: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:551: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:558: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:565: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:572: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:596: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:603: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:627: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:634: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:641: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:648: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:655: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:662: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:686: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:693: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:717: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:724: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:731: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:738: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:763: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:770: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:777: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:784: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:808: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:815: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:822: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:829: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:836: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:843: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:850: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:857: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:864: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:871: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:878: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:885: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:892: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:916: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:923: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:930: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:937: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:944: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:951: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:958: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:982: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:989: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:996: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1003: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1010: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1017: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1024: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1031: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1038: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1045: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1052: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1059: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1066: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1090: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:1097: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1104: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1128: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:1135: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1142: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1166: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:1173: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1180: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1187: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1194: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1218: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:1225: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1232: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1239: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1246: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1253: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1260: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1267: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1274: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1281: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1305: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:1312: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1319: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1326: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1333: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1340: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1347: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1354: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1361: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1368: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1375: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1382: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1389: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1396: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1420: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:1427: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1434: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1441: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1465: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:1472: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1479: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1486: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1493: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1500: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1507: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1514: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1538: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/caffe2_pb2.py:1545: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1552: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1559: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1566: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/caffe2_pb2.py:1667: error: "GeneratedProtocolMessageType" has no attribute "Segment" torch/multiprocessing/queue.py:4: error: No library stub file for standard library module 'multiprocessing.reduction' caffe2/proto/torch_pb2.py:18: error: Unexpected keyword argument "serialized_options" for "FileDescriptor"; did you mean "serialized_pb"? caffe2/proto/torch_pb2.py:27: error: Unexpected keyword argument "serialized_options" for "EnumDescriptor" caffe2/proto/torch_pb2.py:33: error: Unexpected keyword argument "serialized_options" for "EnumValueDescriptor" caffe2/proto/torch_pb2.py:50: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/torch_pb2.py:57: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:81: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/torch_pb2.py:88: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:95: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:102: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:109: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:116: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:123: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:130: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:137: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:144: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:151: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:175: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/torch_pb2.py:182: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:189: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:196: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:220: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/torch_pb2.py:227: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:234: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:241: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:265: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/torch_pb2.py:272: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:279: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:286: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:293: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:300: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:307: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:314: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:321: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:328: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:335: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:342: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:366: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/torch_pb2.py:373: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:397: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/torch_pb2.py:404: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:411: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:418: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:425: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/torch_pb2.py:432: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:17: error: Unexpected keyword argument "serialized_options" for "FileDescriptor"; did you mean "serialized_pb"? caffe2/proto/metanet_pb2.py:29: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/metanet_pb2.py:36: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:43: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:50: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:57: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:64: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:88: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/metanet_pb2.py:95: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:102: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:126: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/metanet_pb2.py:133: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:140: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:164: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/metanet_pb2.py:171: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:178: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:202: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/metanet_pb2.py:209: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:216: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:240: error: Unexpected keyword argument "serialized_options" for "Descriptor" caffe2/proto/metanet_pb2.py:247: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:254: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:261: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:268: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:275: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:282: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:289: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/metanet_pb2.py:296: error: Unexpected keyword argument "serialized_options" for "FieldDescriptor" caffe2/proto/__init__.py:13: error: Skipping analyzing 'caffe2.caffe2.fb.session.proto': found module but no type hints or library stubs torch/multiprocessing/pool.py:3: error: No library stub file for standard library module 'multiprocessing.util' torch/multiprocessing/pool.py:3: note: (Stub files are from https://github.com/python/typeshed) caffe2/python/scope.py:10: error: Skipping analyzing 'past.builtins': found module but no type hints or library stubs caffe2/python/__init__.py:7: error: Module has no attribute "CPU" caffe2/python/__init__.py:8: error: Module has no attribute "CUDA" caffe2/python/__init__.py:9: error: Module has no attribute "MKLDNN" caffe2/python/__init__.py:10: error: Module has no attribute "OPENGL" caffe2/python/__init__.py:11: error: Module has no attribute "OPENCL" caffe2/python/__init__.py:12: error: Module has no attribute "IDEEP" caffe2/python/__init__.py:13: error: Module has no attribute "HIP" caffe2/python/__init__.py:14: error: Module has no attribute "COMPILE_TIME_MAX_DEVICE_TYPES"; maybe "PROTO_COMPILE_TIME_MAX_DEVICE_TYPES"? caffe2/python/__init__.py:15: error: Module has no attribute "ONLY_FOR_TEST"; maybe "PROTO_ONLY_FOR_TEST"? caffe2/python/__init__.py:34: error: Item "_Loader" of "Optional[_Loader]" has no attribute "exec_module" caffe2/python/__init__.py:34: error: Item "None" of "Optional[_Loader]" has no attribute "exec_module" caffe2/python/__init__.py:35: error: Module has no attribute "cuda" caffe2/python/__init__.py:37: error: Module has no attribute "cuda" caffe2/python/__init__.py:49: error: Module has no attribute "add_dll_directory" torch/random.py:4: error: Cannot find implementation or library stub for module named 'torch._C' torch/_classes.py:2: error: Cannot find implementation or library stub for module named 'torch._C' torch/onnx/__init__.py:1: error: Cannot find implementation or library stub for module named 'torch._C' torch/hub.py:21: error: Skipping analyzing 'tqdm.auto': found module but no type hints or library stubs torch/hub.py:24: error: Skipping analyzing 'tqdm': found module but no type hints or library stubs torch/hub.py:27: error: Name 'tqdm' already defined (possibly by an import) torch/_tensor_str.py:164: error: Not all arguments converted during string formatting torch/_ops.py:1: error: Cannot find implementation or library stub for module named 'torch._C' torch/_linalg_utils.py:26: error: Name 'Optional' is not defined torch/_linalg_utils.py:26: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/_linalg_utils.py:26: error: Name 'Tensor' is not defined torch/_linalg_utils.py:63: error: Name 'Tensor' is not defined torch/_linalg_utils.py:63: error: Name 'Optional' is not defined torch/_linalg_utils.py:63: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/_linalg_utils.py:70: error: Name 'Optional' is not defined torch/_linalg_utils.py:70: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/_linalg_utils.py:70: error: Name 'Tensor' is not defined torch/_linalg_utils.py:88: error: Name 'Tensor' is not defined torch/_linalg_utils.py:88: error: Name 'Optional' is not defined torch/_linalg_utils.py:88: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/_linalg_utils.py:88: error: Name 'Tuple' is not defined torch/_linalg_utils.py:88: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Tuple") torch/_jit_internal.py:17: error: Need type annotation for 'boolean_dispatched' torch/_jit_internal.py:474: error: Need type annotation for '_overloaded_fns' (hint: "_overloaded_fns: Dict[<type>, <type>] = ...") torch/_jit_internal.py:512: error: Need type annotation for '_overloaded_methods' (hint: "_overloaded_methods: Dict[<type>, <type>] = ...") torch/_jit_internal.py:648: error: Incompatible types in assignment (expression has type "FinalCls", variable has type "_SpecialForm") torch/sparse/__init__.py:11: error: Name 'Tensor' is not defined torch/sparse/__init__.py:71: error: Name 'Tensor' is not defined torch/sparse/__init__.py:71: error: Name 'Optional' is not defined torch/sparse/__init__.py:71: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/sparse/__init__.py:71: error: Name 'Tuple' is not defined torch/sparse/__init__.py:71: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Tuple") torch/nn/init.py:109: error: Name 'Tensor' is not defined torch/nn/init.py:126: error: Name 'Tensor' is not defined torch/nn/init.py:142: error: Name 'Tensor' is not defined torch/nn/init.py:165: error: Name 'Tensor' is not defined torch/nn/init.py:180: error: Name 'Tensor' is not defined torch/nn/init.py:194: error: Name 'Tensor' is not defined torch/nn/init.py:287: error: Name 'Tensor' is not defined torch/nn/init.py:315: error: Name 'Tensor' is not defined torch/multiprocessing/reductions.py:8: error: No library stub file for standard library module 'multiprocessing.util' torch/multiprocessing/reductions.py:9: error: No library stub file for standard library module 'multiprocessing.reduction' torch/multiprocessing/reductions.py:17: error: No library stub file for standard library module 'multiprocessing.resource_sharer' torch/jit/_builtins.py:72: error: Module has no attribute "_no_grad_embedding_renorm_" torch/jit/_builtins.py:80: error: Module has no attribute "stft" torch/jit/_builtins.py:81: error: Module has no attribute "cdist" torch/jit/_builtins.py:82: error: Module has no attribute "norm" torch/jit/_builtins.py:83: error: Module has no attribute "nuclear_norm" torch/jit/_builtins.py:84: error: Module has no attribute "frobenius_norm" torch/backends/cudnn/__init__.py:8: error: Cannot find implementation or library stub for module named 'torch._C' torch/backends/cudnn/__init__.py:86: error: Need type annotation for '_handles' (hint: "_handles: Dict[<type>, <type>] = ...") torch/autograd/profiler.py:13: error: Name 'ContextDecorator' already defined (possibly by an import) torch/autograd/function.py:2: error: Cannot find implementation or library stub for module named 'torch._C' torch/autograd/function.py:2: note: See https://mypy.readthedocs.io/en/latest/running_mypy.html#missing-imports torch/autograd/function.py:109: error: Unsupported dynamic base class "with_metaclass" torch/serialization.py:609: error: "Callable[[Any], Any]" has no attribute "cache" torch/_lowrank.py:11: error: Name 'Tensor' is not defined torch/_lowrank.py:13: error: Name 'Optional' is not defined torch/_lowrank.py:13: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/_lowrank.py:14: error: Name 'Optional' is not defined torch/_lowrank.py:14: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/_lowrank.py:14: error: Name 'Tensor' is not defined torch/_lowrank.py:82: error: Name 'Tensor' is not defined torch/_lowrank.py:82: error: Name 'Optional' is not defined torch/_lowrank.py:82: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/_lowrank.py:82: error: Name 'Tuple' is not defined torch/_lowrank.py:82: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Tuple") torch/_lowrank.py:130: error: Name 'Tensor' is not defined torch/_lowrank.py:130: error: Name 'Optional' is not defined torch/_lowrank.py:130: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/_lowrank.py:130: error: Name 'Tuple' is not defined torch/_lowrank.py:130: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Tuple") torch/_lowrank.py:167: error: Name 'Tensor' is not defined torch/_lowrank.py:167: error: Name 'Optional' is not defined torch/_lowrank.py:167: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/_lowrank.py:167: error: Name 'Tuple' is not defined torch/_lowrank.py:167: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Tuple") torch/quantization/observer.py:45: error: Variable "torch.quantization.observer.ABC" is not valid as a type torch/quantization/observer.py:45: note: See https://mypy.readthedocs.io/en/latest/common_issues.html#variables-vs-type-aliases torch/quantization/observer.py:45: error: Invalid base class "ABC" torch/quantization/observer.py:127: error: Name 'Tensor' is not defined torch/quantization/observer.py:127: error: Name 'Tuple' is not defined torch/quantization/observer.py:127: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Tuple") torch/quantization/observer.py:172: error: Module has no attribute "per_tensor_symmetric" torch/quantization/observer.py:172: error: Module has no attribute "per_channel_symmetric" torch/quantization/observer.py:192: error: Name 'Tensor' is not defined torch/quantization/observer.py:192: error: Name 'Tuple' is not defined torch/quantization/observer.py:192: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Tuple") torch/quantization/observer.py:233: error: Module has no attribute "per_tensor_symmetric" torch/quantization/observer.py:233: error: Module has no attribute "per_channel_symmetric" torch/quantization/observer.py:534: error: Name 'Tensor' is not defined torch/quantization/observer.py:885: error: Name 'Tensor' is not defined torch/quantization/observer.py:885: error: Name 'Tuple' is not defined torch/quantization/observer.py:885: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Tuple") torch/quantization/observer.py:894: error: Cannot determine type of 'max_val' torch/quantization/observer.py:894: error: Cannot determine type of 'min_val' torch/quantization/observer.py:899: error: Cannot determine type of 'min_val' torch/quantization/observer.py:902: error: Name 'Tensor' is not defined torch/quantization/observer.py:925: error: Name 'Tensor' is not defined torch/quantization/observer.py:928: error: Cannot determine type of 'min_val' torch/quantization/observer.py:929: error: Cannot determine type of 'max_val' torch/quantization/observer.py:946: error: Argument "min" to "histc" has incompatible type "Tuple[Tensor, Tensor]"; expected "Union[int, float, bool]" torch/quantization/observer.py:946: error: Argument "max" to "histc" has incompatible type "Tuple[Tensor, Tensor]"; expected "Union[int, float, bool]" torch/quantization/observer.py:1056: error: Module has no attribute "per_tensor_symmetric" torch/quantization/observer.py:1058: error: Module has no attribute "per_channel_symmetric" torch/nn/quantized/functional.py:76: error: Name 'Tensor' is not defined torch/nn/quantized/functional.py:76: error: Name 'BroadcastingList2' is not defined torch/nn/quantized/functional.py:259: error: Name 'Tensor' is not defined torch/nn/quantized/functional.py:259: error: Name 'Optional' is not defined torch/nn/quantized/functional.py:259: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/nn/quantized/functional.py:289: error: Module has no attribute "ops" torch/nn/quantized/functional.py:290: error: Module has no attribute "ops" torch/nn/quantized/functional.py:308: error: Name 'Tensor' is not defined torch/nn/quantized/functional.py:326: error: Name 'Tensor' is not defined torch/nn/quantized/functional.py:356: error: Name 'Tensor' is not defined torch/nn/quantized/functional.py:371: error: Name 'Tensor' is not defined torch/nn/quantized/functional.py:400: error: Name 'Tensor' is not defined torch/nn/quantized/functional.py:400: error: Name 'Optional' is not defined torch/nn/quantized/functional.py:400: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/nn/quantized/functional.py:430: error: Name 'Tensor' is not defined torch/nn/quantized/functional.py:448: error: Name 'Tensor' is not defined torch/nn/quantized/modules/linear.py:26: error: Module has no attribute "ops" torch/nn/quantized/modules/linear.py:28: error: Module has no attribute "ops" torch/nn/quantized/modules/functional_modules.py:40: error: Name 'Tensor' is not defined torch/nn/quantized/modules/functional_modules.py:47: error: Name 'Tensor' is not defined torch/nn/quantized/modules/functional_modules.py:54: error: Name 'Tensor' is not defined torch/nn/quantized/modules/functional_modules.py:61: error: Name 'Tensor' is not defined torch/nn/quantized/modules/functional_modules.py:68: error: Name 'List' is not defined torch/nn/quantized/modules/functional_modules.py:68: note: Did you forget to import it from "typing"? (Suggestion: "from typing import List") torch/nn/quantized/modules/functional_modules.py:68: error: Name 'Tensor' is not defined torch/nn/quantized/modules/functional_modules.py:75: error: Name 'Tensor' is not defined torch/nn/quantized/modules/functional_modules.py:140: error: Name 'Tensor' is not defined torch/nn/quantized/modules/functional_modules.py:146: error: Name 'Tensor' is not defined torch/nn/quantized/modules/functional_modules.py:151: error: Name 'Tensor' is not defined torch/nn/quantized/modules/functional_modules.py:157: error: Name 'Tensor' is not defined torch/nn/quantized/modules/functional_modules.py:162: error: Name 'List' is not defined torch/nn/quantized/modules/functional_modules.py:162: note: Did you forget to import it from "typing"? (Suggestion: "from typing import List") torch/nn/quantized/modules/functional_modules.py:162: error: Name 'Tensor' is not defined torch/nn/quantized/modules/functional_modules.py:168: error: Name 'Tensor' is not defined torch/multiprocessing/spawn.py:9: error: Module 'torch.multiprocessing' has no attribute '_prctl_pr_set_pdeathsig' torch/multiprocessing/__init__.py:28: error: Module has no attribute "__all__" torch/jit/frontend.py:9: error: Cannot find implementation or library stub for module named 'torch._C._jit_tree_views' torch/jit/annotations.py:6: error: Module 'torch._jit_internal' has no attribute 'BroadcastingList2'; maybe "BroadcastingList1" or "BroadcastingListCls"? torch/jit/annotations.py:6: error: Module 'torch._jit_internal' has no attribute 'BroadcastingList3'; maybe "BroadcastingList1" or "BroadcastingListCls"? torch/jit/annotations.py:9: error: Cannot find implementation or library stub for module named 'torch._C' torch/distributions/distribution.py:16: error: Need type annotation for 'arg_constraints' (hint: "arg_constraints: Dict[<type>, <type>] = ...") torch/distributions/distribution.py:74: error: Name 'arg_constraints' already defined on line 16 torch/distributions/distribution.py:84: error: Name 'support' already defined on line 15 torch/functional.py:114: error: Name 'Tuple' is not defined torch/functional.py:114: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Tuple") torch/functional.py:114: error: Name 'Optional' is not defined torch/functional.py:114: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/functional.py:189: error: Incompatible types in assignment (expression has type "None", variable has type "Tensor") torch/functional.py:200: error: Argument 1 to "_indices_product" has incompatible type "Tuple[int, ...]"; expected "List[int]" torch/functional.py:204: error: No overload variant of "__setitem__" of "list" matches argument types "Tensor", "int" torch/functional.py:204: note: Possible overload variants: torch/functional.py:204: note: def __setitem__(self, int, int) -> None torch/functional.py:204: note: def __setitem__(self, slice, Iterable[int]) -> None torch/functional.py:204: error: No overload variant of "__getitem__" of "list" matches argument type "Tensor" torch/functional.py:204: note: def __getitem__(self, int) -> int torch/functional.py:204: note: def __getitem__(self, slice) -> List[int] torch/functional.py:207: error: "Tensor" has no attribute "copy_" torch/functional.py:212: error: No overload variant of "__setitem__" of "list" matches argument types "Tensor", "int" torch/functional.py:212: note: Possible overload variants: torch/functional.py:212: note: def __setitem__(self, int, int) -> None torch/functional.py:212: note: def __setitem__(self, slice, Iterable[int]) -> None torch/functional.py:212: error: No overload variant of "__getitem__" of "list" matches argument type "Tensor" torch/functional.py:212: note: def __getitem__(self, int) -> int torch/functional.py:212: note: def __getitem__(self, slice) -> List[int] torch/functional.py:215: error: Incompatible types in assignment (expression has type "None", variable has type "Tensor") torch/functional.py:334: error: Name 'Optional' is not defined torch/functional.py:334: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/functional.py:429: error: Argument 2 to "pad" has incompatible type "Tuple[int, int]"; expected "List[int]" torch/functional.py:431: error: Module has no attribute "stft" torch/functional.py:766: error: Module has no attribute "cdist" torch/functional.py:768: error: Module has no attribute "cdist" torch/functional.py:770: error: Module has no attribute "cdist" torch/functional.py:775: error: Name 'Optional' is not defined torch/functional.py:775: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/functional.py:780: error: Name 'Optional' is not defined torch/functional.py:780: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/functional.py:780: error: Name 'number' is not defined torch/functional.py:780: error: Name 'norm' already defined on line 775 torch/functional.py:785: error: Name 'Optional' is not defined torch/functional.py:785: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/functional.py:785: error: Name 'number' is not defined torch/functional.py:785: error: Name 'norm' already defined on line 775 torch/functional.py:790: error: Name 'Optional' is not defined torch/functional.py:790: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/functional.py:790: error: Name 'norm' already defined on line 775 torch/functional.py:795: error: Name 'norm' already defined on line 775 torch/functional.py:960: error: Name 'Any' is not defined torch/functional.py:960: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Any") torch/functional.py:960: error: Name 'Tuple' is not defined torch/functional.py:960: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Tuple") torch/functional.py:1036: error: Argument 1 to "len" has incompatible type "int"; expected "Sized" torch/functional.py:1041: error: Name 'Optional' is not defined torch/functional.py:1041: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/functional.py:1041: error: Name 'Tuple' is not defined torch/functional.py:1041: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Tuple") torch/functional.py:1056: error: Name 'Optional' is not defined torch/functional.py:1056: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/functional.py:1056: error: Name 'Tuple' is not defined torch/functional.py:1056: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Tuple") torch/distributions/von_mises.py:87: error: Incompatible types in assignment (expression has type "_Real", base class "Distribution" defined the type as "None") torch/distributions/negative_binomial.py:25: error: Incompatible types in assignment (expression has type "_IntegerGreaterThan", base class "Distribution" defined the type as "None") torch/distributions/multivariate_normal.py:116: error: Incompatible types in assignment (expression has type "_Real", base class "Distribution" defined the type as "None") torch/distributions/laplace.py:23: error: Incompatible types in assignment (expression has type "_Real", base class "Distribution" defined the type as "None") torch/distributions/independent.py:34: error: Need type annotation for 'arg_constraints' (hint: "arg_constraints: Dict[<type>, <type>] = ...") torch/distributions/cauchy.py:28: error: Incompatible types in assignment (expression has type "_Real", base class "Distribution" defined the type as "None") torch/distributions/poisson.py:28: error: Incompatible types in assignment (expression has type "_IntegerGreaterThan", base class "Distribution" defined the type as "None") torch/distributions/one_hot_categorical.py:32: error: Incompatible types in assignment (expression has type "_Simplex", base class "Distribution" defined the type as "None") torch/distributions/normal.py:27: error: Incompatible types in assignment (expression has type "_Real", base class "Distribution" defined the type as "None") torch/distributions/lowrank_multivariate_normal.py:79: error: Incompatible types in assignment (expression has type "_Real", base class "Distribution" defined the type as "None") torch/distributions/gamma.py:30: error: Incompatible types in assignment (expression has type "_GreaterThan", base class "Distribution" defined the type as "None") torch/distributions/exponential.py:23: error: Incompatible types in assignment (expression has type "_GreaterThan", base class "Distribution" defined the type as "None") torch/distributions/fishersnedecor.py:25: error: Incompatible types in assignment (expression has type "_GreaterThan", base class "Distribution" defined the type as "None") torch/distributions/dirichlet.py:44: error: Incompatible types in assignment (expression has type "_Simplex", base class "Distribution" defined the type as "None") torch/nn/quantized/dynamic/modules/rnn.py:230: error: Incompatible types in assignment (expression has type "int", variable has type "Tensor") torch/nn/quantized/dynamic/modules/rnn.py:232: error: Incompatible types in assignment (expression has type "int", variable has type "Tensor") torch/nn/quantized/dynamic/modules/rnn.py:236: error: Incompatible return value type (got "Tuple[Any, Tensor, Any]", expected "Tuple[int, int, int]") torch/nn/quantized/dynamic/modules/rnn.py:351: error: Incompatible types in assignment (expression has type "Type[LSTM]", base class "RNNBase" defined the type as "Type[RNNBase]") torch/nn/quantized/dynamic/modules/rnn.py:381: error: Module has no attribute "quantized_lstm" torch/nn/quantized/dynamic/modules/rnn.py:385: error: Module has no attribute "quantized_lstm" torch/nn/quantized/dynamic/modules/rnn.py:414: error: Argument 1 to "forward_impl" of "LSTM" has incompatible type "PackedSequence"; expected "Tensor" torch/nn/quantized/dynamic/modules/rnn.py:416: error: Incompatible types in assignment (expression has type "PackedSequence", variable has type "Tensor") torch/nn/quantized/dynamic/modules/rnn.py:418: error: Incompatible return value type (got "Tuple[Tensor, Tuple[Tensor, Tensor]]", expected "Tuple[PackedSequence, Tuple[Tensor, Tensor]]") torch/nn/quantized/dynamic/modules/rnn.py:420: error: Argument 1 of "permute_hidden" is incompatible with supertype "RNNBase"; supertype defines the argument type as "Tensor" torch/nn/quantized/dynamic/modules/rnn.py:420: error: Return type "Tuple[Tensor, Tensor]" of "permute_hidden" incompatible with return type "Tensor" in supertype "RNNBase" torch/nn/quantized/dynamic/modules/rnn.py:426: error: Argument 2 of "check_forward_args" is incompatible with supertype "RNNBase"; supertype defines the argument type as "Tensor" torch/nn/intrinsic/qat/modules/conv_fused.py:232: error: Incompatible types in assignment (expression has type "Type[ConvBnReLU2d]", base class "ConvBn2d" defined the type as "Type[ConvBn2d]") torch/distributions/beta.py:27: error: Incompatible types in assignment (expression has type "_Interval", base class "Distribution" defined the type as "None") torch/distributions/geometric.py:31: error: Incompatible types in assignment (expression has type "_IntegerGreaterThan", base class "Distribution" defined the type as "None") torch/distributions/continuous_bernoulli.py:38: error: Incompatible types in assignment (expression has type "_Interval", base class "Distribution" defined the type as "None") torch/distributions/bernoulli.py:30: error: Incompatible types in assignment (expression has type "_Boolean", base class "Distribution" defined the type as "None") torch/quantization/fake_quantize.py:126: error: Module has no attribute "per_tensor_symmetric" torch/quantization/fake_quantize.py:132: error: Module has no attribute "per_channel_symmetric" torch/distributions/transformed_distribution.py:41: error: Need type annotation for 'arg_constraints' (hint: "arg_constraints: Dict[<type>, <type>] = ...") torch/jit/__init__.py:1: error: Cannot find implementation or library stub for module named 'torch._C' torch/jit/__init__.py:15: error: Module 'torch.utils' has no attribute 'set_module' torch/jit/__init__.py:70: error: Name 'Attribute' already defined on line 68 torch/jit/__init__.py:213: error: On Python 3 '{}'.format(b'abc') produces "b'abc'"; use !r if this is a desired behavior torch/jit/__init__.py:215: error: On Python 3 '{}'.format(b'abc') produces "b'abc'"; use !r if this is a desired behavior torch/jit/__init__.py:1524: error: Unsupported dynamic base class "with_metaclass" torch/jit/__init__.py:1869: error: Name 'ScriptModule' already defined on line 1524 torch/jit/__init__.py:1998: error: Need type annotation for '_jit_caching_layer' torch/jit/__init__.py:1999: error: Need type annotation for '_jit_function_overload_caching' torch/distributions/relaxed_categorical.py:34: error: Incompatible types in assignment (expression has type "_Real", base class "Distribution" defined the type as "None") torch/distributions/relaxed_categorical.py:108: error: Incompatible types in assignment (expression has type "_Simplex", base class "Distribution" defined the type as "None") torch/distributions/relaxed_bernoulli.py:31: error: Incompatible types in assignment (expression has type "_Real", base class "Distribution" defined the type as "None") torch/distributions/relaxed_bernoulli.py:114: error: Incompatible types in assignment (expression has type "_Interval", base class "Distribution" defined the type as "None") torch/distributions/logistic_normal.py:31: error: Incompatible types in assignment (expression has type "_Simplex", base class "Distribution" defined the type as "None") torch/distributions/log_normal.py:26: error: Incompatible types in assignment (expression has type "_GreaterThan", base class "Distribution" defined the type as "None") torch/distributions/half_normal.py:27: error: Incompatible types in assignment (expression has type "_GreaterThan", base class "Distribution" defined the type as "None") torch/distributions/half_cauchy.py:28: error: Incompatible types in assignment (expression has type "_GreaterThan", base class "Distribution" defined the type as "None") torch/distributions/gumbel.py:28: error: Incompatible types in assignment (expression has type "_Real", base class "Distribution" defined the type as "None") torch/nn/quantized/modules/conv.py:18: error: Module 'torch.nn.utils' has no attribute 'fuse_conv_bn_weights' torch/nn/quantized/modules/conv.py:209: error: Name 'Optional' is not defined torch/nn/quantized/modules/conv.py:209: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/nn/quantized/modules/conv.py:214: error: Module has no attribute "ops" torch/nn/quantized/modules/conv.py:321: error: Name 'Optional' is not defined torch/nn/quantized/modules/conv.py:321: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/nn/quantized/modules/conv.py:323: error: Module has no attribute "ops" torch/nn/quantized/modules/conv.py:447: error: Name 'Optional' is not defined torch/nn/quantized/modules/conv.py:447: note: Did you forget to import it from "typing"? (Suggestion: "from typing import Optional") torch/nn/quantized/modules/conv.py:449: error: Module has no attribute "ops" torch/nn/quantized/modules/conv.py:513: error: Name 'nn.modules.conv._ConvTransposeNd' is not defined torch/nn/quantized/modules/conv.py:525: error: Name 'List' is not defined torch/nn/quantized/modules/conv.py:525: note: Did you forget to import it from "typing"? (Suggestion: "from typing import List") torch/nn/quantized/modules/conv.py:527: error: Name 'List' is not defined torch/nn/quantized/modules/conv.py:527: note: Did you forget to import it from "typing"? (Suggestion: "from typing import List") torch/nn/intrinsic/quantized/modules/conv_relu.py:8: error: Module 'torch.nn.utils' has no attribute 'fuse_conv_bn_weights' torch/nn/intrinsic/quantized/modules/conv_relu.py:21: error: Incompatible types in assignment (expression has type "Type[ConvReLU2d]", base class "Conv2d" defined the type as "Type[Conv2d]") torch/nn/intrinsic/quantized/modules/conv_relu.py:62: error: Incompatible types in assignment (expression has type "Type[ConvReLU3d]", base class "Conv3d" defined the type as "Type[Conv3d]") torch/distributions/weibull.py:25: error: Incompatible types in assignment (expression has type "_GreaterThan", base class "Distribution" defined the type as "None") torch/distributions/kl.py:35: error: Need type annotation for '_KL_MEMOIZE' (hint: "_KL_MEMOIZE: Dict[<type>, <type>] = ...") torch/distributions/studentT.py:27: error: Incompatible types in assignment (expression has type "_Real", base class "Distribution" defined the type as "None") torch/distributions/mixture_same_family.py:48: error: Need type annotation for 'arg_constraints' (hint: "arg_constraints: Dict[<type>, <type>] = ...") torch/distributions/__init__.py:158: error: Name 'transforms' is not defined torch/onnx/utils.py:21: error: Cannot find implementation or library stub for module named 'torch._C' torch/distributed/rendezvous.py:4: error: Cannot find implementation or library stub for module named 'urlparse' torch/distributed/rendezvous.py:4: error: Name 'urlparse' already defined (possibly by an import) torch/distributed/rendezvous.py:4: error: Name 'urlunparse' already defined (possibly by an import) torch/distributed/rendezvous.py:9: error: Module 'torch.distributed' has no attribute 'FileStore' torch/distributed/rendezvous.py:9: error: Module 'torch.distributed' has no attribute 'TCPStore' torch/distributed/rendezvous.py:65: error: On Python 3 '{}'.format(b'abc') produces "b'abc'"; use !r if this is a desired behavior torch/distributed/distributed_c10d.py:11: error: Module 'torch.distributed' has no attribute 'AllreduceOptions'; maybe "ReduceOptions" or "AllreduceCoalescedOptions"? torch/distributed/distributed_c10d.py:11: error: Module 'torch.distributed' has no attribute 'AllreduceCoalescedOptions'; maybe "AllreduceOptions"? torch/distributed/distributed_c10d.py:11: error: Module 'torch.distributed' has no attribute 'AllToAllOptions' torch/distributed/distributed_c10d.py:11: error: Module 'torch.distributed' has no attribute 'BroadcastOptions' torch/distributed/distributed_c10d.py:11: error: Module 'torch.distributed' has no attribute 'GatherOptions'; maybe "ScatterOptions"? torch/distributed/distributed_c10d.py:11: error: Module 'torch.distributed' has no attribute 'ReduceOptions'; maybe "AllreduceOptions", "ReduceScatterOptions", or "ReduceOp"? torch/distributed/distributed_c10d.py:11: error: Module 'torch.distributed' has no attribute 'ReduceScatterOptions'; maybe "ScatterOptions" or "ReduceOptions"? torch/distributed/distributed_c10d.py:11: error: Module 'torch.distributed' has no attribute 'ScatterOptions'; maybe "ReduceScatterOptions" or Pull Request resolved: https://github.com/pytorch/pytorch/pull/36584 Reviewed By: seemethere, ailzhang Differential Revision: D21155985 Pulled By: ezyang fbshipit-source-id: f628d4293992576207167e7c417998fad15898d1	2020-04-22 14:17:08 -07:00
David Reiss	e75fb4356b	Remove (most) Python 2 support from Python code (#35615 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35615 Python 2 has reached end-of-life and is no longer supported by PyTorch. Now we can clean up a lot of cruft that we put in place to support it. These changes were all done manually, and I skipped anything that seemed like it would take more than a few seconds, so I think it makes sense to review it manually as well (though using side-by-side view and ignoring whitespace change might be helpful). Test Plan: CI Differential Revision: D20842886 Pulled By: dreiss fbshipit-source-id: 8cad4e87c45895e7ce3938a88e61157a79504aed	2020-04-22 09:23:14 -07:00
Amy Yang	37479ddf4e	[caffe2] create and register child ws in pybind (#36741 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36741 Create child workspace that shares parent workspace's blobs. Register child workspace in registrar to enable switching into child workspace and feeding to child workspace alone. Test Plan: numeric suite unit tests in stacked diff Reviewed By: hx89 Differential Revision: D21055567 fbshipit-source-id: 374b12aef75a4c58452c271f8961ee156ce6c559	2020-04-16 14:53:55 -07:00
Nikita Shulga	527cf877d6	Delete old `mkl_speed_test.py` Summary: It was always skipped for last 1.5 years (since D10372230 was landed) Test Plan: CI Reviewed By: ailzhang Differential Revision: D21036194 fbshipit-source-id: 9ace60b45a123a9372a88310b91f33a69ae8880c	2020-04-15 11:02:01 -07:00
Yuxi Hu	cf27d07e04	Implementation of STORM optimizer caffe2 python wrapper (#36399 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36399 Added caffe2 python wrapper and unit test for the STORM C++ operator. Test Plan: All newly added unit tests passed using "buck test //caffe2/caffe2/python:optimizer_test -- TestStorm" {F233644598} Reviewed By: chocjy Differential Revision: D18841013 fbshipit-source-id: f692bc18412839db140202ec9a971e556db0e54f	2020-04-14 23:05:45 -07:00
Yuxi Hu	f7c9faab05	Implementation and operator test for STORM optimizer (#36225 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36225 Implemented the [STORM](https://arxiv.org/abs/1905.10018) optimizer operator for dense and sparse cases. Test Plan: All newly added unit tests passed using "buck test //caffe2/caffe2/python/operator_test:storm_test". {F233643713} Reviewed By: chocjy Differential Revision: D18702897 fbshipit-source-id: d25eeb492aa2a03c69754d3f076a8239230b3bf4	2020-04-14 23:04:26 -07:00
Xing Wang	4b3e3d8227	[improve logging] add the param information when logging the optimizer engine (#36558 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36558 In the log, frequently see a large trunk of Using engine xx for rowWise Adagrad, but without information on which parameter is applied. Test Plan: Should be covered by existing testing that use optimizer Reviewed By: chocjy Differential Revision: D20985176 fbshipit-source-id: 6eb4e19e5307db53fc89b38594a3f303f1492a1c	2020-04-14 07:42:24 -07:00
Hao Lu	fb70b4fb93	[caffe2] Add support for std::shared_ptr<std::vector<TensorList>> in PackRecordsOp and UnPackRecordsOp (#36550 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36550 Separate dataset_ops changes into a separate diff. Test Plan: ``` buck test caffe2/caffe2/python/operator_test:dataset_ops_test ``` AI/AF canary (tested with D20959214): https://our.intern.facebook.com/intern/experiment_store/experiment/3298538636995/#commit1-commit2 https://our.intern.facebook.com/intern/experiment_store/experiment/2199027015376/#commit1-commit2 Reviewed By: yinghai Differential Revision: D20988910 fbshipit-source-id: b37a7bfd131813e9472a5e2fa24d681d1ef19018	2020-04-14 03:43:21 -07:00
Yinghai Lu	eb00bac2b5	Make FakeLowP tests work (#36525 ) Summary: Make the e2e FakeLowP python tests work with Glow lowering in OSS environment. Added a README.md as a guideline. Pull Request resolved: https://github.com/pytorch/pytorch/pull/36525 Reviewed By: hyuen Differential Revision: D21004706 Pulled By: yinghai fbshipit-source-id: d182152e4a1a3368640bd7872cb9ea4d4bff4b02	2020-04-13 20:16:33 -07:00
Yinghai Lu	c1efe1ddb5	Enable building of FakeLowP ops (#36170 ) Summary: We open sourced the FakeLowp ops as a reference implementation of fp16 ops. This PR makes it buildable. ``` USE_CUDA=0 USE_ROCM=0 USE_FAKELOWP=ON python setup.py install ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/36170 Test Plan: Build Onnxifi library in Glow. ``` cp ${GLOW}/build/lib/Onnxifi/libonnxifi-glow.so ${MY_PATH}/ibonnxifi.so LD_LIBRARY_PATH=${MY_PATH}/ibonnxifi.so python pytorch/caffe2/python/fakelowp/test_sls_nnpi_fp16.py ``` It doesn't run successfully right now because we need to open source the glow gflags and some other ops like `FbgemmPack`. Reviewed By: houseroad Differential Revision: D20980681 Pulled By: yinghai fbshipit-source-id: 6dd31883a985850a77261bcc527029479bbc303f	2020-04-11 13:17:59 -07:00
Yinghai Lu	6920b13500	Move fakelowp tests from glow to caffe2 (#36409 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36409 Pull Request resolved: https://github.com/pytorch/glow/pull/4409 Since glow OSS doesn't really ship with python, it's much easier to do it in pytorch. All the glow dependency can be done though LD_LIBRARY_PATH in OSS. Test Plan: ``` buck test caffe2/caffe2/python/fakelowp: ``` Reviewed By: amylittleyang Differential Revision: D20969308 fbshipit-source-id: 06a02d23f4972a92beb18e1d052e27d8724539d0	2020-04-10 15:52:36 -07:00
Dhruv Choudhary	f0ea6862ba	Support for pruning delays in Adagrad Optimizer (#34527 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34527 Adding support for prune_delays and prune ratios in Adagrad optimizer. Test Plan: Tested via unit tests in masked_adagrad_optimizer_test. Added unit test for prune_delay versions of MaskedAdagrad buck build caffe2/caffe2/fb/optimizers:masked_adagrad_optimizer_test; buck-out/gen/caffe2/caffe2/fb/optimizers/masked_adagrad_optimizer_test#binary.par buck test caffe2/caffe2/fb/dper/layer_models/tests/split_1:sparse_nn_test -- 'test_pruning' All Dper tests passed https://our.intern.facebook.com/intern/testinfra/testrun/7599824380741217 Reviewed By: chocjy Differential Revision: D20313419 fbshipit-source-id: 5c2c8d4e0fc2ec538bcd6f145c6b87a2381f90f3	2020-04-09 12:59:23 -07:00
Tristan Rice	376542c83d	caffe2: preserve python exception type from PythonOp (#36267 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36267 This makes PythonOp throw the original python exception instead of wrapping it in a c10::Error type. This allows throwing exceptions from Python and preserving the type when they're caught again in Python. This is important for structured logging and handling non-retryable error types. Test Plan: buck test caffe2/caffe2/python:python_op_test Reviewed By: wenqicaofb Differential Revision: D20928098 fbshipit-source-id: 001747f022c657b420f8450b84d64f4d57f6cdf6	2020-04-09 12:43:24 -07:00
Devin He	b46fddf506	idtt + zch distributed inference (#35763 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35763 Adds inference function and test for ScatterAssign Test Plan: Updated unit test Reviewed By: yyetim, shunting1986 Differential Revision: D20501079 fbshipit-source-id: 7ec6ef0127a151250dd699c90c2b80c35cfb1fe4	2020-04-03 12:09:34 -07:00
Tristan Rice	676fc929b7	[caffe2] fix type and shape inference for common gradient ops (#35857 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35857 This fixes a lot of common ops for InferBlobShapesAndTypes as well as adds support for testing the inferred shapes and types of gradient ops. Ops: * Concat * Split * LeakyReLU * Relu * Prelu * Gelu * Elu * Sinh, Tanh, Cosh * Abs * ... and a number of other simple element wise ops Test Plan: Added support to hypothesis test to check the shape and type of gradient ops. Enabled it for all the ops I fixed the shape and type inference for. buck test caffe2/caffe2/python/operator_test: Reviewed By: pradeepd24 Differential Revision: D20806284 fbshipit-source-id: 77f796d9ff208e09e871bdbadf9a0a7c196b77f2	2020-04-02 11:17:04 -07:00
Yinghai Lu	dd98abb453	Enable splitSparseLengthsSumSparse in onnxifi (#35555 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35555 Att. So that we can lower the SparseLengthsSum* part of SparseLengthsSumSparse. We update the tying policy between Gather and SparsLengthsWeightSum so that we don't bother lowering a single Gather into the backend, which is inefficient to execute on card and creates bubbles between continuous lowering graphs. Test Plan: ``` buck test glow/fb/test:test_onnxifinnpi ``` Reviewed By: ipiszy Differential Revision: D20688525 fbshipit-source-id: cb8e38239057ff13a8d385ed09d0d019421de78b	2020-03-30 13:34:59 -07:00
Yinghai Lu	af4d86788c	Split SparseLengthsSumSparse into SparseLengthsSumSparseLookup + SparseLengthsSum (#35507 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35507 We want to split up the SparseLengthsSumSparse op into an indirection op and the SparseLengthsSum op so that we can lower the later part. The indirection part is a plain impl now. Test Plan: ``` for i in `seq 10`; do buck test caffe2/caffe2/python/operator_test:lengths_reducer_fused_nbit_rowwise_ops_test -- test_sparse_lengths_sum_rowwise_sparse; done ``` Reviewed By: jspark1105 Differential Revision: D20683478 fbshipit-source-id: 509effe88719d20aa0c4783bbe0ce1f183ee473c	2020-03-30 13:33:29 -07:00
anjali411	96eec95ece	torch.from_numpy for complex dtypes (#35531 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35531 Differential Revision: D20693581 Pulled By: anjali411 fbshipit-source-id: d53e26b4175452fa00b287efbfceea18104c1364	2020-03-27 14:40:28 -07:00
pinzhenx	bd604cb5b7	Upgrade MKL-DNN to DNNL v1.2 (#32422 ) Summary: ## Motivation This PR upgrades MKL-DNN from v0.20 to DNNL v1.2 and resolves https://github.com/pytorch/pytorch/issues/30300. DNNL (Deep Neural Network Library) is the new brand of MKL-DNN, which improves performance, quality, and usability over the old version. This PR focuses on the migration of all existing functionalities, including minor fixes, performance improvement and code clean up. It serves as the cornerstone of our future efforts to accommodate new features like OpenCL support, BF16 training, INT8 inference, etc. and to let the Pytorch community derive more benefits from the Intel Architecture. <br> ## What's included? Even DNNL has many breaking changes to the API, we managed to absorb most of them in ideep. This PR contains minimalist changes to the integration code in pytorch. Below is a summary of the changes: <br> General: 1. Replace op-level allocator with global-registered allocator ``` // before ideep::sum::compute<AllocForMKLDNN>(scales, {x, y}, z); // after ideep::sum::compute(scales, {x, y}, z); ``` The allocator is now being registeted at `aten/src/ATen/native/mkldnn/IDeepRegistration.cpp`. Thereafter all tensors derived from the `cpu_engine` (by default) will use the c10 allocator. ``` RegisterEngineAllocator cpu_alloc( ideep::engine::cpu_engine(), [](size_t size) { return c10::GetAllocator(c10::DeviceType::CPU)->raw_allocate(size); }, [](void* p) { c10::GetAllocator(c10::DeviceType::CPU)->raw_deallocate(p); } ); ``` ------ 2. Simplify group convolution We had such a scenario in convolution where ideep tensor shape mismatched aten tensor: when `groups > 1`, DNNL expects weights tensors to be 5-d with an extra group dimension, e.g. `goihw` instead of `oihw` in 2d conv case. As shown below, a lot of extra checks came with this difference in shape before. Now we've completely hidden this difference in ideep and all tensors are going to align with pytorch's definition. So we could safely remove these checks from both aten and c2 integration code. ``` // aten/src/ATen/native/mkldnn/Conv.cpp if (w.ndims() == x.ndims() + 1) { AT_ASSERTM( groups > 1, "Only group _mkldnn_conv2d weights could have been reordered to 5d"); kernel_size[0] = w.get_dim(0) * w.get_dim(1); std::copy_n( w.get_dims().cbegin() + 2, x.ndims() - 1, kernel_size.begin() + 1); } else { std::copy_n(w.get_dims().cbegin(), x.ndims(), kernel_size.begin()); } ``` ------ 3. Enable DNNL built-in cache Previously, we stored DNNL jitted kernels along with intermediate buffers inside ideep using an LRU cache. Now we are switching to the newly added DNNL built-in cache, and no longer caching buffers in order to reduce memory footprint. This change will be mainly reflected in lower memory usage from memory profiling results. On the code side, we removed couple of lines of `op_key_` that depended on the ideep cache before. ------ 4. Use 64-bit integer to denote dimensions We changed the type of `ideep::dims` from `vector<int32_t>` to `vector<int64_t>`. This renders ideep dims no longer compatible with 32-bit dims used by caffe2. So we use something like `{stride_.begin(), stride_.end()}` to cast parameter `stride_` into a int64 vector. <br> Misc changes in each commit: Commit: change build options Some build options were slightly changed, mainly to avoid name collisions with other projects that include DNNL as a subproject. In addition, DNNL built-in cache is enabled by option `DNNL_ENABLE_PRIMITIVE_CACHE`. Old \| New -- \| -- WITH_EXAMPLE \| MKLDNN_BUILD_EXAMPLES WITH_TEST \| MKLDNN_BUILD_TESTS MKLDNN_THREADING \| MKLDNN_CPU_RUNTIME MKLDNN_USE_MKL \| N/A (not use MKL anymore) ------ Commit: aten reintegration - aten/src/ATen/native/mkldnn/BinaryOps.cpp Implement binary ops using new operation `binary` provided by DNNL - aten/src/ATen/native/mkldnn/Conv.cpp Clean up group convolution checks Simplify conv backward integration - aten/src/ATen/native/mkldnn/MKLDNNConversions.cpp Simplify prepacking convolution weights - test/test_mkldnn.py Fixed an issue in conv2d unit test: it didn't check conv results between mkldnn and aten implementation before. Instead, it compared the mkldnn with mkldnn as the default cpu path will also go into mkldnn. Now we use `torch.backends.mkldnn.flags` to fix this issue - torch/utils/mkldnn.py Prepack weight tensor on module `__init__` to achieve better performance significantly ------ Commit: caffe2 reintegration - caffe2/ideep/ideep_utils.h Clean up unused type definitions - caffe2/ideep/operators/adam_op.cc & caffe2/ideep/operators/momentum_sgd_op.cc Unify tensor initialization with `ideep::tensor::init`. Obsolete `ideep::tensor::reinit` - caffe2/ideep/operators/conv_op.cc & caffe2/ideep/operators/quantization/int8_conv_op.cc Clean up group convolution checks Revamp convolution API - caffe2/ideep/operators/conv_transpose_op.cc Clean up group convolution checks Clean up deconv workaround code ------ Commit: custom allocator - Register c10 allocator as mentioned above <br><br> ## Performance We tested inference on some common models based on user scenarios, and most performance numbers are either better than or on par with DNNL 0.20. ratio: new / old \| Latency (batch=1 4T) \| Throughput (batch=64 56T) -- \| -- \| -- pytorch resnet18 \| 121.4% \| 99.7% pytorch resnet50 \| 123.1% \| 106.9% pytorch resnext101_32x8d \| 116.3% \| 100.1% pytorch resnext50_32x4d \| 141.9% \| 104.4% pytorch mobilenet_v2 \| 163.0% \| 105.8% caffe2 alexnet \| 303.0% \| 99.2% caffe2 googlenet-v3 \| 101.1% \| 99.2% caffe2 inception-v1 \| 102.2% \| 101.7% caffe2 mobilenet-v1 \| 356.1% \| 253.7% caffe2 resnet101 \| 100.4% \| 99.8% caffe2 resnet152 \| 99.8% \| 99.8% caffe2 shufflenet \| 141.1% \| 69.0% † caffe2 squeezenet \| 98.5% \| 99.2% caffe2 vgg16 \| 136.8% \| 100.6% caffe2 googlenet-v3 int8 \| 100.0% \| 100.7% caffe2 mobilenet-v1 int8 \| 779.2% \| 943.0% caffe2 resnet50 int8 \| 99.5% \| 95.5% _Configuration: Platform: Skylake 8180 Latency Test: 4 threads, warmup 30, iteration 500, batch size 1 Throughput Test: 56 threads, warmup 30, iteration 200, batch size 64_ † Shufflenet is one of the few models that require temp buffers during inference. The performance degradation is an expected issue since we no longer cache any buffer in the ideep. As for the solution, we suggest users opt for caching allocator like jemalloc as a drop-in replacement for system allocator in such heavy workloads. Pull Request resolved: https://github.com/pytorch/pytorch/pull/32422 Test Plan: Perf results: https://our.intern.facebook.com/intern/fblearner/details/177790608?tab=Experiment%20Results 10% improvement for ResNext with avx512, neutral on avx2 More results: https://fb.quip.com/ob10AL0bCDXW#NNNACAUoHJP Reviewed By: yinghai Differential Revision: D20381325 Pulled By: dzhulgakov fbshipit-source-id: 803b906fd89ed8b723c5fcab55039efe3e4bcb77	2020-03-26 22:07:59 -07:00
Tristan Rice	d4f3bc7f8e	[dt] [caffe2] add/fix shape inference for StumpFunc, SliceGradient and ResizeLike (#35430 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35430 This fixes and adds tests for several commonly used operators. There's some formatting differences due to running clang-format on one of the files. Test Plan: buck test //caffe2/caffe2/fb/operators:hypothesis_test //caffe2/caffe2/python/operator_test:utility_ops_test //caffe2/caffe2/python/operator_test:concat_split_op_test Reviewed By: yyetim Differential Revision: D20657405 fbshipit-source-id: 51d86d0834003b8ac8d6acb5149ae13d7bbfc6ab	2020-03-26 17:50:32 -07:00
Chunli Fu	de3b2f98db	[Shape Inference] Add ssaRewrite pybind func (#35410 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35410 Reviewed By: yinghai Differential Revision: D20653042 fbshipit-source-id: 3845413d4e80b9be4fb97dc1eb8e824a55fb7576	2020-03-26 00:46:28 -07:00
Xiaodong Wang	53fceff1e1	Change weight scale test to cpu only (#35346 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35346 weight scale op doesn't have GPU impl. This is breaking OSS CI from D20506032. Making it cpu only Test Plan: OSS CI Reviewed By: ustctf Differential Revision: D20637440 fbshipit-source-id: 9aa6cce63ce637ab7856788e5d02f527decb2a26	2020-03-25 09:18:58 -07:00
Fei Tian	845b19c4ef	Add weight_scale in Adagrad (#34944 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34944 Reviewed By: chonglinsun Differential Revision: D20506032 fbshipit-source-id: ef025e536da01fdcabc783466bc065685b80ab9a	2020-03-20 22:36:51 -07:00
Zhonghao Liu	e3272559e4	[caffe2] SWA operator (#34394 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34394 # SWA operator In this diff, we added a new operator `SWA` which will be used in `AdaGradOptimizer`. The algorithm looks like: {F230902995} # Background In our testings, we found that this operator could improve our models' reproducibility a lot. (KT: 0.86 -> .92) So we hope to land this operator and in future, enable this by default in our Models. Test Plan: Local build `aml.dper3:30f068668cfb408fbb40141fb17129f2` and bento kernel. - Local test: n215857 - f174600345 Reviewed By: chocjy Differential Revision: D20165239 fbshipit-source-id: c03cdd048cb10b091e5f06323f4c0f3999f95d8a	2020-03-20 08:17:08 -07:00
Chunli Fu	b3fccda4a9	[DPER3][Shape inference] Update Shape Information in dper3 backend (#34475 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34475 Differential Revision: D20332799 fbshipit-source-id: 16aa7399eb48ce4d1d0f8431941ae1252322c382	2020-03-19 13:49:34 -07:00
Li Zhang (DAI)	69e701fbf9	Add transfer_learning_blob_name_mappings into layer_model_helper to support layer model transfer learning Summary: Add transfer_learning_blob_name_mappings into layer_model_helper to support layer model transfer learning Reviewed By: mraway Differential Revision: D20286298 fbshipit-source-id: de3e029611d843f38d3f42ecd4148358f7e14a2b	2020-03-18 15:28:00 -07:00
Edward Yang	d927d58c2a	Revert D20289209: Support RowWiseSparseAdam on GPU Test Plan: revert-hammer Differential Revision: D20289209 Original commit changeset: a7a8a21bd18c fbshipit-source-id: 4a8ae684d099a5499c28b7e65578fc7ab10b248d	2020-03-18 07:35:07 -07:00
Jongsoo Park	bcbdba450c	[caffe2] open source 2/4-bit SLS operators (#34903 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34903 Reattempt of D20461609 Moving 2/4-bit SLS and row-wise 2/4-bit conversion operator to open source to be used by DLRM Test Plan: CI Reviewed By: jianyuh Differential Revision: D20495304 fbshipit-source-id: 66a99677583f50fd40e29c514710c7b1a8cdbc29	2020-03-17 22:55:10 -07:00
Yan Xie	959a7138fd	Support RowWiseSparseAdam on GPU (#34341 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34341 Implement RowWiseSparseAdam on CUDA Reviewed By: xianjiec Differential Revision: D20289209 fbshipit-source-id: a7a8a21bd18c1b9891f04f202d3ecaf183e30cad	2020-03-17 15:08:24 -07:00
Edward Yang	3e68d0c5d0	Revert D20461609: [caffe2] open source 2/4-bit SLS operators Test Plan: revert-hammer Differential Revision: D20461609 Original commit changeset: b3ef73ff10f2 fbshipit-source-id: e90ee5e34b1feab5b0bd582ed7e96e37de7044b0	2020-03-17 11:10:10 -07:00
Jongsoo Park	d9b97a4ffd	[caffe2] open source 2/4-bit SLS operators (#34783 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34783 Moving 2/4-bit SLS and row-wise 2/4-bit conversion operator to open source to be used by DLRM Test Plan: CI Reviewed By: yinghai Differential Revision: D20461609 fbshipit-source-id: b3ef73ff10f2433afe06ffa73fe1145282d9ec4c	2020-03-17 01:00:31 -07:00
Xinyi Zhang	99b91ee2ad	[fix][tiny][caffe2] Avoid triggering errors when allow ratio is 100% (#34757 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34757 Reviewed By: Wakeupbuddy Differential Revision: D20451255 fbshipit-source-id: 07997cf31dba653b61d082ec3f28357c3b90c4eb	2020-03-16 11:39:32 -07:00
Bangsheng Tang	8f854fb9e2	[1/n][multi-tower] add partition info in predictor construction (#34175 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34175 to incorporate PartitionInfo added in D20015493 Test Plan: unit tests Reviewed By: yinghai Differential Revision: D20133759 fbshipit-source-id: 130db2d80bca3c05a7ec91292159f857046718e0	2020-03-13 09:23:39 -07:00
Edward Yang	d5f8c8f3ba	Revert D20121169: [pytorch][PR] ONNX Export Support for CrossEntropyLoss Test Plan: revert-hammer Differential Revision: D20121169 Original commit changeset: 7b56617e8c60 fbshipit-source-id: d7f302d1e54f3c978c3be0a0ad1ee600790a5b27	2020-03-12 20:30:54 -07:00
Chunli Fu	fe9b4e3cba	[DPER3] Blob Reorder (#33579 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33579 Differential Revision: D20008865 fbshipit-source-id: f35aded311d9d1d7d438d828ccabd2bab5575e5c	2020-03-12 12:28:12 -07:00
Ksenija Stanojevic	944ea4c334	ONNX Export Support for CrossEntropyLoss (#33767 ) Summary: Add ONNX export support for torch.nn.CrossEntropyLoss. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33767 Reviewed By: hl475 Differential Revision: D20121169 Pulled By: houseroad fbshipit-source-id: 7b56617e8c60617b922949fc8b4ecc626eedf7ed	2020-03-12 11:46:58 -07:00
Michael Suo	c235be42dd	[jit] kill script namespace (#34515 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34515 Once upon a time we thought this was necessary. In reality it is not, so removing it. For backcompat, our public interface (defined in `api/`) still has typedefs to the old `script::` names. There was only one collision: `Pass` as a `Stmt` and `Pass` as a graph transform. I renamed one of them. Test Plan: Imported from OSS Differential Revision: D20353503 Pulled By: suo fbshipit-source-id: 48bb911ce75120a8c9e0c6fb65262ef775dfba93	2020-03-11 23:32:48 -07:00
Xianjie Chen	0dc0fffca1	[net_transform] only skip ConstantFill for autogen_grad (#34628 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34628 Differential Revision: D20370564 fbshipit-source-id: 854c8ab44ba262e5020383447ed6bb629064ec33	2020-03-11 19:09:52 -07:00
Xiaodong Wang	4c99351de6	[AMD] Remove num_gpu check for remote execution (#34318 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34318 Stop checking whether we have AMD GPU devices on the host, because we may be constructing a net on a machine without GPU, and run the net on another one with GPU Reviewed By: ajauhri Differential Revision: D20269562 fbshipit-source-id: 1f561086cacdcead3ce7c03c2d02c25336c8b11a	2020-03-06 09:53:57 -08:00
Orion Reblitz-Richardson	ad17dafc50	[caffe2] Remove python2 from operator_test (#33977 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33977 Removing python2 from operator_test so we can retire python2 support for PyTorch. Test Plan: waitforsandcastle Reviewed By: seemethere Differential Revision: D20129500 fbshipit-source-id: d4c82e4acfc795be9bec6a162c713e37ffb9f5ff	2020-03-02 08:55:53 -08:00
Michael Suo	dbe850af5b	[jit] do the code reorg (#33851 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33851 Rationale and context described in #33828. Script to reproduce the move: https://gist.github.com/suo/16cbefaaeb67ca5a7c6caffd49b7f6e9 ghstack-source-id: 99079645 Test Plan: Make sure CI passes Reviewed By: jamesr66a Differential Revision: D20133869 fbshipit-source-id: 390e9241a9c85366d9005c492ac31f10aa96488e	2020-02-27 13:02:51 -08:00
Alex Cheparukhin	ee23944f46	[Caffe2] Fix shape inference for element-wise operators (#33431 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33431 Some elementwise operators don't have shape and type inference specified for the output tensor: `BitwiseOr`, `BitwiseAnd`, `BitwiseXor`, `Not`, `Sign`. This change fixes this issue: - For `Not` and `Sign` operators, the output has the same type and shape as the input, so `IdenticalTypeAndShapeOfInput` function is used to specify that. - For bitwise operators created by `CAFFE2_SCHEMA_FOR_BINARY_BITWISE_OP` macro, the type and shape inference rules should be the same as for other binary element-wise operators, so `TensorInferenceFunction(ElementwiseOpShapeInference)` is used to specify that. Also some tests were modified to ensure that the shape and type are inferred (`ensure_outputs_are_inferred` parameter) Test Plan: ``` CAFFE2_ASSERT_SHAPEINFERENCE=1 buck test caffe2/caffe2/python/operator_test:elementwise_ops_test CAFFE2_ASSERT_SHAPEINFERENCE=1 buck test caffe2/caffe2/python/operator_test:math_ops_test ``` Note that the tests have to be executed with `CAFFE2_ASSERT_SHAPEINFERENCE=1` in order to fail upon shape inference failure. Reviewed By: idning Differential Revision: D19880164 fbshipit-source-id: 5d7902e045d79e5669e5e98dfb13a39711294939	2020-02-25 09:03:06 -08:00
Xinyi Zhang	696527e659	[caffe2] Add embedding empty ratio checker (disabled by default) (#33145 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33145 Reviewed By: xianjiec Differential Revision: D19716574 fbshipit-source-id: 42a636600ac3977910d35093916865790bbe5b10	2020-02-24 16:10:01 -08:00
Jongsoo Park	e95282ab28	[caffe2] make fused rowwise quant/dequant op work for N-dim tensors (#33426 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33426 Make 2/4/8-bit fused rowwise conversion operators more general to work for N-dim tensors Test Plan: CI Reviewed By: ellie-wen Differential Revision: D19943136 fbshipit-source-id: 47008544dd7e1d11a346d34f35449e0fcc0e7ee0	2020-02-19 23:29:42 -08:00
Huayu Li	c75d06d854	Move gating part of SparseFeatureGating to local Summary: in dper2, local net is hard-coded by whitelisting some layers. Add SparseFeatureGating related layers to local net explicitly. Test Plan: * workflow: f167812211 * QRT: fall back looks normal {F228442018} Differential Revision: D19852280 fbshipit-source-id: 6fecc3d745c3f742d029575a7b9fe320618f1863	2020-02-16 14:18:27 -08:00
Johannes M Dieterich	6ade7e3a15	[ROCm] Enable 3D convolutions through ROCm (#33067 ) Summary: For both the Caffe2 and PyTorch backends, enable 3D convolutions through MIOpen. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33067 Reviewed By: BIT-silence Differential Revision: D19880495 Pulled By: bddppq fbshipit-source-id: 8f6f970910654c1c5aa871b48a04c1054875691c	2020-02-14 13:19:10 -08:00
Lu Fang	e5c7b7b8b5	Automatic update of fbcode/onnx to 04a29addfd5b912812addb8dea5f8763fbfaad01 (#33328 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33328 Previous import was 8b3f7e2e7a0f2aba0e629e23d89f07c7fc0e6a5e Included changes: - [04a29add](https://github.com/onnx/onnx/commit/04a29add): Use // instead of # (#2598) <Lu Fang> - [f8e140a9](https://github.com/onnx/onnx/commit/f8e140a9): Kezhan/function update (#2596) <Ke Zhang> - [6185faae](https://github.com/onnx/onnx/commit/6185faae): fix the attribute types section in IR.md (#2590) <Ke Zhang> - [f254647a](https://github.com/onnx/onnx/commit/f254647a): Allow Constant operator to promote scalar and list to tensors. (#2592) <Jeremy Cochoy> - [f12ec799](https://github.com/onnx/onnx/commit/f12ec799): Add NegativeLogLikelihood(NllLoss) op (#2551) <liqunfu> Test Plan: ci Reviewed By: hl475 Differential Revision: D19897554 fbshipit-source-id: d8efb5c5ac8f9d71727de33c67af681ed8ec8123	2020-02-13 21:03:17 -08:00
Chaitanya Sri Krishna Lolla	2635055229	[ROCm] Enable 3D batch norms through MIOpen (#33262 ) Summary: Enable test for Caffe2 Pull Request resolved: https://github.com/pytorch/pytorch/pull/33262 Differential Revision: D19880486 Pulled By: bddppq fbshipit-source-id: af663a11137a53302e55198f38117ab6bdc9ec89	2020-02-13 11:29:51 -08:00
Lin Yang	9d9fa2eace	[2/3] Bind Bucketize to PyTorch (#33014 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33014 Export Bucketize to PyTorch. Test Plan: buck test caffe2/caffe2/python/operator_test:torch_integration_test Reviewed By: bddppq Differential Revision: D19737534 fbshipit-source-id: be1c892bb8d01da9892f221f150f1a2788ac732e	2020-02-11 23:20:10 -08:00
Lin Yang	6f46962f21	[1/3] Bind IndexHash to PyTorch (#33015 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33015 Export IndexHash to PyTorch Test Plan: buck test caffe2/caffe2/python/operator_test:torch_integration_test ✓ caffe2/caffe2/python/operator_test:torch_integration_test-2.7 - test_index_hash_op (caffe2.caffe2.python.operator_test.torch_integration_test.TorchIntegration) 0.151 44/50 (passed) Reviewed By: bddppq Differential Revision: D19727301 fbshipit-source-id: a65c954539e81a15577fe5c3c0deb3614e983534	2020-02-10 17:47:38 -08:00
Lu Fang	674dca0831	Automatic update of fbcode/onnx to 8b3f7e2e7a0f2aba0e629e23d89f07c7fc0e6a5e (#33075 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33075 Previous import was 65020daafa9183c769938b4512ce543fd5740f8f Included changes: - [8b3f7e2e](https://github.com/onnx/onnx/commit/8b3f7e2e): Update Dropout and BatchNorm to be Training Friendly (#2568) <Lara Haidar> - [61f0bbc5](https://github.com/onnx/onnx/commit/61f0bbc5): Fix a bug in ScatterND shape inference (#2577) <Bowen Bao> - [05bce9cf](https://github.com/onnx/onnx/commit/05bce9cf): add utility function to make reference attribute whose name is not the same as the attribute it refers. (#2583) <Ke Zhang> - [71181c83](https://github.com/onnx/onnx/commit/71181c83): Clarify spec for constant of shape with dim_n = 0 (#2567) <Negin Raoof> - [eadba733](https://github.com/onnx/onnx/commit/eadba733): Update sigs.md with link to calendar page (#2579) <Prasanth Pulavarthi> - [08562f8e](https://github.com/onnx/onnx/commit/08562f8e): Update working-groups.md (#2580) <Prasanth Pulavarthi> - [0e718913](https://github.com/onnx/onnx/commit/0e718913): Fix Slice op's shape inference logic (#2526) <Hariharan Seshadri> - [12111410](https://github.com/onnx/onnx/commit/12111410): Add missing spaces to RandomLike doc (#2572) <Takeshi Watanabe> - [7e6e61d6](https://github.com/onnx/onnx/commit/7e6e61d6): Contributing: fix typos (#2571) <Maher Jendoubi> - [bbd604ef](https://github.com/onnx/onnx/commit/bbd604ef): Add Einsum op (#2504) <Negin Raoof> - [fd3ab73a](https://github.com/onnx/onnx/commit/fd3ab73a): Clarify split supports zero length splits (#2544) <Negin Raoof> - [6dd73774](https://github.com/onnx/onnx/commit/6dd73774): Fix circleci build and drop unsupported Windows builds (#2565) <Wei-Sheng Chin> - [b3d201a2](https://github.com/onnx/onnx/commit/b3d201a2): Fix the formula of intermediate zero calculation for DynamicQuantizeLinear (#2556) <Yufeng Li> - [3613eb25](https://github.com/onnx/onnx/commit/3613eb25): Add wording to clarify. (#2555) <Dwayne Robinson> - [dfa4384c](https://github.com/onnx/onnx/commit/dfa4384c): Fix shape inference for Split with split attribute (#2328) <Shinichiro Hamaji> - [684fc1bc](https://github.com/onnx/onnx/commit/684fc1bc)*: Keep symbolic dims in Concat with a single input (#2418) <Shinichiro Hamaji> Test Plan: ci Reviewed By: hl475 Differential Revision: D19784487 fbshipit-source-id: 421cdc3394faeff0168853f4ff065fc599ca3967	2020-02-07 02:18:57 -08:00
Andrey Malevich	e76fa9822d	[C2] Introduce extra_info force CPU tags for auto-generated iteration counter blobs (#32607 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32607 As desc. Test Plan: Unit-test. Reviewed By: xw285cornell, chocjy Differential Revision: D19551567 fbshipit-source-id: 3a121351d2b4016e99a1536dec746be970698664	2020-02-05 23:49:27 -08:00
peng	18d1896ba0	Fix confusing "does not have GPU support" warning message (#30721 ) Summary: Many people who use caffe2 are confused about "does not have GPU support" warning message. https://github.com/facebookresearch/video-nonlocal-net/issues/6 facebookarchive/caffe2#346 facebookarchive/caffe2#1634 facebookarchive/caffe2#197 Many none GPU reasons can cause this warning message. It is better to give the error info. ![image](https://user-images.githubusercontent.com/13826327/70129721-41175e00-16ba-11ea-85df-a4b1a1690149.png) Pull Request resolved: https://github.com/pytorch/pytorch/pull/30721 Differential Revision: D19697413 Pulled By: ezyang fbshipit-source-id: bd24b7c814e7e677352068b9e9f77a68de080159	2020-02-04 14:20:00 -08:00
James Reed	341fb6d11d	Make caffe2/caffe2/python/models/seq2seq python3 compatible Test Plan: watiforsadcastle Reviewed By: dzhulgakov Differential Revision: D19698403 fbshipit-source-id: 36b73e07e598c848abbe368e522484da9ba4c78f	2020-02-04 10:51:47 -08:00
Edward Yang	c47c78d0bf	Revert D19597036: More code fakefp16 mapping unification Test Plan: revert-hammer Differential Revision: D19597036 Original commit changeset: deed61945884 fbshipit-source-id: c057e57810a99464aefb00b645613ecd6a7c5533	2020-01-29 13:32:42 -08:00
Deepali Chourasia	e84f9d9d0c	Fix TensorProtosDBInput AttributeError (#32274 ) Summary: https://github.com/pytorch/pytorch/issues/6794 Pull Request resolved: https://github.com/pytorch/pytorch/pull/32274 Differential Revision: D19621889 Pulled By: ezyang fbshipit-source-id: 1bdd042b6421a2798c7f1e9030dfc6dfc1246989	2020-01-29 12:05:43 -08:00
Yinghai Lu	642c9ef922	More code fakefp16 mapping unification Summary: ATT Reviewed By: amylittleyang Differential Revision: D19597036 fbshipit-source-id: deed61945884fb4b01d058f3c72c75f5a937a41c	2020-01-29 11:01:24 -08:00
Xinyi Zhang	1f78bd0774	[caffe2] Early error throwing for currupted embeddings Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32717 Reviewed By: xianjiec Differential Revision: D19604954 fbshipit-source-id: c02eccf048c0dba3f66d729ab1fda50f3cacef63	2020-01-28 16:55:29 -08:00
Jongsoo Park	e735395fc6	[caffe2] use 2-stage EmbeddingSpMDM interface (#32271 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32271 Use the 2-stage EmbeddingSpMDM interface in D19425982 to reduce the overhead of code cache lookup and lock contention. Fix an issue in sparse_lengths_sum_benchmarks generating empty indices when average length is small like 1. Test Plan: CI Reviewed By: dskhudia Differential Revision: D19425987 fbshipit-source-id: d5c5f0d46e0072403901809c31d516fa0f4b9b31	2020-01-22 19:05:36 -08:00
Dehua Cheng	685f090ac8	[Rowwise Pruning][c2 op] Add Quantile Op (#32448 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32448 Using binary search to compute the value for the given quantile among the input tensors. Test Plan: Newly added unittests; Reviewed By: jspark1105 Differential Revision: D19487604 fbshipit-source-id: 0dc6627b78d1310ac35b3f1d53b89cc89a697ece	2020-01-22 16:59:56 -08:00
Jongsoo Park	14e0bec9f2	[caffe2] remove unnecessary np.set_printoptions and fix test errors (#32475 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32475 As title Test Plan: CI Reviewed By: houseroad Differential Revision: D19508778 fbshipit-source-id: fd9ad63607535980505d155f3e3c3b7c6b95daf7	2020-01-22 14:49:47 -08:00
peter	b77c25dec0	Fix dll load logic for Python 3.8 on Windows (#32215 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/31181 and https://github.com/pytorch/pytorch/pull/31162#discussion_r362495611. Pull Request resolved: https://github.com/pytorch/pytorch/pull/32215 Differential Revision: D19501869 Pulled By: ezyang fbshipit-source-id: 363824e52d2592ad968ecf1df345aa4c0daff915	2020-01-22 08:33:34 -08:00
Brian Wignall	f326045b37	Fix typos, via a Levenshtein-type corrector (#31523 ) Summary: Should be non-semantic. Uses https://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines to find likely typos, with https://github.com/bwignall/typochecker to help automate the checking. Uses an updated version of the tool used in https://github.com/pytorch/pytorch/pull/30606 . Pull Request resolved: https://github.com/pytorch/pytorch/pull/31523 Differential Revision: D19216749 Pulled By: mrshenli fbshipit-source-id: 7fd489cb9a77cd7e4950c1046f925d57524960ea	2020-01-17 16:03:19 -08:00
Yanghan Wang	9b6ec61bfd	exposing CPU/GPU Copy ops (#32248 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32248 expose CPU/GPU copy ops Test Plan: buck test mode/dev-nosan caffe2/caffe2/python/operator_test:torch_integration_test Reviewed By: houseroad Differential Revision: D19405856 fbshipit-source-id: 1df4aa202e26647cb81e9fe7e4478e594a5f7f3e	2020-01-17 12:40:43 -08:00
Alexander Melnikov	4e69352713	Add 64bit atomic fetch add (#32354 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32354 adding int_64 version of AtomicFetchAdd Reviewed By: bwasti Differential Revision: D19434349 fbshipit-source-id: b2358e8c5c6b7cd7e7b21de974b4ee1b5258fcf4	2020-01-17 11:43:43 -08:00
David Gisser	91bdb872ce	fix spelling mistake: excpected -> expected Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28817 Differential Revision: D18544562 Pulled By: dgisser fbshipit-source-id: 51f728e807f9c4bb30f58585d5b6f436cb880153	2020-01-17 00:11:08 -08:00
Jing Huang	ef5ae4823a	Register RoIAlignRotated with C10 Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30785 Reviewed By: wat3rBro Differential Revision: D18415056 fbshipit-source-id: e00376bec948309d53f2172697cd477449f769b2	2020-01-16 16:32:28 -08:00
Tim Gates	0392e8384b	Fix simple typo: whos -> whose (#31288 ) Summary: Closes https://github.com/pytorch/pytorch/issues/31287 Pull Request resolved: https://github.com/pytorch/pytorch/pull/31288 Differential Revision: D19166753 Pulled By: zou3519 fbshipit-source-id: da31ad323b8fafa7cbc502fda4e2eb6e02facfb6	2020-01-15 11:47:21 -08:00
Shu Liu	8c3ee9f2ba	[Python] Deprecate use of scipy.misc.logsumexp and scipy.misc.comb (#32209 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32209 * Deprecate use of scipy.misc.logsumexp and scipy.misc.comb. * Removed in 1.0.0 https://docs.scipy.org/doc/scipy-1.1.0/reference/generated/scipy.misc.logsumexp.html and https://docs.scipy.org/doc/scipy-1.2.1/reference/generated/scipy.misc.comb.html * Use scipy.special.logsumexp and scipy.special.comb instead. * This diff updates most usages of except those in experimental folders. * This diff does NOT fix existing lint/code/TARGETS issues. * This diff does NOT autoformat codes. Test Plan: sandcastle auto unittests Differential Revision: D19406460 fbshipit-source-id: 2103fa0d674d9671a0175f4ce54b3c887d22f04e	2020-01-15 10:40:47 -08:00
Jongsoo Park	879620e85e	[caffe2] fix how np.clip is used in lengths_reducer_fused_{4,8}_rowwise_ops_test (#32086 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32086 np.clip(1, num_indices // 2, 10) -> np.clip(num_indices // 2, 1, 10) Also change batchsize -> num_rows to match with what the variable actually does Test Plan: CI Reviewed By: hx89 Differential Revision: D19361521 fbshipit-source-id: 9ce864c7d7da046dc606afa5207da677ccf80f52	2020-01-14 22:53:28 -08:00
Lu Fang	f6f1e0aef5	Automatic update of fbcode/onnx to 65020daafa9183c769938b4512ce543fd5740f8f (#32125 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32125 Previous import was 57ebc587fcf3913b4be93653b0dd58c686447298 Included changes: - [65020daa](https://github.com/onnx/onnx/commit/65020daa): better error message for undefined inputs (#2540) <Yuxin Wu> - [8afff0e9](https://github.com/onnx/onnx/commit/8afff0e9): bump ORT version (#2538) <Lu Fang> - [3d9ca57e](https://github.com/onnx/onnx/commit/3d9ca57e): fix name of directory (#2537) <Prasanth Pulavarthi> - [df8fa2c9](https://github.com/onnx/onnx/commit/df8fa2c9): Repository guidelines (#2539) <Prasanth Pulavarthi> - [49cc2f02](https://github.com/onnx/onnx/commit/49cc2f02): Update CircleCI job to use Python3.6 (#2527) <bddppq> - [25ff79a4](https://github.com/onnx/onnx/commit/25ff79a4): Fix wrong model version, it's not 12 (the onnx_opset_version()), not 11 (the opset version of the latest stable), but 10 (#2478) <daquexian> - [7cebaed5](https://github.com/onnx/onnx/commit/7cebaed5): Fix Windows py3.5 CI (#2529) <bddppq> - [eddae00e](https://github.com/onnx/onnx/commit/eddae00e): Correct the order of arguments of InferShapes (#2500) <Shinichiro Hamaji> - [41b5afe6](https://github.com/onnx/onnx/commit/41b5afe6): Include <ostream> in common/status.h (#2519) <Casey Carter> - [423f1977](https://github.com/onnx/onnx/commit/423f1977): add 8 bit support to maxpool op (#2510) <Ashwini Khade> - [78593c2f](https://github.com/onnx/onnx/commit/78593c2f): add 8 bit support to reducemin and reducemax ops (#2516) <Ashwini Khade> Test Plan: cont build Reviewed By: benoitsteiner Differential Revision: D19380034 fbshipit-source-id: ddce8450864a611773b2a32e2f0254c9bb6b6906	2020-01-14 15:21:37 -08:00
Silun Wang	28c1258f18	Scale init for batch-norm and layer-norm (#31983 ) Summary: Per discussion with Fei Tian, we need to add a `scale_init_value` to scale down the output of normalization such as batch-norm and layer-norm. Currently we have `sparse_normalization_options` to normalize embedding pooling output. By default, scale = 1.0, we found it's better to set scale from 0.025 to 0.1 https://fb.quip.com/MiKUAibEaYhH Besides, I am removing the tags from normalizers because it makes more sense to calculate norm ops in distributed trainers, not ps. Pull Request resolved: https://github.com/pytorch/pytorch/pull/31983 Test Plan: Testing LN and BN after sum-pooling -- baseline f160348514 LN: f160348609 BN: f160348710 {F226106518} Layer norm after sum-pooling fwd_net https://fburl.com/sa4j207n Layer norm after dot-prod fwd_net https://fburl.com/twggwyvb ## Unit Tests Testing normalization after pooling ``` buck test caffe2/caffe2/fb/dper/layer_models/tests/split_1:sparse_nn_test_4 -- test_sparse_pooling_batch_normalization buck test caffe2/caffe2/fb/dper/layer_models/tests/split_1:sparse_nn_test_4 -- test_dense_sparse_pooling_batch_normalization buck test caffe2/caffe2/fb/dper/layer_models/tests/split_1:sparse_nn_test_4 -- test_sparse_pooling_layer_normalization buck test caffe2/caffe2/fb/dper/layer_models/tests/split_1:sparse_nn_test_4 -- test_dense_sparse_pooling_layer_normalization ``` Testing normalization after dot-prod ``` buck test caffe2/caffe2/fb/dper/layer_models/tests/split_1:sparse_nn_test -- test_last_layer_use_batch_norm buck test caffe2/caffe2/fb/dper/layer_models/tests/split_1:sparse_nn_test -- test_last_layer_use_layer_norm ``` Differential Revision: D19277618 Pulled By: SilunWang fbshipit-source-id: ea323e33e3647ba55d2e808ef09d94ad7b45b934	2020-01-10 11:55:56 -08:00

... 3 4 5 6 7 ...

3042 Commits