pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
kshitij12345	01e0296eb7	[special] migrate log1p, sinc, round to special namespace (#55878 ) Summary: Reference : https://github.com/pytorch/pytorch/issues/50345 Pull Request resolved: https://github.com/pytorch/pytorch/pull/55878 Reviewed By: zou3519, janeyx99 Differential Revision: D29160593 Pulled By: mruberry fbshipit-source-id: f3ca9c541382bab33fb85d7817ce8ddc117c6826	2021-06-21 12:34:29 -07:00
Thomas J. Fan	c16f87949f	ENH Adds nn.ReflectionPad3d (#59791 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/27655 This PR adds a C++ and Python version of ReflectionPad3d with structured kernels. The implementation uses lambdas extensively to better share code from the backward and forward pass. Pull Request resolved: https://github.com/pytorch/pytorch/pull/59791 Reviewed By: gchanan Differential Revision: D29242015 Pulled By: jbschlosser fbshipit-source-id: 18e692d3b49b74082be09f373fc95fb7891e1b56	2021-06-21 10:53:14 -07:00
kshitij12345	5ec4ad7f54	[special] Add special.ndtri (#58650 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 TODO * [x] Add docs https://13865352-65600975-gh.circle-artifacts.com/0/docs/special.html#torch.special.ndtri * [x] Add comments on implementation * [x] Clean-up Pull Request resolved: https://github.com/pytorch/pytorch/pull/58650 Reviewed By: H-Huang Differential Revision: D29160170 Pulled By: mruberry fbshipit-source-id: 50e4ea663920e97b8437d03d5b52bcd9dedc1a8d	2021-06-19 18:36:54 -07:00
Richard Barnes	b162d95e46	Fix a number of lint perf and safety issues in torch (#59897 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59897 Test Plan: Sandcastle Reviewed By: ngimel Differential Revision: D29037012 fbshipit-source-id: 7c16286d5fc2b67964fb65f8374dfff4d1a7aefb	2021-06-15 13:14:51 -07:00
Kushashwa Ravi Shrimali	cf38b20c61	Alias for `digamma` as `psi` to `special` namespace (#59143 ) Summary: See https://github.com/pytorch/pytorch/issues/50345 cc: mruberry kshitij12345 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59143 Reviewed By: jbschlosser Differential Revision: D28986909 Pulled By: mruberry fbshipit-source-id: bc8ff0375de968f3662b224689fa0a6b117f9c4e	2021-06-14 03:05:14 -07:00
Nils Plath	60ba451731	[torch] Remove using directive from header (#59728 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59728 I noticed Sandcastle jobs failing with: ``` fbcode/caffe2/torch/csrc/api/include/torch/nn/modules/rnn.h:19:35: error: using namespace directive in global context in header [-Werror,-Wheader-hygiene] using namespace torch::nn::utils::rnn; ``` (cf. V3 of D28939167 or https://www.internalfb.com/intern/sandcastle/job/36028797455955174/). Removing `using namespace ...` fixes the problem. ~~... also applied code formatting ...~~ Test Plan: Sandcastle Reviewed By: jbschlosser Differential Revision: D29000888 fbshipit-source-id: 10917426828fc0c82b982da435ce891dc2bb6eec	2021-06-10 15:13:07 -07:00
Richard Barnes	e3d75b8475	irange for PyTorch sans jit (#59481 ) Summary: Switches most of the simple for loops outside of `jit` directories to use `c10::irange`. Generated with D28874212. Pull Request resolved: https://github.com/pytorch/pytorch/pull/59481 Test Plan: Sandcastle Reviewed By: ngimel Differential Revision: D28909681 fbshipit-source-id: ec9ab1bd602933238d9d0f73d4d8d027b75d9d85	2021-06-09 14:46:11 -07:00
Richard Barnes	3979cb0656	irange for size_t (#55320 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55320 Test Plan: Sandcastle Reviewed By: ngimel Differential Revision: D27572577 fbshipit-source-id: 97710fd2bb1303006b05828a0d1343b0b59ccb03	2021-06-03 01:04:13 -07:00
Kushashwa Ravi Shrimali	44c20ce676	Alias for `i0` to `special` namespace (#59141 ) Summary: See https://github.com/pytorch/pytorch/issues/50345 cc: mruberry kshitij12345 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59141 Reviewed By: ngimel Differential Revision: D28784097 Pulled By: mruberry fbshipit-source-id: 9b61a21906ef337292686fd40e328502a79e6f09	2021-06-01 23:04:09 -07:00
kshitij12345	fea7a79e0b	[special] Add ndtr (#58126 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 Plot: ![image](https://user-images.githubusercontent.com/19503980/117942099-54efd680-b328-11eb-8948-c3080779ce19.png) https://colab.research.google.com/drive/1Of67A042rOImj8wrLF_fUTgoy_wVEOZS?usp=sharing TODO: * [x] Add docs (https://13385714-65600975-gh.circle-artifacts.com/0/docs/special.html#torch.special.ndtr) Pull Request resolved: https://github.com/pytorch/pytorch/pull/58126 Reviewed By: anjali411 Differential Revision: D28700957 Pulled By: mruberry fbshipit-source-id: 5b9991e97ec1e8fd01518cc9d9849108d35fe406	2021-05-30 21:12:04 -07:00
kshitij12345	5c18994674	[special] Add `i1` and `i1e` (#56352 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 * [x] Check Docs https://12721710-65600975-gh.circle-artifacts.com/0/docs/special.html * [x] Investigate fp32 failure on CI?! (Fails on clang. Reproduced locally with clang-11) * [ ] Kernel vs Composite? * [x] Autograd for `i0e` for zero? Pull Request resolved: https://github.com/pytorch/pytorch/pull/56352 Reviewed By: anjali411 Differential Revision: D28700888 Pulled By: mruberry fbshipit-source-id: 91a3cbb94f5b8a3b063589ec38179848c11def83	2021-05-29 20:55:23 -07:00
Joel Schlosser	ef32a29c97	Back out "[pytorch][PR] ENH Adds dtype to nn.functional.one_hot" (#59080 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59080 Original commit changeset: 3686579517cc Test Plan: None; reverting diff Reviewed By: albanD Differential Revision: D28746799 fbshipit-source-id: 75a7885ab0bf3abadde9a42b56d479f71f57c89c	2021-05-27 15:40:52 -07:00
Adnios	09a8f22bf9	Add mish activation function (#58648 ) Summary: See issus: https://github.com/pytorch/pytorch/issues/58375 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58648 Reviewed By: gchanan Differential Revision: D28625390 Pulled By: jbschlosser fbshipit-source-id: 23ea2eb7d5b3dc89c6809ff6581b90ee742149f4	2021-05-25 10:36:21 -07:00
Thomas J. Fan	a7f4f80903	ENH Adds dtype to nn.functional.one_hot (#58090 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/33046 Related to https://github.com/pytorch/pytorch/issues/53785 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58090 Reviewed By: zou3519 Differential Revision: D28640893 Pulled By: jbschlosser fbshipit-source-id: 3686579517ccc75beaa74f0f6d167f5e40a83fd2	2021-05-24 13:48:25 -07:00
Kurt Mohler	fe8e5eb260	Change native functions to take `c10::string_view` args instead of `std::string` (#57680 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/53546 Pull Request resolved: https://github.com/pytorch/pytorch/pull/57680 Reviewed By: malfet Differential Revision: D28511799 Pulled By: ezyang fbshipit-source-id: 43142f994d048b28b3279ccdb7a28cbaa3190973	2021-05-20 18:15:45 -07:00
Eddie Yan	18edb77a28	Add `pad_sequence` as a native function (#57868 ) Summary: https://github.com/pytorch/pytorch/issues/56229 Pull Request resolved: https://github.com/pytorch/pytorch/pull/57868 Reviewed By: mruberry Differential Revision: D28334174 Pulled By: ngimel fbshipit-source-id: f1647718ada596686117703b682c0af7e92e16f5	2021-05-11 11:18:13 -07:00
Heitor Schueroff	4cf2c646c2	Added torch.linalg.matrix_norm (#57127 ) Summary: This PR is focused on the API for `linalg.matrix_norm` and delegates computations to `linalg.norm` for the moment. The main difference between the norms is when `dim=None`. In this case - `linalg.norm` will compute a vector norm on the flattened input if `ord=None`, otherwise it requires the input to be either 1D or 2D in order to disambiguate between vector and matrix norm - `linalg.vector_norm` will flatten the input - `linalg.matrix_norm` will compute the norm over the last two dimensions, treating the input as batch of matrices In future PRs, the computations will be moved to `torch.linalg.matrix_norm` and `torch.norm` and `torch.linalg.norm` will delegate computations to either `linalg.vector_norm` or `linalg.matrix_norm` based on the arguments provided. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57127 Reviewed By: mrshenli Differential Revision: D28186736 Pulled By: mruberry fbshipit-source-id: 99ce2da9d1c4df3d9dd82c0a312c9570da5caf25	2021-05-09 04:50:33 -07:00
Nikita Shulga	3a66a1cb99	[clang-tidy] Exclude cppcoreguidelines-avoid-magic-numbers (#57841 ) Summary: Add cppcoreguidelines-avoid-magic-numbers exclusion to clang-tidy Remove existing nolint warnings using following script: ``` for file in `git ls-files \| grep -v \.py`; do gsed '/^ *\/\/ NOLINTNEXTLINE(cppcoreguidelines-avoid-magic-numbers)/d' -i $file; done ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/57841 Reviewed By: samestep Differential Revision: D28295045 Pulled By: malfet fbshipit-source-id: 7c6e8d1213c9593f169ed3df6a916498f1a97163	2021-05-07 20:02:33 -07:00
Ivan Yashchuk	58f32fa5fd	Remove compute_uv flag from torch.linalg.svd (#57180 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57180 We have now a separate function for computing only the singular values. `compute_uv` argument is not needed and it was decided in the offline discussion to remove it. This is a BC-breaking change but our linalg module is beta, therefore we can do it without a deprecation notice. Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D28142163 Pulled By: mruberry fbshipit-source-id: 3fac1fcae414307ad5748c9d5ff50e0aa4e1b853	2021-05-07 15:16:42 -07:00
Heitor Schueroff	1f1e2dab6b	Remove optional type for ord parameter in vector_norm (#57662 ) Summary: As per discussion here https://github.com/pytorch/pytorch/pull/57127#discussion_r624948215 Note that we cannot remove the optional type from the `dim` parameter because the default is to flatten the input tensor which cannot be easily captured by a value other than `None` ### BC Breaking Note This PR changes the `ord` parameter of `torch.linalg.vector_norm` so that it no longer accepts `None` arguments. The default behavior of `2` is equivalent to the previous default of `None`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57662 Reviewed By: albanD, mruberry Differential Revision: D28228870 Pulled By: heitorschueroff fbshipit-source-id: 040fd8055bbe013f64d3c8409bbb4b2c87c99d13	2021-05-06 17:53:25 -07:00
Heitor Schueroff	c65a1da90a	Fixed C++ linalg API (#57464 ) Summary: Previous reverted PR https://github.com/pytorch/pytorch/pull/57055. This PR leaves the deprecated signatures untouched. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57464 Reviewed By: mruberry Differential Revision: D28151048 Pulled By: heitorschueroff fbshipit-source-id: bc89d6cf3d801819d37b3d19bf525f8abd816881	2021-05-05 08:05:10 -07:00
kshitij12345	d4ddb47719	[special] Add `xlog1py` (#55138 ) Summary: Reference : https://github.com/pytorch/pytorch/issues/50345 * [x] Check Rendered Document (https://12494173-65600975-gh.circle-artifacts.com/0/docs/special.html#torch.special.xlog1py) * [x] Tests in Binary Ufunc * [x] OpInfo * [x] Structured Kernel Pull Request resolved: https://github.com/pytorch/pytorch/pull/55138 Reviewed By: ngimel Differential Revision: D27961461 Pulled By: mruberry fbshipit-source-id: 30a8f41970a829bf50254aadf5615e8ce4148c7e	2021-04-30 05:51:13 -07:00
Mike Ruberry	b8e1be1a13	Revert D28041140: [pytorch][PR] Adding vector_norm to the C++ API Test Plan: revert-hammer Differential Revision: D28041140 (`fda8561944`) Original commit changeset: 65ab32efbcf9 fbshipit-source-id: ce69c6c1f2076c24f96d1f678ace415b22b2332c	2021-04-29 08:20:10 -07:00
Heitor Schueroff	fda8561944	Adding vector_norm to the C++ API (#57055 ) Summary: ## BC Breaking Note This PR removes the redundant linalg_ prefix from torch::linalg::linalg_det and torch::linalg::linalg_norm C++ API. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57055 Reviewed By: H-Huang Differential Revision: D28041140 Pulled By: heitorschueroff fbshipit-source-id: 65ab32efbcf92010439881bd8a292cdb5b39c579	2021-04-29 08:12:24 -07:00
Nikita Shulga	eac02f85cf	Fix more clang-tidy errors (#57235 ) Summary: In my last PR I've missed CUDA and distributed folders, fixing this now This change is autogenerated by `python tool/clang_tidy.py -s` Pull Request resolved: https://github.com/pytorch/pytorch/pull/57235 Reviewed By: janeyx99 Differential Revision: D28084444 Pulled By: malfet fbshipit-source-id: bf222f69ee90c7872c3cb0931e8cdb84f0cb3cda	2021-04-28 23:29:10 -07:00
Nikita Shulga	4cb534f92e	Make PyTorch code-base clang-tidy compliant (#56892 ) Summary: This is an automatic change generated by the following script: ``` #!/usr/bin/env python3 from subprocess import check_output, check_call import os def get_compiled_files_list(): import json with open("build/compile_commands.json") as f: data = json.load(f) files = [os.path.relpath(node['file']) for node in data] for idx, fname in enumerate(files): if fname.startswith('build/') and fname.endswith('.DEFAULT.cpp'): files[idx] = fname[len('build/'):-len('.DEFAULT.cpp')] return files def run_clang_tidy(fname): check_call(["python3", "tools/clang_tidy.py", "-c", "build", "-x", fname,"-s"]) changes = check_output(["git", "ls-files", "-m"]) if len(changes) == 0: return check_call(["git", "commit","--all", "-m", f"NOLINT stubs for {fname}"]) def main(): git_files = check_output(["git", "ls-files"]).decode("ascii").split("\n") compiled_files = get_compiled_files_list() for idx, fname in enumerate(git_files): if fname not in compiled_files: continue if fname.startswith("caffe2/contrib/aten/"): continue print(f"[{idx}/{len(git_files)}] Processing {fname}") run_clang_tidy(fname) if __name__ == "__main__": main() ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/56892 Reviewed By: H-Huang Differential Revision: D27991944 Pulled By: malfet fbshipit-source-id: 5415e1eb2c1b34319a4f03024bfaa087007d7179	2021-04-28 14:10:25 -07:00
Ivan Yashchuk	d5ff432615	Add torch.linalg.svdvals (#56684 ) Summary: This PR adds `torch.linalg.svdvals(input, out=None)` that computes only the singular values of `input`. Resolves https://github.com/pytorch/pytorch/issues/54155. Pull Request resolved: https://github.com/pytorch/pytorch/pull/56684 Reviewed By: albanD Differential Revision: D27938229 Pulled By: mruberry fbshipit-source-id: 5ea79ad9cccf818df0fbda1f431299ebf8de3798	2021-04-25 03:42:24 -07:00
Nikita Shulga	6d7d36d255	s/“pad”/"pad"/ in files introduced by #56065 (#56618 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56618 Reviewed By: albanD Differential Revision: D27919343 Pulled By: malfet fbshipit-source-id: 2fac8ba5f399e050463141eba225da935c97a5ce	2021-04-21 17:40:29 -07:00
Ailing Zhang	27a0d6f1df	AutoDispatchBelowAutograd takes no arguments. (#56424 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56424 Test Plan: Imported from OSS Reviewed By: nikithamalgifb Differential Revision: D27866607 Pulled By: ailzhang fbshipit-source-id: b82cfb90af5bc7b4129266083fe31f8b335a5b41	2021-04-21 14:44:12 -07:00
Joel Schlosser	8a81c4dc27	Update padding_idx docs for EmbeddingBag to better match Embedding's (#56065 ) Summary: Match updated `Embedding` docs from https://github.com/pytorch/pytorch/pull/54026 as closely as possible. Additionally, update the C++ side `Embedding` docs, since those were missed in the previous PR. There are 6 (!) places for docs: 1. Python module form in `sparse.py` - includes an additional line about newly constructed `Embedding`s / `EmbeddingBag`s 2. Python `from_pretrained()` in `sparse.py` (refers back to module docs) 3. Python functional form in `functional.py` 4. C++ module options - includes an additional line about newly constructed `Embedding`s / `EmbeddingBag`s 5. C++ `from_pretrained()` options 6. C++ functional options Pull Request resolved: https://github.com/pytorch/pytorch/pull/56065 Reviewed By: malfet Differential Revision: D27908383 Pulled By: jbschlosser fbshipit-source-id: c5891fed1c9d33b4b8cd63500a14c1a77d92cc78	2021-04-21 12:10:37 -07:00
Ailing Zhang	3d904b56ec	s/AutoNonVariableTypeMode/AutoDispatchBelowAutograd/ (#56423 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56423 Test Plan: Imported from OSS Reviewed By: bertmaher Differential Revision: D27866606 Pulled By: ailzhang fbshipit-source-id: e3942356dc3133d1c5722de40ec0d45e6a60f2f1	2021-04-20 17:17:46 -07:00
Brandon Lin	04607a58f1	[pytorch] Fix compiler warnings from conv.h (#56181 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56181 Need to change to size_t vs size_t: Reviewed By: ezyang Differential Revision: D27800849 fbshipit-source-id: 25f744128eb8750c382dc967a99af3c9f16247d9	2021-04-19 14:13:02 -07:00
David Riazati	1ec12fd491	Add minidump collection via breakpad (#55647 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55647 This adds [breakpad](https://github.com/google/breakpad) which comes with out-of-the-box utilities to register a signal handler that writes out a minidump on an unhandled exception. Right now this is gated behind a flag in `torch.utils`, but in the future it could be on by default. Sizewise this adds aboute 500k to `libtorch_cpu.so` (187275968 B to 187810016 B). ```bash $ cat <<EOF > test.py import torch torch.utils.enable_minidump_collection() # temporary util that just segfaults torch._C._crash() EOF $ python test.py Wrote minidump to /tmp/pytorch_crashes/6a829041-50e9-4247-ea992f99-a74cf47a.dmp fish: “python test.py” terminated by signal SIGSEGV (Address boundary error) $ minidump-2-core /tmp/pytorch_crashes/6a829041-50e9-4247-ea992f99-a74cf47a.dmp -o core.dmp $ gdb python core.dmp ... commence debugging ... ``` Right now all exceptions that get passed up to Python don't trigger the signal handler (which by default only handles [these](https://github.com/google/breakpad/blob/main/src/client/linux/handler/exception_handler.cc#L115)). It would be possible for PyTorch exceptions to explicitly write a minidump when passed up to Python (maybe only when the exception is unhandled or something). Test Plan: Imported from OSS Reviewed By: ailzhang Differential Revision: D27679767 Pulled By: driazati fbshipit-source-id: 1ab3b5160b6dc405f5097eb25acc644d533358d7	2021-04-16 13:05:01 -07:00
Jerry Zhang	0a541e23e1	[nn] Add allow_duplicate option for named_modules (#54812 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54812 Needed for quantization since different attribute might refer to the same module instance Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D27408376 fbshipit-source-id: cada85c4a1772d3dd9502c3f6f9a56d690d527e7	2021-04-16 01:26:16 -07:00
kshitij12345	50057e560b	[special] Add `i0e` (#54409 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 Changes: * Add `i0e` * Move some kernels from `UnaryOpsKernel.cu` to `UnarySpecialOpsKernel.cu` to decrease compilation time per file. Time taken by i0e_vs_scipy tests: around 6.33.s <details> <summary>Test Run Log</summary> ``` (pytorch-cuda-dev) kshiteej@qgpu1:~/Pytorch/pytorch_module_special$ pytest test/test_unary_ufuncs.py -k _i0e_vs ======================================================================= test session starts ======================================================================== platform linux -- Python 3.8.6, pytest-6.1.2, py-1.9.0, pluggy-0.13.1 rootdir: /home/kshiteej/Pytorch/pytorch_module_special, configfile: pytest.ini plugins: hypothesis-5.38.1 collected 8843 items / 8833 deselected / 10 selected test/test_unary_ufuncs.py ...sss.... [100%] ========================================================================= warnings summary ========================================================================= ../../.conda/envs/pytorch-cuda-dev/lib/python3.8/site-packages/torch/backends/cudnn/__init__.py:73 test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_i0e_vs_scipy_cuda_bfloat16 /home/kshiteej/.conda/envs/pytorch-cuda-dev/lib/python3.8/site-packages/torch/backends/cudnn/__init__.py:73: UserWarning: PyTorch was compiled without cuDNN/MIOpen support. To use cuDNN/MIOpen, rebuild PyTorch making sure the library is visible to the build system. warnings.warn( -- Docs: https://docs.pytest.org/en/stable/warnings.html ===================================================================== short test summary info ====================================================================== SKIPPED [3] test/test_unary_ufuncs.py:1182: not implemented: Could not run 'aten::_copy_from' with arguments from the 'Meta' backend. This could be because the operator doesn't exist for this backend, or was omitted during the selective/custom build process (if using custom build). If you are a Facebook employee using PyTorch on mobile, please visit https://fburl.com/ptmfixes for possible resolutions. 'aten::_copy_from' is only available for these backends: [BackendSelect, Named, InplaceOrView, AutogradOther, AutogradCPU, AutogradCUDA, AutogradXLA, UNKNOWN_TENSOR_TYPE_ID, AutogradMLC, AutogradNestedTensor, AutogradPrivateUse1, AutogradPrivateUse2, AutogradPrivateUse3, Tracer, Autocast, Batched, VmapMode]. BackendSelect: fallthrough registered at ../aten/src/ATen/core/BackendSelectFallbackKernel.cpp:3 [backend fallback] Named: registered at ../aten/src/ATen/core/NamedRegistrations.cpp:7 [backend fallback] InplaceOrView: fallthrough registered at ../aten/src/ATen/core/VariableFallbackKernel.cpp:56 [backend fallback] AutogradOther: registered at ../torch/csrc/autograd/generated/VariableType_4.cpp:8761 [autograd kernel] AutogradCPU: registered at ../torch/csrc/autograd/generated/VariableType_4.cpp:8761 [autograd kernel] AutogradCUDA: registered at ../torch/csrc/autograd/generated/VariableType_4.cpp:8761 [autograd kernel] AutogradXLA: registered at ../torch/csrc/autograd/generated/VariableType_4.cpp:8761 [autograd kernel] UNKNOWN_TENSOR_TYPE_ID: registered at ../torch/csrc/autograd/generated/VariableType_4.cpp:8761 [autograd kernel] AutogradMLC: registered at ../torch/csrc/autograd/generated/VariableType_4.cpp:8761 [autograd kernel] AutogradNestedTensor: registered at ../torch/csrc/autograd/generated/VariableType_4.cpp:8761 [autograd kernel] AutogradPrivateUse1: registered at ../torch/csrc/autograd/generated/VariableType_4.cpp:8761 [autograd kernel] AutogradPrivateUse2: registered at ../torch/csrc/autograd/generated/VariableType_4.cpp:8761 [autograd kernel] AutogradPrivateUse3: registered at ../torch/csrc/autograd/generated/VariableType_4.cpp:8761 [autograd kernel] Tracer: registered at ../torch/csrc/autograd/generated/TraceType_4.cpp:9348 [kernel] Autocast: fallthrough registered at ../aten/src/ATen/autocast_mode.cpp:250 [backend fallback] Batched: registered at ../aten/src/ATen/BatchingRegistrations.cpp:1016 [backend fallback] VmapMode: fallthrough registered at ../aten/src/ATen/VmapModeRegistrations.cpp:33 [backend fallback] ==================================================== 7 passed, 3 skipped, 8833 deselected, 2 warnings in 6.33s ===================================================== ``` </details> TODO: * [x] Check rendered docs (https://11743402-65600975-gh.circle-artifacts.com/0/docs/special.html) Pull Request resolved: https://github.com/pytorch/pytorch/pull/54409 Reviewed By: jbschlosser Differential Revision: D27760472 Pulled By: mruberry fbshipit-source-id: bdfbcaa798b00c51dc9513c34626246c8fc10548	2021-04-15 06:06:11 -07:00
Kurt Mohler	3fe4718d16	Add `padding_idx` argument to EmbeddingBag (#49237 ) Summary: This PR adds a `padding_idx` parameter to `nn.EmbeddingBag` and `nn.functional.embedding_bag`. As with `nn.Embedding`'s `padding_idx` argument, if an embedding's index is equal to `padding_idx` it is ignored, so it is not included in the reduction. This PR does not add support for `padding_idx` for quantized or ONNX `EmbeddingBag` for opset10/11 (opset9 is supported). In these cases, an error is thrown if `padding_idx` is provided. Fixes https://github.com/pytorch/pytorch/issues/3194 Pull Request resolved: https://github.com/pytorch/pytorch/pull/49237 Reviewed By: walterddr, VitalyFedyunin Differential Revision: D26948258 Pulled By: jbschlosser fbshipit-source-id: 3ca672f7e768941f3261ab405fc7597c97ce3dfc	2021-04-14 09:38:01 -07:00
kshitij12345	902bf0bbbe	[special] Alias for sigmoid and logit & follow-up (#54759 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 Chages: * Alias for sigmoid and logit * Adds out variant for C++ API * Updates docs to link back to `special` documentation Pull Request resolved: https://github.com/pytorch/pytorch/pull/54759 Reviewed By: mrshenli Differential Revision: D27615208 Pulled By: mruberry fbshipit-source-id: 8bba908d1bea246e4aa9dbadb6951339af353556	2021-04-08 00:56:59 -07:00
peter	3517ee1bcb	Fix ordered_dict.h for CUDA on Windows (#55275 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/55266 Pull Request resolved: https://github.com/pytorch/pytorch/pull/55275 Reviewed By: mrshenli Differential Revision: D27623887 Pulled By: malfet fbshipit-source-id: 6dac357e21179a259ac95f0e1b7399b03dacc81d	2021-04-07 23:43:35 -07:00
Ivan Yashchuk	84d18727bd	Added linalg.eig, linalg.eigvals (#52491 ) Summary: This PR adds `torch.linalg.eig`, and `torch.linalg.eigvals` for NumPy compatibility. MAGMA uses a hybrid CPU-GPU algorithm and doesn't have a GPU interface for the non-symmetric eigendecomposition. It means that it forces us to transfer inputs living in GPU memory to CPU first before calling MAGMA, and then transfer results from MAGMA to CPU. That is rather slow for smaller matrices and MAGMA is faster than CPU path only for matrices larger than 3000x3000. Unfortunately, there is no cuSOLVER function for this operation. Autograd support for `torch.linalg.eig` will be added in a follow-up PR. Ref https://github.com/pytorch/pytorch/issues/42666 Pull Request resolved: https://github.com/pytorch/pytorch/pull/52491 Reviewed By: anjali411 Differential Revision: D27563616 Pulled By: mruberry fbshipit-source-id: b42bb98afcd2ed7625d30bdd71cfc74a7ea57bb5	2021-04-06 13:53:26 -07:00
Richard Barnes	d690973295	irange on int64_t (#55148 ) Summary: Converts loops of the form: ``` for(int64_t VAR=0;VAR<LIMIT;VAR++) ``` to the form ``` for(const auto VAR : c10::irange(LIMIT)) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/55148 Test Plan: Sandcastle Reviewed By: ngimel Differential Revision: D27447811 fbshipit-source-id: 6311a094ec4a81a0b57383aaee0ba1b1dc2445c4	2021-04-05 16:14:00 -07:00
Mike Ruberry	c0ac0fef4e	Revert D27448156: irange for size_t Test Plan: revert-hammer Differential Revision: D27448156 (`041b4431b2`) Original commit changeset: 585da57d4de9 fbshipit-source-id: 8e047c29f391c0166e0a1a87c3fb2a0854377365	2021-04-03 19:14:00 -07:00
Richard Barnes	041b4431b2	irange for size_t (#55163 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55163 Test Plan: Sandcastle Reviewed By: ngimel Differential Revision: D27448156 fbshipit-source-id: 585da57d4de91c692b6360d65f7b8a66deb0f8c1	2021-04-02 23:22:29 -07:00
Maxim Grechkin	38a08a49ea	Flip clip_grad_norm default for error_if_nonfinite to false (#55169 ) Summary: Non-backwards-compatible change introduced in https://github.com/pytorch/pytorch/pull/53843 is tripping up a lot of code. Better to set it to False initially and then potentially flip to True in the later version to give people time to adapt. Pull Request resolved: https://github.com/pytorch/pytorch/pull/55169 Reviewed By: mruberry Differential Revision: D27511150 Pulled By: jbschlosser fbshipit-source-id: 1ac018557c0900b31995c29f04aea060a27bc525	2021-04-02 12:25:32 -07:00
Heitor Schueroff	5d68b3695c	[Relanding] Implemented torch.linalg.multi_dot (#52859 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52859 This reverts commit `92a4ee1cf6`. Added support for bfloat16 for CUDA 11 and removed fast-path for empty input tensors that was affecting autograd graph. Test Plan: Imported from OSS Reviewed By: H-Huang Differential Revision: D27402390 Pulled By: heitorschueroff fbshipit-source-id: 73c5ccf54f3da3d29eb63c9ed3601e2fe6951034	2021-04-01 04:49:05 -07:00
kshitij12345	c9d0c855f7	[special] Alias for special.expm1 and special.exp2 (#54670 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 Pull Request resolved: https://github.com/pytorch/pytorch/pull/54670 Reviewed By: H-Huang Differential Revision: D27401440 Pulled By: mruberry fbshipit-source-id: 02b1fd0e8ffd3f5a017d6b6b9229b76b92b4b745	2021-03-30 10:03:13 -07:00
Kurt Mohler	3ddc6174da	Raise error in clip_grad_norm_ if norm is non-finite (#53843 ) Summary: BC-breaking note: This change throws errors for cases that used to silently pass. The old behavior can be obtained by setting `error_if_nonfinite=False` Fixes https://github.com/pytorch/pytorch/issues/46849 Pull Request resolved: https://github.com/pytorch/pytorch/pull/53843 Reviewed By: malfet Differential Revision: D27291838 Pulled By: jbschlosser fbshipit-source-id: 216d191b26e1b5919a44a3af5cde6f35baf825c4	2021-03-29 08:41:21 -07:00
Bel H	645119eaef	Lowering NLLLoss/CrossEntropyLoss to ATen code (#53789 ) Summary: * Lowering NLLLoss/CrossEntropyLoss to ATen dispatch * This allows the MLC device to override these ops * Reduce code duplication between the Python and C++ APIs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/53789 Reviewed By: ailzhang Differential Revision: D27345793 Pulled By: albanD fbshipit-source-id: 99c0d617ed5e7ee8f27f7a495a25ab4158d9aad6	2021-03-26 07:31:08 -07:00
kshitij12345	6f8328ef44	[special] Add special.entr (#53500 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 TODO: * [x] Verfiy docs rendering (https://11397990-65600975-gh.circle-artifacts.com/0/docs/special.html) Pull Request resolved: https://github.com/pytorch/pytorch/pull/53500 Reviewed By: ngimel Differential Revision: D27287096 Pulled By: mruberry fbshipit-source-id: 6b3dfd53e811a0f023ee444a0b56176f825d39e9	2021-03-24 18:44:42 -07:00
Heitor Schueroff	f9e7f132fb	Added torch.linalg.matrix_power (#52608 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52608 TODO - [x] Add OpInfo - [x] Update documentation - [x] Add more tests and compare against NumPy Test Plan: Imported from OSS Reviewed By: bdhirsh Differential Revision: D27261532 Pulled By: heitorschueroff fbshipit-source-id: c1e4ab297da3683f6d5751be8790602f9dc37b6b	2021-03-23 15:10:06 -07:00
kshitij12345	bfd009836e	[torch.special] Add special.erf{c, inv} (#53260 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 Also adds `overrides` entry for module and the newly added functions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/53260 Reviewed By: agolynski Differential Revision: D27114342 Pulled By: mruberry fbshipit-source-id: b1dd88f373db251bb71df12d33b160382138f63f	2021-03-18 19:06:25 -07:00
Peter Bell	04e0cbf5a9	Add padding='same' mode to conv{1,2,3}d (#45667 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45667 First part of #3867 (Pooling operators still to do) This adds a `padding='same'` mode to the interface of `conv{n}d`and `nn.Conv{n}d`. This should match the behaviour of `tensorflow`. I couldn't find it explicitly documented but through experimentation I found `tensorflow` returns the shape `ceil(len/stride)` and always adds any extra asymmetric padding onto the right side of the input. Since the `native_functions.yaml` schema doesn't seem to support strings or enums, I've moved the function interface into python and it now dispatches between the numerically padded `conv{n}d` and the `_conv{n}d_same` variant. Underscores because I couldn't see any way to avoid exporting a function into the `torch` namespace. A note on asymmetric padding. The total padding required can be odd if both the kernel-length is even and the dilation is odd. mkldnn has native support for asymmetric padding, so there is no overhead there, but for other backends I resort to padding the input tensor by 1 on the right hand side to make the remaining padding symmetrical. In these cases, I use `TORCH_WARN_ONCE` to notify the user of the performance implications. Test Plan: Imported from OSS Reviewed By: ejguan Differential Revision: D27170744 Pulled By: jbschlosser fbshipit-source-id: b3d8a0380e0787ae781f2e5d8ee365a7bfd49f22	2021-03-18 16:22:03 -07:00
Kurt Mohler	382a47b493	Add torch.linalg.vector_norm function (#51099 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/50214 Pull Request resolved: https://github.com/pytorch/pytorch/pull/51099 Reviewed By: agolynski Differential Revision: D27147360 Pulled By: mruberry fbshipit-source-id: 1056f840e7027ad81971c9d1a9f952ab9648f1b5	2021-03-18 06:41:39 -07:00
Ivan Yashchuk	564456ac44	Added autograd support for torch.orgqr (#52637 ) Summary: This PR adds autograd support for `torch.orgqr`. Since `torch.orgqr` is one of few functions that expose LAPACK's naming and all other linear algebra routines were renamed a long time ago, I also added a new function with a new name and `torch.orgqr` now is an alias for it. The new proposed name is `householder_product`. For a matrix `input` and a vector `tau` LAPACK's orgqr operation takes columns of `input` (called Householder vectors or elementary reflectors) scalars of `tau` that together represent Householder matrices and then the product of these matrices is computed. See https://www.netlib.org/lapack/lug/node128.html. Other linear algebra libraries that I'm aware of do not expose this LAPACK function, so there is some freedom in naming it. It is usually used internally only for QR decomposition, but can be useful for deep learning tasks now when it supports differentiation. Resolves https://github.com/pytorch/pytorch/issues/50104 Pull Request resolved: https://github.com/pytorch/pytorch/pull/52637 Reviewed By: agolynski Differential Revision: D27114246 Pulled By: mruberry fbshipit-source-id: 9ab51efe52aec7c137aa018c7bd486297e4111ce	2021-03-18 05:42:18 -07:00
Wenlei Xie	2ecb2c7931	Pass Scalar by reference (#53583 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53583 `Scalar` takes 32 bytes due to `c10::complex<double>` requires aligning to 16 bytes. Passing Scalar by reference shows about 1% improvements on instruction count. All the changes in this commit are codemoded except for the following 4 files (which code-gen signatures): ``` tools/codegen/api/cpp.py tools/codegen/api/native.py tools/codegen/api/structured.py caffe2/contrib/aten/gen_op.py ``` # Codemode ## Main Step For the codemod part, here is the main command used: ``` fastmod --extensions h '([a-zA-Z_+]\([^)],?\s)Scalar (\w+)' '${1}const Scalar& ${2}' fastmod --extensions h '([a-zA-Z_+]\([^)],?\s)optional<Scalar> (\w+)' '${1}const optional<Scalar>& ${2}' fastmod --extensions cpp '([a-zA-Z_+]\([^)],?\s)Scalar (\w+)' '${1}const Scalar& ${2}' fastmod --extensions cpp '([a-zA-Z_+]\([^)],?\s)optional<Scalar> (\w+)' '${1}const optional<Scalar>& ${2}' ``` As you can tell, it codemods both `Scalar` and `optional<Scalar>`. Apply these commands iteratively until reaching a fix-point (since one method signature might contain multiple `Scalar` parameter). In retrospect, excluding `thrid_party` and `torch/csrc/jit` would be a good idea. (I revert it manually later, see https://github.com/pytorch/pytorch/pull/53479 as an reference). ## Pre-Step Prior to applying the main command, as some `Scalar` are presented as `at::Scalar` or `c10::Scalar`, so I codemod some of them in advance. Here is an incomplete list: ``` fastmod --extensions h '([a-zA-Z_+]\([^)],?\s)at::Scalar (\w+)' '${1}const at::Scalar& ${2}' fastmod --extensions cpp '([a-zA-Z_+]\([^)],?\s)at::Scalar (\w+)' '${1}const at::Scalar& ${2}' fastmod --extensions h '([a-zA-Z_+]\([^)],?\s)c10::optional<Scalar> (\w+)' '${1}const c10::optional<Scalar>& ${2}' fastmod --extensions cpp '([a-zA-Z_+]\([^)],?\s)c10::optional<Scalar> (\w+)' '${1}const c10::optional<Scalar>& ${2}' ``` ## Fixup There are a couple of post codemod fixup. For example, `const Scalar` will be codemoded into `const const Scalar&`. `at:Scalar` will be codemoded into `at::const Scalar&` (if `Pre-step` is not done comprehensively). Here is an incomplete list: ``` fastmod --extensions cpp 'const const Scalar' 'const Scalar' fastmod --extensions h 'const const c10::optional<Scalar>' 'const c10::optional<Scalar>' fastmod --extensions cpp 'const const c10::optional<Scalar>' 'const c10::optional<Scalar>' fastmod 'at::const Scalar&' 'const at::Scalar&' ``` ## Supplementary `cu` and `mm` files also need to be codemoded, for example: ``` fastmod --extensions cu 'at::const Scalar&' 'const at::Scalar&' fastmod --extensions mm '([a-zA-Z_+]$[^)],?\s)Scalar (\w+)' '${1}const Scalar& ${2}' ``` Function pointers are not codemoded. Here is an incomplete list: ``` # Cover case: using index_fill_fn = void()(TensorIterator & iter, int64_t dim, int64_t self_dim_size, int64_t self_dim_stride, Scalar source); fastmod --extensions h '(void\s\(\s\\s$$[^)],?\s)Scalar (\w+)' '${1}const Scalar& ${2}' # Cover case: using softplus_fn = void ()(TensorIterator&, Scalar, Scalar); fastmod --extensions h '(void\s\(\s\\s$$[^)],?\s)Scalar([, $])' '${1}const Scalar&${2}' fastmod --extensions cpp '(void\s$\s\\s$$[^)],?\s)Scalar([, $])' '${1}const Scalar&${2}' fastmod --extensions h '(void\s$\s\\s$$[^)],?\s)optional<Scalar>([, $])' '${1}const optional<Scalar>&${2}' ``` Some corner cases needs to be manually fixed. ghstack-source-id: 123970306 Test Plan: Imported from OSS Reviewed By: smessmer Differential Revision: D26904445 fbshipit-source-id: 8d8a002af4b5125f153a32f03c6956be7ae5671d	2021-03-15 23:17:06 -07:00
Nikita Vedeneev	afa1ff8e04	Implements `torch.linalg.lstsq` (#49093 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/44378 by providing a wider range of drivers similar to what SciPy is doing. The supported CPU drivers are `gels, gelsy, gelsd, gelss`. The CUDA interface has only `gels` implemented but only for overdetermined systems. The current state of this PR: - [x] CPU interface - [x] CUDA interface - [x] CPU tests - [x] CUDA tests - [x] Memory-efficient batch-wise iteration with broadcasting which fixes https://github.com/pytorch/pytorch/issues/49252 - [x] docs Pull Request resolved: https://github.com/pytorch/pytorch/pull/49093 Reviewed By: albanD Differential Revision: D26991788 Pulled By: mruberry fbshipit-source-id: 8af9ada979240b255402f55210c0af1cba6a0a3c	2021-03-12 13:25:55 -08:00
James Butterworth	37ab711822	Adding learning rate schedulers to C++ API (#52268 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/50577 Learning rate schedulers had not yet been implemented for the C++ API. This pull request introduces the learning rate scheduler base class and the StepLR subclass. Furthermore, it modifies the existing OptimizerOptions such that the learning rate scheduler can modify the learning rate. Pull Request resolved: https://github.com/pytorch/pytorch/pull/52268 Reviewed By: mrshenli Differential Revision: D26818387 Pulled By: glaringlee fbshipit-source-id: 2b28024a8ea7081947c77374d6d643fdaa7174c1	2021-03-10 23:09:51 -08:00
Sam Estep	8c798e0622	Forbid trailing whitespace (#53406 ) Summary: Context: https://github.com/pytorch/pytorch/pull/53299#discussion_r587882857 These are the only hand-written parts of this diff: - the addition to `.github/workflows/lint.yml` - the file endings changed in these four files (to appease FB-internal land-blocking lints): - `GLOSSARY.md` - `aten/src/ATen/core/op_registration/README.md` - `scripts/README.md` - `torch/csrc/jit/codegen/fuser/README.md` The rest was generated by running this command (on macOS): ``` git grep -I -l ' $' -- . ':(exclude)/contrib/' ':(exclude)third_party' \| xargs gsed -i 's/ *$//' ``` I looked over the auto-generated changes and didn't see anything that looked problematic. Pull Request resolved: https://github.com/pytorch/pytorch/pull/53406 Test Plan: This run (after adding the lint but before removing existing trailing spaces) failed: - https://github.com/pytorch/pytorch/runs/2043032377 This run (on the tip of this PR) succeeded: - https://github.com/pytorch/pytorch/runs/2043296348 Reviewed By: walterddr, seemethere Differential Revision: D26856620 Pulled By: samestep fbshipit-source-id: 3f0de7f7c2e4b0f1c089eac9b5085a58dd7e0d97	2021-03-05 17:22:55 -08:00
kshitij12345	c4c77e2001	[special] add `torch.special` namespace (#52296 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 * Add `torch.special` namespace * Add `torch.special.gammaln` (alias to `torch.lgamma`) TODO: * Add proper entries for docs. * [x] Add .rst file entry * [x] Add documentation * [x] Update `lgamma` OpInfo entry for alias to `special.gammaln`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/52296 Reviewed By: ngimel Differential Revision: D26754890 Pulled By: mruberry fbshipit-source-id: 73479f68989d6443ad07b7b02763fa98973c15f6	2021-03-04 00:04:36 -08:00
Mike Ruberry	9c2673df46	Revert D26723384: [pytorch][PR] Implements `torch.linalg.lstsq` Test Plan: revert-hammer Differential Revision: D26723384 (`3ac9013235`) Original commit changeset: c9866a95f140 fbshipit-source-id: 3e5263d71facdc91ca09d7dcbbbe3ba818ee2821	2021-03-03 15:24:25 -08:00
Nikita Vedeneev	3ac9013235	Implements `torch.linalg.lstsq` (#49093 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/44378 by providing a wider range of drivers similar to what SciPy is doing. The supported CPU drivers are `gels, gelsy, gelsd, gelss`. The CUDA interface has only `gels` implemented but only for overdetermined systems. The current state of this PR: - [x] CPU interface - [x] CUDA interface - [x] CPU tests - [x] CUDA tests - [x] Memory-efficient batch-wise iteration with broadcasting which fixes https://github.com/pytorch/pytorch/issues/49252 - [x] docs Pull Request resolved: https://github.com/pytorch/pytorch/pull/49093 Reviewed By: H-Huang Differential Revision: D26723384 Pulled By: mruberry fbshipit-source-id: c9866a95f14091955cf42de22f4ac9e2da009713	2021-03-02 19:00:07 -08:00
Joel Schlosser	e86476f736	Huber loss (#50553 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/48595. ## Background This PR implements HuberLoss, which differs from SmoothL1Loss by a factor of beta. The current implementation does not share logic between the two. Feedback is welcome for the optimal way to minimize code duplication while remaining performant. I've done some early [benchmarking](https://pytorch.org/tutorials/recipes/recipes/benchmark.html#collecting-instruction-counts-with-callgrind) with Huber calling in to the Smooth L1 kernel and scaling afterwards; for the simple test case I used, instruction counts are as follows: ``` Huber loss calls dedicated Huber kernel: 2,795,300 Huber loss calls Smooth L1 kernel and scales afterwards: 4,523,612 ``` With these numbers, instruction counts are ~62% higher when using the pre-existing Smooth L1 kernel. Pull Request resolved: https://github.com/pytorch/pytorch/pull/50553 Test Plan: ``` python test/test_nn.py TestNN.test_HuberLoss python test/test_nn.py TestNN.test_HuberLoss_delta python test/test_nn.py TestNN.test_huber_loss_invalid_delta python test/test_nn.py TestNNDeviceTypeCPU.test_smooth_l1_loss_vs_huber_loss_cpu python test/test_nn.py TestNNDeviceTypeCUDA.test_smooth_l1_loss_vs_huber_loss_cuda python test/test_nn.py TestNNDeviceTypeCPU.test_invalid_reduction_strings_cpu python test/test_nn.py TestNNDeviceTypeCUDA.test_invalid_reduction_strings_cuda python test/test_nn.py TestNN.test_loss_equal_input_target_shape python test/test_nn.py TestNN.test_pointwise_loss_broadcast python test/test_overrides.py python test/test_jit.py TestJitGeneratedFunctional.test_nn_huber_loss python test/test_type_hints.py python test/test_cpp_api_parity.py build/bin/test_api ``` ## Documentation <img width="677" alt="Screen Shot 2021-01-14 at 4 25 08 PM" src="https://user-images.githubusercontent.com/75754324/104651224-5a445980-5685-11eb-884b-14ea517958c2.png"> <img width="677" alt="Screen Shot 2021-01-14 at 4 24 35 PM" src="https://user-images.githubusercontent.com/75754324/104651190-4e589780-5685-11eb-974d-8c63a89c050e.png"> <img width="661" alt="Screen Shot 2021-01-14 at 4 24 45 PM" src="https://user-images.githubusercontent.com/75754324/104651198-50225b00-5685-11eb-958e-136b36f6f8a8.png"> <img width="869" alt="Screen Shot 2021-01-14 at 4 25 27 PM" src="https://user-images.githubusercontent.com/75754324/104651208-53b5e200-5685-11eb-9fe4-5ff433aa13c5.png"> <img width="862" alt="Screen Shot 2021-01-14 at 4 25 48 PM" src="https://user-images.githubusercontent.com/75754324/104651209-53b5e200-5685-11eb-8051-b0cfddcb07d3.png"> Reviewed By: H-Huang Differential Revision: D26734071 Pulled By: jbschlosser fbshipit-source-id: c98c1b5f32a16f7a2a4e04bdce678080eceed5d5	2021-03-02 17:30:45 -08:00
Bel H	99a428ab22	Lower ReLu6 to aten (#52723 ) Summary: -Lower Relu6 to ATen -Change Python and C++ to reflect change -adds an entry in native_functions.yaml for that new function -this is needed as we would like to intercept ReLU6 at a higher level with an XLA-approach codegen. -Should pass functional C++ tests pass. But please let me know if more tests are required. Pull Request resolved: https://github.com/pytorch/pytorch/pull/52723 Reviewed By: ailzhang Differential Revision: D26641414 Pulled By: albanD fbshipit-source-id: dacfc70a236c4313f95901524f5f021503f6a60f	2021-02-25 08:38:11 -08:00
Luca Wehrstedt	92a4ee1cf6	Revert D26375734: Implemented torch.linalg.multi_dot Test Plan: revert-hammer Differential Revision: D26375734 (`0396f492b9`) Original commit changeset: 839642692424 fbshipit-source-id: cb64db646010128d802e1930d5e9526c1f7aa6a2	2021-02-25 00:43:57 -08:00
Heitor Schueroff	0396f492b9	Implemented torch.linalg.multi_dot (#51807 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51807 Implemented torch.linalg.multi_dot similar to [numpy.linalg.multi_dot](https://numpy.org/doc/stable/reference/generated/numpy.linalg.multi_dot.html). This function does not support broadcasting or batched inputs at the moment. NOTE numpy.linalg.multi_dot allows the first and last tensors to have more than 2 dimensions despite their docs stating these must be either 1D or 2D. This PR diverges from NumPy in that it enforces this restriction. TODO - [ ] Benchmark against NumPy - [x] Add OpInfo testing - [x] Remove unnecessary copy for out= argument Test Plan: Imported from OSS Reviewed By: nikithamalgifb Differential Revision: D26375734 Pulled By: heitorschueroff fbshipit-source-id: 839642692424c4b1783606c76dd5b34455368f0b	2021-02-24 15:32:30 -08:00
Joel Schlosser	e60f18c2ad	Generate header with version #defines for LibTorch (#50073 ) Summary: Uses cmake's `configure_file()` macro to generate a new `torch/csrc/api/include/torch/version.h` header with `TORCH_VERSION_{MAJOR,MINOR,PATCH}` \#defines from an input file `torch/csrc/api/include/torch/version.h.in`. For Bazel builds, this is accomplished with `header_template_rule()`. For Buck builds, this is accomplished with `fb_native.genrule()`. Fixes https://github.com/pytorch/pytorch/issues/44365 <img width="1229" alt="Screen Shot 2021-01-05 at 3 19 24 PM" src="https://user-images.githubusercontent.com/75754324/103809279-3fd80380-5027-11eb-9039-fd23922cebd5.png"> Pull Request resolved: https://github.com/pytorch/pytorch/pull/50073 Reviewed By: glaringlee Differential Revision: D25855877 Pulled By: jbschlosser fbshipit-source-id: 6bb792718c97e2c2dbaa74b7b7b831a4f6938e49	2021-02-03 22:18:53 -08:00
Ivan Yashchuk	f9a5ba7398	Added linalg.slogdet (#49194 ) Summary: This PR adds `torch.linalg.slogdet`. Changes compared to the original torch.slogdet: - Complex input now works as in NumPy - Added out= variant (allocates temporary and makes a copy for now) - Updated `slogdet_backward` to work with complex input Ref. https://github.com/pytorch/pytorch/issues/42666 Pull Request resolved: https://github.com/pytorch/pytorch/pull/49194 Reviewed By: VitalyFedyunin Differential Revision: D25916959 Pulled By: mruberry fbshipit-source-id: cf9be8c5c044870200dcce38be48cd0d10e61a48	2021-01-19 07:28:12 -08:00
Ivan Yashchuk	9384d31af5	Added linalg.pinv (#48399 ) Summary: This PR adds `torch.linalg.pinv`. Changes compared to the original `torch.pinverse`: * New kwarg "hermitian": with `hermitian=True` eigendecomposition is used instead of singular value decomposition. * `rcond` argument can now be a `Tensor` of appropriate shape to apply matrix-wise clipping of singular values. * Added `out=` variant (allocates temporary and makes a copy for now) Ref. https://github.com/pytorch/pytorch/issues/42666 Pull Request resolved: https://github.com/pytorch/pytorch/pull/48399 Reviewed By: zhangguanheng66 Differential Revision: D25869572 Pulled By: mruberry fbshipit-source-id: 0f330a91d24ba4e4375f648a448b27594e00dead	2021-01-12 06:52:06 -08:00
Hebo Yang	72c1d9df75	Minor Fix: Double ";" typo in transformerlayer.h (#50300 ) Summary: Fix double ";" typo in transformerlayer.h Pull Request resolved: https://github.com/pytorch/pytorch/pull/50300 Reviewed By: zhangguanheng66 Differential Revision: D25857236 Pulled By: glaringlee fbshipit-source-id: b9b21cfb3ddbff493f6d1c616abe21c5cfb9bce0	2021-01-11 19:25:22 -08:00
Ivan Yashchuk	4774c6800b	Added linalg.inv (#48261 ) Summary: This PR adds `torch.linalg.inv` for NumPy compatibility. `linalg_inv_out` uses in-place operations on provided `result` tensor. I modified `apply_inverse` to accept tensor of Int instead of std::vector, that way we can write a function similar to `linalg_inv_out` but removing the error checks and device memory synchronization. I fixed `lda` (leading dimension parameter which is max(1, n)) in many places to handle 0x0 matrices correctly. Zero batch dimensions are also working and tested. Ref https://github.com/pytorch/pytorch/issues/42666 Pull Request resolved: https://github.com/pytorch/pytorch/pull/48261 Reviewed By: gchanan Differential Revision: D25849590 Pulled By: mruberry fbshipit-source-id: cfee6f1daf7daccbe4612ec68f94db328f327651	2021-01-10 04:00:51 -08:00
Joel Schlosser	7d9eb6c680	Implementation of torch::cuda::synchronize (#50072 ) Summary: Adding `torch::cuda::synchronize()` to libtorch. Note that the implementation here adds a new method to the `CUDAHooksInterface`. An alternative that was suggested to me is to add a method to the `DeviceGuard` interface. Fixes https://github.com/pytorch/pytorch/issues/47722 Pull Request resolved: https://github.com/pytorch/pytorch/pull/50072 Reviewed By: H-Huang Differential Revision: D25804342 Pulled By: jbschlosser fbshipit-source-id: 45aa61d7c6fbfd3178caf2eb5ec053d6c01b5a43	2021-01-06 10:53:39 -08:00
Mike Ruberry	5acc27c00a	Revert D25690129: [pytorch][PR] Added linalg.inv Test Plan: revert-hammer Differential Revision: D25690129 (`8554b58fbd`) Original commit changeset: edb2d03721f2 fbshipit-source-id: 8679ea18e637423d35919544d2b047a62ac3abd8	2020-12-23 15:27:52 -08:00
Ivan Yashchuk	8554b58fbd	Added linalg.inv (#48261 ) Summary: This PR adds `torch.linalg.inv` for NumPy compatibility. `linalg_inv_out` uses in-place operations on provided `result` tensor. I modified `apply_inverse` to accept tensor of Int instead of std::vector, that way we can write a function similar to `linalg_inv_out` but removing the error checks and device memory synchronization. I fixed `lda` (leading dimension parameter which is max(1, n)) in many places to handle 0x0 matrices correctly. Zero batch dimensions are also working and tested. Ref https://github.com/pytorch/pytorch/issues/42666 Pull Request resolved: https://github.com/pytorch/pytorch/pull/48261 Reviewed By: ngimel Differential Revision: D25690129 Pulled By: mruberry fbshipit-source-id: edb2d03721f22168c42ded8458513cb23dfdc712	2020-12-23 11:29:00 -08:00
Joel Schlosser	68d438c9da	Add PixelUnshuffle (#49334 ) Summary: Adds an implementation of `torch.nn.PixelUnshuffle` as the inverse operation of `torch.nn.PixelShuffle`. This addresses https://github.com/pytorch/pytorch/issues/2456 Pull Request resolved: https://github.com/pytorch/pytorch/pull/49334 Test Plan: ``` # Unit tests. python test/test_nn.py TestNN.test_pixel_shuffle_unshuffle # Module test. python test/test_nn.py TestNN.test_PixelUnshuffle # C++ API tests. build/bin/test_api # C++ / python parity tests. python test/test_cpp_api_parity.py # JIT test. python test/test_jit.py TestJitGeneratedFunctional.test_nn_pixel_unshuffle # Override tests. python test/test_overrides.py # Type hint tests. python test/test_type_hints.py ``` Screenshots of rendered docs: <img width="876" alt="Screen Shot 2020-12-18 at 12 19 05 PM" src="https://user-images.githubusercontent.com/75754324/102642255-6b07bb00-412b-11eb-88fa-e53e7e8ba720.png"> <img width="984" alt="Screen Shot 2020-12-18 at 12 19 26 PM" src="https://user-images.githubusercontent.com/75754324/102642276-70fd9c00-412b-11eb-8548-445082a2db02.png"> <img width="932" alt="Screen Shot 2020-12-18 at 12 19 34 PM" src="https://user-images.githubusercontent.com/75754324/102642704-19abfb80-412c-11eb-9546-95bdd1c3cf22.png"> <img width="876" alt="Screen Shot 2020-12-22 at 12 51 36 PM" src="https://user-images.githubusercontent.com/75754324/102918259-986aa680-4454-11eb-99e7-a0b4c8b3e283.png"> <img width="869" alt="Screen Shot 2020-12-22 at 12 51 44 PM" src="https://user-images.githubusercontent.com/75754324/102918274-9ef91e00-4454-11eb-94bb-91b58aff47d3.png"> Reviewed By: mruberry Differential Revision: D25401439 Pulled By: jbschlosser fbshipit-source-id: 209d92ce7295e51699e83616d0c62170a7ce75c8	2020-12-22 20:14:55 -08:00
Ivan Yashchuk	8be205ae13	Added linalg.solve (#48456 ) Summary: This PR adds `torch.linalg.solve`. `linalg_solve_out` uses in-place operations on the provided result tensor. I modified `apply_solve` to accept tensor of Int instead of std::vector, that way we can write a function similar to `linalg_solve_out` but removing the error checks and device memory synchronization. In comparison to `torch.solve` this routine accepts 1-dimensional tensors and batches of 1-dim tensors for the right-hand-side term. `torch.solve` requires it to be at least 2-dimensional. Ref. https://github.com/pytorch/pytorch/issues/42666 Pull Request resolved: https://github.com/pytorch/pytorch/pull/48456 Reviewed By: izdeby Differential Revision: D25562222 Pulled By: mruberry fbshipit-source-id: a9355c029e2442c2e448b6309511919631f9e43b	2020-12-21 10:11:12 -08:00
Igor Gitman	1b6d18aa7c	Adding support for CuDNN-based LSTM with projections (#47725 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/46213 I didn't yet update the documentation, will add those change soon. A few other things that I didn't do, but want to clarify if I maybe should. 1. I didn't expose projections in c++ API: torch/csrc/api/src/nn/modules/rnn.cpp. Let me know if this is desirable and I will add those changes. 2. I didn't expose projections in "lstm_cell" function and "_thnn_differentiable_lstm_cell_backward" functions from aten/src/ATen/native/RNN.cpp. As far as I understand, they are not needed for nn.LSTM CPU execution. For lstm_cell, projections don't bring any real benefit, since if cell is used separately, it can be easily added in Python. For "_thnn_differentiable_lstm_cell_backward", I'm actually not sure where exactly that function is used, so I also disabled projections there for now. Please let me know if I should change that. 3. I added check that projections are not supported for quantized LSTMs to quantized_lstm_<data/input> functions. But I didn't add any checks to LSTMCell code. It seems that since I disabled projections in "lstm_cell" function, they should also not be available for quantized models through any other API than quantized_lstm_<data/input>. Please let me know if I'm not correct and I will add checks to other places. 4. Projections are not supported for CuDNN versions < 7.1.2. Should I add the check for CuDNN version and disable projections in that case? If so, what will be the best way to do that? 5. Currently I added projection weight as the last weight, so the layout is "w_ih, w_hh, b_ih, b_hh, w_hr". This breaks the assumption that biases come after weights and thus I had to add additional if-s in various places. Alternative way would be to have "w_ih, w_hh, w_hr, b_ih, b_hh" layout, in which case the assumption will be true. But in that case I will need to split the loop in get_parameters function from aten/src/ATen/native/cudnn/RNN.cpp. And in some cases, I will still need to add an "undefined" tensor in the 3rd position, because we get all 5 weights from CuDNN most of the time. So I'm not sure which way is better. Let me know if you think I should change to the weights-then-biases layout. Pull Request resolved: https://github.com/pytorch/pytorch/pull/47725 Reviewed By: zou3519 Differential Revision: D25449794 Pulled By: ngimel fbshipit-source-id: fe6ce59e481d1f5fd861a8ff7fa13d1affcedb0c	2020-12-16 11:27:02 -08:00
Peter Bell	5180caeeb4	Remove deprecated spectral ops from torch namespace (#48594 ) Summary: Ref https://github.com/pytorch/pytorch/issues/42175 This removes the 4 deprecated spectral functions: `torch.{fft,rfft,ifft,irfft}`. `torch.fft` is also now imported by by default. The actual `at::native` functions are still used in `torch.stft` so can't be full removed yet. But will once https://github.com/pytorch/pytorch/issues/47601 has been merged. Pull Request resolved: https://github.com/pytorch/pytorch/pull/48594 Reviewed By: heitorschueroff Differential Revision: D25298929 Pulled By: mruberry fbshipit-source-id: e36737fe8192fcd16f7e6310f8b49de478e63bf0	2020-12-05 04:12:32 -08:00
Ivan Yashchuk	74330e0497	Added linalg.matrix_rank (#48206 ) Summary: This PR adds `torch.linalg.matrix_rank`. Changes compared to the original `torch.matrix_rank`: - input with the complex dtype is supported - batched input is supported - "symmetric" kwarg renamed to "hermitian" Should I update the documentation for `torch.matrix_rank`? For the input with no elements (for example 0×0 matrix), the current implementation is divergent from NumPy. NumPy stumbles on not defined max for such input, here I chose to return appropriately sized tensor of zeros. I think that's mathematically a correct thing to do. Ref https://github.com/pytorch/pytorch/issues/42666. Pull Request resolved: https://github.com/pytorch/pytorch/pull/48206 Reviewed By: albanD Differential Revision: D25211965 Pulled By: mruberry fbshipit-source-id: ae87227150ab2cffa07f37b4a3ab228788701837	2020-12-02 03:29:25 -08:00
Ivan Yashchuk	4ed7f36ed1	Added linalg.eigh, linalg.eigvalsh (#45526 ) Summary: This PR adds `torch.linalg.eigh`, and `torch.linalg.eigvalsh` for NumPy compatibility. The current `torch.symeig` uses (on CPU) a different LAPACK routine than NumPy (`syev` vs `syevd`). Even though it shouldn't matter in practice, `torch.linalg.eigh` uses `syevd` (as NumPy does). Ref https://github.com/pytorch/pytorch/issues/42666 Pull Request resolved: https://github.com/pytorch/pytorch/pull/45526 Reviewed By: gchanan Differential Revision: D25022659 Pulled By: mruberry fbshipit-source-id: 3676b77a121c4b5abdb712ad06702ac4944e900a	2020-11-22 04:57:28 -08:00
Ivan Yashchuk	343b3e5cae	Added linalg.tensorinv (#45969 ) Summary: This PR adds `torch.linalg.tensorinv` for NumPy compatibility. Ref https://github.com/pytorch/pytorch/issues/42666 Pull Request resolved: https://github.com/pytorch/pytorch/pull/45969 Reviewed By: zhangguanheng66 Differential Revision: D25060568 Pulled By: mruberry fbshipit-source-id: 3b145ce64e4bd5021bc229f5ffdd791c572673a0	2020-11-19 11:54:50 -08:00
Erjia Guan	c542614e53	Implement C++ ModuleDict (#47707 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47707 Fixes #45896 Test Plan: Imported from OSS Reviewed By: glaringlee Differential Revision: D24872641 Pulled By: ejguan fbshipit-source-id: 3d1dc9148ba3bcf66ab9c44ddb5774060bbc365d	2020-11-19 08:07:51 -08:00
Ivan Yashchuk	260daf088d	Added linalg.cholesky (#46083 ) Summary: This PR adds `torch.linalg.cholesky` function that matches `numpy.linalg.cholesky`. Fixed `lda` argument to `lapackCholesky` calls. Added `random_hermitian_pd_matrix` helper function for tests. Ref https://github.com/pytorch/pytorch/issues/42666. Pull Request resolved: https://github.com/pytorch/pytorch/pull/46083 Reviewed By: ailzhang Differential Revision: D24861752 Pulled By: mruberry fbshipit-source-id: 214dbceb4e8a2c589df209493efd843962d25593	2020-11-13 16:50:40 -08:00
pomelyu	f41f3e3cd1	Implement bicubic grid sampler (#44780 ) Summary: Fix https://github.com/pytorch/pytorch/issues/44601 I added bicubic grid sampler in both cpu and cuda side, but haven't in AVX2 There is a [colab notebook](https://colab.research.google.com/drive/1mIh6TLLj5WWM_NcmKDRvY5Gltbb781oU?usp=sharing) show some test results. The notebook use bilinear for test, since I could only use distributed version of pytorch in it. You could just download it and modify the `mode_torch=bicubic` to show the results. There are some duplicate code about getting and setting values, since the helper function used in bilinear at first clip the coordinate beyond boundary, and then get or set the value. However, in bicubic, there are more points should be consider. I could refactor that part after making sure the overall calculation are correct. Thanks Pull Request resolved: https://github.com/pytorch/pytorch/pull/44780 Reviewed By: mrshenli Differential Revision: D24681114 Pulled By: mruberry fbshipit-source-id: d39c8715e2093a5a5906cb0ef040d62bde578567	2020-11-03 15:34:59 -08:00
Ivan Yashchuk	f629fbe235	Added torch.linalg.tensorsolve (#46142 ) Summary: This PR adds `torch.linalg.tensorsolve` function that matches `numpy.linalg.tensorsolve`. Ref https://github.com/pytorch/pytorch/issues/42666. Pull Request resolved: https://github.com/pytorch/pytorch/pull/46142 Reviewed By: izdeby Differential Revision: D24539400 Pulled By: mruberry fbshipit-source-id: 6e38364fe0bc511e739036deb274d9307df119b2	2020-10-29 10:29:28 -07:00
Peter Bell	da95eec613	torch.fft: Two dimensional FFT functions (#45164 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45164 This PR implements `fft2`, `ifft2`, `rfft2` and `irfft2`. These are the last functions required for `torch.fft` to match `numpy.fft`. If you look at either NumPy or SciPy you'll see that the 2-dimensional variants are identical to `*fftn` in every way, except for the default value of `axes`. In fact you can even use `fft2` to do general n-dimensional transforms. Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D24363639 Pulled By: mruberry fbshipit-source-id: 95191b51a0f0b8e8e301b2c20672ed4304d02a57	2020-10-17 16:23:06 -07:00
olegfaust	ac3f23deb0	Fixed usage of std::move function (#46199 ) Summary: Removed std::move in situations when move wasn't really possible (therefore std::move didn't move anything but created copy instead). Pull Request resolved: https://github.com/pytorch/pytorch/pull/46199 Reviewed By: bdhirsh Differential Revision: D24287408 Pulled By: glaringlee fbshipit-source-id: f88b9500e7bbaa709bff62b845966e2adc7fa588	2020-10-13 19:13:30 -07:00
Peter Bell	d44eaf63d1	torch.fft helper functions (#44877 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44877 Part of gh-42175. This implements the `torch.fft` helper functions: `fftfreq`, `rfftfreq`, `fftshift` and `ifftshift`. * #43009 Cleanup tracer handling of optional arguments Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D24043473 Pulled By: mruberry fbshipit-source-id: 35de7b70b27658a426773f62d23722045ea53268	2020-10-05 22:04:52 -07:00
Xinyu Li	c9bb990707	[c++] Distance-agnostic triplet margin loss (#45377 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45377 This PR adds a C++ implementation of the TripletMarginWithDistanceLoss, for which the Python implementation was introduced in PR #43680. It's based on PR #44072, but I'm resubmitting this to unlink it from Phabricator. Test Plan: Imported from OSS Reviewed By: izdeby Differential Revision: D24003973 fbshipit-source-id: 2d9ada7260a6f27425ff2fdbbf623dad0fb79405	2020-09-30 12:37:35 -07:00
VinodSKumar	e02868e12d	Unify Transformer coder Constructors (#45515 ) Summary: Fixes #{[45502](https://github.com/pytorch/pytorch/issues/45502)} Pull Request resolved: https://github.com/pytorch/pytorch/pull/45515 Reviewed By: zhangguanheng66, ZolotukhinM Differential Revision: D23994644 Pulled By: glaringlee fbshipit-source-id: b8728e8dfd8857e27246ebb11b17c2d1b48796ca	2020-09-30 07:05:41 -07:00
Brian Hirsh	439930c81b	adding a beta parameter to the smooth_l1 loss fn (#44433 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44433 Not entirely sure why, but changing the type of beta from `float` to `double in autocast_mode.cpp and FunctionsManual.h fixes my compiler errors, failing instead at link time fixing some type errors, updated fn signature in a few more files removing my usage of Scalar, making beta a double everywhere instead Test Plan: Imported from OSS Reviewed By: mrshenli Differential Revision: D23636720 Pulled By: bdhirsh fbshipit-source-id: caea2a1f8dd72b3b5fd1d72dd886b2fcd690af6d	2020-09-25 16:36:28 -07:00
Peter Bell	6a2e9eb51c	torch.fft: Multi-dimensional transforms (#44550 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44550 Part of the `torch.fft` work (gh-42175). This adds n-dimensional transforms: `fftn`, `ifftn`, `rfftn` and `irfftn`. This is aiming for correctness first, with the implementation on top of the existing `_fft_with_size` restrictions. I plan to follow up later with a more efficient rewrite that makes `_fft_with_size` work with arbitrary numbers of dimensions. Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D23846032 Pulled By: mruberry fbshipit-source-id: e6950aa8be438ec5cb95fb10bd7b8bc9ffb7d824	2020-09-23 22:09:58 -07:00
Peter Bell	da7863f46b	Add one dimensional FFTs to torch.fft namespace (#43011 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43011 Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D23751850 Pulled By: mruberry fbshipit-source-id: 8dc5fec75102d8809eeb85a3d347ba1b5de45b33	2020-09-19 23:32:22 -07:00
Gregory Chanan	5579b53a7f	Fix SmoothL1Loss when target.requires_grad is True. (#44486 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44486 SmoothL1Loss had a completely different (and incorrect, see #43228) path when target.requires_grad was True. This PR does the following: 1) adds derivative support for target via the normal derivatives.yaml route 2) kill the different (and incorrect) path for when target.requires_grad was True 3) modify the SmoothL1Loss CriterionTests to verify that the target derivative is checked. Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23630699 Pulled By: gchanan fbshipit-source-id: 0f94d1a928002122d6b6875182867618e713a917	2020-09-11 12:13:36 -07:00
Gregory Chanan	d07d25a8c5	Fix MSELoss when target.requires_grad is True. (#44437 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44437 MSELoss had a completely different (and incorrect, see https://github.com/pytorch/pytorch/issues/43228) path when target.requires_grad was True. This PR does the following: 1) adds derivative support for target via the normal derivatives.yaml route 2) kill the different (and incorrect) path for when target.requires_grad was True 3) modify the MSELoss CriterionTests to verify that the target derivative is checked. TODO: 1) do we still need check_criterion_jacobian when we run grad/gradgrad checks? 2) ensure the Module tests check when target.requires_grad 3) do we actually test when reduction='none' and reduction='mean'? Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23612166 Pulled By: gchanan fbshipit-source-id: 4f74d38d8a81063c74e002e07fbb7837b2172a10	2020-09-11 08:51:28 -07:00
lixinyu	77cc7d1ecd	C++ APIs Transformer NN Module Top Layer (#44333 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44333 Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D23584010 Pulled By: glaringlee fbshipit-source-id: 990026e3f1b5ae276776e344ea981386cb7528fe	2020-09-11 08:25:27 -07:00
Vinod Kumar S	13c7c6227e	Python/C++ API Parity: TransformerDecoder (#42886 ) Summary: Fixes #{[37756](https://github.com/pytorch/pytorch/issues/37756)} Pull Request resolved: https://github.com/pytorch/pytorch/pull/42886 Reviewed By: zhangguanheng66 Differential Revision: D23385631 Pulled By: glaringlee fbshipit-source-id: 610a2fabb4c25b2dfd37b33287215bb8872d653d	2020-08-28 20:13:53 -07:00
Kurt Mohler	68b9daa9bf	Add `torch.linalg.norm` (#42749 ) Summary: Adds `torch.linalg.norm` function that matches the behavior of `numpy.linalg.norm`. Additional changes: * Add support for dimension wrapping in `frobenius_norm` and `nuclear_norm` * Fix `out` argument behavior for `nuclear_norm` * Fix issue where `frobenius_norm` allowed duplicates in `dim` argument * Add `_norm_matrix` Closes https://github.com/pytorch/pytorch/issues/24802 Pull Request resolved: https://github.com/pytorch/pytorch/pull/42749 Reviewed By: ngimel Differential Revision: D23336234 Pulled By: mruberry fbshipit-source-id: f0aba3089a3a0bf856aa9c4215e673ff34228fac	2020-08-28 18:28:33 -07:00
Mike Ruberry	f4695203c2	Fixes fft function calls for C++ API (#43749 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/43732. Requires importing the fft namespace in the C++ API, just like the Python API does, to avoid clobbering torch::fft the function. Pull Request resolved: https://github.com/pytorch/pytorch/pull/43749 Reviewed By: glaringlee Differential Revision: D23391544 Pulled By: mruberry fbshipit-source-id: d477d0b6d9a689d5c154ad6c31213a7d96fdf271	2020-08-28 12:41:30 -07:00
lixinyu	48e08f884e	C++ APIs TransformerEncoder (#43187 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43187 Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D23182770 Pulled By: glaringlee fbshipit-source-id: 968846138d4b1c391a74277216111dba8b72d683	2020-08-27 01:31:46 -07:00
lixinyu	e32d014f46	remove empty override pretty_print (#43341 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43341 This is to remove the empty pretty_print() since it overrides the impl within Module base which is not as designed here. Test Plan: Imported from OSS Reviewed By: pbelevich Differential Revision: D23244616 Pulled By: glaringlee fbshipit-source-id: 94b8dfd3697dfc450f53b3b4eee6e9c13cafba7b	2020-08-20 18:48:29 -07:00
Vinod Kumar S	5d608d45cf	Added Encoder Layer constructor with default parameters (#43130 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/37756 Pull Request resolved: https://github.com/pytorch/pytorch/pull/43130 Reviewed By: colesbury Differential Revision: D23189803 Pulled By: mrshenli fbshipit-source-id: 53f3fca838828ddd728d8b44c36745bab5acee1f	2020-08-18 11:09:49 -07:00
lixinyu	269fdb5bb2	prepare to split transformer header file (#43069 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43069 The transformer c++ impl need to put TransformerEncoderLayer/DecoderLayer and TransformerEncoder/TransformerDecoder in different header since TransformerEncoder/Decoder's options class need TransformerEncoderLayer/DecoderLayer as input parameter. Split header files to avoid cycle includsion. Test Plan: Imported from OSS Reviewed By: yf225 Differential Revision: D23139437 Pulled By: glaringlee fbshipit-source-id: 3c752ed7702ba18a9742e4d47d049e62d2813de0	2020-08-17 07:54:05 -07:00
Xiaomeng Yang	4ae832e106	Optimize SiLU (Swish) op in PyTorch (#42976 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42976 Optimize SiLU (Swish) op in PyTorch. Some benchmark result input = torch.rand(1024, 32768, dtype=torch.float, device="cpu") forward: 221ms -> 133ms backward: 600ms -> 170ms input = torch.rand(1024, 32768, dtype=torch.double, device="cpu") forward: 479ms -> 297ms backward: 1438ms -> 387ms input = torch.rand(8192, 32768, dtype=torch.float, device="cuda") forward: 24.34ms -> 9.83ms backward: 97.05ms -> 29.03ms input = torch.rand(4096, 32768, dtype=torch.double, device="cuda") forward: 44.24ms -> 30.15ms backward: 126.21ms -> 49.68ms Test Plan: buck test mode/dev-nosan //caffe2/test:nn -- "SiLU" Reviewed By: houseroad Differential Revision: D23093593 fbshipit-source-id: 1ba7b95d5926c4527216ed211a5ff1cefa3d3bfd	2020-08-16 13:21:57 -07:00
aviloria	450315198a	Fix a casting warning (#42451 ) Summary: Fix an annoying casting warning Pull Request resolved: https://github.com/pytorch/pytorch/pull/42451 Reviewed By: yf225 Differential Revision: D22993194 Pulled By: ailzhang fbshipit-source-id: f317a212d4e768d49d24f50aeff9c003be2fd30a	2020-08-14 15:47:02 -07:00
Heitor Schueroff de Souza	3d8c144400	Implemented torch::nn::Unflatten in libtorch (#42613 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42613 Test Plan: Imported from OSS Reviewed By: glaringlee Differential Revision: D23030302 Pulled By: heitorschueroff fbshipit-source-id: 954f1cdfcbd3a62a7f0e887fcf5995ef27222a87	2020-08-14 15:32:13 -07:00
Vinod Kumar S	830423b80b	Python/C++ API Parity: TransformerDecoderLayer (#42717 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/37756 Pull Request resolved: https://github.com/pytorch/pytorch/pull/42717 Reviewed By: zhangguanheng66 Differential Revision: D23095841 Pulled By: glaringlee fbshipit-source-id: 327a5a23c9a3cca05e422666a6d7d802a7e8c468	2020-08-13 20:31:13 -07:00
Mike Ruberry	bee174dc3f	Adds linalg.det alias, fixes outer alias, updates alias testing (#42802 ) Summary: This PR: - updates test_op_normalization.py, which verifies that aliases are correctly translated in the JIT - adds torch.linalg.det as an alias for torch.det - moves the torch.linalg.outer alias to torch.outer (to be consistent with NumPy) The torch.linalg.outer alias was put the linalg namespace erroneously as a placeholder since it's a "linear algebra op" according to NumPy but is actually still in the main NumPy namespace. The updates to test_op_normalization are necessary. Previously it was using method_tests to generate tests, and method_tests assumes test suites using it also use the device generic framework, which test_op_normalization did not. For example, some ops require decorators like `skipCPUIfNoLapack`, which only works in device generic test classes. Moving test_op_normalization to the device generic framework also lets these tests run on CPU and CUDA. Continued reliance on method_tests() is excessive since the test suite is only interested in testing aliasing, and a simpler and more readable `AliasInfo` class is used for the required information. An example impedance mismatch between method_tests and the new tests, for example, was how to handle ops in namespaces like torch.linalg.det. In the future this information will likely be folded into a common 'OpInfo' registry in the test suite. The actual tests performed are similar to what they were previously: a scripted and traced version of the op is run and the test verifies that both graphs do not contain the alias name and do contain the aliased name. The guidance for adding an alias has been updated accordingly. cc mattip Note: ngimel suggests: - deprecating and then removing the `torch.ger` name - reviewing the implementation of `torch.outer` Pull Request resolved: https://github.com/pytorch/pytorch/pull/42802 Reviewed By: zou3519 Differential Revision: D23059883 Pulled By: mruberry fbshipit-source-id: 11321c2a7fb283a6e7c0d8899849ad7476be42d1	2020-08-11 21:48:31 -07:00
Heitor Schueroff de Souza	d396d135db	Added torch::cuda::manual_seed(_all) to mirror torch.cuda.manual_seed(_all) (#42638 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42638 Test Plan: Imported from OSS Reviewed By: glaringlee Differential Revision: D23030317 Pulled By: heitorschueroff fbshipit-source-id: b0d7bdf0bc592a913ae5b1ffc14c3a5067478ce3	2020-08-11 08:22:20 -07:00
lixinyu	98de150381	C++ API TransformerEncoderLayer (#42633 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42633 Test Plan: Imported from OSS Reviewed By: ezyang Differential Revision: D22994332 Pulled By: glaringlee fbshipit-source-id: 873abdf887d135fb05bde560d695e2e8c992c946	2020-08-07 11:49:42 -07:00
Mike Ruberry	9c8021c0b1	Adds torch.linalg namespace (#42664 ) Summary: This PR adds the `torch.linalg` namespace as part of our continued effort to be more compatible with NumPy. The namespace is tested by adding a single function, `torch.linalg.outer`, and testing it in a new test suite, test_linalg.py. It follows the same pattern that https://github.com/pytorch/pytorch/pull/41911, which added the `torch.fft` namespace, did. Future PRs will likely: - add more functions to torch.linalg - expand the testing done in test_linalg.py, including legacy functions, like torch.ger - deprecate existing linalg functions outside of `torch.linalg` in preference to the new namespace Pull Request resolved: https://github.com/pytorch/pytorch/pull/42664 Reviewed By: ngimel Differential Revision: D22991019 Pulled By: mruberry fbshipit-source-id: 39258d9b116a916817b3588f160b141f956e5d0b	2020-08-07 10:18:30 -07:00
Mike Ruberry	ccfce9d4a9	Adds fft namespace (#41911 ) Summary: This PR creates a new namespace, torch.fft (torch::fft) and puts a single function, fft, in it. This function is analogous to is a simplified version of NumPy's [numpy.fft.fft](https://numpy.org/doc/1.18/reference/generated/numpy.fft.fft.html?highlight=fft#numpy.fft.fft) that accepts no optional arguments. It is intended to demonstrate how to add and document functions in the namespace, and is not intended to deprecate the existing torch.fft function. Adding this namespace was complicated by the existence of the torch.fft function in Python. Creating a torch.fft Python module makes this name ambiguous: does it refer to a function or module? If the JIT didn't exist, a solution to this problem would have been to make torch.fft refer to a callable class that mimicked both the function and module. The JIT, however, cannot understand this pattern. As a workaround it's required to explicitly `import torch.fft` to access the torch.fft.fft function in Python: ``` import torch.fft t = torch.randn(128, dtype=torch.cdouble) torch.fft.fft(t) ``` See https://github.com/pytorch/pytorch/issues/42175 for future work. Another possible future PR is to get the JIT to understand torch.fft as a callable class so it need not be imported explicitly to be used. Pull Request resolved: https://github.com/pytorch/pytorch/pull/41911 Reviewed By: glaringlee Differential Revision: D22941894 Pulled By: mruberry fbshipit-source-id: c8e0b44cbe90d21e998ca3832cf3a533f28dbe8d	2020-08-06 00:20:50 -07:00
Jianyu Huang	1c5c289b62	[pt] Add incude_last_offset option to EmbeddingBag mean and max (#42215 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42215 Specifically on https://github.com/pytorch/pytorch/pull/27477#discussion_r371402079 We would like to supported with include_last=True overall for other reduction types like mean and max. It now causes further code fragmentation in DPER (https://www.internalfb.com/intern/diff/D22794469/). More details: https://www.internalfb.com/intern/diff/D22794469/?dest_fbid=309597093427021&transaction_id=631457624153457 ghstack-source-id: 108733009 Test Plan: ``` buck test mode/dev-nosan //caffe2/test:nn -- "test_EmbeddingBag_per_sample_weights_and_new_offsets_cpu" ``` ``` (base) [jianyuhuang@devbig281.ftw3.facebook.com: ~/fbsource/fbcode/caffe2/test] $ TORCH_SHOW_CPP_STACKTRACES=1 buck test mode/dev-nosan //caffe2/test: nn -- "test_EmbeddingBag_per_sample_weights_and_new_offsets_cpu" --print-passing-details Parsing buck files: finished in 1.2 sec Building: finished in 5.5 sec (100%) 10130/10130 jobs, 2 updated Total time: 6.7 sec More details at https://www.internalfb.com/intern/buck/build/dbdc2063-69d8-45cb-9146-308a9e8505ef First unknown argument: --print-passing-details. Falling back to TestPilot classic. Trace available for this run at /tmp/testpilot.20200728-195414.1422748.log TestPilot test runner for Facebook. See https://fburl.com/testpilot for details. Testpilot build revision cd2638f1f47250eac058b8c36561760027d16add fbpkg f88726c8ebde4ba288e1172a348c7f46 at Mon Jul 27 18:11:43 2020 by twsvcscm from /usr/local/fbprojects/packages/testinfra.testpilot/887/t.par Discovering tests Running 1 test Started new test run: https://our.intern.facebook.com/intern/testinfra/testrun/844425097242375 ✓ caffe2/test:nn - test_EmbeddingBag_per_sample_weights_and_new_offsets_cpu (test_nn.TestNNDeviceTypeCPU) 0.162 1/1 (passed) Test output: > /data/users/jianyuhuang/fbsource/fbcode/buck-out/dev/gen/caffe2/test/nn#binary,link-tree/torch/_utils_internal.py:103: DeprecationWarning: This is a NOOP in python >= 3.7, its just too dangerous with how we write code at facebook. Instead we patch os.fork and multiprocessing which can raise exceptions if a deadlock would happen. > threadSafeForkRegisterAtFork() > /usr/local/fbcode/platform007/lib/python3.7/importlib/_bootstrap.py:219: ImportWarning: can't resolve package from __spec__ or __package__, falling back on __name__ and __path__ > return f(args, *kwds) > test_EmbeddingBag_per_sample_weights_and_new_offsets_cpu (test_nn.TestNNDeviceTypeCPU) ... Couldn't download test skip set, leaving all tests enabled... > ok > > ---------------------------------------------------------------------- > Ran 1 test in 0.162s > > OK Finished test run: https://our.intern.facebook.com/intern/testinfra/testrun/844425097242375 Summary (total time 5.54s): PASS: 1 FAIL: 0 SKIP: 0 FATAL: 0 TIMEOUT: 0 OMIT: 0 Did _not_ run with tpx. See https://fburl.com/tpx for details. ``` Reviewed By: dzhulgakov Differential Revision: D22801881 fbshipit-source-id: 80a624465727081bb9bf55c28419695a3d79c6e5	2020-07-29 01:20:00 -07:00
lixinyu	5246bc4e87	register parameters correctly in c++ MultiheadAttention (#42037 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42037 This is to fix #41951 Test Plan: Imported from OSS Reviewed By: yf225 Differential Revision: D22764717 Pulled By: glaringlee fbshipit-source-id: e6da0aeb05a2356f52446e6d5fad391f2cd1cf6f	2020-07-27 13:58:11 -07:00
mattip	b7bda236d1	DOC: split quantization.rst into smaller pieces (#41321 ) Summary: xref gh-38010 and gh-38011. After this PR, there should be only two warnings: ``` pytorch/docs/source/index.rst:65: WARNING: toctree contains reference to nonexisting \ document 'torchvision/index' WARNING: autodoc: failed to import class 'tensorboard.writer.SummaryWriter' from module \ 'torch.utils'; the following exception was raised: No module named 'tensorboard' ``` If tensorboard and torchvision are prerequisites to building docs, they should be added to the `requirements.txt`. As for breaking up quantization into smaller pieces: I split out the list of supported operations and the list of modules to separate documents. I think this makes the page flow better, makes it much "lighter" in terms of page cost, and also removes some warnings since the same class names appear in multiple sub-modules. Pull Request resolved: https://github.com/pytorch/pytorch/pull/41321 Reviewed By: ngimel Differential Revision: D22753099 Pulled By: mruberry fbshipit-source-id: d504787fcf1104a0b6e3d1c12747ec53450841da	2020-07-25 23:59:40 -07:00
albanD	45c5bac870	[WIP] Fix cpp grad accessor API (#40887 ) Summary: Update the API to access grad in cpp to avoid unexpected thread safety issues. In particular, with the current API, a check like `t.grad().defined()` is not thread safe. - This introduces `t.mutable_grad()` that should be used when getting a mutable version of the saved gradient. This function is not thread safe. - The `Tensor& grad()` API is now removed. We could not do a deprecation cycle as most of our call side use non-const Tensors that use the non-const overload. This would lead to most calls hitting the warning. This would be too verbose for all the users. Pull Request resolved: https://github.com/pytorch/pytorch/pull/40887 Reviewed By: ezyang Differential Revision: D22343932 Pulled By: albanD fbshipit-source-id: d5eb909bb743bc20caaf2098196e18ca4110c5d2	2020-07-16 09:11:12 -07:00
Kurt Mohler	0b73ea0ea2	Change BCELoss size mismatch warning into an error (#41426 ) Summary: BCELoss currently uses different broadcasting semantics than numpy. Since previous versions of PyTorch have thrown a warning in these cases telling the user that input sizes should match, and since the CUDA and CPU results differ when sizes do not match, it makes sense to upgrade the size mismatch warning to an error. We can consider supporting numpy broadcasting semantics in BCELoss in the future if needed. Closes https://github.com/pytorch/pytorch/issues/40023 Pull Request resolved: https://github.com/pytorch/pytorch/pull/41426 Reviewed By: zou3519 Differential Revision: D22540841 Pulled By: ezyang fbshipit-source-id: 6c6d94c78fa0ae30ebe385d05a9e3501a42b3652	2020-07-14 20:34:06 -07:00
yyn19951228	98df9781a7	Impl for ParameterList (#41259 ) Summary: This is a new PR for https://github.com/pytorch/pytorch/issues/40850, https://github.com/pytorch/pytorch/issues/40987 and https://github.com/pytorch/pytorch/issues/41206(I unintentionally closed), as I have some issues for rebates for that one. Very sorry about that. And I have fixed the tests failed in that PR. This diff contains the implementation of C++ API for ParameterList from https://github.com/pytorch/pytorch/issues/25883. Refer to the Python API: `bc9e8af218/torch/nn/modules/container.py (L376)` Not sure about some naming difference between C++ API and Python API, like `append`, should it be called `push_back` Pull Request resolved: https://github.com/pytorch/pytorch/pull/41259 Test Plan: Add unit tests in this diff Differential Revision: D22495780 Pulled By: glaringlee fbshipit-source-id: 79ea3592db640f35477d445ecdaeafbdad814bec	2020-07-12 20:50:31 -07:00
Negin Raoof	f69d6a7ea3	[ONNX] Update Default Value of recompute_scale_factor in Interpolate (#39453 ) Summary: This is a duplicate of https://github.com/pytorch/pytorch/pull/38362 "This PR completes Interpolate's deprecation process for recomputing the scales values, by updating the default value of the parameter recompute_scale_factor as planned for pytorch 1.6.0. The warning message is also updated accordingly." I'm recreating this PR as previous one is not being updated. cc gchanan Pull Request resolved: https://github.com/pytorch/pytorch/pull/39453 Reviewed By: hl475 Differential Revision: D21955284 Pulled By: houseroad fbshipit-source-id: 911585d39273a9f8de30d47e88f57562216968d8	2020-07-09 11:32:49 -07:00
yyn19951228	4121d34036	Python/C++ API Parity: Add impl and tests for ParameterDict (#40654 ) Summary: This diff contains the implementation of C++ api for ParameterDict from https://github.com/pytorch/pytorch/issues/25883, refer to https://github.com/pytorch/pytorch/issues/36904 and https://github.com/pytorch/pytorch/issues/28652 Pull Request resolved: https://github.com/pytorch/pytorch/pull/40654 Test Plan: Add unit test in this diff Differential Revision: D22273265 Pulled By: glaringlee fbshipit-source-id: 9134a92c95eacdd53d5b24470d5f7edbeb40a488	2020-06-29 08:50:44 -07:00
Nikita Shulga	44bf822084	Add C++ standard version check to top level headers (#40510 ) Summary: Remove `-std=c++14` flag from `utils.cmake`, since PyTorch C++ API can be invoked by any compiler compliant with C++14 standard or later Pull Request resolved: https://github.com/pytorch/pytorch/pull/40510 Differential Revision: D22253313 Pulled By: malfet fbshipit-source-id: ff731525868b251c27928fc98b0724080ead9be2	2020-06-26 08:44:04 -07:00
lixinyu	5c133eb2db	fix small typo in optim adamw (#40283 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40283 Test Plan: Imported from OSS Differential Revision: D22138796 Pulled By: glaringlee fbshipit-source-id: 2c3a35f7e539b43ee5abf8dbc10b95df5d62fccb	2020-06-19 19:10:17 -07:00
Xiang Gao	954a59a2f5	Add at::tensor(complex) and torch::tensor(complex) overload (#39793 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39793 Differential Revision: D22067181 Pulled By: anjali411 fbshipit-source-id: 3cec1289a8aa3a9cc6bd1fcdb2974f858f75f7bd	2020-06-18 16:20:27 -07:00
Sotiris Lamprinidis	41f2dbde31	Add `AdamW` to C++ frontend (#40009 ) Summary: Slightly modified Adam, following the python implementation, and the `ProducesPyTorchValues` tests pass. I had a problem with another test though (see commit c1a6241676ab84fc531c1c3a10f964aa5704092e), it seems that optimizing for two steps with the same optimizer vs optimizing for two steps using freshly initialized objects will produce the same output. Pull Request resolved: https://github.com/pytorch/pytorch/pull/40009 Differential Revision: D22096053 Pulled By: glaringlee fbshipit-source-id: a31a8f5488cb37c53752ddf15436efabdba67dc4	2020-06-18 15:28:12 -07:00
Jongsoo Park	5d2cfb3d4c	[torch] remove integer conversion resulted in a change of sign warning (#38968 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38968 As title Reviewed By: glaringlee Differential Revision: D21711684 fbshipit-source-id: c340360b29849fe9ab0e7be376918c92ba3629be	2020-06-03 12:38:18 -07:00
David Reiss	6d642a6f6c	Remove (most) Python 2 support from C++ code (#35614 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35614 Python 2 has reached end-of-life and is no longer supported by PyTorch. Now we can clean up a lot of cruft that we put in place to support it. These changes were all done manually, and I skipped anything that seemed like it would take more than a few seconds, so I think it makes sense to review it manually as well. Test Plan: CI Differential Revision: D20842876 Pulled By: dreiss fbshipit-source-id: 18abf0d324ed2185ec6d27c864e935d856dcc6ad	2020-05-14 15:01:49 -07:00
SsnL	5f2a274015	Fix conv non zero padding being applied in wrong dim (#37881 ) Summary: Turns out F.pad takes in dims in reverse order. Fixes https://github.com/pytorch/pytorch/issues/37844 Pull Request resolved: https://github.com/pytorch/pytorch/pull/37881 Differential Revision: D21554011 Pulled By: soumith fbshipit-source-id: a85a7f6db9f981d915728965903c5c57b6617c93	2020-05-14 11:56:38 -07:00
Sebastian Messmer	63c3b89c1c	Simplify code with decltype(auto) (#30922 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30922 New c++14 feature we can use now ghstack-source-id: 103767403 Test Plan: waitforsandcastle Differential Revision: D18869644 fbshipit-source-id: 54541c8004b2116386668a31eb9b0410a603b7dc	2020-05-11 21:31:18 -07:00
Ilia Cherniavskii	2d708cefcc	Move RecordFunction into ATen (#37548 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37548 Moving RecordFunction from torch::autograd::profiler into at namespace Test Plan: CI Imported from OSS Differential Revision: D21315852 fbshipit-source-id: 4a4dbabf116c162f9aef0da8606590ec3f3847aa	2020-05-07 14:52:39 -07:00
Nikita Shulga	cd0724f9f1	Do not `std::move` returned value (#37891 ) Summary: This prevents compiler to use copy elision and triggers `redundant move in return statement` warning. Pull Request resolved: https://github.com/pytorch/pytorch/pull/37891 Differential Revision: D21417998 Pulled By: malfet fbshipit-source-id: 4008a6442cee3fe710c2da252b1bde7b4293b63f	2020-05-06 15:38:05 -07:00
Nikita Shulga	c0ff085775	[PyTorch] Modify `data_parallel` to work with small tensors (#37704 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37704 If input tensor can not be chunked, run `parallel_apply` on fewer devices Modfy input tensor dimention in `DataParallelUsesAllAvailableCUDADevices_CUDA` to be chunkable by any number of available CUDA devices Test Plan: Run `test/cpp/api/parallel` on machine with 6 GPUs Differential Revision: D21365416 fbshipit-source-id: 60fdfed4a0e6256b2c966c2ea3e8d0bfb298d9a8	2020-05-04 11:06:42 -07:00
lixinyu	47fec01c45	Fix cpp extension compile failure on some envs (#37221 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37221 Test Plan: Imported from OSS Differential Revision: D21226873 Pulled By: glaringlee fbshipit-source-id: 0a390bbeaf153ee5ec355943f92c2dbcc5e04b59	2020-04-26 11:00:20 -07:00
meganset	8b685a8af0	C++ make constructor NamedAnyModule(name,any) public (#36869 ) Summary: Allows creation of _NamedAnyModule_ directly from _AnyModule_, e.g. ``` auto a=torch::nn::AnyModule(torch::nn::Linear(1,2)); auto m=torch::nn::NamedAnyModule("fc", a); ``` Without the public constructor, it would be necessary to recast the AnyModule to underlying type, then have the constructor cast it back to AnyModule. With the public AnyModule constructor, possible to do ``` auto q=Sequential({m}); ``` or ``` q->push_back(m.name, m.module()); ``` (works in conjunction with PR https://github.com/pytorch/pytorch/issues/36720 which allowed adding _AnyModule_ directly) Pull Request resolved: https://github.com/pytorch/pytorch/pull/36869 Differential Revision: D21110074 Pulled By: yf225 fbshipit-source-id: aaea02282b9024824785e54d8732c0a12c850977	2020-04-18 20:08:15 -07:00
Anton Jansson	35cc2bbca3	Removed unnecessary call to '_strong_wolfe' in LBFGS. (#36453 ) Summary: It was called twice, but the result of the first invocation was not used. Pull Request resolved: https://github.com/pytorch/pytorch/pull/36453 Differential Revision: D20993535 Pulled By: yf225 fbshipit-source-id: 4d85207a936b846866424903d7622905f3fddd36	2020-04-13 09:06:33 -07:00
Robin Lobel	34a10238d5	fix is_float_scale_factor warning (c++) (#35601 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35601 Differential Revision: D20925642 Pulled By: yf225 fbshipit-source-id: a4e1f953efce04b3f399a8e526fb6c055cc2971c	2020-04-08 17:52:09 -07:00
meganset	93256617c8	C++ Adam optimizer - corrected messages for check of default options (#36161 ) Summary: Modified messages in the check of default options for the Adam optimizer. Pull Request resolved: https://github.com/pytorch/pytorch/pull/36161 Differential Revision: D20920140 Pulled By: yf225 fbshipit-source-id: e697ef1741d4dd86f7f18dc0be2c3b4bd3894d8f	2020-04-08 11:22:50 -07:00
Will Feng (FAIAR)	5fab1bf3e4	Use `std::abs` instead of `abs` in lbfgs.cpp (#35974 ) Summary: This supersedes https://github.com/pytorch/pytorch/pull/35698. `abs` is a C-style function that takes only integral argument `std::abs` is polymorphic and can be applied to both integral and floating point types This PR also increases `kBatchSize` in `test_optimizer_xor` function in `test/cpp/api/optim.cpp` to fix `OptimTest.XORConvergence_LBFGS` failure under ASAN. Pull Request resolved: https://github.com/pytorch/pytorch/pull/35974 Test Plan: CI Reviewed By: pbelevich Differential Revision: D20853570 Pulled By: yf225 fbshipit-source-id: 6135588df2426c5b974e4e097b416955d1907bd4	2020-04-04 09:37:21 -07:00
Will Feng (FAIAR)	86f3305859	Improve C++ API autograd and indexing docs (#35777 ) Summary: This PR adds docs for the following components: 1. Tensor autograd APIs (such as `is_leaf` / `backward` / `detach` / `detach_` / `retain_grad` / `grad` / `register_hook` / `remove_hook`) 2. Autograd APIs: `torch::autograd::backward` / `grad` / `Function` / `AutogradContext`, `torch::NoGradGuard` / `torch::AutoGradMode` 3. Tensor indexing Pull Request resolved: https://github.com/pytorch/pytorch/pull/35777 Differential Revision: D20810616 Pulled By: yf225 fbshipit-source-id: 60526ec0c5b051021901d89bc3b56861c68758e8	2020-04-02 09:33:11 -07:00
Will Feng (FAIAR)	b33ae23c5a	Revert D20794765: [pytorch][PR] Improve C++ API autograd and indexing docs Test Plan: revert-hammer Differential Revision: D20794765 Original commit changeset: fad623e5d505 fbshipit-source-id: 041fb7257d4978a3767d8229d70d6f3cc55e5f28	2020-04-01 20:14:13 -07:00
Will Feng (FAIAR)	41ef2c0d58	Improve C++ API autograd and indexing docs (#35777 ) Summary: This PR adds docs for the following components: 1. Tensor autograd APIs (such as `is_leaf` / `backward` / `detach` / `detach_` / `retain_grad` / `grad` / `register_hook` / `remove_hook`) 2. Autograd APIs: `torch::autograd::backward` / `grad` / `Function` / `AutogradContext`, `torch::NoGradGuard` / `torch::AutoGradMode` 3. Tensor indexing Pull Request resolved: https://github.com/pytorch/pytorch/pull/35777 Differential Revision: D20794765 Pulled By: yf225 fbshipit-source-id: fad623e5d505b7cfcd76a8c5264f18b7a0a3298c	2020-04-01 16:54:08 -07:00
Nik Ved	35cdb78522	Make kl_div accept target in log space (#34586 ) Summary: Fixes [32520](https://github.com/pytorch/pytorch/issues/32520), implements [34536](https://github.com/pytorch/pytorch/issues/34536). Here are some benchmarks: ```python import torch import torch.nn.functional as F from IPython import get_ipython ipython = get_ipython() torch.set_num_threads(1) for d in [5, 10, 20, 50, 100, 1000]: i = torch.rand(d, d) t = torch.rand(d, d) print(f"Size: {d}x{d}") ipython.magic("timeit F.kl_div(i, t, reduction='none', log_target=False)") ipython.magic("timeit F.kl_div(i, t.log(), reduction='none', log_target=True)") ``` Output: ``` Size: 5x5 16 µs ± 33 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each) 8.24 µs ± 17.3 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each) Size: 10x10 16.7 µs ± 17.5 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each) 8.7 µs ± 20.6 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each) Size: 20x20 17.7 µs ± 47.5 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each) 9.7 µs ± 28.8 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each) Size: 50x50 23.6 µs ± 60.1 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each) 15 µs ± 33.7 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each) Size: 100x100 42.8 µs ± 223 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each) 34 µs ± 17.2 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each) Size: 1000x1000 3.9 ms ± 1.8 µs per loop (mean ± std. dev. of 7 runs, 100 loops each) 3.45 ms ± 364 ns per loop (mean ± std. dev. of 7 runs, 100 loops each) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/34586 Differential Revision: D20652726 Pulled By: ezyang fbshipit-source-id: 480697b4cd01341bbeee7514a8b812705a0600ea	2020-04-01 12:26:58 -07:00
Richard Zou	539d3ff344	Revert D20749588: [pytorch][PR] Use `std::abs` instead of `abs` in lbfgs.cpp Test Plan: revert-hammer Differential Revision: D20749588 Original commit changeset: b6640af67587 fbshipit-source-id: 730ff95e19d2f222aa11d092fa53f661f3f0d367	2020-03-31 06:50:47 -07:00
Nikita Shulga	2f3b952d16	Use `std::abs` instead of `abs` in lbfgs.cpp (#35698 ) Summary: `abs` is a C-style function that takes only integral argument `std::abs` is polymorphic and can be applied to both integral and floating point types Pull Request resolved: https://github.com/pytorch/pytorch/pull/35698 Test Plan: CI Differential Revision: D20749588 Pulled By: malfet fbshipit-source-id: b6640af67587650786366fe3907384bc8803069f	2020-03-30 18:47:28 -07:00
anjali411	5371fdb1a0	[C++ API Parity] [Optimizers] Merged Optimizer and LossClosureOptimizer (#34957 ) Summary: 1. Removed LossClosureOptimizer, and merged Optimizer into OptimizerBase (and renamed the merged class to Optimizer) 2. Merged the LBFGS-specific serialize test function and the generic test_serialize_optimizer function. 3. BC-compatibility serialization test for LBFGS 4. Removed mentions of parameters_ in optimizer.cpp, de-virtualize all functions 5. Made defaults_ optional argument in all optimizers except SGD TODO: add BC-breaking notes for this PR Pull Request resolved: https://github.com/pytorch/pytorch/pull/34957 Test Plan: Imported from GitHub, without a `Test Plan:` line. Differential Revision: D20678162 Pulled By: yf225 fbshipit-source-id: 74e062e42d86dc118f0fbaddd794e438b2eaf35a	2020-03-26 19:53:02 -07:00
Edward Yang	843fd740fb	Revert D20645945: [pytorch][PR] [C++ API Parity] [Optimizers] Merged Optimizer and LossClosureOptimizer Test Plan: revert-hammer Differential Revision: D20645945 Original commit changeset: 383588065bf1 fbshipit-source-id: 6d7bc5676de64e329d9862889f32033c76b4009c	2020-03-26 06:40:34 -07:00
anjali411	efbd6b8533	[C++ API Parity] [Optimizers] Merged Optimizer and LossClosureOptimizer (#34957 ) Summary: 1. Removed LossClosureOptimizer, and merged Optimizer into OptimizerBase (and renamed the merged class to Optimizer) 2. Merged the LBFGS-specific serialize test function and the generic test_serialize_optimizer function. 3. BC-compatibility serialization test for LBFGS 4. Removed mentions of parameters_ in optimizer.cpp, de-virtualize all functions 5. Made defaults_ optional argument in all optimizers except SGD TODO: add BC-breaking notes for this PR Pull Request resolved: https://github.com/pytorch/pytorch/pull/34957 Differential Revision: D20645945 Pulled By: yf225 fbshipit-source-id: 383588065bf1859b38f0ad0a25d93d41e153c96e	2020-03-25 18:26:02 -07:00
Will Feng	cfc0ff1691	Renaming: MultiLabelMarginLossFuncOptions -> MultilabelMarginLossFuncOptions, MultiLabelSoftMarginLossFuncOptions -> MultilabelSoftMarginLossFuncOptions (#35163 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35163 This PR is BC-breaking in the following way: Renaming: - `torch::nn::functional::MultiLabelMarginLossFuncOptions` -> `torch::nn::functional::MultilabelMarginLossFuncOptions` - `torch::nn::functional::MultiLabelSoftMarginLossFuncOptions` -> `torch::nn::functional::MultilabelSoftMarginLossFuncOptions` Reason for renaming: to be consistent with the corresponding functional name after camel case to snake case conversion (e.g. the `multilabel_margin_loss` functional should use `MultilabelMarginLossFuncOptions` as options) Test Plan: Imported from OSS Differential Revision: D20582598 Pulled By: yf225 fbshipit-source-id: 0f5bdb8249d901b310875a14320449a2fdfa8ecd	2020-03-21 18:34:46 -07:00
Will Feng	bbec4520c6	Add inplace tests for several torch::nn modules / functionals (#35147 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35147 Test Plan: Imported from OSS Differential Revision: D20578217 Pulled By: yf225 fbshipit-source-id: b8bafa49ee94c7dfbbca6e100ee3d9df5b2b621c	2020-03-21 10:02:56 -07:00
Will Feng	a2557970f3	Fix F::interpolate and torch::nn::Upsample implementation (#35025 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35025 This PR fixes `F::interpolate` and `torch::nn::Upsample` implementation to match the Python API implementation. This PR is BC-breaking in the following way: There are changes to `UpsampleOptions` and `InterpolateFuncOptions`: - `size` is changed from `std::vector<int64_t>` to `c10::optional<std::vector<int64_t>>`. If you want to pass a list of `int64_t` to this argument, you must pass it as `std::vector<int64_t>`. - `scale_factor` is changed from `std::vector<double>` to `c10::optional<std::vector<double>>`. If you want to pass a list of `double` to this argument, you must pass it as `std::vector<double>`. TODO: cherry-pick this PR into v1.5 release branch. Test Plan: Imported from OSS Differential Revision: D20559892 Pulled By: yf225 fbshipit-source-id: ac18609e351a9f2931eaeced8966b9491b2995f7	2020-03-20 22:37:13 -07:00
Will Feng	c0958c883e	Fix fractional_max_pool3d_with_indices implementation (#35024 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35024 TODO: cherry-pick this PR into v1.5 release branch. Test Plan: Imported from OSS Differential Revision: D20559891 Pulled By: yf225 fbshipit-source-id: c2b5c005c0bd560b5a84d4cc9097dbd64ee902c0	2020-03-20 22:37:08 -07:00
Will Feng	ef7fe371ce	Fix Conv and ConvTranspose implementation (#35023 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35023 This PR fixes Conv and ConvTranspose implementation to match the Python API implementation. TODO: cherry-pick this PR into v1.5 release branch. Test Plan: Imported from OSS Differential Revision: D20559889 Pulled By: yf225 fbshipit-source-id: 53783a7398ef968ec6d25b6f568fde44907417c5	2020-03-20 22:37:03 -07:00
Will Feng	d7462dcea6	Fix AdaptiveAvgPool{2,3}d and AdaptiveMaxPool{2,3}d implementation (#35022 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35022 This PR fixes `AdaptiveAvgPool{2,3}d` and `AdaptiveMaxPool{2,3}d` implementation to match the Python API implementation. Particularly, `output_size` is changed to accept `c10::nullopt` in its elements, matching the Python API behavior. TODO: cherry-pick this PR into v1.5 release branch. Test Plan: Imported from OSS Differential Revision: D20559890 Pulled By: yf225 fbshipit-source-id: ccddbd278dd39165cf1dda11fc0e49387c76dbef	2020-03-20 22:36:57 -07:00
Edward Yang	7c06b86e42	Revert D20518647: [pytorch][PR] [C++ API Parity] [Optimizers] Merged Optimizer and LossClosureOptimizer Test Plan: revert-hammer Differential Revision: D20518647 Original commit changeset: 4760d1d29df1 fbshipit-source-id: b84f1a06c2de27e147716279223a6844ef89f760	2020-03-19 07:53:43 -07:00
anjali411	b8e043abca	[C++ API Parity] [Optimizers] Merged Optimizer and LossClosureOptimizer (#34957 ) Summary: 1. Removed LossClosureOptimizer, and merged Optimizer into OptimizerBase (and renamed the merged class to Optimizer) 2. Merged the LBFGS-specific serialize test function and the generic test_serialize_optimizer function. 3. BC-compatibility serialization test for LBFGS 4. Removed mentions of parameters_ in optimizer.cpp, de-virtualize all functions 5. Made defaults_ optional argument in all optimizers except SGD Pull Request resolved: https://github.com/pytorch/pytorch/pull/34957 Test Plan: Imported from GitHub, without a `Test Plan:` line. Differential Revision: D20518647 Pulled By: anjali411 fbshipit-source-id: 4760d1d29df1784e2d01e2a476d2a08e9df4ea1c	2020-03-18 17:28:57 -07:00
anjali411	d7e4a379a0	[C++ API Parity] LBFGS optimizer step() update and added closure to the Optimizer step() function (#34564 ) Summary: Follow-ups after this PR: * Remove `LossClosureOptimizer`, and merge `Optimizer` into `OptimizerBase` (and rename the merged class to Optimizer) * Merge the LBFGS-specific serialize test function and the generic `test_serialize_optimizer` function, possibly by passing a bool `has_only_global_state` flag into the `test_serialize_optimizer` function to denote whether `size()` should be equal to 1 or 2? * https://github.com/pytorch/pytorch/pull/34564#discussion_r393780303 * It seems that we don't have the equivalent `XORConvergence_LBFGS` test like the other optimizers, and it would be good to add one * Remove mentions of `parameters_` in optimizer.cpp, de-virtualize all functions, and remove the `OptimizerBase(std::vector<Tensor> parameters)` constructor from `OptimizerBase` Pull Request resolved: https://github.com/pytorch/pytorch/pull/34564 Test Plan: Imported from GitHub, without a `Test Plan:` line. Differential Revision: D20495701 Pulled By: anjali411 fbshipit-source-id: 6d35286d2decb6f7dff93d9d3e57515770666622	2020-03-17 22:27:24 -07:00
anjali411	762be86e63	[C++ API Parity] [Optimizers] added closure to optimizers (#34790 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34790 Differential Revision: D20468361 Pulled By: anjali411 fbshipit-source-id: 1c6115d735b211dc2bedf002d58931cb32cf657a	2020-03-16 07:51:44 -07:00
Will Feng	bdd7dbfd4b	[C++ API] RNN / GRU / LSTM layer refactoring (#34322 ) Summary: This PR refactors RNN / GRU / LSTM layers in C++ API to exactly match the implementation in Python API. BC-breaking changes: - Instead of returning `RNNOutput`, RNN / GRU forward method now returns `std::tuple<Tensor, Tensor>`, and LSTM forward method now returns `std::tuple<Tensor, std::tuple<Tensor, Tensor>>`, matching Python API. - RNN / LSTM / GRU forward method now accepts the same inputs (input tensor and optionally hidden state), matching Python API. - RNN / LSTM / GRU layers now have `forward_with_packed_input` method which accepts `PackedSequence` as input and optionally hidden state, matching the `forward(PackedSequence, ...)` variant in Python API. - RNN / LSTM / GRU layers no longer have these fields: `w_ih` / `w_hh` / `b_ih` / `b_hh`. Instead, to access the weights and biases of the gates, users should do e.g. `rnn->named_parameters()["weight_ih_l0"]`, which mirrors the Python API `rnn.weight_ih_l0`. - In `RNNOptions` - `tanh()` / `relu()` / `activation` are removed. Instead, `nonlinearity` is added which takes either `torch::kTanh` or `torch::kReLU` - `layers` -> `num_layers` - `with_bias` -> `bias` - In `LSTMOptions` - `layers` -> `num_layers` - `with_bias` -> `bias` - In `GRUOptions` - `layers` -> `num_layers` - `with_bias` -> `bias` The majority of the changes in this PR focused on refactoring the implementations in `torch/csrc/api/src/nn/modules/rnn.cpp` to match the Python API. RNN tests are then changed to reflected the revised API design. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34322 Differential Revision: D20458302 Pulled By: yf225 fbshipit-source-id: ffff2ae1ddb1c742c966956f6ad4d7fba03dc54d	2020-03-15 17:48:29 -07:00
Will Feng	6c555e1508	Revert D20311699: [pytorch][PR] [C++ API] RNN / GRU / LSTM layer refactoring Test Plan: revert-hammer Differential Revision: D20311699 Original commit changeset: e2b60fc7bac6 fbshipit-source-id: 72f4a762189490998d6b716857eeac053a11742d	2020-03-14 16:18:48 -07:00
Will Feng	e23a9dc140	[C++ API] RNN / GRU / LSTM layer refactoring (#34322 ) Summary: This PR refactors RNN / GRU / LSTM layers in C++ API to exactly match the implementation in Python API. BC-breaking changes: - Instead of returning `RNNOutput`, RNN / GRU forward method now returns `std::tuple<Tensor, Tensor>`, and LSTM forward method now returns `std::tuple<Tensor, std::tuple<Tensor, Tensor>>`, matching Python API. - RNN / LSTM / GRU forward method now accepts the same inputs (input tensor and optionally hidden state), matching Python API. - RNN / LSTM / GRU now has `forward_with_packed_input` method which accepts `PackedSequence` as input and optionally hidden state, matching the `forward(PackedSequence, ...)` variant in Python API. - In `RNNOptions` - `tanh()` / `relu()` / `activation` are removed. Instead, `nonlinearity` is added which takes either `torch::kTanh` or `torch::kReLU` - `layers` -> `num_layers` - `with_bias` -> `bias` - In `LSTMOptions` - `layers` -> `num_layers` - `with_bias` -> `bias` - In `GRUOptions` - `layers` -> `num_layers` - `with_bias` -> `bias` The majority of the changes in this PR focused on refactoring the implementations in `torch/csrc/api/src/nn/modules/rnn.cpp` to match the Python API. RNN tests are then changed to reflected the revised API design. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34322 Differential Revision: D20311699 Pulled By: yf225 fbshipit-source-id: e2b60fc7bac64367a8434647d74c08568a7b28f7	2020-03-14 12:09:04 -07:00
Will Feng	d041d0784e	[C++ API] RNNCell / LSTMCell / GRUCell layers (#34400 ) Summary: This PR adds `RNNCell` / `LSTMCell` / `GRUCell` layers to the C++ frontend, with implementations exactly matching the Python API equivalent. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34400 Differential Revision: D20316859 Pulled By: yf225 fbshipit-source-id: bb7cee092622334043c0d0fd0fcb4e75e707699c	2020-03-13 21:52:24 -07:00
Will Feng	da11646db1	[C++ API] Link to module options doc for functional that has same options as module (#34752 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34752 Test Plan: Imported from GitHub, without a `Test Plan:` line. Differential Revision: D20452681 Pulled By: yf225 fbshipit-source-id: 06b56a08bd480999353ebbff39c035225e4070df	2020-03-13 20:19:43 -07:00
Will Feng	3924c55f4c	[C++ API] Update torch::nn functional docs (#34688 ) Summary: - `torch::nn::functional` functions must provide example for how to use the corresponding functional options - `torch::nn::functional` functions must link to the corresponding functional options - remove `TORCH_NN_FUNCTIONAL_USE_MODULE_OPTIONS` macro, and put `torch::nn::functional` options docs inside the functional namespace, right above functional declaration - `torch::nn::functional` options docs should not link back to torch::nn layers. Instead, they should have links to `torch::nn::functional::xxx` ---- This PR is BC-breaking in the following way: `TORCH_NN_FUNCTIONAL_USE_MODULE_OPTIONS` macro is removed, and user should explicitly write ```cpp namespace functional { using SomeFuncOptions = SomeModuleOptions; } // namespace functional ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/34688 Differential Revision: D20431251 Pulled By: yf225 fbshipit-source-id: 7d4f27dca3aad2a1e523690927d7afb261b9d308	2020-03-13 10:27:28 -07:00
Will Feng	a54416d208	[C++ API] Remove deprecated torch::nn::BatchNorm / FeatureDropout / modules_ordered_dict and torch::nn::init::Nonlinearity / FanMode (#34508 ) Summary: This PR is BC-breaking in the following way: - The deprecated `torch::nn::BatchNorm` is removed in favor of `torch::nn::BatchNorm{1,2,3}d` - The deprecated `torch::nn::FeatureDropout` is removed in favor of `torch::nn::Dropout{2,3}d` - The deprecated `torch::nn::modules_ordered_dict` is removed. User should do `Sequential sequential({{"m1", MyModule(1)}, {"m2", MyModule(2)}})` instead. - The deprecated `torch::nn::init::Nonlinearity` is removed, in favor of the following enums: - `torch::kLinear` - `torch::kConv1D` - `torch::kConv2D` - `torch::kConv3D` - `torch::kConvTranspose1D` - `torch::kConvTranspose2D` - `torch::kConvTranspose3D` - `torch::kSigmoid` - `torch::kTanh` - `torch::kReLU` - `torch::kLeakyReLU` - The deprecated `torch::nn::init::FanMode` is removed, in favor of the following enums: - `torch::kFanIn` - `torch::kFanOut` Pull Request resolved: https://github.com/pytorch/pytorch/pull/34508 Differential Revision: D20351601 Pulled By: yf225 fbshipit-source-id: cca0cd112f29a31bb023e348ca8f82780e42bea3	2020-03-12 10:09:58 -07:00
Mansoor	e95657b87e	[C++ API] AdaptiveLogSoftmaxWithLoss (#29076 ) Summary: Implemented AdaptiveLogSoftmaxWithLoss and some tests for modules. Reference https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/29076 Differential Revision: D20404588 Pulled By: yf225 fbshipit-source-id: edbadf432b8173cbcc6caf83c9c03dd92dc31a37	2020-03-12 09:53:58 -07:00
Michael Suo	c235be42dd	[jit] kill script namespace (#34515 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34515 Once upon a time we thought this was necessary. In reality it is not, so removing it. For backcompat, our public interface (defined in `api/`) still has typedefs to the old `script::` names. There was only one collision: `Pass` as a `Stmt` and `Pass` as a graph transform. I renamed one of them. Test Plan: Imported from OSS Differential Revision: D20353503 Pulled By: suo fbshipit-source-id: 48bb911ce75120a8c9e0c6fb65262ef775dfba93	2020-03-11 23:32:48 -07:00
Will Feng	9064fafb6e	[C++ API] Update torch::nn layer docs (#34522 ) Summary: This PR updates C++ API torch::nn layer docs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34522 Test Plan: Imported from GitHub, without a `Test Plan:` line. Differential Revision: D20380832 Pulled By: yf225 fbshipit-source-id: ee99a838ec05c6ce2a23aa97555707e507d09958	2020-03-11 16:09:09 -07:00
anjali411	2d24005d18	[C++ API Parity] rmsprop optimizer update (#33450 ) Summary: This PR is BC-breaking in the following way: In RMSpropOptions: 1. learning_rate is renamed to lr. Test plan before 1.5 release: Test that in 1.5 we can load a C++ RMSprop optimizer that was serialized in 1.4, and their states are the same. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33450 Differential Revision: D20366623 Pulled By: anjali411 fbshipit-source-id: 83250be9b583a766927e0e22a4de8b0765379451	2020-03-10 13:30:56 -07:00
Jithun Nair	9e94e46453	Check if rnn weights need to be flattened (#34265 ) Summary: cuDNN needs it, MIOpen doesn't. However, since it seems to be the PyTorch preference to not introduce ROCm-specific logic in the python layer, we need to add a C++ function to detect if rnn weight flattening is needed. This PR will be needed to fix the rnn unit test errors arising for PR https://github.com/pytorch/pytorch/issues/33837. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34265 Differential Revision: D20345105 Pulled By: ezyang fbshipit-source-id: a2588a6e2ac6f7d1edf2b7872bc6a879a7df96ec	2020-03-10 08:45:29 -07:00
Will Feng	baeb359e7a	Remove `using namespace torch::autograd` from header files (#34423 ) Summary: This PR prevents leaking symbols from `torch::autograd` namespace to the root namespace. Fixes https://github.com/pytorch/pytorch/issues/34371. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34423 Differential Revision: D20338404 Pulled By: yf225 fbshipit-source-id: e7ff3348193667a0cee5d38f9a003ae36cc704ca	2020-03-09 10:31:21 -07:00
Adam Paszke	e3d50c4dda	Retain the order of parameters while generating ConcreteModuleTypes (#34131 ) Summary: `ConcreteModuleTypeBuilder` used to keep parameters together with all others attributes in an `unordered_map` often leading to reordering them while building up the type. Parameter order is semantically meaningful, so we need to preserve it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34131 Differential Revision: D20331542 Pulled By: suo fbshipit-source-id: 5b860025f7902654d6099751d3fb14b12f6f5a67	2020-03-09 10:25:45 -07:00
Will Feng	739d4609c3	[C++ API] Fix ModuleList compile error: error: 'begin' was not declared in this scope (#34463 ) Summary: One example in the current docs for `torch::nn::ModuleList` doesn't compile, and this PR fixes it. Fixes https://github.com/pytorch/pytorch/issues/32414. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34463 Test Plan: Imported from GitHub, without a `Test Plan:` line. Differential Revision: D20331120 Pulled By: yf225 fbshipit-source-id: 50bb078fe1a900c9114d5434e92dc40ee13b52bf	2020-03-09 08:15:50 -07:00
meganset	b8fd88319a	C++ make torch::nn::Sequential push_back(AnyModule) methods public (#34208 ) Summary: Issue https://github.com/pytorch/pytorch/issues/33192 Moves Sequential::push_back methods with AnyModule from private -> public Allows adding an existing AnyModule via something like: ``` torch::nn::Sequential q; auto a=torch::nn::AnyModule(torch::nn::Linear(1,2)); q->push_back(a); q->push_back("fc",a); ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/34208 Differential Revision: D20300278 Pulled By: yf225 fbshipit-source-id: 4525319bb7fb6667e43a006c9f446a2193781005	2020-03-06 05:47:14 -08:00
anjali411	76035f050b	[C++ API Parity] Adam: updated step and class design (#33730 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33730 Differential Revision: D20292073 Pulled By: anjali411 fbshipit-source-id: a7b4a70f29027ab355aebb91873ea55d5cb51783	2020-03-05 19:15:24 -08:00
Nikita Shulga	45b8c8dbcb	[torch] Fix sign-compare warning in `torch::utils::rnn:pack_sequence` (#34185 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34185 ArrayRef<T>::size() is size_t Test Plan: CI Reviewed By: EscapeZero Differential Revision: D20241552 fbshipit-source-id: 73cd062db810ebc5a4e34e094dfe6c7e6571ef2d	2020-03-04 10:13:45 -08:00
Michael Suo	dbe850af5b	[jit] do the code reorg (#33851 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33851 Rationale and context described in #33828. Script to reproduce the move: https://gist.github.com/suo/16cbefaaeb67ca5a7c6caffd49b7f6e9 ghstack-source-id: 99079645 Test Plan: Make sure CI passes Reviewed By: jamesr66a Differential Revision: D20133869 fbshipit-source-id: 390e9241a9c85366d9005c492ac31f10aa96488e	2020-02-27 13:02:51 -08:00
Will Feng	0dded4026e	[C++ API] Add PackedSequence / pack_padded_sequence / pad_packed_sequence / pack_sequence (#33652 ) Summary: Most of the function implementation and test code are translated from the Python version. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33652 Differential Revision: D20052211 Pulled By: yf225 fbshipit-source-id: ce6767db54364f91ef4f06674239a12278c2752a	2020-02-25 12:53:41 -08:00
Will Feng	533b973fd0	Fix visibility of torch::nn::RNNImpl::options (#33718 ) Summary: In PR https://github.com/pytorch/pytorch/issues/33027, `options` in RNNImpl was mistakenly changed to `protected` (it was `public` before) ``` protected: FORWARD_HAS_DEFAULT_ARGS({1, AnyValue(Tensor())}) RNNOptions options; ``` This PR changes it back to `public` again. Fixes https://github.com/pytorch/pytorch/issues/33694. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33718 Differential Revision: D20075149 Pulled By: yf225 fbshipit-source-id: 82901369eeaacd82df849e17df64dc1aaf98f9fe	2020-02-24 13:50:39 -08:00
Akash S M	9e384f9ce4	Remove duplicate header include. (#33656 ) Summary: The same header `<torch/nn/functional/conv.h>` is included twice. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33656 Differential Revision: D20056913 Pulled By: yf225 fbshipit-source-id: b1563035c9821731b99c26eec130ff0b9cc627a7	2020-02-22 14:17:07 -08:00
Suyash458	47e90d774e	C++/Python API Parity: add pad_sequence (#32387 ) Summary: - add `pad_sequence` and tests - related issue https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/32387 Differential Revision: D20025421 Pulled By: yf225 fbshipit-source-id: caa9ae2114bece8db387a3a1610f24a3e06b1324	2020-02-21 13:16:09 -08:00
Will Feng	a203dc2e6d	[C++ API] Allow skipping default arguments in module's forward method when module is used in Sequential (#33027 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33027 This PR allows default arguments in module's forward method to be skipped when module is used in `torch::nn::Sequential`, by introducing the `FORWARD_HAS_DEFAULT_ARGS` macro and requiring that all modules that have default arguments in its forward method must have a corresponding `FORWARD_HAS_DEFAULT_ARGS` macro call. Fixes issue mentioned in https://github.com/pytorch/pytorch/issues/30931#issuecomment-564144468. Test Plan: Imported from OSS Differential Revision: D19777815 Pulled By: yf225 fbshipit-source-id: 73282fcf63377530063e0092a9d84b6c139d2e32	2020-02-17 20:38:02 -08:00
Will Feng	4724964810	[C++ API] Expose AnyValue and AnyModuleHolder classes (#33026 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33026 This PR contains necessary changes to prepare for https://github.com/pytorch/pytorch/pull/33027. It exposes the following classes to public: 1. `torch::nn::AnyValue`, because if the user has optional arguments in their module's forward method, they must also use the `FORWARD_HAS_DEFAULT_ARGS` macro and pass in the default values for those optional arguments wrapped by `torch::nn::AnyValue`. 2. `torch::nn::AnyModuleHolder`, because `torch::nn::Module` needs to declare it as a friend class for it to be able to access `torch::nn::Module`'s protected methods such as `_forward_has_default_args` / `_forward_num_required_args` / `_forward_populate_default_args`. Test Plan: Imported from OSS Differential Revision: D19777814 Pulled By: yf225 fbshipit-source-id: 1c9d5aa24f0689154752c426a83ee98f64c9d02f	2020-02-17 20:35:22 -08:00
meganset	e9e9331927	Fractional Max Pooling: output ratios defined as double (#33304 ) Summary: References https://github.com/pytorch/pytorch/issues/33240 Changes options.output_ratio from long integer to double to allow ratios to used to calculate output size from inputs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33304 Differential Revision: D19887318 Pulled By: yf225 fbshipit-source-id: 228c2c6bf4158307700c2a983d27d539c6b9eded	2020-02-14 12:31:39 -08:00
Mikhail Zolotukhin	806e7daa1f	Rename TorchScript compiler to IR emitter to better reflect its function. (#33127 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33127 Test Plan: Imported from OSS Differential Revision: D19806503 Pulled By: ZolotukhinM fbshipit-source-id: ab78bdbbac5f12dbcc6c2e2573f5862a16ffcf3d	2020-02-12 18:45:13 -08:00
anjali411	91744907d4	SGD: updated step and class design (#32592 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32592 Differential Revision: D19868154 Pulled By: anjali411 fbshipit-source-id: ce888efc68b1531d97e8b0abf2b146198e012d2f	2020-02-12 18:38:55 -08:00
Will Feng	b564eaf7a8	Bug fixes: torch::tensor(floating-point values) -> default dtype, and torch::tensor(integer values) ->at::kLong (#32367 ) Summary: Some of the `torch::tensor` behavior is updated to better match Python API. Fixes https://github.com/pytorch/pytorch/issues/32234. This PR is BC-breaking in the following way: - `torch::tensor({1.0f, 2.0f})`: float -> default dtype - `torch::tensor(at::ArrayRef<int>({1, 2, 3}))`: int -> at::kLong - `torch::tensor(std::vector<int>({1, 2, 3}))`: int -> at::kLong - `torch::tensor(at::ArrayRef<float>({1.f, 2.f, 3.f}))`: float -> default dtype - `torch::tensor(std::vector<float>({1.f, 2.f, 3.f}))`: float -> default dtype - `torch::tensor(at::ArrayRef<double>({1., 2., 3.}))`: double -> default dtype - `torch::tensor(std::vector<double>({1., 2., 3.}))`: double -> default dtype Pull Request resolved: https://github.com/pytorch/pytorch/pull/32367 Differential Revision: D19498484 Pulled By: yf225 fbshipit-source-id: 19c8dc2a56476266153cff4c404e7f84d309eb12	2020-02-01 15:00:07 -08:00
Jianyu Huang	3ada2e0d64	[pytorch][embeddingbag] Parallelize the EmbeddingBag operator (#4049 ) Summary: Pull Request resolved: https://github.com/pytorch/glow/pull/4049 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27477 We would like to add the intra-op parallelization support for the EmbeddingBag operator. This should bring speedup for the DLRM benchmark: https://github.com/pytorch/pytorch/pull/24385 Benchmark code: ``` from __future__ import absolute_import, division, print_function, unicode_literals import torch import time eb = torch.nn.EmbeddingBag(1000000, 64, mode='sum') input = torch.LongTensor(1500).random_(0, 1000000) offsets = torch.zeros(64, dtype=torch.int64) niter = 10000 s = time.time() for _ in range(niter): out = eb(input, offsets) time_per_iter = (time.time() - s) / niter print('time_per_iter', time_per_iter) print('GB/s', (input.numel() * 64 * 4 + out.numel() * 4) / time_per_iter / 1e9) ``` The following results are single core on Skylake T6: - Before our change (with the original caffe2::EmbeddingLookup) time_per_iter 6.313693523406982e-05 GB/s 6.341517821789133 - After our change using the EmbeddingLookupIdx API which takes the offsets instead of lengths. time_per_iter 5.7627105712890626e-05 GB/s 6.947841559053659 - With Intel's PR: https://github.com/pytorch/pytorch/pull/24385 time_per_iter 7.393271923065185e-05 GB/s 5.415518381664018 For multi-core performance, because Clang doesn't work with OMP, I can only see the single-core performance on SKL T6. ghstack-source-id: 97124557 Test Plan: With D16990830: ``` buck run mode/dev //caffe2/caffe2/perfkernels:embedding_bench ``` With D17750961: ``` buck run mode/opt //experimental/jianyuhuang/embeddingbag:eb buck run mode/opt-lto //experimental/jianyuhuang/embeddingbag:eb ``` OSS test ``` python run_test.py -i nn -- TestNNDeviceTypeCPU.test_EmbeddingBag_per_sample_weights_and_new_offsets_cpu ``` Buck test ``` buck test mode/dev-nosan //caffe2/test:nn -- "test_EmbeddingBag_per_sample_weights_and_new_offsets_cpu" OMP_NUM_THREADS=3 buck test mode/opt -c pytorch.parallel_backend=tbb //caffe2/test:nn -- "test_EmbeddingBag_per_sample_weights_and_new_offsets" --print-passing-details ``` Generate the AVX2 code for embedding_lookup_idx_avx2.cc: ``` python hp_emblookup_codegen.py --use-offsets ``` Differential Revision: D17768404 fbshipit-source-id: 8dcd15a62d75b737fa97e0eff17f347052675700	2020-01-23 21:29:44 -08:00
anjali411	be6ffac1b6	Adagrad optimizer - updated step function, added param_groups, state to optimizers Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29335 Differential Revision: D19449382 Pulled By: anjali411 fbshipit-source-id: ee238801ed9cdf15a80f2ce31cc4aab8ba582aea	2020-01-21 14:41:12 -08:00
Brian Wignall	f326045b37	Fix typos, via a Levenshtein-type corrector (#31523 ) Summary: Should be non-semantic. Uses https://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines to find likely typos, with https://github.com/bwignall/typochecker to help automate the checking. Uses an updated version of the tool used in https://github.com/pytorch/pytorch/pull/30606 . Pull Request resolved: https://github.com/pytorch/pytorch/pull/31523 Differential Revision: D19216749 Pulled By: mrshenli fbshipit-source-id: 7fd489cb9a77cd7e4950c1046f925d57524960ea	2020-01-17 16:03:19 -08:00
Will Feng	01010f5705	Add comments to torch::nn::ConvTranspose{1,2,3}d modules explaining how to use them in a Sequential module (#32223 ) Summary: Following changes in https://github.com/pytorch/pytorch/pull/31005. Pull Request resolved: https://github.com/pytorch/pytorch/pull/32223 Differential Revision: D19415328 Pulled By: yf225 fbshipit-source-id: f6f74f10ba3b5cc7e1a92f8b02ea4c9747018ae8	2020-01-15 14:53:33 -08:00
meganset	1ba1799a66	C++ added 3rd arg of false to BatchNorm/InstanceNorm register_parameter … (#31873 ) Summary: Fix for issue https://github.com/pytorch/pytorch/issues/31680 C++ BatchNorm & InstanceNorm attempt to register undefined tensors when affine is false. Fixes https://github.com/pytorch/pytorch/issues/31680 Pull Request resolved: https://github.com/pytorch/pytorch/pull/31873 Differential Revision: D19287087 Pulled By: yf225 fbshipit-source-id: 0d57f10c49083386919b703d72b520a73a8e9e7f	2020-01-06 01:46:24 -08:00
Mingbo Wan	647569e546	get rid of choco install (#30897 ) Summary: 7zip and cmake are part of base image, no need to re-install. Remove the install step can make build/test more stable. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30897 Differential Revision: D19232961 Pulled By: mingbowan fbshipit-source-id: fa3bbd1325839a2a977bf13fdbd97fda43793b8d	2019-12-27 13:12:04 -08:00
Pavel Belevich	47766e648f	C++ API parity: MultiheadAttention Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27309 Test Plan: Imported from OSS Differential Revision: D17766736 Pulled By: pbelevich fbshipit-source-id: 7a5f2399f081945d31d4c13d7a8d248c387fc1a6	2019-12-18 10:13:29 -08:00
Sebastian Messmer	f0243ea712	Use [[deprecated]] instead of C10_DEPRECATED (#30918 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30918 This is a C++14 feature we can use now ghstack-source-id: 95811482 Test Plan: waitforsandcastle Differential Revision: D18869636 fbshipit-source-id: b5b3d78b61b6ceb2deda509131f8502e95b1d057	2019-12-17 15:21:34 -08:00
Will Feng	49a5841a9f	Make Conv{1,2,3}dOptions and ConvTranspose{1,2,3}dOptions different classes (#31005 ) Summary: Currently, both `Conv{1,2,3}dOptions` and `ConvTranspose{1,2,3}dOptions` are aliases of the `ConvOptions<{1,2,3}>` class, which causes confusion because the `ConvOptions` class has parameters such as `transposed` that shouldn't be exposed to the end user. (This has caused issues such as https://github.com/pytorch/pytorch/issues/30931.) This PR makes the following improvements: 1. Rename the original `torch::nn::ConvOptions<N>` class to `torch::nn::detail::ConvNdOptions<N>` class, to signify that it's an implementation detail and should not be used publicly. 2. Create new classes `torch::nn::ConvOptions<N>` and `torch::nn::ConvTransposeOptions<N>`, which have parameters that exactly match the constructor of `torch.nn.Conv{1,2,3}d` and `torch.nn.ConvTranspose{1,2,3}d` in Python API. Pull Request resolved: https://github.com/pytorch/pytorch/pull/31005 Differential Revision: D18898048 Pulled By: yf225 fbshipit-source-id: 7663d646304c8cb004ca7f4aa4e70d3612c7bc75	2019-12-11 20:31:48 -08:00
Sebastian Messmer	536481d9de	Fix missing virtual destructor (#30927 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30927 Classes that are used virtually (e.g. have virtual methods) must have a virtual destructor or bad things happen ghstack-source-id: 95144736 Test Plan: waitforsandcastle Differential Revision: D18870351 fbshipit-source-id: 333af4e95469fdd9103aa9ef17b40cbc4a343f82	2019-12-09 12:25:26 -08:00
Anjali Chourdia	5687ee1d85	added a serialize function in SGD class to utilize the existing macro for serialization/deserialization calls Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30739 Differential Revision: D18842908 Pulled By: anjali411 fbshipit-source-id: 7dc13ff9c4fc126790b88b1b4b5d03425c349d38	2019-12-06 08:38:07 -08:00
Will Feng	244b0bd1a5	Add docs for how we expose declarations in at:: to torch:: (#30760 ) Summary: This PR adds docs for how we expose declarations in `at::` to `torch::`, to make the semantics more clear. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30760 Differential Revision: D18833081 Pulled By: yf225 fbshipit-source-id: eff4d8815c67f681ce3a930ce99771cf2e55dbd9	2019-12-05 13:05:28 -08:00
Will Feng	03a73cb9ac	Remove namespace F = torch::nn::functional from torch/nn/modules/batchhnorm.h (#30684 ) Summary: This PR removes `namespace F = torch::nn::functional` from `torch/nn/modules/batchhnorm.h`, so that people don't have to define `torch::nn::functional` as `F` if they don't want to. Fixes https://github.com/pytorch/pytorch/issues/30682. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30684 Differential Revision: D18795717 Pulled By: yf225 fbshipit-source-id: c9feffbeb632cc6b4ce3e6c22c0a78533bab69ad	2019-12-03 14:52:23 -08:00
Will Feng	18ec4632b3	Exclude undefined tensors in the result of Module::parameters() / named_paramters() / buffers() / named_buffers() (#30626 ) Summary: PR https://github.com/pytorch/pytorch/pull/30523 attempted to fix https://github.com/pytorch/pytorch/issues/30508 and https://github.com/pytorch/pytorch/issues/30462, but the fix wasn't complete. This PR makes the following improvements: 1. Fixes https://github.com/pytorch/pytorch/issues/30508 and https://github.com/pytorch/pytorch/issues/30462 properly by excluding undefined tensors in the result of `Module::parameters()` / `named_parameters()` / `buffers()` / `named_buffers()`, which mirrors the Python API behavior. 2. Audits all use sites of `Module::parameters_` / `buffers_` and change them to `Module::named_parameters(/recurse=/false)` / `named_buffers(/recurse=/false)` when appropriate, so that use sites of module parameters / buffers never need to worry about undefined tensors. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30626 Differential Revision: D18777507 Pulled By: yf225 fbshipit-source-id: 55b64b69779e1186342efd3c44857f416334ed6b	2019-12-02 21:59:58 -08:00
Brian Wignall	e7fe64f6a6	Fix typos (#30606 ) Summary: Should be non-semantic. Uses https://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines to find likely typos. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30606 Differential Revision: D18763028 Pulled By: mrshenli fbshipit-source-id: 896515a2156d062653408852e6c04b429fc5955c	2019-12-02 20:17:42 -08:00
Will Feng	7ac8efa689	Skip undefined tensors when moving torch::nn module to a different device (#30523 ) Summary: This fixes high-pri issues such as https://github.com/pytorch/pytorch/issues/30508 and https://github.com/pytorch/pytorch/issues/30462. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30523 Differential Revision: D18732904 Pulled By: yf225 fbshipit-source-id: fe5a7a43838000f5803bd9c01ecfba0c3f02df5d	2019-11-27 21:21:02 -08:00
davidriazati	2a7a39c1af	(de)serialization of values between C++ and Python (#30108 ) Summary: This PR updates `torch::pickle_save` to use the new zipfile format introduced in #29232 and adds `torch::pickle_load` which can decode the zipfile format. Now that `torch.save/load` use this format as well (if the `_use_new_zipfile_serialization` flag is `True`), raw values saved in Python can be loaded in C++ and vice versa. Fixes #20356 ](https://our.intern.facebook.com/intern/diff/18607087/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/30108 Pulled By: driazati Differential Revision: D18607087 fbshipit-source-id: 067cdd5b1cf9c30ddc7e2e5021a8cceee62d8a14	2019-11-23 00:06:07 -08:00
Will Feng	3ba1456aee	Fix clip_grad_norm_ / clip_grad_value_ to take input by value instead of by non-const ref (#30216 ) Summary: The original design of `torch::nn::utils::clip_grad_norm_` / `clip_grad_value_` takes input by non-const reference, which prevents users from passing rvalue reference as input into the functions. This PR changes the functions to take input by value, which matches the Python version's semantics, and also adheres to the C++ API convention that if a function modifies its input in-place, it should take that input by value. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30216 Differential Revision: D18632543 Pulled By: yf225 fbshipit-source-id: 97a09d6467f982fe9c8120f483a9c07fcf13699e	2019-11-21 10:07:00 -08:00
Edward Yang	9e81616343	Merge Tensor and Variable types. (#28287 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28287 This PR eliminates the static distinction between Tensor and Variable. Every Variable is a Tensor, no need to static_cast or call the Variable constructor. To do this, I need Tensor to have API parity with Variable. I have already moved most of the methods I don't want in Tensor off Variable. These implementations are all placed in Tensor.cpp. One API difference is that all Variable methods now have const, so we no longer have faux const-correctness (see https://github.com/zdevito/ATen/issues/27 for back story) This diff is BC breaking in a few ways: - Because torch::autograd::Variable is now just an alias of at::Tensor, ADL for `torch::autograd` functions no longer works, you have to explicitly qualify them with `torch::autograd` (examples: `torch/nn/parallel/data_parallel.h`) - Because Variable and Tensor are now the same type, code which assumes that they are different types (e.g., for the purposes of templating, or enable_if checks) will not work until you delete the (now) redundant overload/specialization. (examples: `torch/nn/modules/container/any.h`, `torch/csrc/utils/pybind.h`) Some other notes: - I'm not sure what was going with the old template implementation of `extract_vars`, but I couldn't get the sfinae version to work. Replacing it with an overloading based version made it work. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D18571426 Pulled By: ezyang fbshipit-source-id: 2ea8151e5f1d8512cdebf1345399642e68b707b8	2019-11-21 09:26:39 -08:00
lsrock1	0a77c090d5	C++ parity, convert_parameters (#29267 ) Summary: yf225 https://github.com/pytorch/pytorch/issues/25883 update parameters_to_vector and vector_to_parameters check please! Pull Request resolved: https://github.com/pytorch/pytorch/pull/29267 Differential Revision: D18628571 Pulled By: yf225 fbshipit-source-id: 03783e6b0f8183dd97ae48f3da4acb1d07083555	2019-11-20 19:59:11 -08:00
Will Feng	a460c856dd	Fix naming for kl_div and binary_cross_entropy functional options (#30146 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30146 This PR fixes naming for kl_div and binary_cross_entropy functional options, to be more consistent with the naming scheme of other functional options. Test Plan: Imported from OSS Differential Revision: D18618971 Pulled By: yf225 fbshipit-source-id: 2af62c1a0ace2cd0c36c2f1071639bf131d8fe61	2019-11-20 12:23:50 -08:00
Will Feng	72bc7bf37b	Revert D18612158: Fix naming for kl_div and binary_cross_entropy functional options Test Plan: revert-hammer Differential Revision: D18612158 Original commit changeset: 8c403fa1c2a0 fbshipit-source-id: f22d7c4664119d4e7397fc017bacecf3e318af11	2019-11-20 10:26:31 -08:00
Will Feng	e84fcc1fd1	Fix naming for kl_div and binary_cross_entropy functional options (#30146 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30146 This PR fixes naming for kl_div and binary_cross_entropy functional options, to be more consistent with the naming scheme of other functional options. Test Plan: Imported from OSS Differential Revision: D18612158 Pulled By: yf225 fbshipit-source-id: 8c403fa1c2a0a65734a3ec2387cc0937c46cab24	2019-11-20 09:44:21 -08:00
Pavel Belevich	f8e7f3fca4	C++ API parity: BCEWithLogitsLoss Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28783 Test Plan: Imported from OSS Differential Revision: D18202435 Pulled By: pbelevich fbshipit-source-id: 011b028bbb2a091e98d3548616b99d7b4569c239	2019-11-20 06:46:38 -08:00
Will Feng	bb1d9b238d	torch::nn::FractionalMaxPool{2,3}d module and functional Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29933 Test Plan: Imported from OSS Differential Revision: D18548174 Pulled By: yf225 fbshipit-source-id: 070776db6e8b7ad94d9b7cbd82b3d6966f061a46	2019-11-19 17:24:07 -08:00
Divyansh Singhvi	ec52d911bd	InstanceNorm{1,2,3}d (#28790 ) Summary: Hi yf225, I have a few doubts related to implementation: 1) What tests do I have to write? 2) What does _load_state_from_dict does? 3) Do I need to override reset() function as I can not see it's utility? 4) InstanceNormOptions could be removed with BatchNormOptions, but I find that `track_running_status` is not defined instead `stateful` is defined. InstanceNorm{1,2,3}d https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28790 Differential Revision: D18588666 Pulled By: yf225 fbshipit-source-id: bb9b81f01f62c3fc8765fa0ba0716768087ee155	2019-11-19 16:57:01 -08:00
Will Feng	99c59d73a7	Remove input_channels / output_channels / with_bias from ConvOptions (#29838 ) Summary: Since torchvision is not using input_channels / output_channels / with_bias in ConvOptions anymore (https://github.com/pytorch/vision/pull/1576), we can remove the bridges now. Pull Request resolved: https://github.com/pytorch/pytorch/pull/29838 Differential Revision: D18597943 Pulled By: yf225 fbshipit-source-id: 59101437f032f042574998eb90eaf0be09352364	2019-11-19 16:28:54 -08:00
Will Feng	05a7aaa742	Pass Tensor instead of Tensor& to torch::nn functionals that can change input in place (#30112 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30112 Currently, we have torch::nn functionals that takes `input` as `Tensor&` in order to be able to in-place change `input`'s value. We likely shouldn't do this because it will prevent the following use case: ```cpp F::elu(torch::tensor(1), F::ELUFuncOptions().inplace(true)) ``` The solution is to change the type of `input` to `Tensor`, so that we can pass an rvalue into the functional. Test Plan: Imported from OSS Differential Revision: D18601580 Pulled By: yf225 fbshipit-source-id: 639a86eb62f6c986b0f20bf7e201983e83126e73	2019-11-19 16:11:39 -08:00
nuka137	a75b669b0f	C++ API: torch::nn::ConvTranspose{1,2,3}d (#29721 ) Summary: Add torch::nn::ConvTranspose{1,2,3}d module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/29721 Differential Revision: D18588943 Pulled By: yf225 fbshipit-source-id: d4dbb091389367e70459399d5cda3778325c2120	2019-11-19 16:04:12 -08:00
Suyash458	e88d096321	C++/Python API Parity: add AlphaDropout (#28424 ) Summary: - add `AlphaDropoutImpl` to `modules/dropout.h` and `modules/dropout.cpp` - add `functional/dropout.h` containing the `alpha_dropout` function - include `functional/dropout.h` in `nn/functional.h` - add functional and module tests - related issue https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28424 Differential Revision: D18589162 Pulled By: yf225 fbshipit-source-id: c85734e02431a6c052515e26b11ca30ad7303644	2019-11-19 10:05:51 -08:00
Will Feng	82682b3e96	Revert D18531481: Remove input_channels / output_channels / with_bias from ConvOptions Test Plan: revert-hammer Differential Revision: D18531481 Original commit changeset: e48d9e8cf110 fbshipit-source-id: a233425cc10278552674c48b6b577ef53fca0632	2019-11-18 09:10:54 -08:00
James Reed	18bdf97dbb	Factor Module into Object and Module Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29500 Test Plan: Imported from OSS Differential Revision: D18463064 Pulled By: jamesr66a fbshipit-source-id: d37bef242a8626593d4b8754042152cfc0f0acb2	2019-11-17 22:58:50 -08:00
Will Feng	689b4bea7b	torch::nn::GLU and F::glu (#29922 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29922 * #29920 [C++ API] torch::nn::GroupNorm and F::group_norm Test Plan: Imported from OSS Differential Revision: D18558818 Pulled By: yf225 fbshipit-source-id: ff80d634309fcb55f53db8dcf86eb9cf8161b37e	2019-11-16 21:03:38 -08:00
Will Feng	d5bf51b684	torch::nn::GroupNorm and F::group_norm Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29920 Test Plan: Imported from OSS Differential Revision: D18539314 Pulled By: yf225 fbshipit-source-id: dabbbaac31796fe7bfde02487737971bde699c1c	2019-11-16 19:22:11 -08:00
PyExtreme	e1d13f4f8b	C++ API parity: NLLLoss & CrossEntropyLoss (#29812 ) Summary: Hi yf225 , I have added NLLLoss and CrossEntropyLoss. ``` Also, while using log_softmax in cross_entropy_loss, I am getting an error ../caffe2/../torch/csrc/api/include/torch/nn/functional/loss.h:537:63: error: no matching function for call to log_softmax(const at::Tensor&)’ const Tensor& log_softmax_input = torch::log_softmax(input); aten/src/ATen/Functions.h:5551:22: note: candidate: at::Tensor at::log_softmax(const at::Tensor&, int64_t, c10::optional<c10::ScalarType>) static inline Tensor log_softmax(const Tensor & self, int64_t dim, c10::optional<ScalarType> dtype) { ^~~~~~~~~~~ aten/src/ATen/Functions.h:5551:22: note: candidate expects 3 arguments, 1 provided ``` I think the other two parameters should be optional as in python frontend(shown in documentation here at https://pytorch.org/docs/stable/nn.functional.html#torch.nn.functional.log_softmax ). Rest, there were no errors in build and tests have passed Pull Request resolved: https://github.com/pytorch/pytorch/pull/29812 Differential Revision: D18548249 Pulled By: yf225 fbshipit-source-id: 2ab350abd2a6f498d4dba2345f51ad87471f3038	2019-11-16 10:49:09 -08:00
Will Feng	890a3f8b8d	Remove input_channels / output_channels / with_bias from ConvOptions (#29838 ) Summary: Since torchvision is not using input_channels / output_channels / with_bias in ConvOptions anymore (https://github.com/pytorch/vision/pull/1576), we can remove the bridges now. Pull Request resolved: https://github.com/pytorch/pytorch/pull/29838 Differential Revision: D18531481 Pulled By: yf225 fbshipit-source-id: e48d9e8cf110095f83d9ed18b9fec020ec725f3e	2019-11-16 10:46:50 -08:00
Pavel Belevich	27afac2134	C++ API parity: Dropout, Dropout2d, Dropout3d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29761 Test Plan: Imported from OSS Differential Revision: D18530820 Pulled By: pbelevich fbshipit-source-id: 9d351561692f7de099d7c6aaf2ecb930b5c867e9	2019-11-15 20:32:06 -08:00
David Reiss	0108f473ad	Use c10::to_string in more places (#29839 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29839 std::to_string isn't reliably available on Android. Use c10::to_string instead in some more files that we want to add to some Android builds. Test Plan: CI Reviewed By: linbinyu Differential Revision: D18509295 fbshipit-source-id: 678af1abbea05777310499634ab01afbe21134d8	2019-11-15 09:22:59 -08:00
Will Feng	893105b79e	Add reset_parameters to torch::nn modules (#29832 ) Summary: This PR adds `reset_parameters` to the torch::nn modules whose Python version also has `reset_parameters` defined, so that there is better parity between Python and C++ version. Pull Request resolved: https://github.com/pytorch/pytorch/pull/29832 Differential Revision: D18515939 Pulled By: yf225 fbshipit-source-id: 5aa23e5c7ce1026787c04ffeb6c7f167620dd491	2019-11-14 20:58:32 -08:00
Will Feng	a68c52494c	Use F::*FuncOptions for embedding/embeddingbag functionals (#29673 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29673 Following https://github.com/pytorch/pytorch/pull/29364 and https://github.com/pytorch/pytorch/pull/29404, this PR makes `F::EmbeddingFuncOptions` and `F::EmbeddingBagFuncOptions` separate classes from `torch::nn::EmbeddingOptions` and `torch::nn::EmbeddingBagOptions`, so that it's easier to enforce that arguments such as `num_embeddings` and `embedding_dim` are required for `torch::nn::EmbeddingOptions` and `torch::nn::EmbeddingBagOptions`. Test Plan: Imported from OSS Differential Revision: D18462540 Pulled By: yf225 fbshipit-source-id: f2abf431e48675b0a9d7f6f398cdb90ff9037c35	2019-11-13 18:47:22 -08:00
Will Feng	2bcac59a30	Use default dtype for torch::tensor(floating_point_values) and torch::tensor(empty braced-init-list) when dtype is not specified (#29632 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29632 This PR is BC-breaking in the following way: Previously, C++ `torch::tensor` with a floating-point literal with no suffix (e.g. `torch::tensor(1.1)`) or a (nested) braced-init-list of floating-point literals with no suffix (e.g. `torch::tensor({{1.1, 2.2}})` produces a tensor with dtype `at::kDouble`. After this PR, it produces a tensor with dtype `torch::get_default_dtype()`, matching Python `torch.tensor` behavior. Test Plan: Imported from OSS Differential Revision: D18465819 Pulled By: yf225 fbshipit-source-id: 6834fe50335c677bc3832f2a5e9cf8d1ede9f665	2019-11-13 15:17:11 -08:00
Will Feng	b37c235d86	C++/Python API parity for Conv{1,2,3}d layers, and add F::conv{1,2,3}d functionals (#28917 ) Summary: This PR changes the implementation of C++ Conv{1,2,3}d layers to exactly match the Python version, and add F::conv{1,2,3}d functionals. For more thorough testing, I will rely on the parity test mechanism which uses values from `common_nn.py` to generate the inputs and options that we are interested in testing. This PR is BC-breaking in the following way: In `Conv{1,2,3}dOptions`: - `with_bias` is renamed to `bias`. - `input_channels` is renamed to `in_channels`. - `output_channels` is renamed to `out_channels`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/28917 Differential Revision: D18471526 Pulled By: yf225 fbshipit-source-id: 7a33f60654ad93cc2e043245e7ff9e0ef9da15b3	2019-11-13 12:53:31 -08:00
Will Feng	433baf1b90	Change arg dtype from float to double in LPPool and nn/utils/clip_grad.h (#29584 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29584 In Python, `float` dtype is always 64-bit (https://stackoverflow.com/a/8216110), and the C++ equivalent APIs should take `double` dtype to match the bit length. Test Plan: Imported from OSS Differential Revision: D18436616 Pulled By: yf225 fbshipit-source-id: ece510bba6f089ccada03af216f4805bbd03f5f2	2019-11-12 16:05:35 -08:00
Will Feng	65bfcde05e	Use c10::variant-based enums for SmoothL1Loss module and functional Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29536 Test Plan: Imported from OSS Differential Revision: D18432272 Pulled By: yf225 fbshipit-source-id: fa355145962e93025b7de98b99b0a4fc82e8c871	2019-11-12 16:05:31 -08:00
Will Feng	57eab22c6a	Use c10::variant-based enums for F::grid_sample Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29535 Test Plan: Imported from OSS Differential Revision: D18432273 Pulled By: yf225 fbshipit-source-id: 11476f0431a9b544dfb62bc7a89bab84399f9b83	2019-11-12 16:05:26 -08:00
Will Feng	9f879ef532	Make all non-input arguments to functionals part of its options (#29404 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29404 This PR makes all non-input arguments to functionals part of its options parameters, so that we won't break backward compatibility even if we add or reorder some of the non-input arguments to functionals in the future. Test Plan: Imported from OSS Differential Revision: D18378526 Pulled By: yf225 fbshipit-source-id: f5cf6bdfb844e75bf94fdee58c121e0955631b6e	2019-11-12 16:05:22 -08:00
Anjali Chourdia	604fc9ec41	F::embedding, F::embedding_bag, moved Embedding and EmbeddingBag options to embedding.h in options Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28669 Differential Revision: D18377609 Pulled By: anjali411 fbshipit-source-id: 6a2c547368849ebd1a2f8828cfbe7252152b26a2	2019-11-11 11:51:26 -08:00
Edward Yang	344e7c26c4	Delete USE_CUDA macro use from data_parallel.h (#29483 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29483 Somehow, these macros were not necessary! Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D18427851 Pulled By: ezyang fbshipit-source-id: 86e1d75d98342461c9a5afa1c30c14346188f7cc	2019-11-11 09:21:12 -08:00
Will Feng	cb74ede59e	Pass F::FuncOptions instead of torch::nn::Options to functionals, and make F::FuncOptions a different class when necessary (#29364 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29364 Currently, we use `torch::nn::Options` both as module options and functional options. However, this makes it very hard to manage the parameters in `torch::nn::Options`, because a module's constructor can take a different set of arguments than the module's equivalent functional (e.g. `torch.nn.BatchNorm1d` takes `num_features, eps=1e-5, momentum=0.1, affine=True, track_running_stats=True`, while `F::batch_norm` takes `running_mean, running_var, weight=None, bias=None, training=False, momentum=0.1, eps=1e-5`). This PR resolves the above problem by making `F::FuncOptions` a different class from `torch::nn::Options` when necessary (i.e. when a module's constructor takes a different set of arguments than the module's equivalent functional). In the rest of the cases where the module constructor takes the same set of arguments as the module's equivalent functional, `F::FuncOptions` is an alias of `torch::nn::*Options`. Also as part of this PR, we change all functional options to pass-by-value, to make the semantics consistent across all functionals. Test Plan: Imported from OSS Differential Revision: D18376977 Pulled By: yf225 fbshipit-source-id: 8d9c240d93bfd5af0165b6884fdc912476b1d06b	2019-11-08 22:38:21 -08:00
Will Feng	4da3ac91b7	Add functional overloads for fold, linear, loss, normalization, padding (#29360 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29360 This PR adds functional overloads that take the full set of arguments (instead of just Options) for the following functionals: - fold - linear - loss - normalization - padding These new functionals lives in the `torch::nn::functional::detail` namespace and they are only meant to be called from the module forward methods (i.e. they are not public API). This is in preparation for the future change where we make module Options and functional Options two different classes, because if the module forward method has to construct a new functional Options object every time it runs it will be pretty silly and bad performance. Test Plan: Imported from OSS Differential Revision: D18376975 Pulled By: yf225 fbshipit-source-id: 233cd940834dc9d0b5d4b89339ab7082ec042c3c	2019-11-08 15:10:49 -08:00
Will Feng	3ab44c48d1	Add functional overloads for pixelshuffle, pooling, upsampling, vision (#29359 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29359 This PR adds functional overloads that take the full set of arguments (instead of just Options) for the following functionals: - pixelshuffle - pooling - upsampling - vision These new functionals lives in the `torch::nn::functional::detail` namespace and they are only meant to be called from the module forward methods (i.e. they are not public API). This is in preparation for the future change where we make module Options and functional Options two different classes, because if the module forward method has to construct a new functional Options object every time it runs it will be pretty silly and bad performance. Test Plan: Imported from OSS Differential Revision: D18376978 Pulled By: yf225 fbshipit-source-id: 4ea8d359e7efde0d741eff79faad6b24b2a5d804	2019-11-08 13:48:47 -08:00
Will Feng	edcf659e42	Remove default values from functional overloads for activation, batchnorm, distance, embedding Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29456 Test Plan: Imported from OSS Differential Revision: D18401483 Pulled By: yf225 fbshipit-source-id: 638ff72a60fb69e41bec6f468835654b208c2896	2019-11-08 12:24:51 -08:00
Edward Yang	4e21157e01	Revert "Revert D18171156: Merge Tensor and Variable." (#29299 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29299 This reverts commit `9c43b16df9`, but also with the changes from D18348622. Comments there: thpp-compatibility is used by admarket/adreview/service:adreviewservice and libtorch is too big for the service to deal with. thpp-compatibility doesn't support autograd, so we hack around dispatching variables by using AutoNonVariableTypeMode everywhere we call into ATen, so we never attempt to call into Variable stubs. If you get it wrong, you'll get an error like: ``` what(): Could not run 'aten::empty' with arguments from the 'VariableTensorId' backend. 'aten::empty' is only available for these backends: [SparseCPUTensorId, CPUTensorId, MkldnnCPUTensorId]. (lookup_ at caffe2/aten/src/ATen/core/dispatch/DispatchTable.h:298) ``` Test Plan: Imported from OSS ``` buck test //thpp-compatibility/... buck build mode/opt-clang admarket/adreview/service:adreviewservice ``` adreviewservice canary: https://our.intern.facebook.com/intern/ads/canary/422290029716387895 (comparing against parent comment due to current breakage) ==> experiment store https://our.intern.facebook.com/intern/experiment_store/experiment/43990006/ adfinder canary: https://our.intern.facebook.com/intern/ads/canary/422268535840333934 adindexer canary: https://our.intern.facebook.com/intern/ads/canary/422268550559034675 adreview second canary: https://our.intern.facebook.com/intern/ads/canary/422307863515591925 canary without thpp-compat fixups https://our.intern.facebook.com/intern/ads/canary/422308951649168772 Reviewed By: dreiss Differential Revision: D18353504 Pulled By: ezyang fbshipit-source-id: 65feaba39fa07bb66762810909aeb38868668a30	2019-11-08 09:11:20 -08:00
Will Feng	b24b967e00	Add functional overloads for activation, batchnorm, distance, embedding (#29358 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29358 This PR adds functional overloads that take the full set of arguments (instead of just Options) for the following functionals: - activation - batchnorm - distance - embedding These new functionals lives in the `torch::nn::functional::detail` namespace and they are only meant to be called from the module forward methods (i.e. they are not public API). This is in preparation for the future change where we make module Options and functional Options two different classes, because if the module forward method has to construct a new functional Options object every time it runs it will be pretty silly and bad performance. Test Plan: Imported from OSS Differential Revision: D18376976 Pulled By: yf225 fbshipit-source-id: 0b254dc6340b6d6ac08c9f95d2b1c02b791b2f38	2019-11-08 08:34:10 -08:00
Igor Fedan	75309b45f3	explicitly provide memory format when calling to clone() at Indexing.cpp Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28660 Test Plan: Imported from OSS Differential Revision: D18333346 Pulled By: ifedan fbshipit-source-id: 06590205d883a5096388a4ae318389244130972d	2019-11-07 05:38:32 -08:00
Zachary DeVito	796363147f	Implement more of of the nn.Module API (#28828 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28828 This updates torch::script::Module to more closely match the behavior of nn.Module. In particular, it implements the (optionally recurisive) iterators that retrieve submodules, parameters, and buffers and makes their names match the python versions. This also removes the individual accessors for Parameter, Module, Buffer, etc. and replaces them with a single `attr` function which is equivalent to writing `a.foo` in Python (`setattr` emulates `a.foo = v`). As we build out the user-facing API for TorchScript values this will end up matching how an attribute is accessed on general objects. This PR preservers the python bindings for script::Module by emulating the old API at the binding level. A followup will clean up the usage to more directly match the C++ API. Test Plan: Imported from OSS Differential Revision: D18197611 Pulled By: zdevito fbshipit-source-id: 7ee4dcbb258605d1c988314b05d938423f1ccee5	2019-11-06 22:58:25 -08:00
Edward Yang	9c43b16df9	Revert D18171156: Merge Tensor and Variable. Test Plan: revert-hammer Differential Revision: D18171156 Original commit changeset: 5b6a045beba3 fbshipit-source-id: f5581d902c2305018ea49f8473592be2a465560b	2019-11-06 10:57:00 -08:00
lsrock1	6389c18709	C++ parity, nn::CrossMapLRN2d (#29039 ) Summary: yf225 https://github.com/pytorch/pytorch/issues/25883 re- pull request because of rebase mistake! Pull Request resolved: https://github.com/pytorch/pytorch/pull/29039 Differential Revision: D18326829 Pulled By: yf225 fbshipit-source-id: 5ed737f6275e4463efa4951d9b7f45c6f2723c82	2019-11-05 15:27:08 -08:00
Pavel Belevich	69f845cb77	C++ API parity: MarginRankingLoss Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29000 Test Plan: Imported from OSS Differential Revision: D18271855 Pulled By: pbelevich fbshipit-source-id: cbafc7f059173306c83673d7be374c2d3700911f	2019-11-05 05:41:40 -08:00
Will Feng	026fd36c71	Use at::kLong for torch::tensor(integer_value) when dtype is not specified (#29066 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29066 This PR is BC-breaking in the following way: Previously, C++ `torch::tensor` with an integer literal or a braced-init-list of integer literals produces a tensor with dtype being the type of the integer literal(s). After this PR, it always produces a tensor of dtype `at::kLong` (aka. int64_t), matching Python `torch.tensor` behavior. Test Plan: Imported from OSS Differential Revision: D18307248 Pulled By: yf225 fbshipit-source-id: 7a8a2eefa113cbb238f23264843bdb3b77fec668	2019-11-04 21:39:10 -08:00
Edward Yang	df22e4c157	Remove Unicode characters from header, fixing lint. (#29126 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29126 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D18300420 Pulled By: ezyang fbshipit-source-id: d9b3ec75098cdb54624e4f98d4c66db1f4ff62bd	2019-11-04 15:07:37 -08:00
Edward Yang	25261a4776	Merge Tensor and Variable. (#28620 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28620 All Tensors are Variables now, they just happen to have requires_grad=False. Tensors ALWAYS have `VariableTensorId` in their type set. When constructing this patch, I had to make decisions about what I would fix in this patch, and what I would leave for follow up PRs. Here is the cleanup that happens in this patch: - The `is_variable` property is removed from TensorOptions. I removed this immediately because unlike Tensor::is_variable, TensorOptions::is_variable doesn't respect our VariableTensorId thread-local state. This means that there were a bunch of places where TensorOptions::is_variable was false, which is obviously bogus in the world when tensor and variable are merged. Instead of keeping the method as a function that always returns true, I just opted to remove it entirely (it's not public API.) All places we set `is_variable` are deleted. - Knock on effect: there is no longer a separate DeprecatedTypeProperties for the variable and non-variable versions of type. - Knock on effect: instead of asserting on TensorOptions::is_variable, instead we just test `at::impl::variable_is_excluded()` - There is now only one copy of the cuDNN RNN dropout cache, not two (I'm not sure why we had two to begin with) Some cleanup that doesn't happen in this patch: - Eliminating unnecessary uses of `make_variable` - Eliminating `Tensor::is_variable` The most subtle part of this patch is retaining tracing behavior: the fact that everything is a Variable means that more code gets routed to VariableType than before; this can change traces. I identified two places where we didn't appropriately turn off VariableType, mostly factory functions: - `torch.tensor` must turn off VariableType before invoking `at::empty` to construct the tensor, as it subsequently does direct data access - `tensor_slow` (invoked when you pass a Python scalar to a tensor argument) must turn off VariableType before calling `scalar_to_tensor` so the scalar gets traced as constant, rather than as a call to `scalar_to_tensor`. Honestly, these are all giant hacks, and should be replaced with a more specialized guard that just toggles tracing. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: dreiss Differential Revision: D18171156 Pulled By: ezyang fbshipit-source-id: 5b6a045beba37492647e350190f495114e86504d	2019-11-04 14:59:57 -08:00
Xiaomeng Yang	2460dced8f	Add torch.nn.GELU for GELU activation (#28944 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28944 Add torch.nn.GELU for GELU activation Test Plan: buck test mode/dev-nosan //caffe2/test:nn -- "GELU" Reviewed By: hl475, houseroad Differential Revision: D18240946 fbshipit-source-id: 6284b30def9bd4c12bf7fb2ed08b1b2f0310bb78	2019-11-03 21:55:05 -08:00
nuka137	a68c1e109e	C++ API: torch::nn::BatchNorm{2,3}d (#28936 ) Summary: Add torch::nn::BatchNorm{2,3}d module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 #28176 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28936 Differential Revision: D18274584 Pulled By: yf225 fbshipit-source-id: 3784eee9f8947f6c7c9f1699544a3d36a1a019b7	2019-11-01 17:50:33 -07:00
Pavel Belevich	4a94eaa60b	C++ API parity: PoissonNLLLoss Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28755 Test Plan: Imported from OSS Differential Revision: D18202436 Pulled By: pbelevich fbshipit-source-id: a7a27d5f3cdbcbbd9bbbffa02b576609d5fdc9b3	2019-11-01 12:35:59 -07:00
Edward Yang	bbea34f283	Revert D18266918: C++ API: torch::nn::BatchNorm{2,3}d Test Plan: revert-hammer Differential Revision: D18266918 Original commit changeset: f432904c7298 fbshipit-source-id: 0e1c596b2e2f13b59082ff422c67ba025df4be07	2019-11-01 10:46:49 -07:00
nuka137	b7c5b3d398	C++ API: torch::nn::BatchNorm{2,3}d (#28936 ) Summary: Add torch::nn::BatchNorm{2,3}d module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 #28176 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28936 Differential Revision: D18266918 Pulled By: yf225 fbshipit-source-id: f432904c72985d52ec52cb992cceb372b6ff0244	2019-11-01 09:28:58 -07:00
Carlos Miranda	72b9bda9e5	Smooth L1 loss (#27661 ) Summary: In accordance with https://github.com/pytorch/pytorch/issues/25883, I added the `SmoothL1Loss` module and `smooth_l1_loss` functional. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27661 Differential Revision: D18002332 Pulled By: yf225 fbshipit-source-id: b382df8becb0de14986ec16ee0dc953d7b10e917	2019-10-31 23:41:35 -07:00
jokerkeny	aa30176c68	Add C++ API clip_grad_value_ for nn:utils (#28736 ) Summary: Adds C++ API clip_grad_value_ for torch::nn:utils module. Also, fix the for indent level error in the original test/test_nn.py. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28736 Differential Revision: D18263807 Pulled By: yf225 fbshipit-source-id: 29282450bd2099df16925e1d0edd3d933f6eeb9b	2019-10-31 19:11:54 -07:00
Will Feng	595209bddc	Fix bugs in torch::tensor constructor (#28523 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28523 New features: 1. Previously, `torch::tensor({true, false, true})` throws `"tensor_cpu" not implemented for 'Bool'`. After this PR, it produces the correct bool tensor, matching the Python API behavior. 2. Tensors with zero-size dimensions are now supported, e.g. `torch::tensor({{}, {}})` produces a tensor with sizes `{2, 0}`, matching the Python API behavior. BC-breaking bug fixes: 1. Previously, `torch::tensor({{1}, {2}})` produces a tensor of sizes `{2}`. After this PR, it produces a tensor of sizes `{2, 1}`, matching the Python API behavior. 2. Fixed semantics of `torch::tensor(1.1)`: it now returns a 0-dim tensor instead of a 1-dim tensor, matching the Python API behavior. 3. Previously, when passed a non-dtype `TensorOptions` to the `torch::tensor` constructor, it always produces a tensor of dtype `float`. After this PR, it produces tensor of different dtypes based on the dtype of the braced-init-list, matching the behavior of the no-options case. ```cpp // Previously: torch::tensor({1, 2, 3}, torch::TensorOptions(/non-dtype-options/)).dtype() -> float torch::tensor({{1, 2, 3}}, torch::TensorOptions(/non-dtype-options/)).dtype() -> float torch::tensor({1., 2., 3.}, torch::TensorOptions(/non-dtype-options/)).dtype() -> float torch::tensor({{1., 2., 3.}}, torch::TensorOptions(/non-dtype-options/)).dtype() -> float // Now: torch::tensor({1, 2, 3}, torch::TensorOptions(/non-dtype-options/)).dtype() -> int torch::tensor({{1, 2, 3}}, torch::TensorOptions(/non-dtype-options/)).dtype() -> int torch::tensor({1., 2., 3.}, torch::TensorOptions(/non-dtype-options/)).dtype() -> double torch::tensor({{1., 2., 3.}}, torch::TensorOptions(/non-dtype-options/)).dtype() -> double // As comparison, currently: torch::tensor({1, 2, 3}).dtype() -> int torch::tensor({{1, 2, 3}}).dtype() -> int torch::tensor({1., 2., 3.}).dtype() -> double torch::tensor({{1., 2., 3.}}).dtype() -> double ``` Notes: 1. From now on, the behavior of `at::tensor(scalar_value)` (which produces a 1-dim tensor) would be different from `torch::tensor(scalar_value)` (which produces a 0-dim tensor). I will fix the behavior of `at::tensor(scalar_value)` in a follow-up PR. 2. From now on, the behavior of `at::tensor({1, 2, 3}, torch::TensorOptions(/non-dtype-options/))` (which produces a `float` tensor) would be different from `torch::tensor({1, 2, 3}, torch::TensorOptions(/non-dtype-options/))` (which produces a an `int` tensor). I will fix this behavior of `at::tensor` constructor in a follow-up PR. Context for the changes in this PR: The motivation comes from fixing the "`torch::tensor({{1}, {2}})` gives tensor of wrong sizes" bug - in order to fix it, I have to move the handling of `at::ArrayRef` and `std::vector` into `InitListTensor` (see below on why we need to do this) and renamed `InitListTensor` to `TensorDataContainer`. After such changes, support for bool values comes out of the box without extra effort, and support for tensors with zero-size dimensions only requires adding a default constructor for `TensorDataContainer`, so I added those two in this PR. For the semantic change of `torch::tensor(1.1)`, it's actually more effort to preserve the original wrong behavior (i.e. we need to check the sizes of the tensor converted from `TensorDataContainer` and reshape any scalar tensor to a 1-D tensor). I think preserving the original wrong behavior doesn't give us much value, and since the above changes naturally fix the problem, we should just start using the right behavior instead. For the "constructor with non-dtype options behavior" fix, the code looks simpler and easier to reason about with the fix, so I included it in this PR. -------- Why we need to move the handling of `at::ArrayRef` and `std::vector` into `TensorDataContainer`: `torch::tensor({{1}, {2}})` can match this function overload: `torch::tensor(at::ArrayRef<int> values)`, because `{1}` and `{2}` can be treated as a list-initialization of an `int` value. However, this will produce a Tensor with sizes `{2}`, but we actually want a Tensor with sizes `{2, 1}`. In order to avoid matching this function overload, we removed the function overload and moved the ability to convert `at::ArrayRef<T>` (and similarly `std::vector<T>`) into `TensorDataContainer`, and since for braced-init-list the `TensorDataContainer(std::initializer_list<TensorDataContainer>)` constructor is always preferred over all other constructors, it will take the `std::initializer_list` path, and all is good. Test Plan: Imported from OSS Differential Revision: D18234625 Pulled By: yf225 fbshipit-source-id: 0f3f6912e82e2117d2103e31b74e7e97baaa8693	2019-10-31 12:53:06 -07:00
Pavel Belevich	d6f1e49c4a	C++ API parity: CTCLoss Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28654 Test Plan: Imported from OSS Differential Revision: D18202437 Pulled By: pbelevich fbshipit-source-id: a4b80a57e65da84f3988002a026c648fa52a0fde	2019-10-30 14:35:02 -07:00
jon-tow	1d3d9ec7d4	C++ API Parity: `functional::fold` and `Fold::pretty_print` (#28732 ) Summary: Adds `torch::nn::functional::fold` support and updates `Fold::pretty_print` in the C++ API for more thorough Python parity. Note: Small updates in source files to maintain consistency elsewhere. Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28732 Differential Revision: D18219955 Pulled By: yf225 fbshipit-source-id: fd2e9be8f17db77c1b1f384c0d2e16cc34858c0c	2019-10-30 11:37:39 -07:00
mansoorcheema	a465b033fd	Local response norm (#28759 ) Summary: Implemented LocalResponseNorm and some initial tests for modules and functional. Reference https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28759 Differential Revision: D18219745 Pulled By: yf225 fbshipit-source-id: e6aad568a8b1e81f54752decaefd4f9044029da9	2019-10-30 11:31:00 -07:00
Thomas Viehmann	2526f97464	Include hierarchy information in C++ API loading error messages (#28499 ) Summary: Before, we would only give the key we are looking for (i.e. typically just "No such serialized tensor 'weight'", no matter for which submodule we were looking for a weight. Now we error with "No such serialized tensor '0.conv1.weight'" or similar. The analogous information is added to missing module error messages. I threw in a test, and it saved me already... Pull Request resolved: https://github.com/pytorch/pytorch/pull/28499 Differential Revision: D18122442 Pulled By: yf225 fbshipit-source-id: a134b6d06ca33de984a11d6fea923244bcd9fb95	2019-10-30 08:41:37 -07:00
mrsalehi	dfe7b25eaf	Add nn::Flatten to C++ Frontend (#28072 ) Summary: Adds torch::nn::Flatten module support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28072 Differential Revision: D18202778 Pulled By: yf225 fbshipit-source-id: 43345dcbdf2f50d75746bf9a0ba293b84df275ab	2019-10-29 17:52:47 -07:00
nuka137	cbc234bceb	C++ API: torch::nn::BatchNorm1d (#28176 ) Summary: Add torch::nn::BatchNorm1d function/module support for the C++ API. torch::nn::BatchNorm{2,3}d will be added after this PR is merged. Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 I would like to discuss about below items. * Necessity of `num_batches_tracked` in `BatchNormImplBase` * `num_batches_tracked` is needed to calculate `momentum` when we do not feed `momentum` argument in Python API. But in C++ API, `momentum` argument has a default value. * `num_batches_tracked` is only used for counting up `BatchNorm1d::foward()` call. I think it is no necessary for user anymore. * The design of `BatchNorm{1,2,3}dOptions` * We have already `BatchNormOptions` used for deprecated `BatchNorm` module. However, it is hard to use it for `BatchNorm{1,2,3}dOptions` because of the arguments disagreement of each modules. * In this PR, I introduce `BatchNormOptionsv2` template class for the `BatchNorm{1,2,3}dOptions`. But I'm not sure this design is good or not. Pull Request resolved: https://github.com/pytorch/pytorch/pull/28176 Differential Revision: D18196843 Pulled By: yf225 fbshipit-source-id: 667e2b5de4150d5776c41b9088c9e6c2ead24cd4	2019-10-29 17:29:42 -07:00
Will Feng	f1f86994bc	Fix implementation of F::kl_div / F::mse_loss / F::binary_cross_entropy Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28806 Test Plan: Imported from OSS Differential Revision: D18202859 Pulled By: yf225 fbshipit-source-id: 1aa19111cd5111dd5f2779f7f00f07f2f2e16d4d	2019-10-29 16:54:27 -07:00
Will Feng	e33b4b6761	Use c10::variant-based enums for Reduction Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27942 Test Plan: Imported from OSS Differential Revision: D18202857 Pulled By: yf225 fbshipit-source-id: 0303ce2508e3b7665c6a91ae270a7d0ef0e45900	2019-10-29 14:15:48 -07:00
jon-tow	52dd587123	C++ API parity: Upsample (#28413 ) Summary: Adds `interpolate` functional and `Upsample` module support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28413 Differential Revision: D18165014 Pulled By: yf225 fbshipit-source-id: ecae2f432a301b1f4afa7c038b2d104cbad139f2	2019-10-28 21:34:44 -07:00
Will Feng	5804e54c81	Deprecate torch::nn::modules_ordered_dict API (#28774 ) Summary: I finally found a way to get the following API to work for constructing a list of named submodules for `Sequential`: ```cpp Sequential sequential({ {"m1", MyModule(1)}, {"m2", MyModule(2)} })` ``` which was actually our original proposed design and much simpler than our current API: ```cpp Sequential sequential(modules_ordered_dict({ {"m1", MyModule(1)}, {"m2", MyModule(2)} })); ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/28774 Differential Revision: D18174013 Pulled By: yf225 fbshipit-source-id: 3a18c2d36b6a65a07bee7346a7516780567c7774	2019-10-28 13:01:13 -07:00
nuka137	648749b203	C++ API: torch::nn::LPPool2d (#28492 ) Summary: Add torch::nn::LPPool2d module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 #27800 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28492 Differential Revision: D18109401 Pulled By: yf225 fbshipit-source-id: 5cedecb895d9d44c2167cdb3f6f758f3426b3497	2019-10-28 12:28:25 -07:00
Will Feng	f43194ed9e	Move mode_t declaration in PadOptions (#28760 ) Summary: Based on the discussion in https://github.com/pytorch/pytorch/pull/28413#discussion_r338839489, putting anything that's not tagged as `public:` under a `TORCH_ARG` line would hide it under `private:`. To get around this problem, we should move the `mode_t` declaration at the top of the PadOptions declaration. Pull Request resolved: https://github.com/pytorch/pytorch/pull/28760 Differential Revision: D18165117 Pulled By: yf225 fbshipit-source-id: cf39c0a893822264cd6a64cd887729afcd84dbd0	2019-10-27 15:51:39 -07:00
anjali411	dc17a2ecc5	Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28433 Differential Revision: D18138240 Pulled By: anjali411 fbshipit-source-id: 314e5902f103be1feb4cacde47c90204b3d353cc	2019-10-25 11:44:28 -07:00
Will Feng	d04973beda	Use c10::variant-based enums for EmbeddingBag mode (#28330 ) Summary: This PR is BC-breaking in the following way: Previous, we require the use of `std::string` to specify the mode for `EmbeddingBag`. After this PR, we use variant-based enums such as `torch::kSum` / `torch::kMean` / `torch::kMax` to specify the mode for `EmbeddingBag`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/28330 Differential Revision: D18127116 Pulled By: yf225 fbshipit-source-id: 15cd86c764777f4d399587be92cda15b6ce8524b	2019-10-24 17:47:42 -07:00
lsrock1	e885ce6130	C++ parity, grid_sample functional (#28354 ) Summary: https://github.com/pytorch/pytorch/issues/25883 I put grid_sample in vision.h with affine grid. I have a question in string argument(interpolation mode, padding mode) I reuse torch::native::detail::GridSamplerInterpolation in GridSampler.h instead of using string. It follows the way that uses reduction enum in loss functions. I am not sure this is right. yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28354 Differential Revision: D18109333 Pulled By: yf225 fbshipit-source-id: 1bf972b671b107464f73b937bbe0de76fb259fbf	2019-10-24 15:14:37 -07:00
Will Feng	92b39434a2	C++ nn::ConstantPad{1,2,3}d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28541 Test Plan: Imported from OSS Differential Revision: D18115607 Pulled By: yf225 fbshipit-source-id: 736df791ddc3cd30ad9af89eacfb4a0c6b53f2cd	2019-10-24 15:10:27 -07:00
Will Feng	7f9941c4ea	C++ nn::ZeroPad2d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28540 Test Plan: Imported from OSS Differential Revision: D18115610 Pulled By: yf225 fbshipit-source-id: ced7c0917f4712838e753cd2e9fc4fa79fd5d310	2019-10-24 14:23:57 -07:00
Will Feng	303527d733	C++ nn::ReplicationPad{1,2,3}d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28539 Test Plan: Imported from OSS Differential Revision: D18115609 Pulled By: yf225 fbshipit-source-id: 15f4ab6a114279bb06bf62f1265b62aa12f8700f	2019-10-24 12:49:41 -07:00
Will Feng	78375c02b8	C++ nn::ReflectionPad1d and nn::ReflectionPad2d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28538 Test Plan: Imported from OSS Differential Revision: D18115608 Pulled By: yf225 fbshipit-source-id: 3a48d8c11721f013076db2965f5f75b71662c78e	2019-10-24 12:02:51 -07:00
Pavel Belevich	dd277e9086	C++ API parity: Linear Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27382 Test Plan: Imported from OSS Differential Revision: D17766735 Pulled By: pbelevich fbshipit-source-id: c7a66daeb17550eb9a5d26944427723d4ebdc6c8	2019-10-24 07:11:51 -07:00
Thomas Viehmann	09ad464d68	Change activation modules in C++ from using Tensor& to Tensor (#28501 ) Summary: Sequential does not like modules added to it to take Tensor& (const Tensor& and Tensor are both OK). Functional and others use Tensor when they want to potentially change things in-place. This changes ReLU and friends to also do that. Unfortunately, this seems to be BC breaking on the ABI level. On the other hand, use of the module ReLU seems rare enough outside Sequential (in particular in C++ models, the standard seems to be to use torch::relu instead). is the BC breaking OK here? (yf225 or anyone else) Pull Request resolved: https://github.com/pytorch/pytorch/pull/28501 Differential Revision: D18089978 Pulled By: yf225 fbshipit-source-id: ac9aba6dc2081117dece57cd8a15bafe14ec8f51	2019-10-23 13:42:22 -07:00
Anjali Chourdia	7b59174882	torch::nn::LayerNorm Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28032 Differential Revision: D18047371 Pulled By: anjali411 fbshipit-source-id: fb61aea52d6622a67ec1d84950e17e85686461ae	2019-10-22 12:50:22 -07:00
Will Feng	079b3cc02c	Add C++ nn::functional pad Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26601 Test Plan: Imported from OSS Differential Revision: D17517468 Pulled By: yf225 fbshipit-source-id: 9ee8b93b88a60f91f2ae78c242f9eaa246b3293c	2019-10-21 22:20:38 -07:00
nuka137	9ea42f8d7c	C++ API: torch::nn::LPPool1d (#27800 ) Summary: Add torch::nn::LPPool1d module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27800 Differential Revision: D18045040 Pulled By: yf225 fbshipit-source-id: e61fefe9efec3423f7a93dd1e946f3e380122927	2019-10-21 15:33:51 -07:00
Will Feng	eb4bb00a9c	Use c10::variant-based enums for Nonlinearity and FanMode Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27933 Test Plan: Imported from OSS Differential Revision: D18009044 Pulled By: yf225 fbshipit-source-id: e88229ee30badf7a699f62af61d1e88debc0dc7d	2019-10-18 17:48:34 -07:00
Carlos Miranda	a1e14a6626	PixelShuffle module and functional (#28140 ) Summary: Added `PixelShuffle` module and functional https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28140 Differential Revision: D18008474 Pulled By: yf225 fbshipit-source-id: f482495bb56998701c79a61ef065a121bf5a5154	2019-10-18 15:54:14 -07:00
naresh	bd6f9e1d6c	torch.nn.functional.gumbel_softmax #27078 (#28121 ) Summary: Comments: * Grad check from `848d1ba13a/test/test_nn.py (L8898)` not added * Double data type as seen in `848d1ba13a/test/test_nn.py (L8916)` not tested Issue: https://github.com/pytorch/pytorch/issues/27078 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28121 Differential Revision: D18008515 Pulled By: yf225 fbshipit-source-id: 9363fe9430df0f2bfd337cc788b11ac93adaa360	2019-10-18 09:41:40 -07:00
Shahriar	91a260cef9	Adding MSELoss, KLDivLoss and BCELoss to C++ front-end (#27156 ) Summary: This PR adds ```MSELoss```, ```KLDivLoss``` and ```BCELoss```. The tests for ```BCELoss``` fail with the following error: ``` unknown file: Failure C++ exception with description "autograd_meta() INTERNAL ASSERT FAILED at /home/shahriar/Contrib/pytorch/c10/core/TensorImpl.h:533, please report a bug to PyTorch. set_requires_grad is not implemented for Tensor (set_requires_grad at /home/shahriar/Contrib/pytorch/c10/core/TensorImpl.h:533) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/27156 Differential Revision: D17960323 Pulled By: yf225 fbshipit-source-id: 84b8431064f2f573679c03a8d7994e3e2f81a4d1	2019-10-17 22:07:01 -07:00
Will Feng	aad5071206	Use torch::variant for enums in C++ API Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26837 Test Plan: Imported from OSS Differential Revision: D17579438 Pulled By: yf225 fbshipit-source-id: 9ac59df28a317fdb3be2cc02c65962ad99117127	2019-10-16 22:40:57 -07:00
Mikhail Zolotukhin	2265cddbd2	Cleanup torch::jit::script::Module API for accessing attributes/parameters/submodules. (#27260 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27260 This PR has the following changes: - Slot class is removed. In all use cases except `lower_graph` we really just needed the attribute name and thus having an extra layer of abstraction through Slot only made the code harder to understand. - get_parameters, get_attributes, get_modules, and get_slots now return a list of <name, item> pairs instead of a list of Slots. Differential Revision: D17728910 Test Plan: Imported from OSS Pulled By: ZolotukhinM fbshipit-source-id: 94781611752dd88e7fddfe8b8e0252d6ec32ba68	2019-10-16 21:32:08 -07:00
Carlos Miranda	7d277b0670	Multi Label Margin loss (#27659 ) Summary: In accordance with https://github.com/pytorch/pytorch/issues/25883, I added the `MultiLabelMarginLoss` module and `multilabel_margin_loss` functional. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27659 Differential Revision: D17931905 Pulled By: yf225 fbshipit-source-id: 3642f75c79843dda55ac38de9f6f970f3e237847	2019-10-16 15:44:38 -07:00
Carlos Miranda	9540f6c3fe	Soft Margin loss (#27660 ) Summary: In accordance with https://github.com/pytorch/pytorch/issues/25883, I added the `SoftMarginLoss` module and `soft_margin_loss` functional. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27660 Differential Revision: D17958325 Pulled By: yf225 fbshipit-source-id: c14422765e6e1fdabf6c9687080e6d5ff490d300	2019-10-16 12:04:08 -07:00
Moksh Jain	f38beff800	Add nn.Bilinear to C++ Frontend (#26082 ) Summary: Adds support for the Bilinear layer to the C++ frontend Pull Request resolved: https://github.com/pytorch/pytorch/pull/26082 Differential Revision: D17954148 Pulled By: yf225 fbshipit-source-id: 5e746bdea29b00e25969cd7a22044b8059b53687	2019-10-16 09:54:01 -07:00
Jeremy Lilley	2e0294cb39	Make JIT Serialization support arbitrary std::function<> IO (#28039 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28039 Right now, torch::save() uses std::ostream, which results in unnecessary data copies in practice. Similar for torch::load(). Adding a std::function<size_t(const void*, size_t)> as an output option, parallel to the existing filename and std::ostream apis, gives users the flexibility to emit directly to a backing store. For a simple case of appending the output to a std::string, we observe significant benchmark savings (on order of -50%), even with the minor std::function<> dispatch overhead. The main reason is that std::ostringstream effectively requires 2 extra copies of the data beyond a simple string.append lambda. We also provide a parallel api for the load(), though this one is slightly more complex due to the need to do arbitrary position reads. Test Plan: buck test mode/dev-nosan caffe2/test/... (Basic serialization test in caffe2/test/cpp/api/serialize.cpp) Benchmark in experimental/jeremyl/c2/SerializationBench.cpp, with D17823443 (1M time goes from 90ms -> 40ms, albeit with crc patch applied) Differential Revision: D17939034 fbshipit-source-id: 344cce46f74b6438cb638a8cfbeccf4e1aa882d7	2019-10-15 22:12:04 -07:00
Will Feng	964d3d8b38	Revert D17822962: [pytorch][PR] Make JIT Serialization support arbitrary std::function<> IO Test Plan: revert-hammer Differential Revision: D17822962 Original commit changeset: d344a7e59707 fbshipit-source-id: ba153a2110faf91d103bd0f8dea4e9613bd6b0da	2019-10-15 13:55:11 -07:00
Will Feng	fd3d6587e6	Make TripletMarginLossImpl subclass from Cloneable (#27956 ) Summary: Continuing from https://github.com/pytorch/pytorch/pull/27770 to make all `torch::nn` layers subclass from `torch::nn::Cloneable`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27956 Differential Revision: D17936555 Pulled By: yf225 fbshipit-source-id: 75f7982e7893675cf6da0f5419224b92af579818	2019-10-15 13:38:39 -07:00
Jeremy Lilley	cbe5ab1109	Make JIT Serialization support arbitrary std::function<> IO (#27586 ) Summary: Right now, torch::save() uses std::ostream, which results in unnecessary data copies in practice. Similar for torch::load(). Adding a std::function<size_t(const void*, size_t)> as an output option, parallel to the existing filename and std::ostream apis, gives users the flexibility to emit directly to a backing store. For a simple case of appending the output to a std::string, we observe significant benchmark savings (on order of -50%), even with the minor std::function<> dispatch overhead. The main reason is that std::ostringstream effectively requires 2 extra copies of the data beyond a simple string.append lambda. We also provide a parallel api for the load(), though this one is slightly more complex due to the need to do arbitrary position reads. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27586 Test Plan: buck test mode/dev-nosan caffe2/test/... (Basic serialization test in caffe2/test/cpp/api/serialize.cpp) Benchmark in experimental/jeremyl/c2/SerializationBench.cpp, with D17823443 (1M time goes from 90ms -> 40ms, albeit with crc patch applied) Differential Revision: D17822962 Pulled By: jjlilley fbshipit-source-id: d344a7e59707f3b30d42280fbab78f87399e4d10	2019-10-15 12:39:58 -07:00
Divyansh Singhvi	3397d41b8a	Wrapping namespace Reduction in namespace at (#26606 ) (#27422 ) Summary: 1) Wrapped namespace `Reduction` in namespace `at` 2) Prefixed `at::` wherever `Reduction::` is used Pull Request resolved: https://github.com/pytorch/pytorch/pull/27422 Differential Revision: D17913759 Pulled By: yf225 fbshipit-source-id: 8f00ca01cad2e7f673d316b128abf59c026e216c	2019-10-15 11:05:40 -07:00
Will Feng	801b6cd0bd	Allow passing undefined Tensor to Module::register_parameter (#27948 ) Summary: C++ API `Module::register_parameter` should accept undefined Tensor as parameter, which is equivalent to `module.register_parameter("param", None)` in Python API. This unblocks https://github.com/pytorch/pytorch/pull/26082 and https://github.com/pytorch/pytorch/pull/27156. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27948 Differential Revision: D17931739 Pulled By: yf225 fbshipit-source-id: 21bdfc88e66e3dc39f3caf608a6a3de48c510fa9	2019-10-15 10:10:42 -07:00
Will Feng	11172c19be	codemod at::ArrayRef and torch::IntArrayRef to std::vector in C++ API tests (#27884 ) Summary: `at::ArrayRef` / `torch::IntArrayRef` should be discouraged in user code, because users might not be aware of the fact that it doesn't own the underlying data, which already leads to memory access bugs when they try to write the following: ```cpp auto expected_sizes = torch::IntArrayRef({2, 16, 6}); // The memory that represents `{2, 16, 6}` is released after this line ASSERT_EQ(output.sizes(), expected_sizes); // `expected_sizes` is pointing to invalid memory region ``` This PR changes all usage of `at::ArrayRef` and `torch::IntArrayRef` to the corresponding `std::vector` version, so that users won't pick up the habit of using `ArrayRef` by looking at the test code. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27884 Differential Revision: D17921646 Pulled By: yf225 fbshipit-source-id: 461e79fc22b598aac230d36cc028085ce6cbe937	2019-10-14 18:00:30 -07:00
Carlos Miranda	2cae3928b0	Multi-Label Soft Margin loss (#27669 ) Summary: In accordance with https://github.com/pytorch/pytorch/issues/25883, I added the `MultiLabelSoftMarginLoss` module and `multilabel_soft_margin_loss` functional. It looks like there isn't a C++ ATen implementation of `multilabel_soft_margin_loss`, so I translated the python version, which does not rely on a C/C++ backend either. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27669 Differential Revision: D17907608 Pulled By: yf225 fbshipit-source-id: ccb02951e009973c2adbe604593ce929f10c39eb	2019-10-14 13:29:45 -07:00
jon-tow	0003771423	C++ API parity: Unfold (#27809 ) Summary: Adds `unfold` functional and module support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27809 Differential Revision: D17901792 Pulled By: yf225 fbshipit-source-id: ff58a1866bf240f37ebc589463c60593b8931f51	2019-10-14 13:21:59 -07:00
Gaurav Tamba	cc5c34a0d0	Add nn::functional::normalize() to C++ Frontend (#27280 ) Summary: Addresses https://github.com/pytorch/pytorch/issues/27048 PR Summary: Files Added: _torch/csrc/api/include/torch/nn/options/normalization.h torch/csrc/api/include/torch/nn/functional/normalization.h_ Files Modified: _test/cpp/api/functional.cpp torch/csrc/api/include/torch/nn/functional.h_ --- yf225 : I couldn't find a C++ equivalent of gradcheck(), is there such a function or is it sufficient to call .backward() in the test body? I don't think any solutions are checked for the Python tests. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27280 Differential Revision: D17902109 Pulled By: yf225 fbshipit-source-id: 1bce1a88103d0f1848633fec90fde95ea8f3d1ed	2019-10-14 08:39:02 -07:00
nuka137	07d4374239	C++ API: torch::nn::Softmax2d (#27509 ) Summary: Add torch::nn::Softmax2d module support for the C++ API. Softmax2d only supports module in Python API, so this PR adds only module support as well. This PR is WIP because it uses the function in https://github.com/pytorch/pytorch/issues/27446 . After https://github.com/pytorch/pytorch/issues/27446 is merged, I will remove WIP. Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27509 Differential Revision: D17899715 Pulled By: yf225 fbshipit-source-id: bd891bc995f5a92bf4f5405f8bf07d1bd5de2479	2019-10-13 11:00:56 -07:00
PyExtreme	52528c041a	- TripletMarginLoss (#27713 ) Summary: Hi yf225 , I had to create a new branch to tackle merge conflict since I am using cloud due to some limitations on my PC. Therefore, I don't have enough command there. Also, I have incorporated the changes you have put before here https://github.com/pytorch/pytorch/pull/27613 Also, it would be great if you could recommend me some resources to work smmothly on GCP..:-D Thank you Pull Request resolved: https://github.com/pytorch/pytorch/pull/27713 Differential Revision: D17899695 Pulled By: yf225 fbshipit-source-id: eb6643223148774a5cbbd093bdcc5623872e5bba	2019-10-13 10:57:37 -07:00
Pavel Belevich	446a79b959	C++ API parity: Threshold Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27538 Test Plan: Imported from OSS Differential Revision: D17835415 Pulled By: pbelevich fbshipit-source-id: 2a887704655be79ee458081c46a7eea31eca51dc	2019-10-13 09:38:31 -07:00
Pavel Belevich	cbdd55c669	C++ API parity: Tanhshrink Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27537 Test Plan: Imported from OSS Differential Revision: D17835409 Pulled By: pbelevich fbshipit-source-id: ad4120cfe01ea2508bf3ce1054022a2da649ac74	2019-10-13 08:12:13 -07:00
Pavel Belevich	2750ea25b2	C++ API parity: Tanh Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27536 Test Plan: Imported from OSS Differential Revision: D17835411 Pulled By: pbelevich fbshipit-source-id: c8984aec2f4bae48ff901fafc8c53a4122192ac5	2019-10-13 06:34:18 -07:00
Will Feng	27027a4804	Fix torch::nn layers to always subclass from `torch::nn::Cloneable` (#27770 ) Summary: The impl class of `torch::nn` layers must always subclass from `torch::nn::Cloneable`, otherwise `module->clone()` doesn't work on them. This PR fixes layers that don't conform to this rule. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27770 Differential Revision: D17893051 Pulled By: yf225 fbshipit-source-id: 37cdf8c09e22f0f164cbd0e8700965a1778ec4c1	2019-10-12 16:23:46 -07:00
Pavel Belevich	96aafc3cdc	C++ API parity: Softsign Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27535 Test Plan: Imported from OSS Differential Revision: D17835408 Pulled By: pbelevich fbshipit-source-id: 8548deab91f6fe0f7285fdd919c25129ed042181	2019-10-12 08:30:10 -07:00
Pavel Belevich	fcb6dd079e	C++ API parity: Softshrink Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27534 Test Plan: Imported from OSS Differential Revision: D17835404 Pulled By: pbelevich fbshipit-source-id: 7b9f3d3ea793f82840496912f248b0c48bb7463e	2019-10-12 06:36:20 -07:00
nuka137	abaa44122d	C++ API: torch::nn::Softmin (#27459 ) Summary: Add torch::nn::Softmin module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27459 Differential Revision: D17892852 Pulled By: yf225 fbshipit-source-id: db15b06e8ad33947e7d65995df700f5e90c3b6a8	2019-10-11 23:03:55 -07:00
Pavel Belevich	c79d3a4a98	C++ API parity: Softplus Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27489 Test Plan: Imported from OSS Differential Revision: D17835410 Pulled By: pbelevich fbshipit-source-id: 51a8c4ab2ff4b860c96eda1ed8f073017b8cf9ae	2019-10-11 09:00:32 -07:00
Pavel Belevich	9d448099fd	C++ API parity: Sigmoid Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27488 Test Plan: Imported from OSS Differential Revision: D17835405 Pulled By: pbelevich fbshipit-source-id: 78e13047a2a1f2776c59e778db7ba120716e93d3	2019-10-11 07:45:31 -07:00
Pavel Belevich	795c913636	C++ API parity: CELU Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27487 Test Plan: Imported from OSS Differential Revision: D17835406 Pulled By: pbelevich fbshipit-source-id: a8282ae65d8996efcc8b8d846cfa637c3f89eda6	2019-10-11 06:23:57 -07:00
Pavel Belevich	6294a9a877	C++ API parity: RReLU Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27437 Test Plan: Imported from OSS Differential Revision: D17835413 Pulled By: pbelevich fbshipit-source-id: 5d943fdac4fd2633e7f7ca13db1a7fed5636ca50	2019-10-10 19:14:48 -07:00
Pavel Belevich	352092ca95	C++ API parity: ReLU6 Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27436 Test Plan: Imported from OSS Differential Revision: D17835414 Pulled By: pbelevich fbshipit-source-id: 77e743d2f6b71fb3ba5643f9d676f2bb8f236cfa	2019-10-10 17:12:17 -07:00
nuka137	6711969dd8	C++ API: torch::nn::LogSoftmax (#27462 ) Summary: Add torch::nn::LogSoftmax module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27462 Differential Revision: D17867121 Pulled By: yf225 fbshipit-source-id: dae8ac981c1c6ccdef013cd2d886ad4a043f6243	2019-10-10 16:18:15 -07:00
auroraustc	f7d7c4b72f	Fix a bug of C++ L-BFGS optimizer (#27606 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/27605: The C++ L-BFGS Optimizer will not work properly if there are one or more registered tensors with no grad in the model: ``` terminate called after throwing an instance of 'c10::Error' what(): There were no tensor arguments to this function (e.g., you passed an empty list of Tensors), but no fallback function is registered for schema aten::view. This usually means that this function requires a non-empty list of Tensors. Available functions are [CUDATensorId, QuantizedCPUTensorId, VariableTensorId, CPUTensorId, MkldnnCPUTensorId] (lookup_ at /pytorch/aten/src/ATen/core/dispatch/DispatchTable.h:245) ``` Add some `if (!parameter.grad().defined()) {...}` in the ` torch/csrc/api/src/optim/lbfgs.cpp` Pull Request resolved: https://github.com/pytorch/pytorch/pull/27606 Differential Revision: D17866550 Pulled By: yf225 fbshipit-source-id: bcaf0bf75b93c57304856b03d8984c1617ebbfef	2019-10-10 15:38:05 -07:00
Pavel Belevich	8515650c2b	C++ API parity: ReLU Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27435 Test Plan: Imported from OSS Differential Revision: D17835407 Pulled By: pbelevich fbshipit-source-id: b8ee86c7a76674bc88d8e995424dad22d3caab59	2019-10-10 13:34:38 -07:00
jon-tow	f3df6b8ede	Add C++ torch::nn::functional::affine_grid (#27263 ) Summary: Adds`torch::nn::functional::affine_grid` functional support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883, https://github.com/pytorch/pytorch/issues/27196 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27263 Differential Revision: D17802350 Pulled By: yf225 fbshipit-source-id: e823ee53da4a4cc6a1650d2dfc09b0ef6a74e249	2019-10-09 23:17:49 -07:00
Pavel Belevich	1fec1441a1	C++ API parity: PReLU Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27429 Test Plan: Imported from OSS Differential Revision: D17835412 Pulled By: pbelevich fbshipit-source-id: e678d5920dad1293bb0ba3de28e2da3087d19bde	2019-10-09 16:31:54 -07:00
Carlos Miranda	3246fddfd6	Implement C++ API torch::nn::MultiMarginLoss. (#27424 ) Summary: Hi yf225 , here is the C++ frontend API MultiMarginLoss implementation and tests https://github.com/pytorch/pytorch/issues/27198. Could you review it and tell me if it is okay? I am not entirely sure I used `c10::optional` correctly, but `options.weight()` resulted in a compilation error, so I went with `options.weight().value()` instead of `value_or()` to follow the logic in `torch.nn._WeightedLoss.register_buffer` (where one can pass a `None` value). Oh, and are the tests supposed to be skipped or did I do something wrong? I ran `pytest test/test_cpp_api_parity.py -k Loss -v` , and the `L1Loss` test passed but the others were skipped... Thank you for the review in any case! Pull Request resolved: https://github.com/pytorch/pytorch/pull/27424 Differential Revision: D17839963 Pulled By: yf225 fbshipit-source-id: f4b6012590cf22d56d42751c214df80cce717cb8	2019-10-09 14:44:41 -07:00
jon-tow	0fed4756d0	C++ API parity: SELU (#27434 ) Summary: Adds `SELU` functional and module support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27434 Differential Revision: D17782762 Pulled By: yf225 fbshipit-source-id: 96c7ce84b9baf9e219a63e631929b8997ba6f3f0	2019-10-09 14:39:28 -07:00
nuka137	28a1806cbc	C++ API: torch::nn::Softmax (#27446 ) Summary: Add torch::nn::Softmax module support for the C++ API Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27446 Differential Revision: D17839546 Pulled By: yf225 fbshipit-source-id: 7c7fb55111b261614de7c3a75fa1019fbde93c67	2019-10-09 14:19:47 -07:00
Pavel Belevich	fbba4edd1d	C++ API parity: ELU, Hardshrink, Hardtanh, LeakyReLU, LogSigmoid minor fixes Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27565 Test Plan: Imported from OSS Differential Revision: D17835416 Pulled By: pbelevich fbshipit-source-id: 9e83bdb4bf44cbc2ef09e2088df4bf0694c235f0	2019-10-09 13:23:49 -07:00
Anjali Chourdia	a37be201c1	Implement torch.nn.Embedding / EmbeddingBag in PyTorch C++ API (#26358 ) Summary: added more variables to EmbeddingOptions and updated EmbeddingImpl reset, forward functions. Also added EmbeddingBag. ----- This PR is BC-breaking in the following way: Previously, `EmbeddingOptions` supports `count` and `dimension` as options arguments. After this PR, they are renamed to `num_embeddings` and `embedding_dim` respectively. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26358 Differential Revision: D17714337 Pulled By: yf225 fbshipit-source-id: f9f969c68e4bece106b92f8e2e02ac39c8455fb7	2019-10-08 22:13:39 -07:00
Jonathan Tow	3b5d40c339	Add C++ torch::nn::CosineEmbeddingLoss (#27345 ) Summary: Adds `torch::nn::CosineEmbeddingLoss` module and functional support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27345 Differential Revision: D17801402 Pulled By: yf225 fbshipit-source-id: 0eabe80d7d36397e6667b331c3fa2f56d7a15962	2019-10-08 10:52:05 -07:00
Pavel Belevich	2cc1e69cc9	C++ API parity: LogSigmoid Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27060 Test Plan: Imported from OSS Differential Revision: D17682404 Pulled By: pbelevich fbshipit-source-id: d60d64cd4caf1f56a2e05c516f91321d46ec9624	2019-10-05 06:18:25 -07:00
Pavel Belevich	8b61a220c0	C++ API parity: LeakyReLU Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27059 Test Plan: Imported from OSS Differential Revision: D17682407 Pulled By: pbelevich fbshipit-source-id: 2a4f42e9438799ba8de7282ac7a6fd3ff97ee048	2019-10-04 14:18:03 -07:00
Rohan Varma	badb08d577	Add clip_grad_norm_ to c++ api (#26140 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26140 Per https://github.com/pytorch/pytorch/issues/25883, we want to work towards C++/Python API parity. This diff adds clip_grad_norm_ to the c++ API to improve parity. ghstack-source-id: 91334333 ghstack-source-id: 91334333 Test Plan: Added a unit test Differential Revision: D17312367 fbshipit-source-id: 753ba3a4d084d01f3cc8919da3108e67c809ad65	2019-10-04 13:50:36 -07:00
Pavel Belevich	192ca9730f	C++ API parity: Hardtanh Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27038 Test Plan: Imported from OSS Differential Revision: D17682405 Pulled By: pbelevich fbshipit-source-id: f65e76696e0041c3518f56da94f2e3b800305234	2019-10-04 12:53:33 -07:00
Zino Benaissa	803f7bfaac	Implement C++ API version of torch.nn.functional.one_hot (#27081 ) (#27177 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27177 Add support for F::one_hot C++ function. Test Plan: Added 3 new tests to verify API is working Imported from OSS Differential Revision: D17697934 fbshipit-source-id: a8127fb87c00daa119bb92a5702bc4bbba48290d	2019-10-02 17:28:39 -07:00
Pavel Belevich	515e3b85da	C++ API parity: Hardshrink Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27035 Test Plan: Imported from OSS Differential Revision: D17682403 Pulled By: pbelevich fbshipit-source-id: 186377fe577abfdd53acc95751a7ed845b51af95	2019-10-02 08:30:20 -07:00
Pavel Belevich	c864454a8f	C++ API parity: ELU Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27028 Test Plan: Imported from OSS Differential Revision: D17682406 Pulled By: pbelevich fbshipit-source-id: 9c313237cb93b9870c6fcf8d01b3dbe4af4c6f2a	2019-10-02 07:12:08 -07:00
Pavel Belevich	5005f7bce7	C++ API parity: MaxUnpool3d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27027 Test Plan: Imported from OSS Differential Revision: D17682402 Pulled By: pbelevich fbshipit-source-id: 2008ce405176c174cdba88b4f25cd77a82bb13ea	2019-10-02 05:40:42 -07:00
Pavel Belevich	5cac738713	C++ API parity: MaxUnpool2d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26915 Test Plan: Imported from OSS Differential Revision: D17627826 Pulled By: pbelevich fbshipit-source-id: 04a5a7e7d19b1610cafaaa0bd329d4d228ab4be5	2019-10-01 19:29:15 -07:00
Pavel Belevich	d125a83f98	C++ API parity: MaxUnpool1d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26896 Test Plan: Imported from OSS Differential Revision: D17627825 Pulled By: pbelevich fbshipit-source-id: 369d0080412467d0259eb5e692a0778c71b12343	2019-10-01 14:53:40 -07:00
jon-tow	18eea8269a	Add C++ torch::nn::functional::pdist (#27122 ) Summary: Adds `torch::nn::functional::pdist` module support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883, https://github.com/pytorch/pytorch/issues/27082 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27122 Differential Revision: D17685823 Pulled By: yf225 fbshipit-source-id: f8ceb09635385ef2e16a002e5fc255be8eb2ebf4	2019-10-01 07:05:25 -07:00
jon-tow	209dc4c4ba	Add C++ torch::nn::HingeEmbeddingLoss (#27101 ) Summary: Adds `torch::nn::HingeEmbeddingLoss` module support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27101 Differential Revision: D17680489 Pulled By: yf225 fbshipit-source-id: 1f8f41775a9e1272a98232c8f899418b2b907eca	2019-09-30 19:29:24 -07:00
Will Feng	27d4b34ea6	Add temporary torch::k{name} enum declarations (#27051 ) Summary: This PR adds temporary declarations for `torch::k{name}` enums, so that we can submit a PR to rename the enum usage in torchvision. And then, after the changes to torchvision is done, we can remove the temporary declarations in https://github.com/pytorch/pytorch/pull/26837 to officially move over to using `c10::variant` for enums. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27051 Differential Revision: D17672220 Pulled By: yf225 fbshipit-source-id: 4ae77634e8c7efa3404698f7c1a69177cbb5dab3	2019-09-30 13:38:29 -07:00
Pavel Belevich	1a3997e0b8	C++ API parity: AdaptiveAvgPool3d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26819 Test Plan: Imported from OSS Differential Revision: D17627829 Pulled By: pbelevich fbshipit-source-id: be4d803c7d4ba2c59e54d154eeebc63794465191	2019-09-28 22:32:21 -07:00
Pavel Belevich	a31fd5ea68	C++ API parity: AdaptiveAvgPool2d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26818 Test Plan: Imported from OSS Differential Revision: D17627822 Pulled By: pbelevich fbshipit-source-id: 0e1dea1c3ff2650dbc7902ce704ac6b47588d0bb	2019-09-28 10:45:03 -07:00
Pavel Belevich	7d58060f49	C++ API parity: AdaptiveAvgPool1d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26808 Test Plan: Imported from OSS Differential Revision: D17627827 Pulled By: pbelevich fbshipit-source-id: 13ad1d0414e7b62f4fc2f6573332bb2c07b16b53	2019-09-28 10:23:31 -07:00
Pavel Belevich	5aa01fd89a	C++ API parity: AdaptiveMaxPool3d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26775 Test Plan: Imported from OSS Differential Revision: D17627824 Pulled By: pbelevich fbshipit-source-id: c4ae077ea5575c5d1df795e74a0dcb74a695ad06	2019-09-27 15:31:37 -07:00
Pavel Belevich	bb7a415bcc	C++ API parity: AdaptiveMaxPool2d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26772 Test Plan: Imported from OSS Differential Revision: D17627823 Pulled By: pbelevich fbshipit-source-id: 195f1edabbbbe245de3568beb0c7925eb347118a	2019-09-27 12:41:38 -07:00
Will Feng	3acbcb96d4	Include `iteration_` in SGD optimizer serialization (#26906 ) Summary: This PR fixes https://github.com/pytorch/pytorch/issues/24192 by including the private field `iteration_` in SGD optimizer serialization. Under the hood, `iteration_` is serialized into an `IValue`, then stored in a JIT module as an attribute. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26906 Differential Revision: D17628359 Pulled By: yf225 fbshipit-source-id: beec1367459e973a1c9080dc86f502e4c7bc5ebd	2019-09-27 09:37:20 -07:00
Pavel Belevich	0a393f6ef5	C++ API parity: AdaptiveMaxPool1d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26755 Test Plan: Imported from OSS Differential Revision: D17627828 Pulled By: pbelevich fbshipit-source-id: f898a4d2c269b98eb5905291914caa25bca87ce0	2019-09-27 09:10:39 -07:00
Will Feng	b5d15315d8	Improve C++ maxpool and avgpool (#26521 ) Summary: This PR makes the following improvements: 1. Add `forward_with_indices` method to all C++ MaxPool modules, to return the max indices along with the outputs. (We can't make two `forward` methods that return different types based on input, because that will break the type deduction of `torch::detail::return_type_of_forward_t`) 2. Add `max_poolNd_with_indices` to `torch::nn::functional`, to be used when indices of the max values are needed. (We can't merge this with `torch::nn::functional::max_poolNd` because the return type of `max_poolNd` has to be defined statically). 3. Improve `pretty_print` of C++ MaxPoolNd and AvgPoolNd modules to match the Python `extra_repr`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26521 Differential Revision: D17507358 Pulled By: yf225 fbshipit-source-id: b6c0e2b27b38378cdc0c75f4bfc797b3c6b17cd9	2019-09-25 13:52:58 -07:00
jon-tow	5e5b9a9321	Add C++ nn::Identity (#26713 ) Summary: Summary: Adds `torch::nn::Identity` module support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/26713 Differential Revision: D17550982 Pulled By: yf225 fbshipit-source-id: f24483846e82d5d276d77a1a0c50884f3bc05112	2019-09-24 16:29:49 -07:00
Will Feng	3cae3021e5	Add tests for C++ functional cosine_similarity and pairwise_distance, and clean up functional test code (#26559 ) Summary: This ensures that `F::cosine_similarity` and `F::pairwise_distance` can be used simply by including `torch/torch.h` and set `namespace F = torch::nn::functional`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26559 Differential Revision: D17507421 Pulled By: yf225 fbshipit-source-id: f895dde3634d5c8ca66ee036903e327e5cdab6b1	2019-09-24 09:10:42 -07:00
Will Feng	49777e6730	Fix options usage in C++ module / optimizer constructors (#26483 ) Summary: With this PR, we establish the following conventions: 1. Options in C++ module / optimizer constructors should always be `const SomeOptions&` type, not `SomeOptions` type. 2. The options constructor arg should always be named `options_`, not `options`, to not be confused with the module / optimizer's internal field `options`. 3. We never use `std::move` to assign `options_` to the module / optimizer's internal field `options` in the constructor definition. Instead, we simply use `options(options_)`. Here is the reasoning: We might be tempted to declare the constructor as `SomeModule(SomeOptions options_)` and have `options(std::move(options_))` in the member initialization list. However, this can be a dangerous design because the constructor might use `options_` to set values for other member fields in the member initialization list (e.g. `8317f75b79/torch/csrc/api/include/torch/optim/lbfgs.h (L30-L34)`), and use-after-move can cause hard-to-debug problems. Instead, we choose to explicitly use `const SomeOptions&` type for `options_`, and never use `std::move` to assign it to the internal `options` field. This way we have stronger guarantee on the validity of `options_` at any point in the constructor. Notable exceptions to the above conventions: 1. C++ Embedding module doesn't adhere to the conventions now, which will be fixed after https://github.com/pytorch/pytorch/pull/26358 is landed. 2. C++ dataloader and dataset classes likely need similar changes. We will do it when we start to work on dataloader/dataset parity. Thanks ShahriarSS for discovering the options usage inconsistency! 🚀 Pull Request resolved: https://github.com/pytorch/pytorch/pull/26483 Differential Revision: D17500451 Pulled By: yf225 fbshipit-source-id: 49361a3519e4ede933789db75731d40144f0b617	2019-09-20 10:56:19 -07:00
jon-tow	872ca919a9	Distance module (#26424 ) Summary: Adds `Distance` module parity. https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/26424 Differential Revision: D17487314 Pulled By: yf225 fbshipit-source-id: c7d124cb4afb08a4733e7212af0bb276bf32d172	2019-09-20 07:28:49 -07:00
Will Feng	ce3d024727	Make `options.name_` private, and change all callsites to use `options.name()` (#26419 ) Summary: The implementation of several modules in C++ frontend currently has calls to `options.name_`, which is bad practice because `options.name_` should be a private options field and we should use `options.name()` to access its value. This PR makes `options.name_` actually private and changes all callsites of `options.name_` to `options.name()`. After this change, we can change all module options to have a map as the underlying data structure, and require that all options must be able to be stored in `c10::IValue`. These changes together would make serializing module options much easier. Note that this PR is BC-breaking in the following way: Previously, calling `options.name_` in C++ module implementation works because `options.name_` was a public field. After this PR, `options.name_` becomes private, and to get the value of `options.name_` we should call `options.name()`, and to set the value of `options.name_` we should call `options.name(new_value)`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26419 Differential Revision: D17481507 Pulled By: yf225 fbshipit-source-id: 93e4ed0e1d79ef57104ad748809d03e25da61ed3	2019-09-19 14:48:22 -07:00
davidriazati	61197e94b3	Remove `torch.save`-related logic from pickler (#25502 ) Summary: The Pickler previously had a distinction between tensors that would be inlined in 1 pickle binary (matching the format of `torch.save()`) and tensors that are saved elsewhere with only a reference stored in the binary. This PR moves that distinction out to `torch::pickle_save` to match the eager Python interface. The change can be seen in `register_prim_ops.cpp` where the call to `jit::pickle` is now `torch::pickle_save` ](https://our.intern.facebook.com/intern/diff/17175215/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/25502 Pulled By: driazati Differential Revision: D17175215 fbshipit-source-id: 8c9a21327cc79eaf6a0e488ea99e305be52f82b1	2019-09-17 20:38:13 -07:00
Will Feng	57a4b7c55d	Re-organize C++ API `torch::nn` folder structure (#26262 ) Summary: This PR aims to re-organize C++ API `torch::nn` folder structure in the following way: - Every module in `torch/csrc/api/include/torch/nn/modules/` (except `any.h`, `named_any.h`, `modulelist.h`, `sequential.h`, `embedding.h`) has a strictly equivalent Python file in `torch/nn/modules/`. For example: `torch/csrc/api/include/torch/nn/modules/pooling.h` -> `torch/nn/modules/pooling.py` `torch/csrc/api/include/torch/nn/modules/conv.h` -> `torch/nn/modules/conv.py` `torch/csrc/api/include/torch/nn/modules/batchnorm.h` -> `torch/nn/modules/batchnorm.py` `torch/csrc/api/include/torch/nn/modules/sparse.h` -> `torch/nn/modules/sparse.py` - Containers such as `any.h`, `named_any.h`, `modulelist.h`, `sequential.h` are moved into `torch/csrc/api/include/torch/nn/modules/container/`, because their implementations are too long to be combined into one file (like `torch/nn/modules/container.py` in Python API) - `embedding.h` is not renamed to `sparse.h` yet, because we have another work stream that works on API parity for Embedding and EmbeddingBag, and renaming the file would cause conflict. After the embedding API parity work is done, we will rename `embedding.h` to `sparse.h` to match the Python file name, and move the embedding options out to options/ folder. - `torch/csrc/api/include/torch/nn/functional/` is added, and the folder structure mirrors that of `torch/csrc/api/include/torch/nn/modules/`. For example, `torch/csrc/api/include/torch/nn/functional/pooling.h` contains the functions for pooling, which are then used by the pooling modules in `torch/csrc/api/include/torch/nn/modules/pooling.h`. - `torch/csrc/api/include/torch/nn/options/` is added, and the folder structure mirrors that of `torch/csrc/api/include/torch/nn/modules/`. For example, `torch/csrc/api/include/torch/nn/options/pooling.h` contains MaxPoolOptions, which is used by both MaxPool modules in `torch/csrc/api/include/torch/nn/modules/pooling.h`, and max_pool functions in `torch/csrc/api/include/torch/nn/functional/pooling.h`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26262 Differential Revision: D17422426 Pulled By: yf225 fbshipit-source-id: c413d2a374ba716dac81db31516619bbd879db7f	2019-09-17 10:07:29 -07:00
Shahriar	18a0040fec	C++ unregister_module function for Module (#26088 ) Summary: This PR adds ```unregister_module``` to ```nn::Module``` and ```erase``` function to ```OrderedDict```. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26088 Differential Revision: D17360058 Pulled By: yf225 fbshipit-source-id: f1f375b4751317da85b8da1458e092fe2405ceec	2019-09-12 18:38:57 -07:00

... 5 6 7 8 9 ...

894 Commits