pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
yanbing-j	c7a7c2b62f	Enable Gelu fp32/bf16 in CPU path using Mkldnn implementation (#58525 ) Summary: Enable Gelu bf16/fp32 in CPU path using Mkldnn implementation. User doesn't need to_mkldnn() explicitly. New Gelu fp32 performs better than original one. Add Gelu backward for https://github.com/pytorch/pytorch/pull/53615. Pull Request resolved: https://github.com/pytorch/pytorch/pull/58525 Reviewed By: ejguan Differential Revision: D29940369 Pulled By: ezyang fbshipit-source-id: df9598262ec50e5d7f6e96490562aa1b116948bf	2021-08-03 06:52:23 -07:00
Nikita Shulga	a9b0a921d5	Disable `avoid-non-const-global-variables` lint check (#62008 ) Summary: As GoogleTest `TEST` macro is non-compliant with it as well as `DEFINE_DISPATCH` All changes but the ones to `.clang-tidy` are generated using following script: ``` for i in `find . -type f -iname ".c" -or -iname "*.h"\|xargs grep cppcoreguidelines-avoid-non-const-global-variables\|cut -f1 -d:\|sort\|uniq`; do sed -i "/\/\/ NOLINTNEXTLINE(cppcoreguidelines-avoid-non-const-global-variables)/d" $i; done ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/62008 Reviewed By: driazati, r-barnes Differential Revision: D29838584 Pulled By: malfet fbshipit-source-id: 1b2f8602c945bd4ce50a9bfdd204755556e31d13	2021-07-22 18:04:40 -07:00
Thomas J. Fan	c16f87949f	ENH Adds nn.ReflectionPad3d (#59791 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/27655 This PR adds a C++ and Python version of ReflectionPad3d with structured kernels. The implementation uses lambdas extensively to better share code from the backward and forward pass. Pull Request resolved: https://github.com/pytorch/pytorch/pull/59791 Reviewed By: gchanan Differential Revision: D29242015 Pulled By: jbschlosser fbshipit-source-id: 18e692d3b49b74082be09f373fc95fb7891e1b56	2021-06-21 10:53:14 -07:00
Adnios	09a8f22bf9	Add mish activation function (#58648 ) Summary: See issus: https://github.com/pytorch/pytorch/issues/58375 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58648 Reviewed By: gchanan Differential Revision: D28625390 Pulled By: jbschlosser fbshipit-source-id: 23ea2eb7d5b3dc89c6809ff6581b90ee742149f4	2021-05-25 10:36:21 -07:00
Nikita Shulga	3a66a1cb99	[clang-tidy] Exclude cppcoreguidelines-avoid-magic-numbers (#57841 ) Summary: Add cppcoreguidelines-avoid-magic-numbers exclusion to clang-tidy Remove existing nolint warnings using following script: ``` for file in `git ls-files \| grep -v \.py`; do gsed '/^ *\/\/ NOLINTNEXTLINE(cppcoreguidelines-avoid-magic-numbers)/d' -i $file; done ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/57841 Reviewed By: samestep Differential Revision: D28295045 Pulled By: malfet fbshipit-source-id: 7c6e8d1213c9593f169ed3df6a916498f1a97163	2021-05-07 20:02:33 -07:00
Nikita Shulga	4cb534f92e	Make PyTorch code-base clang-tidy compliant (#56892 ) Summary: This is an automatic change generated by the following script: ``` #!/usr/bin/env python3 from subprocess import check_output, check_call import os def get_compiled_files_list(): import json with open("build/compile_commands.json") as f: data = json.load(f) files = [os.path.relpath(node['file']) for node in data] for idx, fname in enumerate(files): if fname.startswith('build/') and fname.endswith('.DEFAULT.cpp'): files[idx] = fname[len('build/'):-len('.DEFAULT.cpp')] return files def run_clang_tidy(fname): check_call(["python3", "tools/clang_tidy.py", "-c", "build", "-x", fname,"-s"]) changes = check_output(["git", "ls-files", "-m"]) if len(changes) == 0: return check_call(["git", "commit","--all", "-m", f"NOLINT stubs for {fname}"]) def main(): git_files = check_output(["git", "ls-files"]).decode("ascii").split("\n") compiled_files = get_compiled_files_list() for idx, fname in enumerate(git_files): if fname not in compiled_files: continue if fname.startswith("caffe2/contrib/aten/"): continue print(f"[{idx}/{len(git_files)}] Processing {fname}") run_clang_tidy(fname) if __name__ == "__main__": main() ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/56892 Reviewed By: H-Huang Differential Revision: D27991944 Pulled By: malfet fbshipit-source-id: 5415e1eb2c1b34319a4f03024bfaa087007d7179	2021-04-28 14:10:25 -07:00
Kurt Mohler	3fe4718d16	Add `padding_idx` argument to EmbeddingBag (#49237 ) Summary: This PR adds a `padding_idx` parameter to `nn.EmbeddingBag` and `nn.functional.embedding_bag`. As with `nn.Embedding`'s `padding_idx` argument, if an embedding's index is equal to `padding_idx` it is ignored, so it is not included in the reduction. This PR does not add support for `padding_idx` for quantized or ONNX `EmbeddingBag` for opset10/11 (opset9 is supported). In these cases, an error is thrown if `padding_idx` is provided. Fixes https://github.com/pytorch/pytorch/issues/3194 Pull Request resolved: https://github.com/pytorch/pytorch/pull/49237 Reviewed By: walterddr, VitalyFedyunin Differential Revision: D26948258 Pulled By: jbschlosser fbshipit-source-id: 3ca672f7e768941f3261ab405fc7597c97ce3dfc	2021-04-14 09:38:01 -07:00
Peter Bell	04e0cbf5a9	Add padding='same' mode to conv{1,2,3}d (#45667 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45667 First part of #3867 (Pooling operators still to do) This adds a `padding='same'` mode to the interface of `conv{n}d`and `nn.Conv{n}d`. This should match the behaviour of `tensorflow`. I couldn't find it explicitly documented but through experimentation I found `tensorflow` returns the shape `ceil(len/stride)` and always adds any extra asymmetric padding onto the right side of the input. Since the `native_functions.yaml` schema doesn't seem to support strings or enums, I've moved the function interface into python and it now dispatches between the numerically padded `conv{n}d` and the `_conv{n}d_same` variant. Underscores because I couldn't see any way to avoid exporting a function into the `torch` namespace. A note on asymmetric padding. The total padding required can be odd if both the kernel-length is even and the dilation is odd. mkldnn has native support for asymmetric padding, so there is no overhead there, but for other backends I resort to padding the input tensor by 1 on the right hand side to make the remaining padding symmetrical. In these cases, I use `TORCH_WARN_ONCE` to notify the user of the performance implications. Test Plan: Imported from OSS Reviewed By: ejguan Differential Revision: D27170744 Pulled By: jbschlosser fbshipit-source-id: b3d8a0380e0787ae781f2e5d8ee365a7bfd49f22	2021-03-18 16:22:03 -07:00
Joel Schlosser	e86476f736	Huber loss (#50553 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/48595. ## Background This PR implements HuberLoss, which differs from SmoothL1Loss by a factor of beta. The current implementation does not share logic between the two. Feedback is welcome for the optimal way to minimize code duplication while remaining performant. I've done some early [benchmarking](https://pytorch.org/tutorials/recipes/recipes/benchmark.html#collecting-instruction-counts-with-callgrind) with Huber calling in to the Smooth L1 kernel and scaling afterwards; for the simple test case I used, instruction counts are as follows: ``` Huber loss calls dedicated Huber kernel: 2,795,300 Huber loss calls Smooth L1 kernel and scales afterwards: 4,523,612 ``` With these numbers, instruction counts are ~62% higher when using the pre-existing Smooth L1 kernel. Pull Request resolved: https://github.com/pytorch/pytorch/pull/50553 Test Plan: ``` python test/test_nn.py TestNN.test_HuberLoss python test/test_nn.py TestNN.test_HuberLoss_delta python test/test_nn.py TestNN.test_huber_loss_invalid_delta python test/test_nn.py TestNNDeviceTypeCPU.test_smooth_l1_loss_vs_huber_loss_cpu python test/test_nn.py TestNNDeviceTypeCUDA.test_smooth_l1_loss_vs_huber_loss_cuda python test/test_nn.py TestNNDeviceTypeCPU.test_invalid_reduction_strings_cpu python test/test_nn.py TestNNDeviceTypeCUDA.test_invalid_reduction_strings_cuda python test/test_nn.py TestNN.test_loss_equal_input_target_shape python test/test_nn.py TestNN.test_pointwise_loss_broadcast python test/test_overrides.py python test/test_jit.py TestJitGeneratedFunctional.test_nn_huber_loss python test/test_type_hints.py python test/test_cpp_api_parity.py build/bin/test_api ``` ## Documentation <img width="677" alt="Screen Shot 2021-01-14 at 4 25 08 PM" src="https://user-images.githubusercontent.com/75754324/104651224-5a445980-5685-11eb-884b-14ea517958c2.png"> <img width="677" alt="Screen Shot 2021-01-14 at 4 24 35 PM" src="https://user-images.githubusercontent.com/75754324/104651190-4e589780-5685-11eb-974d-8c63a89c050e.png"> <img width="661" alt="Screen Shot 2021-01-14 at 4 24 45 PM" src="https://user-images.githubusercontent.com/75754324/104651198-50225b00-5685-11eb-958e-136b36f6f8a8.png"> <img width="869" alt="Screen Shot 2021-01-14 at 4 25 27 PM" src="https://user-images.githubusercontent.com/75754324/104651208-53b5e200-5685-11eb-9fe4-5ff433aa13c5.png"> <img width="862" alt="Screen Shot 2021-01-14 at 4 25 48 PM" src="https://user-images.githubusercontent.com/75754324/104651209-53b5e200-5685-11eb-8051-b0cfddcb07d3.png"> Reviewed By: H-Huang Differential Revision: D26734071 Pulled By: jbschlosser fbshipit-source-id: c98c1b5f32a16f7a2a4e04bdce678080eceed5d5	2021-03-02 17:30:45 -08:00
Joel Schlosser	68d438c9da	Add PixelUnshuffle (#49334 ) Summary: Adds an implementation of `torch.nn.PixelUnshuffle` as the inverse operation of `torch.nn.PixelShuffle`. This addresses https://github.com/pytorch/pytorch/issues/2456 Pull Request resolved: https://github.com/pytorch/pytorch/pull/49334 Test Plan: ``` # Unit tests. python test/test_nn.py TestNN.test_pixel_shuffle_unshuffle # Module test. python test/test_nn.py TestNN.test_PixelUnshuffle # C++ API tests. build/bin/test_api # C++ / python parity tests. python test/test_cpp_api_parity.py # JIT test. python test/test_jit.py TestJitGeneratedFunctional.test_nn_pixel_unshuffle # Override tests. python test/test_overrides.py # Type hint tests. python test/test_type_hints.py ``` Screenshots of rendered docs: <img width="876" alt="Screen Shot 2020-12-18 at 12 19 05 PM" src="https://user-images.githubusercontent.com/75754324/102642255-6b07bb00-412b-11eb-88fa-e53e7e8ba720.png"> <img width="984" alt="Screen Shot 2020-12-18 at 12 19 26 PM" src="https://user-images.githubusercontent.com/75754324/102642276-70fd9c00-412b-11eb-8548-445082a2db02.png"> <img width="932" alt="Screen Shot 2020-12-18 at 12 19 34 PM" src="https://user-images.githubusercontent.com/75754324/102642704-19abfb80-412c-11eb-9546-95bdd1c3cf22.png"> <img width="876" alt="Screen Shot 2020-12-22 at 12 51 36 PM" src="https://user-images.githubusercontent.com/75754324/102918259-986aa680-4454-11eb-99e7-a0b4c8b3e283.png"> <img width="869" alt="Screen Shot 2020-12-22 at 12 51 44 PM" src="https://user-images.githubusercontent.com/75754324/102918274-9ef91e00-4454-11eb-94bb-91b58aff47d3.png"> Reviewed By: mruberry Differential Revision: D25401439 Pulled By: jbschlosser fbshipit-source-id: 209d92ce7295e51699e83616d0c62170a7ce75c8	2020-12-22 20:14:55 -08:00
Nikita Shulga	c05ee86edd	Fix return-type-is-always-copy warning (#47279 ) Summary: `std::vector<bool>` can not return values by reference, since they are stored as bit fields Pull Request resolved: https://github.com/pytorch/pytorch/pull/47279 Reviewed By: glaringlee Differential Revision: D24705188 Pulled By: malfet fbshipit-source-id: 96e71cc4b9881f92af3b4a508d397deab6d68174	2020-11-03 08:53:24 -08:00
Xinyu Li	c9bb990707	[c++] Distance-agnostic triplet margin loss (#45377 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45377 This PR adds a C++ implementation of the TripletMarginWithDistanceLoss, for which the Python implementation was introduced in PR #43680. It's based on PR #44072, but I'm resubmitting this to unlink it from Phabricator. Test Plan: Imported from OSS Reviewed By: izdeby Differential Revision: D24003973 fbshipit-source-id: 2d9ada7260a6f27425ff2fdbbf623dad0fb79405	2020-09-30 12:37:35 -07:00
Heitor Schueroff de Souza	3d8c144400	Implemented torch::nn::Unflatten in libtorch (#42613 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42613 Test Plan: Imported from OSS Reviewed By: glaringlee Differential Revision: D23030302 Pulled By: heitorschueroff fbshipit-source-id: 954f1cdfcbd3a62a7f0e887fcf5995ef27222a87	2020-08-14 15:32:13 -07:00
lixinyu	5246bc4e87	register parameters correctly in c++ MultiheadAttention (#42037 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42037 This is to fix #41951 Test Plan: Imported from OSS Reviewed By: yf225 Differential Revision: D22764717 Pulled By: glaringlee fbshipit-source-id: e6da0aeb05a2356f52446e6d5fad391f2cd1cf6f	2020-07-27 13:58:11 -07:00
Will Feng	bbec4520c6	Add inplace tests for several torch::nn modules / functionals (#35147 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35147 Test Plan: Imported from OSS Differential Revision: D20578217 Pulled By: yf225 fbshipit-source-id: b8bafa49ee94c7dfbbca6e100ee3d9df5b2b621c	2020-03-21 10:02:56 -07:00
Will Feng	a2557970f3	Fix F::interpolate and torch::nn::Upsample implementation (#35025 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35025 This PR fixes `F::interpolate` and `torch::nn::Upsample` implementation to match the Python API implementation. This PR is BC-breaking in the following way: There are changes to `UpsampleOptions` and `InterpolateFuncOptions`: - `size` is changed from `std::vector<int64_t>` to `c10::optional<std::vector<int64_t>>`. If you want to pass a list of `int64_t` to this argument, you must pass it as `std::vector<int64_t>`. - `scale_factor` is changed from `std::vector<double>` to `c10::optional<std::vector<double>>`. If you want to pass a list of `double` to this argument, you must pass it as `std::vector<double>`. TODO: cherry-pick this PR into v1.5 release branch. Test Plan: Imported from OSS Differential Revision: D20559892 Pulled By: yf225 fbshipit-source-id: ac18609e351a9f2931eaeced8966b9491b2995f7	2020-03-20 22:37:13 -07:00
Will Feng	d7462dcea6	Fix AdaptiveAvgPool{2,3}d and AdaptiveMaxPool{2,3}d implementation (#35022 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35022 This PR fixes `AdaptiveAvgPool{2,3}d` and `AdaptiveMaxPool{2,3}d` implementation to match the Python API implementation. Particularly, `output_size` is changed to accept `c10::nullopt` in its elements, matching the Python API behavior. TODO: cherry-pick this PR into v1.5 release branch. Test Plan: Imported from OSS Differential Revision: D20559890 Pulled By: yf225 fbshipit-source-id: ccddbd278dd39165cf1dda11fc0e49387c76dbef	2020-03-20 22:36:57 -07:00
Will Feng	d041d0784e	[C++ API] RNNCell / LSTMCell / GRUCell layers (#34400 ) Summary: This PR adds `RNNCell` / `LSTMCell` / `GRUCell` layers to the C++ frontend, with implementations exactly matching the Python API equivalent. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34400 Differential Revision: D20316859 Pulled By: yf225 fbshipit-source-id: bb7cee092622334043c0d0fd0fcb4e75e707699c	2020-03-13 21:52:24 -07:00
Will Feng	a54416d208	[C++ API] Remove deprecated torch::nn::BatchNorm / FeatureDropout / modules_ordered_dict and torch::nn::init::Nonlinearity / FanMode (#34508 ) Summary: This PR is BC-breaking in the following way: - The deprecated `torch::nn::BatchNorm` is removed in favor of `torch::nn::BatchNorm{1,2,3}d` - The deprecated `torch::nn::FeatureDropout` is removed in favor of `torch::nn::Dropout{2,3}d` - The deprecated `torch::nn::modules_ordered_dict` is removed. User should do `Sequential sequential({{"m1", MyModule(1)}, {"m2", MyModule(2)}})` instead. - The deprecated `torch::nn::init::Nonlinearity` is removed, in favor of the following enums: - `torch::kLinear` - `torch::kConv1D` - `torch::kConv2D` - `torch::kConv3D` - `torch::kConvTranspose1D` - `torch::kConvTranspose2D` - `torch::kConvTranspose3D` - `torch::kSigmoid` - `torch::kTanh` - `torch::kReLU` - `torch::kLeakyReLU` - The deprecated `torch::nn::init::FanMode` is removed, in favor of the following enums: - `torch::kFanIn` - `torch::kFanOut` Pull Request resolved: https://github.com/pytorch/pytorch/pull/34508 Differential Revision: D20351601 Pulled By: yf225 fbshipit-source-id: cca0cd112f29a31bb023e348ca8f82780e42bea3	2020-03-12 10:09:58 -07:00
Mansoor	e95657b87e	[C++ API] AdaptiveLogSoftmaxWithLoss (#29076 ) Summary: Implemented AdaptiveLogSoftmaxWithLoss and some tests for modules. Reference https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/29076 Differential Revision: D20404588 Pulled By: yf225 fbshipit-source-id: edbadf432b8173cbcc6caf83c9c03dd92dc31a37	2020-03-12 09:53:58 -07:00
generatedunixname89002005287564	9482683065	Remove dead includes in caffe2/test Reviewed By: ezyang Differential Revision: D19273220 fbshipit-source-id: 3dfc3388914e60611c84472e3fc529f5b5e40534	2020-01-21 11:30:34 -08:00
Pavel Belevich	47766e648f	C++ API parity: MultiheadAttention Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27309 Test Plan: Imported from OSS Differential Revision: D17766736 Pulled By: pbelevich fbshipit-source-id: 7a5f2399f081945d31d4c13d7a8d248c387fc1a6	2019-12-18 10:13:29 -08:00
Pavel Belevich	f8e7f3fca4	C++ API parity: BCEWithLogitsLoss Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28783 Test Plan: Imported from OSS Differential Revision: D18202435 Pulled By: pbelevich fbshipit-source-id: 011b028bbb2a091e98d3548616b99d7b4569c239	2019-11-20 06:46:38 -08:00
Will Feng	bb1d9b238d	torch::nn::FractionalMaxPool{2,3}d module and functional Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29933 Test Plan: Imported from OSS Differential Revision: D18548174 Pulled By: yf225 fbshipit-source-id: 070776db6e8b7ad94d9b7cbd82b3d6966f061a46	2019-11-19 17:24:07 -08:00
Divyansh Singhvi	ec52d911bd	InstanceNorm{1,2,3}d (#28790 ) Summary: Hi yf225, I have a few doubts related to implementation: 1) What tests do I have to write? 2) What does _load_state_from_dict does? 3) Do I need to override reset() function as I can not see it's utility? 4) InstanceNormOptions could be removed with BatchNormOptions, but I find that `track_running_status` is not defined instead `stateful` is defined. InstanceNorm{1,2,3}d https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28790 Differential Revision: D18588666 Pulled By: yf225 fbshipit-source-id: bb9b81f01f62c3fc8765fa0ba0716768087ee155	2019-11-19 16:57:01 -08:00
nuka137	a75b669b0f	C++ API: torch::nn::ConvTranspose{1,2,3}d (#29721 ) Summary: Add torch::nn::ConvTranspose{1,2,3}d module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/29721 Differential Revision: D18588943 Pulled By: yf225 fbshipit-source-id: d4dbb091389367e70459399d5cda3778325c2120	2019-11-19 16:04:12 -08:00
Suyash458	e88d096321	C++/Python API Parity: add AlphaDropout (#28424 ) Summary: - add `AlphaDropoutImpl` to `modules/dropout.h` and `modules/dropout.cpp` - add `functional/dropout.h` containing the `alpha_dropout` function - include `functional/dropout.h` in `nn/functional.h` - add functional and module tests - related issue https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28424 Differential Revision: D18589162 Pulled By: yf225 fbshipit-source-id: c85734e02431a6c052515e26b11ca30ad7303644	2019-11-19 10:05:51 -08:00
Will Feng	689b4bea7b	torch::nn::GLU and F::glu (#29922 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29922 * #29920 [C++ API] torch::nn::GroupNorm and F::group_norm Test Plan: Imported from OSS Differential Revision: D18558818 Pulled By: yf225 fbshipit-source-id: ff80d634309fcb55f53db8dcf86eb9cf8161b37e	2019-11-16 21:03:38 -08:00
Will Feng	d5bf51b684	torch::nn::GroupNorm and F::group_norm Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29920 Test Plan: Imported from OSS Differential Revision: D18539314 Pulled By: yf225 fbshipit-source-id: dabbbaac31796fe7bfde02487737971bde699c1c	2019-11-16 19:22:11 -08:00
PyExtreme	e1d13f4f8b	C++ API parity: NLLLoss & CrossEntropyLoss (#29812 ) Summary: Hi yf225 , I have added NLLLoss and CrossEntropyLoss. ``` Also, while using log_softmax in cross_entropy_loss, I am getting an error ../caffe2/../torch/csrc/api/include/torch/nn/functional/loss.h:537:63: error: no matching function for call to log_softmax(const at::Tensor&)’ const Tensor& log_softmax_input = torch::log_softmax(input); aten/src/ATen/Functions.h:5551:22: note: candidate: at::Tensor at::log_softmax(const at::Tensor&, int64_t, c10::optional<c10::ScalarType>) static inline Tensor log_softmax(const Tensor & self, int64_t dim, c10::optional<ScalarType> dtype) { ^~~~~~~~~~~ aten/src/ATen/Functions.h:5551:22: note: candidate expects 3 arguments, 1 provided ``` I think the other two parameters should be optional as in python frontend(shown in documentation here at https://pytorch.org/docs/stable/nn.functional.html#torch.nn.functional.log_softmax ). Rest, there were no errors in build and tests have passed Pull Request resolved: https://github.com/pytorch/pytorch/pull/29812 Differential Revision: D18548249 Pulled By: yf225 fbshipit-source-id: 2ab350abd2a6f498d4dba2345f51ad87471f3038	2019-11-16 10:49:09 -08:00
Pavel Belevich	27afac2134	C++ API parity: Dropout, Dropout2d, Dropout3d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29761 Test Plan: Imported from OSS Differential Revision: D18530820 Pulled By: pbelevich fbshipit-source-id: 9d351561692f7de099d7c6aaf2ecb930b5c867e9	2019-11-15 20:32:06 -08:00
Will Feng	b37c235d86	C++/Python API parity for Conv{1,2,3}d layers, and add F::conv{1,2,3}d functionals (#28917 ) Summary: This PR changes the implementation of C++ Conv{1,2,3}d layers to exactly match the Python version, and add F::conv{1,2,3}d functionals. For more thorough testing, I will rely on the parity test mechanism which uses values from `common_nn.py` to generate the inputs and options that we are interested in testing. This PR is BC-breaking in the following way: In `Conv{1,2,3}dOptions`: - `with_bias` is renamed to `bias`. - `input_channels` is renamed to `in_channels`. - `output_channels` is renamed to `out_channels`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/28917 Differential Revision: D18471526 Pulled By: yf225 fbshipit-source-id: 7a33f60654ad93cc2e043245e7ff9e0ef9da15b3	2019-11-13 12:53:31 -08:00
Will Feng	65bfcde05e	Use c10::variant-based enums for SmoothL1Loss module and functional Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29536 Test Plan: Imported from OSS Differential Revision: D18432272 Pulled By: yf225 fbshipit-source-id: fa355145962e93025b7de98b99b0a4fc82e8c871	2019-11-12 16:05:31 -08:00
Will Feng	9f879ef532	Make all non-input arguments to functionals part of its options (#29404 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29404 This PR makes all non-input arguments to functionals part of its options parameters, so that we won't break backward compatibility even if we add or reorder some of the non-input arguments to functionals in the future. Test Plan: Imported from OSS Differential Revision: D18378526 Pulled By: yf225 fbshipit-source-id: f5cf6bdfb844e75bf94fdee58c121e0955631b6e	2019-11-12 16:05:22 -08:00
eellison	e01fc56ecb	move type inference for arange into c++ (#27629 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/17662 I'm not sure if `arange` needs to be in python_arg_parser at all, given the schemas in native_functions.yaml. In any case this at least fixes the dytpe mismatch. In follow up PRs I will try to handle some of the other ops that do type inference at the python level, like randint. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27629 Differential Revision: D17885939 Pulled By: eellison fbshipit-source-id: f97a8bc722b7ab77de1c42a992e49a4a3175ad60	2019-11-11 11:26:21 -08:00
Will Feng	cb74ede59e	Pass F::FuncOptions instead of torch::nn::Options to functionals, and make F::FuncOptions a different class when necessary (#29364 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29364 Currently, we use `torch::nn::Options` both as module options and functional options. However, this makes it very hard to manage the parameters in `torch::nn::Options`, because a module's constructor can take a different set of arguments than the module's equivalent functional (e.g. `torch.nn.BatchNorm1d` takes `num_features, eps=1e-5, momentum=0.1, affine=True, track_running_stats=True`, while `F::batch_norm` takes `running_mean, running_var, weight=None, bias=None, training=False, momentum=0.1, eps=1e-5`). This PR resolves the above problem by making `F::FuncOptions` a different class from `torch::nn::Options` when necessary (i.e. when a module's constructor takes a different set of arguments than the module's equivalent functional). In the rest of the cases where the module constructor takes the same set of arguments as the module's equivalent functional, `F::FuncOptions` is an alias of `torch::nn::*Options`. Also as part of this PR, we change all functional options to pass-by-value, to make the semantics consistent across all functionals. Test Plan: Imported from OSS Differential Revision: D18376977 Pulled By: yf225 fbshipit-source-id: 8d9c240d93bfd5af0165b6884fdc912476b1d06b	2019-11-08 22:38:21 -08:00
lsrock1	6389c18709	C++ parity, nn::CrossMapLRN2d (#29039 ) Summary: yf225 https://github.com/pytorch/pytorch/issues/25883 re- pull request because of rebase mistake! Pull Request resolved: https://github.com/pytorch/pytorch/pull/29039 Differential Revision: D18326829 Pulled By: yf225 fbshipit-source-id: 5ed737f6275e4463efa4951d9b7f45c6f2723c82	2019-11-05 15:27:08 -08:00
Pavel Belevich	69f845cb77	C++ API parity: MarginRankingLoss Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29000 Test Plan: Imported from OSS Differential Revision: D18271855 Pulled By: pbelevich fbshipit-source-id: cbafc7f059173306c83673d7be374c2d3700911f	2019-11-05 05:41:40 -08:00
Xiaomeng Yang	2460dced8f	Add torch.nn.GELU for GELU activation (#28944 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28944 Add torch.nn.GELU for GELU activation Test Plan: buck test mode/dev-nosan //caffe2/test:nn -- "GELU" Reviewed By: hl475, houseroad Differential Revision: D18240946 fbshipit-source-id: 6284b30def9bd4c12bf7fb2ed08b1b2f0310bb78	2019-11-03 21:55:05 -08:00
nuka137	a68c1e109e	C++ API: torch::nn::BatchNorm{2,3}d (#28936 ) Summary: Add torch::nn::BatchNorm{2,3}d module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 #28176 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28936 Differential Revision: D18274584 Pulled By: yf225 fbshipit-source-id: 3784eee9f8947f6c7c9f1699544a3d36a1a019b7	2019-11-01 17:50:33 -07:00
Pavel Belevich	4a94eaa60b	C++ API parity: PoissonNLLLoss Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28755 Test Plan: Imported from OSS Differential Revision: D18202436 Pulled By: pbelevich fbshipit-source-id: a7a27d5f3cdbcbbd9bbbffa02b576609d5fdc9b3	2019-11-01 12:35:59 -07:00
Edward Yang	bbea34f283	Revert D18266918: C++ API: torch::nn::BatchNorm{2,3}d Test Plan: revert-hammer Differential Revision: D18266918 Original commit changeset: f432904c7298 fbshipit-source-id: 0e1c596b2e2f13b59082ff422c67ba025df4be07	2019-11-01 10:46:49 -07:00
nuka137	b7c5b3d398	C++ API: torch::nn::BatchNorm{2,3}d (#28936 ) Summary: Add torch::nn::BatchNorm{2,3}d module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 #28176 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28936 Differential Revision: D18266918 Pulled By: yf225 fbshipit-source-id: f432904c72985d52ec52cb992cceb372b6ff0244	2019-11-01 09:28:58 -07:00
Carlos Miranda	72b9bda9e5	Smooth L1 loss (#27661 ) Summary: In accordance with https://github.com/pytorch/pytorch/issues/25883, I added the `SmoothL1Loss` module and `smooth_l1_loss` functional. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27661 Differential Revision: D18002332 Pulled By: yf225 fbshipit-source-id: b382df8becb0de14986ec16ee0dc953d7b10e917	2019-10-31 23:41:35 -07:00
Will Feng	595209bddc	Fix bugs in torch::tensor constructor (#28523 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28523 New features: 1. Previously, `torch::tensor({true, false, true})` throws `"tensor_cpu" not implemented for 'Bool'`. After this PR, it produces the correct bool tensor, matching the Python API behavior. 2. Tensors with zero-size dimensions are now supported, e.g. `torch::tensor({{}, {}})` produces a tensor with sizes `{2, 0}`, matching the Python API behavior. BC-breaking bug fixes: 1. Previously, `torch::tensor({{1}, {2}})` produces a tensor of sizes `{2}`. After this PR, it produces a tensor of sizes `{2, 1}`, matching the Python API behavior. 2. Fixed semantics of `torch::tensor(1.1)`: it now returns a 0-dim tensor instead of a 1-dim tensor, matching the Python API behavior. 3. Previously, when passed a non-dtype `TensorOptions` to the `torch::tensor` constructor, it always produces a tensor of dtype `float`. After this PR, it produces tensor of different dtypes based on the dtype of the braced-init-list, matching the behavior of the no-options case. ```cpp // Previously: torch::tensor({1, 2, 3}, torch::TensorOptions(/non-dtype-options/)).dtype() -> float torch::tensor({{1, 2, 3}}, torch::TensorOptions(/non-dtype-options/)).dtype() -> float torch::tensor({1., 2., 3.}, torch::TensorOptions(/non-dtype-options/)).dtype() -> float torch::tensor({{1., 2., 3.}}, torch::TensorOptions(/non-dtype-options/)).dtype() -> float // Now: torch::tensor({1, 2, 3}, torch::TensorOptions(/non-dtype-options/)).dtype() -> int torch::tensor({{1, 2, 3}}, torch::TensorOptions(/non-dtype-options/)).dtype() -> int torch::tensor({1., 2., 3.}, torch::TensorOptions(/non-dtype-options/)).dtype() -> double torch::tensor({{1., 2., 3.}}, torch::TensorOptions(/non-dtype-options/)).dtype() -> double // As comparison, currently: torch::tensor({1, 2, 3}).dtype() -> int torch::tensor({{1, 2, 3}}).dtype() -> int torch::tensor({1., 2., 3.}).dtype() -> double torch::tensor({{1., 2., 3.}}).dtype() -> double ``` Notes: 1. From now on, the behavior of `at::tensor(scalar_value)` (which produces a 1-dim tensor) would be different from `torch::tensor(scalar_value)` (which produces a 0-dim tensor). I will fix the behavior of `at::tensor(scalar_value)` in a follow-up PR. 2. From now on, the behavior of `at::tensor({1, 2, 3}, torch::TensorOptions(/non-dtype-options/))` (which produces a `float` tensor) would be different from `torch::tensor({1, 2, 3}, torch::TensorOptions(/non-dtype-options/))` (which produces a an `int` tensor). I will fix this behavior of `at::tensor` constructor in a follow-up PR. Context for the changes in this PR: The motivation comes from fixing the "`torch::tensor({{1}, {2}})` gives tensor of wrong sizes" bug - in order to fix it, I have to move the handling of `at::ArrayRef` and `std::vector` into `InitListTensor` (see below on why we need to do this) and renamed `InitListTensor` to `TensorDataContainer`. After such changes, support for bool values comes out of the box without extra effort, and support for tensors with zero-size dimensions only requires adding a default constructor for `TensorDataContainer`, so I added those two in this PR. For the semantic change of `torch::tensor(1.1)`, it's actually more effort to preserve the original wrong behavior (i.e. we need to check the sizes of the tensor converted from `TensorDataContainer` and reshape any scalar tensor to a 1-D tensor). I think preserving the original wrong behavior doesn't give us much value, and since the above changes naturally fix the problem, we should just start using the right behavior instead. For the "constructor with non-dtype options behavior" fix, the code looks simpler and easier to reason about with the fix, so I included it in this PR. -------- Why we need to move the handling of `at::ArrayRef` and `std::vector` into `TensorDataContainer`: `torch::tensor({{1}, {2}})` can match this function overload: `torch::tensor(at::ArrayRef<int> values)`, because `{1}` and `{2}` can be treated as a list-initialization of an `int` value. However, this will produce a Tensor with sizes `{2}`, but we actually want a Tensor with sizes `{2, 1}`. In order to avoid matching this function overload, we removed the function overload and moved the ability to convert `at::ArrayRef<T>` (and similarly `std::vector<T>`) into `TensorDataContainer`, and since for braced-init-list the `TensorDataContainer(std::initializer_list<TensorDataContainer>)` constructor is always preferred over all other constructors, it will take the `std::initializer_list` path, and all is good. Test Plan: Imported from OSS Differential Revision: D18234625 Pulled By: yf225 fbshipit-source-id: 0f3f6912e82e2117d2103e31b74e7e97baaa8693	2019-10-31 12:53:06 -07:00
Pavel Belevich	d6f1e49c4a	C++ API parity: CTCLoss Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28654 Test Plan: Imported from OSS Differential Revision: D18202437 Pulled By: pbelevich fbshipit-source-id: a4b80a57e65da84f3988002a026c648fa52a0fde	2019-10-30 14:35:02 -07:00
jon-tow	1d3d9ec7d4	C++ API Parity: `functional::fold` and `Fold::pretty_print` (#28732 ) Summary: Adds `torch::nn::functional::fold` support and updates `Fold::pretty_print` in the C++ API for more thorough Python parity. Note: Small updates in source files to maintain consistency elsewhere. Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28732 Differential Revision: D18219955 Pulled By: yf225 fbshipit-source-id: fd2e9be8f17db77c1b1f384c0d2e16cc34858c0c	2019-10-30 11:37:39 -07:00
mansoorcheema	a465b033fd	Local response norm (#28759 ) Summary: Implemented LocalResponseNorm and some initial tests for modules and functional. Reference https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28759 Differential Revision: D18219745 Pulled By: yf225 fbshipit-source-id: e6aad568a8b1e81f54752decaefd4f9044029da9	2019-10-30 11:31:00 -07:00
mrsalehi	dfe7b25eaf	Add nn::Flatten to C++ Frontend (#28072 ) Summary: Adds torch::nn::Flatten module support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28072 Differential Revision: D18202778 Pulled By: yf225 fbshipit-source-id: 43345dcbdf2f50d75746bf9a0ba293b84df275ab	2019-10-29 17:52:47 -07:00
nuka137	cbc234bceb	C++ API: torch::nn::BatchNorm1d (#28176 ) Summary: Add torch::nn::BatchNorm1d function/module support for the C++ API. torch::nn::BatchNorm{2,3}d will be added after this PR is merged. Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 I would like to discuss about below items. * Necessity of `num_batches_tracked` in `BatchNormImplBase` * `num_batches_tracked` is needed to calculate `momentum` when we do not feed `momentum` argument in Python API. But in C++ API, `momentum` argument has a default value. * `num_batches_tracked` is only used for counting up `BatchNorm1d::foward()` call. I think it is no necessary for user anymore. * The design of `BatchNorm{1,2,3}dOptions` * We have already `BatchNormOptions` used for deprecated `BatchNorm` module. However, it is hard to use it for `BatchNorm{1,2,3}dOptions` because of the arguments disagreement of each modules. * In this PR, I introduce `BatchNormOptionsv2` template class for the `BatchNorm{1,2,3}dOptions`. But I'm not sure this design is good or not. Pull Request resolved: https://github.com/pytorch/pytorch/pull/28176 Differential Revision: D18196843 Pulled By: yf225 fbshipit-source-id: 667e2b5de4150d5776c41b9088c9e6c2ead24cd4	2019-10-29 17:29:42 -07:00

1 2 3

141 Commits