pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Will Feng	05a7aaa742	Pass Tensor instead of Tensor& to torch::nn functionals that can change input in place (#30112 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30112 Currently, we have torch::nn functionals that takes `input` as `Tensor&` in order to be able to in-place change `input`'s value. We likely shouldn't do this because it will prevent the following use case: ```cpp F::elu(torch::tensor(1), F::ELUFuncOptions().inplace(true)) ``` The solution is to change the type of `input` to `Tensor`, so that we can pass an rvalue into the functional. Test Plan: Imported from OSS Differential Revision: D18601580 Pulled By: yf225 fbshipit-source-id: 639a86eb62f6c986b0f20bf7e201983e83126e73	2019-11-19 16:11:39 -08:00
nuka137	a75b669b0f	C++ API: torch::nn::ConvTranspose{1,2,3}d (#29721 ) Summary: Add torch::nn::ConvTranspose{1,2,3}d module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/29721 Differential Revision: D18588943 Pulled By: yf225 fbshipit-source-id: d4dbb091389367e70459399d5cda3778325c2120	2019-11-19 16:04:12 -08:00
Suyash458	e88d096321	C++/Python API Parity: add AlphaDropout (#28424 ) Summary: - add `AlphaDropoutImpl` to `modules/dropout.h` and `modules/dropout.cpp` - add `functional/dropout.h` containing the `alpha_dropout` function - include `functional/dropout.h` in `nn/functional.h` - add functional and module tests - related issue https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28424 Differential Revision: D18589162 Pulled By: yf225 fbshipit-source-id: c85734e02431a6c052515e26b11ca30ad7303644	2019-11-19 10:05:51 -08:00
Will Feng	3bd0f476d4	Revert D18233037: C++ API parity: isfinite Test Plan: revert-hammer Differential Revision: D18233037 Original commit changeset: c76b9467bbc1 fbshipit-source-id: 97d2cfa9de767a8c3a0ca919f9d768e959fa484e	2019-11-18 20:26:19 -08:00
Pavel Belevich	8df5e10ee9	C++ API parity: isfinite Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28918 Test Plan: Imported from OSS Differential Revision: D18233037 Pulled By: pbelevich fbshipit-source-id: c76b9467bbc1fbb2c9bf49855895c98438b36c12	2019-11-18 19:06:57 -08:00
Will Feng	689b4bea7b	torch::nn::GLU and F::glu (#29922 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29922 * #29920 [C++ API] torch::nn::GroupNorm and F::group_norm Test Plan: Imported from OSS Differential Revision: D18558818 Pulled By: yf225 fbshipit-source-id: ff80d634309fcb55f53db8dcf86eb9cf8161b37e	2019-11-16 21:03:38 -08:00
Will Feng	d5bf51b684	torch::nn::GroupNorm and F::group_norm Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29920 Test Plan: Imported from OSS Differential Revision: D18539314 Pulled By: yf225 fbshipit-source-id: dabbbaac31796fe7bfde02487737971bde699c1c	2019-11-16 19:22:11 -08:00
PyExtreme	e1d13f4f8b	C++ API parity: NLLLoss & CrossEntropyLoss (#29812 ) Summary: Hi yf225 , I have added NLLLoss and CrossEntropyLoss. ``` Also, while using log_softmax in cross_entropy_loss, I am getting an error ../caffe2/../torch/csrc/api/include/torch/nn/functional/loss.h:537:63: error: no matching function for call to log_softmax(const at::Tensor&)’ const Tensor& log_softmax_input = torch::log_softmax(input); aten/src/ATen/Functions.h:5551:22: note: candidate: at::Tensor at::log_softmax(const at::Tensor&, int64_t, c10::optional<c10::ScalarType>) static inline Tensor log_softmax(const Tensor & self, int64_t dim, c10::optional<ScalarType> dtype) { ^~~~~~~~~~~ aten/src/ATen/Functions.h:5551:22: note: candidate expects 3 arguments, 1 provided ``` I think the other two parameters should be optional as in python frontend(shown in documentation here at https://pytorch.org/docs/stable/nn.functional.html#torch.nn.functional.log_softmax ). Rest, there were no errors in build and tests have passed Pull Request resolved: https://github.com/pytorch/pytorch/pull/29812 Differential Revision: D18548249 Pulled By: yf225 fbshipit-source-id: 2ab350abd2a6f498d4dba2345f51ad87471f3038	2019-11-16 10:49:09 -08:00
Pavel Belevich	27afac2134	C++ API parity: Dropout, Dropout2d, Dropout3d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29761 Test Plan: Imported from OSS Differential Revision: D18530820 Pulled By: pbelevich fbshipit-source-id: 9d351561692f7de099d7c6aaf2ecb930b5c867e9	2019-11-15 20:32:06 -08:00
Edward Yang	65bb34d885	Remove TensorImpl::is_variable, deprecate Tensor::is_variable (#29653 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29653 I didn't remove is_variable from Tensor for BC reasons, but I did remove as many uses as I could from the codebase. at::impl::variable_excluded_from_dispatch got moved to TensorBody.h so that it's more widely accessible. This diff is NOT semantics preserving. Here are the major differences: - In a number of native operator implementations, we tested that arguments are not variable. I replaced these with asserts that variable is excluded from dispatch. I actually don't think these asserts are really necessary now (they should certainly be true, but it's hard to get it wrong), but I've kept them for old time's sake. At least, they'll detect if you call these functions before you've processed variable (indicating a bug in your kernel.) - There are a number of places where we do a per-tensor test for being a variable, for better error reporting when someone commits Tensor/Variable confusion. Although these tests are substantively the same as the tests above, in these cases I decided to delete the test entirely. The reasoning is that in these cases, we didn't really care about dispatch (also, see above; I'm not too sure we really need the dispatch asserts), we cared about Tensor/Variable confusion. Since Tensor/Variable confusion is impossible now, we don't need the tests. One of the key factors which pushed me one way or another was whether or not a function was doing per-tensor validation; if I kept the assert in such functions, I'd repeatedly access the TLS. Even if we want to bring back the asserts, they would have to go somewhere else. Another similar idiom is the number of places we do !x.defined() \|\| x.is_variable(); I treated this equivalently. - nuclear_norm's computation of compute_uv is a bit weird, but I think it's OK to just delete the is_variable case (I suspect that it is always the case that self.is_variable(), but it doesn't really matter.) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D18496168 Pulled By: ezyang fbshipit-source-id: 5a1ded931e0c10a6b758ba64a8380d34110e0c3e	2019-11-14 11:41:02 -08:00
Will Feng	a68c52494c	Use F::*FuncOptions for embedding/embeddingbag functionals (#29673 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29673 Following https://github.com/pytorch/pytorch/pull/29364 and https://github.com/pytorch/pytorch/pull/29404, this PR makes `F::EmbeddingFuncOptions` and `F::EmbeddingBagFuncOptions` separate classes from `torch::nn::EmbeddingOptions` and `torch::nn::EmbeddingBagOptions`, so that it's easier to enforce that arguments such as `num_embeddings` and `embedding_dim` are required for `torch::nn::EmbeddingOptions` and `torch::nn::EmbeddingBagOptions`. Test Plan: Imported from OSS Differential Revision: D18462540 Pulled By: yf225 fbshipit-source-id: f2abf431e48675b0a9d7f6f398cdb90ff9037c35	2019-11-13 18:47:22 -08:00
Will Feng	65f691f2c2	Add more tests for torch::arange Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29689 Test Plan: Imported from OSS Differential Revision: D18465818 Pulled By: yf225 fbshipit-source-id: 0cf0aaa7febcf4318abdaae7d17a43ab3acde017	2019-11-13 15:17:16 -08:00
Will Feng	2bcac59a30	Use default dtype for torch::tensor(floating_point_values) and torch::tensor(empty braced-init-list) when dtype is not specified (#29632 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29632 This PR is BC-breaking in the following way: Previously, C++ `torch::tensor` with a floating-point literal with no suffix (e.g. `torch::tensor(1.1)`) or a (nested) braced-init-list of floating-point literals with no suffix (e.g. `torch::tensor({{1.1, 2.2}})` produces a tensor with dtype `at::kDouble`. After this PR, it produces a tensor with dtype `torch::get_default_dtype()`, matching Python `torch.tensor` behavior. Test Plan: Imported from OSS Differential Revision: D18465819 Pulled By: yf225 fbshipit-source-id: 6834fe50335c677bc3832f2a5e9cf8d1ede9f665	2019-11-13 15:17:11 -08:00
Will Feng	b37c235d86	C++/Python API parity for Conv{1,2,3}d layers, and add F::conv{1,2,3}d functionals (#28917 ) Summary: This PR changes the implementation of C++ Conv{1,2,3}d layers to exactly match the Python version, and add F::conv{1,2,3}d functionals. For more thorough testing, I will rely on the parity test mechanism which uses values from `common_nn.py` to generate the inputs and options that we are interested in testing. This PR is BC-breaking in the following way: In `Conv{1,2,3}dOptions`: - `with_bias` is renamed to `bias`. - `input_channels` is renamed to `in_channels`. - `output_channels` is renamed to `out_channels`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/28917 Differential Revision: D18471526 Pulled By: yf225 fbshipit-source-id: 7a33f60654ad93cc2e043245e7ff9e0ef9da15b3	2019-11-13 12:53:31 -08:00
Edward Yang	30092df15e	Rename getNonVariableDeprecatedTypeProperties to getDeprecatedTypeProperties (#29203 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29203 There is no more Variable/Tensor distinction, so fix the misleading name. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D18353505 Pulled By: ezyang fbshipit-source-id: dadc394d533ab7746f70bc186c6645441a784518	2019-11-13 07:43:32 -08:00
Will Feng	65bfcde05e	Use c10::variant-based enums for SmoothL1Loss module and functional Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29536 Test Plan: Imported from OSS Differential Revision: D18432272 Pulled By: yf225 fbshipit-source-id: fa355145962e93025b7de98b99b0a4fc82e8c871	2019-11-12 16:05:31 -08:00
Will Feng	57eab22c6a	Use c10::variant-based enums for F::grid_sample Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29535 Test Plan: Imported from OSS Differential Revision: D18432273 Pulled By: yf225 fbshipit-source-id: 11476f0431a9b544dfb62bc7a89bab84399f9b83	2019-11-12 16:05:26 -08:00
Will Feng	9f879ef532	Make all non-input arguments to functionals part of its options (#29404 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29404 This PR makes all non-input arguments to functionals part of its options parameters, so that we won't break backward compatibility even if we add or reorder some of the non-input arguments to functionals in the future. Test Plan: Imported from OSS Differential Revision: D18378526 Pulled By: yf225 fbshipit-source-id: f5cf6bdfb844e75bf94fdee58c121e0955631b6e	2019-11-12 16:05:22 -08:00
Anjali Chourdia	604fc9ec41	F::embedding, F::embedding_bag, moved Embedding and EmbeddingBag options to embedding.h in options Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28669 Differential Revision: D18377609 Pulled By: anjali411 fbshipit-source-id: 6a2c547368849ebd1a2f8828cfbe7252152b26a2	2019-11-11 11:51:26 -08:00
eellison	e01fc56ecb	move type inference for arange into c++ (#27629 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/17662 I'm not sure if `arange` needs to be in python_arg_parser at all, given the schemas in native_functions.yaml. In any case this at least fixes the dytpe mismatch. In follow up PRs I will try to handle some of the other ops that do type inference at the python level, like randint. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27629 Differential Revision: D17885939 Pulled By: eellison fbshipit-source-id: f97a8bc722b7ab77de1c42a992e49a4a3175ad60	2019-11-11 11:26:21 -08:00
Will Feng	cb74ede59e	Pass F::FuncOptions instead of torch::nn::Options to functionals, and make F::FuncOptions a different class when necessary (#29364 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29364 Currently, we use `torch::nn::Options` both as module options and functional options. However, this makes it very hard to manage the parameters in `torch::nn::Options`, because a module's constructor can take a different set of arguments than the module's equivalent functional (e.g. `torch.nn.BatchNorm1d` takes `num_features, eps=1e-5, momentum=0.1, affine=True, track_running_stats=True`, while `F::batch_norm` takes `running_mean, running_var, weight=None, bias=None, training=False, momentum=0.1, eps=1e-5`). This PR resolves the above problem by making `F::FuncOptions` a different class from `torch::nn::Options` when necessary (i.e. when a module's constructor takes a different set of arguments than the module's equivalent functional). In the rest of the cases where the module constructor takes the same set of arguments as the module's equivalent functional, `F::FuncOptions` is an alias of `torch::nn::*Options`. Also as part of this PR, we change all functional options to pass-by-value, to make the semantics consistent across all functionals. Test Plan: Imported from OSS Differential Revision: D18376977 Pulled By: yf225 fbshipit-source-id: 8d9c240d93bfd5af0165b6884fdc912476b1d06b	2019-11-08 22:38:21 -08:00
Edward Yang	4e21157e01	Revert "Revert D18171156: Merge Tensor and Variable." (#29299 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29299 This reverts commit `9c43b16df9`, but also with the changes from D18348622. Comments there: thpp-compatibility is used by admarket/adreview/service:adreviewservice and libtorch is too big for the service to deal with. thpp-compatibility doesn't support autograd, so we hack around dispatching variables by using AutoNonVariableTypeMode everywhere we call into ATen, so we never attempt to call into Variable stubs. If you get it wrong, you'll get an error like: ``` what(): Could not run 'aten::empty' with arguments from the 'VariableTensorId' backend. 'aten::empty' is only available for these backends: [SparseCPUTensorId, CPUTensorId, MkldnnCPUTensorId]. (lookup_ at caffe2/aten/src/ATen/core/dispatch/DispatchTable.h:298) ``` Test Plan: Imported from OSS ``` buck test //thpp-compatibility/... buck build mode/opt-clang admarket/adreview/service:adreviewservice ``` adreviewservice canary: https://our.intern.facebook.com/intern/ads/canary/422290029716387895 (comparing against parent comment due to current breakage) ==> experiment store https://our.intern.facebook.com/intern/experiment_store/experiment/43990006/ adfinder canary: https://our.intern.facebook.com/intern/ads/canary/422268535840333934 adindexer canary: https://our.intern.facebook.com/intern/ads/canary/422268550559034675 adreview second canary: https://our.intern.facebook.com/intern/ads/canary/422307863515591925 canary without thpp-compat fixups https://our.intern.facebook.com/intern/ads/canary/422308951649168772 Reviewed By: dreiss Differential Revision: D18353504 Pulled By: ezyang fbshipit-source-id: 65feaba39fa07bb66762810909aeb38868668a30	2019-11-08 09:11:20 -08:00
Zachary DeVito	796363147f	Implement more of of the nn.Module API (#28828 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28828 This updates torch::script::Module to more closely match the behavior of nn.Module. In particular, it implements the (optionally recurisive) iterators that retrieve submodules, parameters, and buffers and makes their names match the python versions. This also removes the individual accessors for Parameter, Module, Buffer, etc. and replaces them with a single `attr` function which is equivalent to writing `a.foo` in Python (`setattr` emulates `a.foo = v`). As we build out the user-facing API for TorchScript values this will end up matching how an attribute is accessed on general objects. This PR preservers the python bindings for script::Module by emulating the old API at the binding level. A followup will clean up the usage to more directly match the C++ API. Test Plan: Imported from OSS Differential Revision: D18197611 Pulled By: zdevito fbshipit-source-id: 7ee4dcbb258605d1c988314b05d938423f1ccee5	2019-11-06 22:58:25 -08:00
Edward Yang	9c43b16df9	Revert D18171156: Merge Tensor and Variable. Test Plan: revert-hammer Differential Revision: D18171156 Original commit changeset: 5b6a045beba3 fbshipit-source-id: f5581d902c2305018ea49f8473592be2a465560b	2019-11-06 10:57:00 -08:00
lsrock1	6389c18709	C++ parity, nn::CrossMapLRN2d (#29039 ) Summary: yf225 https://github.com/pytorch/pytorch/issues/25883 re- pull request because of rebase mistake! Pull Request resolved: https://github.com/pytorch/pytorch/pull/29039 Differential Revision: D18326829 Pulled By: yf225 fbshipit-source-id: 5ed737f6275e4463efa4951d9b7f45c6f2723c82	2019-11-05 15:27:08 -08:00
Pavel Belevich	69f845cb77	C++ API parity: MarginRankingLoss Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29000 Test Plan: Imported from OSS Differential Revision: D18271855 Pulled By: pbelevich fbshipit-source-id: cbafc7f059173306c83673d7be374c2d3700911f	2019-11-05 05:41:40 -08:00
Will Feng	026fd36c71	Use at::kLong for torch::tensor(integer_value) when dtype is not specified (#29066 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29066 This PR is BC-breaking in the following way: Previously, C++ `torch::tensor` with an integer literal or a braced-init-list of integer literals produces a tensor with dtype being the type of the integer literal(s). After this PR, it always produces a tensor of dtype `at::kLong` (aka. int64_t), matching Python `torch.tensor` behavior. Test Plan: Imported from OSS Differential Revision: D18307248 Pulled By: yf225 fbshipit-source-id: 7a8a2eefa113cbb238f23264843bdb3b77fec668	2019-11-04 21:39:10 -08:00
Edward Yang	25261a4776	Merge Tensor and Variable. (#28620 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28620 All Tensors are Variables now, they just happen to have requires_grad=False. Tensors ALWAYS have `VariableTensorId` in their type set. When constructing this patch, I had to make decisions about what I would fix in this patch, and what I would leave for follow up PRs. Here is the cleanup that happens in this patch: - The `is_variable` property is removed from TensorOptions. I removed this immediately because unlike Tensor::is_variable, TensorOptions::is_variable doesn't respect our VariableTensorId thread-local state. This means that there were a bunch of places where TensorOptions::is_variable was false, which is obviously bogus in the world when tensor and variable are merged. Instead of keeping the method as a function that always returns true, I just opted to remove it entirely (it's not public API.) All places we set `is_variable` are deleted. - Knock on effect: there is no longer a separate DeprecatedTypeProperties for the variable and non-variable versions of type. - Knock on effect: instead of asserting on TensorOptions::is_variable, instead we just test `at::impl::variable_is_excluded()` - There is now only one copy of the cuDNN RNN dropout cache, not two (I'm not sure why we had two to begin with) Some cleanup that doesn't happen in this patch: - Eliminating unnecessary uses of `make_variable` - Eliminating `Tensor::is_variable` The most subtle part of this patch is retaining tracing behavior: the fact that everything is a Variable means that more code gets routed to VariableType than before; this can change traces. I identified two places where we didn't appropriately turn off VariableType, mostly factory functions: - `torch.tensor` must turn off VariableType before invoking `at::empty` to construct the tensor, as it subsequently does direct data access - `tensor_slow` (invoked when you pass a Python scalar to a tensor argument) must turn off VariableType before calling `scalar_to_tensor` so the scalar gets traced as constant, rather than as a call to `scalar_to_tensor`. Honestly, these are all giant hacks, and should be replaced with a more specialized guard that just toggles tracing. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: dreiss Differential Revision: D18171156 Pulled By: ezyang fbshipit-source-id: 5b6a045beba37492647e350190f495114e86504d	2019-11-04 14:59:57 -08:00
Xiaomeng Yang	2460dced8f	Add torch.nn.GELU for GELU activation (#28944 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28944 Add torch.nn.GELU for GELU activation Test Plan: buck test mode/dev-nosan //caffe2/test:nn -- "GELU" Reviewed By: hl475, houseroad Differential Revision: D18240946 fbshipit-source-id: 6284b30def9bd4c12bf7fb2ed08b1b2f0310bb78	2019-11-03 21:55:05 -08:00
nuka137	a68c1e109e	C++ API: torch::nn::BatchNorm{2,3}d (#28936 ) Summary: Add torch::nn::BatchNorm{2,3}d module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 #28176 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28936 Differential Revision: D18274584 Pulled By: yf225 fbshipit-source-id: 3784eee9f8947f6c7c9f1699544a3d36a1a019b7	2019-11-01 17:50:33 -07:00
Pavel Belevich	4a94eaa60b	C++ API parity: PoissonNLLLoss Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28755 Test Plan: Imported from OSS Differential Revision: D18202436 Pulled By: pbelevich fbshipit-source-id: a7a27d5f3cdbcbbd9bbbffa02b576609d5fdc9b3	2019-11-01 12:35:59 -07:00
Edward Yang	bbea34f283	Revert D18266918: C++ API: torch::nn::BatchNorm{2,3}d Test Plan: revert-hammer Differential Revision: D18266918 Original commit changeset: f432904c7298 fbshipit-source-id: 0e1c596b2e2f13b59082ff422c67ba025df4be07	2019-11-01 10:46:49 -07:00
nuka137	b7c5b3d398	C++ API: torch::nn::BatchNorm{2,3}d (#28936 ) Summary: Add torch::nn::BatchNorm{2,3}d module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 #28176 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28936 Differential Revision: D18266918 Pulled By: yf225 fbshipit-source-id: f432904c72985d52ec52cb992cceb372b6ff0244	2019-11-01 09:28:58 -07:00
Carlos Miranda	72b9bda9e5	Smooth L1 loss (#27661 ) Summary: In accordance with https://github.com/pytorch/pytorch/issues/25883, I added the `SmoothL1Loss` module and `smooth_l1_loss` functional. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27661 Differential Revision: D18002332 Pulled By: yf225 fbshipit-source-id: b382df8becb0de14986ec16ee0dc953d7b10e917	2019-10-31 23:41:35 -07:00
jokerkeny	aa30176c68	Add C++ API clip_grad_value_ for nn:utils (#28736 ) Summary: Adds C++ API clip_grad_value_ for torch::nn:utils module. Also, fix the for indent level error in the original test/test_nn.py. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28736 Differential Revision: D18263807 Pulled By: yf225 fbshipit-source-id: 29282450bd2099df16925e1d0edd3d933f6eeb9b	2019-10-31 19:11:54 -07:00
Will Feng	595209bddc	Fix bugs in torch::tensor constructor (#28523 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28523 New features: 1. Previously, `torch::tensor({true, false, true})` throws `"tensor_cpu" not implemented for 'Bool'`. After this PR, it produces the correct bool tensor, matching the Python API behavior. 2. Tensors with zero-size dimensions are now supported, e.g. `torch::tensor({{}, {}})` produces a tensor with sizes `{2, 0}`, matching the Python API behavior. BC-breaking bug fixes: 1. Previously, `torch::tensor({{1}, {2}})` produces a tensor of sizes `{2}`. After this PR, it produces a tensor of sizes `{2, 1}`, matching the Python API behavior. 2. Fixed semantics of `torch::tensor(1.1)`: it now returns a 0-dim tensor instead of a 1-dim tensor, matching the Python API behavior. 3. Previously, when passed a non-dtype `TensorOptions` to the `torch::tensor` constructor, it always produces a tensor of dtype `float`. After this PR, it produces tensor of different dtypes based on the dtype of the braced-init-list, matching the behavior of the no-options case. ```cpp // Previously: torch::tensor({1, 2, 3}, torch::TensorOptions(/non-dtype-options/)).dtype() -> float torch::tensor({{1, 2, 3}}, torch::TensorOptions(/non-dtype-options/)).dtype() -> float torch::tensor({1., 2., 3.}, torch::TensorOptions(/non-dtype-options/)).dtype() -> float torch::tensor({{1., 2., 3.}}, torch::TensorOptions(/non-dtype-options/)).dtype() -> float // Now: torch::tensor({1, 2, 3}, torch::TensorOptions(/non-dtype-options/)).dtype() -> int torch::tensor({{1, 2, 3}}, torch::TensorOptions(/non-dtype-options/)).dtype() -> int torch::tensor({1., 2., 3.}, torch::TensorOptions(/non-dtype-options/)).dtype() -> double torch::tensor({{1., 2., 3.}}, torch::TensorOptions(/non-dtype-options/)).dtype() -> double // As comparison, currently: torch::tensor({1, 2, 3}).dtype() -> int torch::tensor({{1, 2, 3}}).dtype() -> int torch::tensor({1., 2., 3.}).dtype() -> double torch::tensor({{1., 2., 3.}}).dtype() -> double ``` Notes: 1. From now on, the behavior of `at::tensor(scalar_value)` (which produces a 1-dim tensor) would be different from `torch::tensor(scalar_value)` (which produces a 0-dim tensor). I will fix the behavior of `at::tensor(scalar_value)` in a follow-up PR. 2. From now on, the behavior of `at::tensor({1, 2, 3}, torch::TensorOptions(/non-dtype-options/))` (which produces a `float` tensor) would be different from `torch::tensor({1, 2, 3}, torch::TensorOptions(/non-dtype-options/))` (which produces a an `int` tensor). I will fix this behavior of `at::tensor` constructor in a follow-up PR. Context for the changes in this PR: The motivation comes from fixing the "`torch::tensor({{1}, {2}})` gives tensor of wrong sizes" bug - in order to fix it, I have to move the handling of `at::ArrayRef` and `std::vector` into `InitListTensor` (see below on why we need to do this) and renamed `InitListTensor` to `TensorDataContainer`. After such changes, support for bool values comes out of the box without extra effort, and support for tensors with zero-size dimensions only requires adding a default constructor for `TensorDataContainer`, so I added those two in this PR. For the semantic change of `torch::tensor(1.1)`, it's actually more effort to preserve the original wrong behavior (i.e. we need to check the sizes of the tensor converted from `TensorDataContainer` and reshape any scalar tensor to a 1-D tensor). I think preserving the original wrong behavior doesn't give us much value, and since the above changes naturally fix the problem, we should just start using the right behavior instead. For the "constructor with non-dtype options behavior" fix, the code looks simpler and easier to reason about with the fix, so I included it in this PR. -------- Why we need to move the handling of `at::ArrayRef` and `std::vector` into `TensorDataContainer`: `torch::tensor({{1}, {2}})` can match this function overload: `torch::tensor(at::ArrayRef<int> values)`, because `{1}` and `{2}` can be treated as a list-initialization of an `int` value. However, this will produce a Tensor with sizes `{2}`, but we actually want a Tensor with sizes `{2, 1}`. In order to avoid matching this function overload, we removed the function overload and moved the ability to convert `at::ArrayRef<T>` (and similarly `std::vector<T>`) into `TensorDataContainer`, and since for braced-init-list the `TensorDataContainer(std::initializer_list<TensorDataContainer>)` constructor is always preferred over all other constructors, it will take the `std::initializer_list` path, and all is good. Test Plan: Imported from OSS Differential Revision: D18234625 Pulled By: yf225 fbshipit-source-id: 0f3f6912e82e2117d2103e31b74e7e97baaa8693	2019-10-31 12:53:06 -07:00
Pavel Belevich	d6f1e49c4a	C++ API parity: CTCLoss Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28654 Test Plan: Imported from OSS Differential Revision: D18202437 Pulled By: pbelevich fbshipit-source-id: a4b80a57e65da84f3988002a026c648fa52a0fde	2019-10-30 14:35:02 -07:00
jon-tow	1d3d9ec7d4	C++ API Parity: `functional::fold` and `Fold::pretty_print` (#28732 ) Summary: Adds `torch::nn::functional::fold` support and updates `Fold::pretty_print` in the C++ API for more thorough Python parity. Note: Small updates in source files to maintain consistency elsewhere. Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28732 Differential Revision: D18219955 Pulled By: yf225 fbshipit-source-id: fd2e9be8f17db77c1b1f384c0d2e16cc34858c0c	2019-10-30 11:37:39 -07:00
mansoorcheema	a465b033fd	Local response norm (#28759 ) Summary: Implemented LocalResponseNorm and some initial tests for modules and functional. Reference https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28759 Differential Revision: D18219745 Pulled By: yf225 fbshipit-source-id: e6aad568a8b1e81f54752decaefd4f9044029da9	2019-10-30 11:31:00 -07:00
Thomas Viehmann	2526f97464	Include hierarchy information in C++ API loading error messages (#28499 ) Summary: Before, we would only give the key we are looking for (i.e. typically just "No such serialized tensor 'weight'", no matter for which submodule we were looking for a weight. Now we error with "No such serialized tensor '0.conv1.weight'" or similar. The analogous information is added to missing module error messages. I threw in a test, and it saved me already... Pull Request resolved: https://github.com/pytorch/pytorch/pull/28499 Differential Revision: D18122442 Pulled By: yf225 fbshipit-source-id: a134b6d06ca33de984a11d6fea923244bcd9fb95	2019-10-30 08:41:37 -07:00
mrsalehi	dfe7b25eaf	Add nn::Flatten to C++ Frontend (#28072 ) Summary: Adds torch::nn::Flatten module support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28072 Differential Revision: D18202778 Pulled By: yf225 fbshipit-source-id: 43345dcbdf2f50d75746bf9a0ba293b84df275ab	2019-10-29 17:52:47 -07:00
nuka137	cbc234bceb	C++ API: torch::nn::BatchNorm1d (#28176 ) Summary: Add torch::nn::BatchNorm1d function/module support for the C++ API. torch::nn::BatchNorm{2,3}d will be added after this PR is merged. Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 I would like to discuss about below items. * Necessity of `num_batches_tracked` in `BatchNormImplBase` * `num_batches_tracked` is needed to calculate `momentum` when we do not feed `momentum` argument in Python API. But in C++ API, `momentum` argument has a default value. * `num_batches_tracked` is only used for counting up `BatchNorm1d::foward()` call. I think it is no necessary for user anymore. * The design of `BatchNorm{1,2,3}dOptions` * We have already `BatchNormOptions` used for deprecated `BatchNorm` module. However, it is hard to use it for `BatchNorm{1,2,3}dOptions` because of the arguments disagreement of each modules. * In this PR, I introduce `BatchNormOptionsv2` template class for the `BatchNorm{1,2,3}dOptions`. But I'm not sure this design is good or not. Pull Request resolved: https://github.com/pytorch/pytorch/pull/28176 Differential Revision: D18196843 Pulled By: yf225 fbshipit-source-id: 667e2b5de4150d5776c41b9088c9e6c2ead24cd4	2019-10-29 17:29:42 -07:00
Will Feng	e33b4b6761	Use c10::variant-based enums for Reduction Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27942 Test Plan: Imported from OSS Differential Revision: D18202857 Pulled By: yf225 fbshipit-source-id: 0303ce2508e3b7665c6a91ae270a7d0ef0e45900	2019-10-29 14:15:48 -07:00
jon-tow	52dd587123	C++ API parity: Upsample (#28413 ) Summary: Adds `interpolate` functional and `Upsample` module support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28413 Differential Revision: D18165014 Pulled By: yf225 fbshipit-source-id: ecae2f432a301b1f4afa7c038b2d104cbad139f2	2019-10-28 21:34:44 -07:00
Will Feng	5804e54c81	Deprecate torch::nn::modules_ordered_dict API (#28774 ) Summary: I finally found a way to get the following API to work for constructing a list of named submodules for `Sequential`: ```cpp Sequential sequential({ {"m1", MyModule(1)}, {"m2", MyModule(2)} })` ``` which was actually our original proposed design and much simpler than our current API: ```cpp Sequential sequential(modules_ordered_dict({ {"m1", MyModule(1)}, {"m2", MyModule(2)} })); ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/28774 Differential Revision: D18174013 Pulled By: yf225 fbshipit-source-id: 3a18c2d36b6a65a07bee7346a7516780567c7774	2019-10-28 13:01:13 -07:00
nuka137	648749b203	C++ API: torch::nn::LPPool2d (#28492 ) Summary: Add torch::nn::LPPool2d module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 #27800 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28492 Differential Revision: D18109401 Pulled By: yf225 fbshipit-source-id: 5cedecb895d9d44c2167cdb3f6f758f3426b3497	2019-10-28 12:28:25 -07:00
anjali411	dc17a2ecc5	Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28433 Differential Revision: D18138240 Pulled By: anjali411 fbshipit-source-id: 314e5902f103be1feb4cacde47c90204b3d353cc	2019-10-25 11:44:28 -07:00
Will Feng	d04973beda	Use c10::variant-based enums for EmbeddingBag mode (#28330 ) Summary: This PR is BC-breaking in the following way: Previous, we require the use of `std::string` to specify the mode for `EmbeddingBag`. After this PR, we use variant-based enums such as `torch::kSum` / `torch::kMean` / `torch::kMax` to specify the mode for `EmbeddingBag`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/28330 Differential Revision: D18127116 Pulled By: yf225 fbshipit-source-id: 15cd86c764777f4d399587be92cda15b6ce8524b	2019-10-24 17:47:42 -07:00
lsrock1	e885ce6130	C++ parity, grid_sample functional (#28354 ) Summary: https://github.com/pytorch/pytorch/issues/25883 I put grid_sample in vision.h with affine grid. I have a question in string argument(interpolation mode, padding mode) I reuse torch::native::detail::GridSamplerInterpolation in GridSampler.h instead of using string. It follows the way that uses reduction enum in loss functions. I am not sure this is right. yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28354 Differential Revision: D18109333 Pulled By: yf225 fbshipit-source-id: 1bf972b671b107464f73b937bbe0de76fb259fbf	2019-10-24 15:14:37 -07:00
Will Feng	92b39434a2	C++ nn::ConstantPad{1,2,3}d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28541 Test Plan: Imported from OSS Differential Revision: D18115607 Pulled By: yf225 fbshipit-source-id: 736df791ddc3cd30ad9af89eacfb4a0c6b53f2cd	2019-10-24 15:10:27 -07:00

1 2 3 4 5 ...

371 Commits