pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Thomas Viehmann	b5a1be02a0	Add RAII DetectAnomalyGuard (#47164 ) Summary: This is a followup to the C++ anomaly detection mode, implementing the guard. Pull Request resolved: https://github.com/pytorch/pytorch/pull/47164 Reviewed By: mruberry Differential Revision: D24682574 Pulled By: albanD fbshipit-source-id: b2224a56bf6eca0b90b8e10ec049cbcd5af9d108	2020-11-02 15:07:59 -08:00
Jeffrey Wan	f5073b0c5a	Add `inputs` argument to `autograd.backward()` (#46855 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/46373 As noted in https://github.com/pytorch/pytorch/issues/46373, there needs to be a flag passed into the engine that indicates whether it was executed through the backward api or grad api. Tentatively named the flag `accumulate_grad` since functionally, backward api accumulates grad into .grad while grad api captures the grad and returns it. Moving changes not necessary to the python api (cpp, torchscript) to a new PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/46855 Reviewed By: ngimel Differential Revision: D24649054 Pulled By: soulitzer fbshipit-source-id: 6925d5a67d583eeb781fc7cfaec807c410e1fc65	2020-11-02 14:32:38 -08:00
Thomas Viehmann	a81572cdc5	Add anomaly mode for C++ (#46981 ) Summary: This adds anomaly mode for C++. The backtrace isn't perfect yet, but it's a start. Pull Request resolved: https://github.com/pytorch/pytorch/pull/46981 Reviewed By: IvanKobzarev Differential Revision: D24631957 Pulled By: albanD fbshipit-source-id: 4b91e205e7e51f4cf0fbc651da5013a00a3b2497	2020-10-30 15:18:07 -07:00
Xinyu Li	c9bb990707	[c++] Distance-agnostic triplet margin loss (#45377 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45377 This PR adds a C++ implementation of the TripletMarginWithDistanceLoss, for which the Python implementation was introduced in PR #43680. It's based on PR #44072, but I'm resubmitting this to unlink it from Phabricator. Test Plan: Imported from OSS Reviewed By: izdeby Differential Revision: D24003973 fbshipit-source-id: 2d9ada7260a6f27425ff2fdbbf623dad0fb79405	2020-09-30 12:37:35 -07:00
Brian Hirsh	439930c81b	adding a beta parameter to the smooth_l1 loss fn (#44433 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44433 Not entirely sure why, but changing the type of beta from `float` to `double in autocast_mode.cpp and FunctionsManual.h fixes my compiler errors, failing instead at link time fixing some type errors, updated fn signature in a few more files removing my usage of Scalar, making beta a double everywhere instead Test Plan: Imported from OSS Reviewed By: mrshenli Differential Revision: D23636720 Pulled By: bdhirsh fbshipit-source-id: caea2a1f8dd72b3b5fd1d72dd886b2fcd690af6d	2020-09-25 16:36:28 -07:00
Peter Bell	da7863f46b	Add one dimensional FFTs to torch.fft namespace (#43011 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43011 Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D23751850 Pulled By: mruberry fbshipit-source-id: 8dc5fec75102d8809eeb85a3d347ba1b5de45b33	2020-09-19 23:32:22 -07:00
lixinyu	77cc7d1ecd	C++ APIs Transformer NN Module Top Layer (#44333 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44333 Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D23584010 Pulled By: glaringlee fbshipit-source-id: 990026e3f1b5ae276776e344ea981386cb7528fe	2020-09-11 08:25:27 -07:00
generatedunixname89002005287564@sandcastle1415.cln1.facebook.com	1dd658f28f	[Codemod][GleanFbcode] Remove dead includes in caffe2/test (#43953 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43953 Reviewed By: malfet Differential Revision: D23445556 fbshipit-source-id: 89cd6833aa06f35c5d3c99d698abb08cd61ae4ab	2020-09-01 21:48:28 -07:00
Vinod Kumar S	13c7c6227e	Python/C++ API Parity: TransformerDecoder (#42886 ) Summary: Fixes #{[37756](https://github.com/pytorch/pytorch/issues/37756)} Pull Request resolved: https://github.com/pytorch/pytorch/pull/42886 Reviewed By: zhangguanheng66 Differential Revision: D23385631 Pulled By: glaringlee fbshipit-source-id: 610a2fabb4c25b2dfd37b33287215bb8872d653d	2020-08-28 20:13:53 -07:00
Mike Ruberry	f4695203c2	Fixes fft function calls for C++ API (#43749 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/43732. Requires importing the fft namespace in the C++ API, just like the Python API does, to avoid clobbering torch::fft the function. Pull Request resolved: https://github.com/pytorch/pytorch/pull/43749 Reviewed By: glaringlee Differential Revision: D23391544 Pulled By: mruberry fbshipit-source-id: d477d0b6d9a689d5c154ad6c31213a7d96fdf271	2020-08-28 12:41:30 -07:00
lixinyu	48e08f884e	C++ APIs TransformerEncoder (#43187 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43187 Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D23182770 Pulled By: glaringlee fbshipit-source-id: 968846138d4b1c391a74277216111dba8b72d683	2020-08-27 01:31:46 -07:00
lixinyu	e32d014f46	remove empty override pretty_print (#43341 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43341 This is to remove the empty pretty_print() since it overrides the impl within Module base which is not as designed here. Test Plan: Imported from OSS Reviewed By: pbelevich Differential Revision: D23244616 Pulled By: glaringlee fbshipit-source-id: 94b8dfd3697dfc450f53b3b4eee6e9c13cafba7b	2020-08-20 18:48:29 -07:00
lixinyu	269fdb5bb2	prepare to split transformer header file (#43069 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43069 The transformer c++ impl need to put TransformerEncoderLayer/DecoderLayer and TransformerEncoder/TransformerDecoder in different header since TransformerEncoder/Decoder's options class need TransformerEncoderLayer/DecoderLayer as input parameter. Split header files to avoid cycle includsion. Test Plan: Imported from OSS Reviewed By: yf225 Differential Revision: D23139437 Pulled By: glaringlee fbshipit-source-id: 3c752ed7702ba18a9742e4d47d049e62d2813de0	2020-08-17 07:54:05 -07:00
Heitor Schueroff de Souza	3d8c144400	Implemented torch::nn::Unflatten in libtorch (#42613 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42613 Test Plan: Imported from OSS Reviewed By: glaringlee Differential Revision: D23030302 Pulled By: heitorschueroff fbshipit-source-id: 954f1cdfcbd3a62a7f0e887fcf5995ef27222a87	2020-08-14 15:32:13 -07:00
Vinod Kumar S	830423b80b	Python/C++ API Parity: TransformerDecoderLayer (#42717 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/37756 Pull Request resolved: https://github.com/pytorch/pytorch/pull/42717 Reviewed By: zhangguanheng66 Differential Revision: D23095841 Pulled By: glaringlee fbshipit-source-id: 327a5a23c9a3cca05e422666a6d7d802a7e8c468	2020-08-13 20:31:13 -07:00
Heitor Schueroff de Souza	ffc3da35f4	Don't materialize output grads (#41821 ) Summary: Added a new option in AutogradContext to tell autograd to not materialize output grad tensors, that is, don't expand undefined/None tensors into tensors full of zeros before passing them as input to the backward function. This PR is the second part that closes https://github.com/pytorch/pytorch/issues/41359. The first PR is https://github.com/pytorch/pytorch/pull/41490. Pull Request resolved: https://github.com/pytorch/pytorch/pull/41821 Reviewed By: albanD Differential Revision: D22693163 Pulled By: heitorschueroff fbshipit-source-id: a8d060405a17ab1280a8506a06a2bbd85cb86461	2020-08-11 04:27:07 -07:00
lixinyu	98de150381	C++ API TransformerEncoderLayer (#42633 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42633 Test Plan: Imported from OSS Reviewed By: ezyang Differential Revision: D22994332 Pulled By: glaringlee fbshipit-source-id: 873abdf887d135fb05bde560d695e2e8c992c946	2020-08-07 11:49:42 -07:00
Mike Ruberry	ccfce9d4a9	Adds fft namespace (#41911 ) Summary: This PR creates a new namespace, torch.fft (torch::fft) and puts a single function, fft, in it. This function is analogous to is a simplified version of NumPy's [numpy.fft.fft](https://numpy.org/doc/1.18/reference/generated/numpy.fft.fft.html?highlight=fft#numpy.fft.fft) that accepts no optional arguments. It is intended to demonstrate how to add and document functions in the namespace, and is not intended to deprecate the existing torch.fft function. Adding this namespace was complicated by the existence of the torch.fft function in Python. Creating a torch.fft Python module makes this name ambiguous: does it refer to a function or module? If the JIT didn't exist, a solution to this problem would have been to make torch.fft refer to a callable class that mimicked both the function and module. The JIT, however, cannot understand this pattern. As a workaround it's required to explicitly `import torch.fft` to access the torch.fft.fft function in Python: ``` import torch.fft t = torch.randn(128, dtype=torch.cdouble) torch.fft.fft(t) ``` See https://github.com/pytorch/pytorch/issues/42175 for future work. Another possible future PR is to get the JIT to understand torch.fft as a callable class so it need not be imported explicitly to be used. Pull Request resolved: https://github.com/pytorch/pytorch/pull/41911 Reviewed By: glaringlee Differential Revision: D22941894 Pulled By: mruberry fbshipit-source-id: c8e0b44cbe90d21e998ca3832cf3a533f28dbe8d	2020-08-06 00:20:50 -07:00
Kurt Mohler	df7c059428	Throw error if `torch.set_deterministic(True)` is called with nondeterministic CuBLAS config (#41377 ) Summary: For CUDA >= 10.2, the `CUBLAS_WORKSPACE_CONFIG` environment variable must be set to either `:4096:8` or `:16:8` to ensure deterministic CUDA stream usage. This PR adds some logic inside `torch.set_deterministic()` to raise an error if this environment variable is not set properly and CUDA >= 10.2. Issue https://github.com/pytorch/pytorch/issues/15359 Pull Request resolved: https://github.com/pytorch/pytorch/pull/41377 Reviewed By: malfet Differential Revision: D22758459 Pulled By: ezyang fbshipit-source-id: 4b96f1e9abf85d94ba79140fd927bbd0c05c4522	2020-08-05 12:42:24 -07:00
Yujun Zhao	0444bac940	Add test to cross function Summary: function `cross_kernel_scalar` is not covered in `Aten/native/cpu/CrossKernel.cpp`, add tests to cover it Test Plan: 1. Test locally to check new lines are covered 2. CI https://pxl.cl/1fZjG Reviewed By: malfet Differential Revision: D22834122 fbshipit-source-id: 0d50f3a3e6aee52cb6fdee2b9f5883f542c7b6e2	2020-07-29 22:48:52 -07:00
Yujun Zhao	9ea7476d9c	Add test to lerp function (#42266 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42266 function `lerp_kernel_scalar` and `lerp_kernel_tensor` are not covered in `Aten/native/cpu/LerpKernel.cpp`, add tests to cover them Test Plan: 1. Test locally to check new lines are covered 2. CI https://pxl.cl/1fXPd Reviewed By: malfet Differential Revision: D22832164 fbshipit-source-id: b1eaabbf8bfa08b4dedc1a468abfdfb619a50e3c	2020-07-29 22:47:37 -07:00
lixinyu	5246bc4e87	register parameters correctly in c++ MultiheadAttention (#42037 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42037 This is to fix #41951 Test Plan: Imported from OSS Reviewed By: yf225 Differential Revision: D22764717 Pulled By: glaringlee fbshipit-source-id: e6da0aeb05a2356f52446e6d5fad391f2cd1cf6f	2020-07-27 13:58:11 -07:00
Heitor Schueroff de Souza	cf811d2fb3	retain undefined tensors in backward pass (#41490 ) Summary: Leave undefined tensors / None returned from custom backward functions as undefined/None instead of creating a tensor full of zeros. This change improves performance in some cases. This is BC-Breaking: Custom backward functions that return None will now see it potentially being propagated all the way up to AccumulateGrad nodes. Potential impact is that .grad field of leaf tensors as well as the result of autograd.grad may be undefined/None where it used to be a tensor full of zeros. Also, autograd.grad may raise an error, if so, consider using allow_unused=True ([see doc](https://pytorch.org/docs/stable/autograd.html?highlight=autograd%20grad#torch.autograd.grad)) if it applies to your case. Pull Request resolved: https://github.com/pytorch/pytorch/pull/41490 Reviewed By: albanD Differential Revision: D22578241 Pulled By: heitorschueroff fbshipit-source-id: f4966f4cb520069294f8c5c1691eeea799cc0abe	2020-07-17 12:42:50 -07:00
albanD	45c5bac870	[WIP] Fix cpp grad accessor API (#40887 ) Summary: Update the API to access grad in cpp to avoid unexpected thread safety issues. In particular, with the current API, a check like `t.grad().defined()` is not thread safe. - This introduces `t.mutable_grad()` that should be used when getting a mutable version of the saved gradient. This function is not thread safe. - The `Tensor& grad()` API is now removed. We could not do a deprecation cycle as most of our call side use non-const Tensors that use the non-const overload. This would lead to most calls hitting the warning. This would be too verbose for all the users. Pull Request resolved: https://github.com/pytorch/pytorch/pull/40887 Reviewed By: ezyang Differential Revision: D22343932 Pulled By: albanD fbshipit-source-id: d5eb909bb743bc20caaf2098196e18ca4110c5d2	2020-07-16 09:11:12 -07:00
yyn19951228	98df9781a7	Impl for ParameterList (#41259 ) Summary: This is a new PR for https://github.com/pytorch/pytorch/issues/40850, https://github.com/pytorch/pytorch/issues/40987 and https://github.com/pytorch/pytorch/issues/41206(I unintentionally closed), as I have some issues for rebates for that one. Very sorry about that. And I have fixed the tests failed in that PR. This diff contains the implementation of C++ API for ParameterList from https://github.com/pytorch/pytorch/issues/25883. Refer to the Python API: `bc9e8af218/torch/nn/modules/container.py (L376)` Not sure about some naming difference between C++ API and Python API, like `append`, should it be called `push_back` Pull Request resolved: https://github.com/pytorch/pytorch/pull/41259 Test Plan: Add unit tests in this diff Differential Revision: D22495780 Pulled By: glaringlee fbshipit-source-id: 79ea3592db640f35477d445ecdaeafbdad814bec	2020-07-12 20:50:31 -07:00
Sebastian Messmer	9daba76ba1	Change to.dtype_layout to c10-full (#41169 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/41169 - ghstack-source-id: 107537240 Test Plan: waitforsandcastle Differential Revision: D22289257 fbshipit-source-id: ed3cc06327951fa886eb3b8f1c8bcc014ae2bc41	2020-07-10 16:04:34 -07:00
yyn19951228	4121d34036	Python/C++ API Parity: Add impl and tests for ParameterDict (#40654 ) Summary: This diff contains the implementation of C++ api for ParameterDict from https://github.com/pytorch/pytorch/issues/25883, refer to https://github.com/pytorch/pytorch/issues/36904 and https://github.com/pytorch/pytorch/issues/28652 Pull Request resolved: https://github.com/pytorch/pytorch/pull/40654 Test Plan: Add unit test in this diff Differential Revision: D22273265 Pulled By: glaringlee fbshipit-source-id: 9134a92c95eacdd53d5b24470d5f7edbeb40a488	2020-06-29 08:50:44 -07:00
Peter Bell	3dcc329746	Use tree-based sum for floats to avoid numerical instability (#39516 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/38716, fixes https://github.com/pytorch/pytorch/issues/37234 This algorithm does the summation along a single axis with multiple "levels" of accumulator, each of which is designed to hold the sum of an order of magnitude more values than the previous. e.g. if there are 2^16 elements, the first level will hold the sum of 2^4 elements, and so on in increasing powers of 2: 2^4, 2^8, 2^12 and finally 2^16. This limits the differences in magnitude of the partial results being added together, and so we don't lose accuracy as the axis length increases. WIP to write a vectorized version. Pull Request resolved: https://github.com/pytorch/pytorch/pull/39516 Reviewed By: ezyang Differential Revision: D22106251 Pulled By: ngimel fbshipit-source-id: b56de4773292439dbda62b91f44ff37715850ae9	2020-06-24 17:06:38 -07:00
Peter Bell	16f276cef9	Add C++-only `int dim` overloads to `std`-related operations (#40451 ) Summary: Fixes gh-40287 The `int -> bool` conversion takes higher precedence than `int -> IntArrayRef`. So, calling `std(0)` in C++ would select the `std(unbiased=False)` overload instead. Pull Request resolved: https://github.com/pytorch/pytorch/pull/40451 Differential Revision: D22217926 Pulled By: ezyang fbshipit-source-id: 7520792fab5ab6665bddd03b6f57444c6c729af4	2020-06-24 16:56:55 -07:00
Mike Ruberry	cb26661fe4	Throws runtime error when torch.full would infer a float dtype from a bool or integral fill value (#40364 ) Summary: BC-breaking NOTE: In PyTorch 1.6 bool and integral fill values given to torch.full must set the dtype our out keyword arguments. In prior versions of PyTorch these fill values would return float tensors by default, but in PyTorch 1.7 they will return a bool or long tensor, respectively. The documentation for torch.full has been updated to reflect this. PR NOTE: This PR causes torch.full to throw a runtime error when it would have inferred a float dtype by being given a boolean or integer value. A versioned symbol for torch.full is added to preserve the behavior of already serialized Torchscript programs. Existing tests for this behavior being deprecated have been updated to reflect it now being unsupported, and a couple new tests have been added to validate the versioned symbol behavior. The documentation of torch.full has also been updated to reflect this change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/40364 Differential Revision: D22176640 Pulled By: mruberry fbshipit-source-id: b20158ebbcb4f6bf269d05a688bcf4f6c853a965	2020-06-23 23:27:22 -07:00
Xiang Gao	954a59a2f5	Add at::tensor(complex) and torch::tensor(complex) overload (#39793 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39793 Differential Revision: D22067181 Pulled By: anjali411 fbshipit-source-id: 3cec1289a8aa3a9cc6bd1fcdb2974f858f75f7bd	2020-06-18 16:20:27 -07:00
Sotiris Lamprinidis	41f2dbde31	Add `AdamW` to C++ frontend (#40009 ) Summary: Slightly modified Adam, following the python implementation, and the `ProducesPyTorchValues` tests pass. I had a problem with another test though (see commit c1a6241676ab84fc531c1c3a10f964aa5704092e), it seems that optimizing for two steps with the same optimizer vs optimizing for two steps using freshly initialized objects will produce the same output. Pull Request resolved: https://github.com/pytorch/pytorch/pull/40009 Differential Revision: D22096053 Pulled By: glaringlee fbshipit-source-id: a31a8f5488cb37c53752ddf15436efabdba67dc4	2020-06-18 15:28:12 -07:00
Kurt Mohler	124cdf2290	Add experimental deterministic flag (#38683 ) Summary: Adds `torch.experimental.deterministic` flag to enforce deterministic algorithms across all of pytorch. Adds `torch.experimental.deterministic_error_level` to allow users to choose between error/warning/silent if determinism for an operation is not available. Adds `torch.experimental.alert_not_deterministic()` which should be called within operations that are not deterministic. Offers both Python and ATen interfaces Issue https://github.com/pytorch/pytorch/issues/15359 Pull Request resolved: https://github.com/pytorch/pytorch/pull/38683 Differential Revision: D21998093 Pulled By: ezyang fbshipit-source-id: 23aabbddd20f6199d846f97764ff24d728163737	2020-06-12 08:44:06 -07:00
Nikita Shulga	c6e9e9359f	[Codemod][GleanFbcode] Remove dead includes in caffe2/test (#39023 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39023 Reviewed By: orionr Differential Revision: D21702529 fbshipit-source-id: 6945bba95609102409850b105a8a091e33b8acc9	2020-05-27 14:07:26 -07:00
Jeremy Lilley	468a9d448e	[aten] Pass std::function<> to thread_pool by value, instead of const ref. (#37681 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37681 By passing by value, we can std::move, and avoid unnecessarily copying args that are part of any std::function/lambda state (e.g. in the jit interpreter, there is a std::vector<> stack passed in the InterpreterContinuation) This makes the api also consistent with e.g. folly and best practices. Added a minor at::launch() benchmark to test/cpp/, the difference is mostly noticeable when copying the std::function<> internal args is non-trivial. Benchmarks pre/post (min over ~5 runs) NoData: 5.81 us -> 5.63 us (-3.2%) WithData(0): 6.67 us -> 5.88 us (-11.8%) WithData(4): 6.98 us -> 6.51 us (-6.7%) WithData(256): 9.44 us -> 7.89 (-16.5%) ghstack-source-id: 103322321 Test Plan: - perf: buck run mode/opt caffe2/test/cpp/api:parallel_benchmark pre/post - correctness buck test mode/dev-nosan caffe2/test/... Reviewed By: dzhulgakov Differential Revision: D21355148 fbshipit-source-id: 3567e730845106f1991091e4a892d093e00571c3	2020-05-05 08:41:38 -07:00
Nikita Shulga	c0ff085775	[PyTorch] Modify `data_parallel` to work with small tensors (#37704 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37704 If input tensor can not be chunked, run `parallel_apply` on fewer devices Modfy input tensor dimention in `DataParallelUsesAllAvailableCUDADevices_CUDA` to be chunkable by any number of available CUDA devices Test Plan: Run `test/cpp/api/parallel` on machine with 6 GPUs Differential Revision: D21365416 fbshipit-source-id: 60fdfed4a0e6256b2c966c2ea3e8d0bfb298d9a8	2020-05-04 11:06:42 -07:00
Mike Ruberry	b64fc3c4b5	Changes warnings generated in cpp to show point of Python origination (#36052 ) Summary: Today in PyTorch, warnings triggered in C++ are printed to Python users like this: `../aten/src/ATen/native/BinaryOps.cpp:81: UserWarning: Integer division of tensors using div or / is deprecated, and in a future release div will perform true division as in Python 3. Use true_divide or floor_divide (// in Python) instead.` This may be unhelpful to Python users, who have complained it's difficult to relate these messages back to their programs. After this PR, warnings that go through the PyWarningHandler and allow it to add context print like this: ``` test/test_torch.py:16463: UserWarning: Integer division of tensors using div or / is deprecated, and in a future release div will perform true division as in Python 3. Use true_divide or floor_divide (// in Python) instead. (Triggered internally at ../aten/src/ATen/native/BinaryOps.cpp:81.) cpu_result = getattr(cpu_tensor, op_str)(*cpu_args) ``` This relates the warning back to the user's program. The information about the cpp file and line number is preserved in the body of the warning message. Some warnings, like those generated in the JIT, already account for a user's Python context, and so they specify that they should be printed verbatim and are unaffected by this change. Warnings originating in Python and warnings that go through c10's warning handler, which prints to cerr, are also unaffected. A test is added to test_torch.py for this behavior. The test relies on uint8 indexing being deprecated and its warning originating from its current header file, which is an unfortunate dependency. We could implement a `torch.warn` function, instead. Pull Request resolved: https://github.com/pytorch/pytorch/pull/36052 Differential Revision: D20887740 Pulled By: mruberry fbshipit-source-id: d3515c6658a387acb7fccaf83f23dbb452f02847	2020-04-25 21:18:58 -07:00
anjali411	6e92579883	Added autograd support for C->C functions and enabled requires_grad=True for complex (#36932 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36932 Differential Revision: D21181230 Pulled By: anjali411 fbshipit-source-id: 295f2cd1e2b9918a8b2cb88cab0536b2407dc455	2020-04-24 12:30:49 -07:00
Dmytro Dzhulgakov	50a1850d8d	[pytorch] Route default warning sync to LOG(WARNING) - second try (#36984 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36984 Follow LOG(WARNING) format for c++ side warnings in order to play well with larger services, especially when using glog. I need to hook up into GLOG internals a bit in order to override FILE/LINE without having to change the whole thing to be macros, but it seems to be stable between glog versions. Note, this also changes caffe2_log_level to warning by default - I think it's a much better default when compiling without glog (or maybe even have info). With glog output, stderr capture doesn't work any more in tests. That's why we instead use c10-level warnings capture. Test Plan: Run unittest in both glog and non-glog build mode: glog: ``` W0416 12:06:49.778215 3311666 exception_test.cpp:23] Warning: I'm a warning (function TestBody) ``` no-glog: ``` [W exception_test.cpp:23] Warning: I'm a warning (function TestBody) ``` Reviewed By: ilia-cher Differential Revision: D21151351 fbshipit-source-id: fa926d9e480db5ff696990dad3d80f79ef79f24a	2020-04-23 01:08:00 -07:00
Wanchao Liang	6d4c509168	[autograd] lower MAX_DEPTH limit according to TSAN limit (#36745 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36745 As we hold a mutex for our custom C++ Node, when calling reentrant backward from custom C++ function, we will cocurrently holding many mutexes up to MAX_DEPTH. TSAN only allow 65 mutexes at once, otherwise it will complain. This PR lower the limit according to TSAN. TSAN Reference: https://github.com/google/sanitizers/issues/950 Test Plan: Imported from OSS Differential Revision: D21072604 Pulled By: wanchaol fbshipit-source-id: 99cd1acab41a203d834fa4947f4e6f0ffd2e70f2	2020-04-16 20:43:20 -07:00
Michael Ranieri	3567b881a5	make sure dispatch test works on windows (#36729 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36729 setenv not available on windows Test Plan: CI green in ovrsource Reviewed By: stepancheg Differential Revision: D21067835 fbshipit-source-id: ddbc3285ef88f123dc6a200b661c48cfafc6bf00	2020-04-16 11:36:56 -07:00
Will Feng (FAIAR)	5fab1bf3e4	Use `std::abs` instead of `abs` in lbfgs.cpp (#35974 ) Summary: This supersedes https://github.com/pytorch/pytorch/pull/35698. `abs` is a C-style function that takes only integral argument `std::abs` is polymorphic and can be applied to both integral and floating point types This PR also increases `kBatchSize` in `test_optimizer_xor` function in `test/cpp/api/optim.cpp` to fix `OptimTest.XORConvergence_LBFGS` failure under ASAN. Pull Request resolved: https://github.com/pytorch/pytorch/pull/35974 Test Plan: CI Reviewed By: pbelevich Differential Revision: D20853570 Pulled By: yf225 fbshipit-source-id: 6135588df2426c5b974e4e097b416955d1907bd4	2020-04-04 09:37:21 -07:00
Ashkan Aliabadi	b7f4b6a6de	Support for XNNPACK max pooling operator. (#35354 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35354 Differential Revision: D20821862 Test Plan: Imported from OSS Pulled By: AshkanAliabadi fbshipit-source-id: 156fb8db85ab194919f68fd99599f08f2647b695	2020-04-03 22:53:15 -07:00
Ilia Cherniavskii	a604041a11	Back out "[pytorch][PR] indexing: throw exception for masks with dtype=uint8" (#36013 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36013 Original commit changeset: f4ebaabf427d Test Plan: CI Differential Revision: D20853694 fbshipit-source-id: 93deb43f67a385ddfd6853fef6f1dc6de408ec37	2020-04-03 21:40:02 -07:00
Pavel Belevich	4b64dffcb6	Move uniform_() to DistributionTemplates(Migrate uniform_ from TH to ATen) (#35580 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35580 `uniform_kernel_cpu` is based on https://github.com/pytorch/pytorch/pull/30954 Test Plan: Imported from OSS Differential Revision: D20820221 Pulled By: pbelevich fbshipit-source-id: 13f9fc8fc75b0e9fb48021f2ac08dcb38212a53f	2020-04-03 16:37:44 -07:00
Wojciech Baranowski	2f84a07b58	indexing: throw exception for masks with dtype=uint8 (#34418 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/33751 Pull Request resolved: https://github.com/pytorch/pytorch/pull/34418 Differential Revision: D20776164 Pulled By: ngimel fbshipit-source-id: f4ebaabf427d7967f2f317235562f91c8f9216f0	2020-03-31 20:51:56 -07:00
Nikita Shulga	b9adbb5002	Fix/relax CMake linter rules (#35574 ) Summary: Ignore mixed upper-case/lower-case style for now Fix space between function and its arguments violation Pull Request resolved: https://github.com/pytorch/pytorch/pull/35574 Test Plan: CI Differential Revision: D20712969 Pulled By: malfet fbshipit-source-id: 0012d430aed916b4518599a0b535e82d15721f78	2020-03-27 16:52:33 -07:00
anjali411	5371fdb1a0	[C++ API Parity] [Optimizers] Merged Optimizer and LossClosureOptimizer (#34957 ) Summary: 1. Removed LossClosureOptimizer, and merged Optimizer into OptimizerBase (and renamed the merged class to Optimizer) 2. Merged the LBFGS-specific serialize test function and the generic test_serialize_optimizer function. 3. BC-compatibility serialization test for LBFGS 4. Removed mentions of parameters_ in optimizer.cpp, de-virtualize all functions 5. Made defaults_ optional argument in all optimizers except SGD TODO: add BC-breaking notes for this PR Pull Request resolved: https://github.com/pytorch/pytorch/pull/34957 Test Plan: Imported from GitHub, without a `Test Plan:` line. Differential Revision: D20678162 Pulled By: yf225 fbshipit-source-id: 74e062e42d86dc118f0fbaddd794e438b2eaf35a	2020-03-26 19:53:02 -07:00
Edward Yang	843fd740fb	Revert D20645945: [pytorch][PR] [C++ API Parity] [Optimizers] Merged Optimizer and LossClosureOptimizer Test Plan: revert-hammer Differential Revision: D20645945 Original commit changeset: 383588065bf1 fbshipit-source-id: 6d7bc5676de64e329d9862889f32033c76b4009c	2020-03-26 06:40:34 -07:00
anjali411	efbd6b8533	[C++ API Parity] [Optimizers] Merged Optimizer and LossClosureOptimizer (#34957 ) Summary: 1. Removed LossClosureOptimizer, and merged Optimizer into OptimizerBase (and renamed the merged class to Optimizer) 2. Merged the LBFGS-specific serialize test function and the generic test_serialize_optimizer function. 3. BC-compatibility serialization test for LBFGS 4. Removed mentions of parameters_ in optimizer.cpp, de-virtualize all functions 5. Made defaults_ optional argument in all optimizers except SGD TODO: add BC-breaking notes for this PR Pull Request resolved: https://github.com/pytorch/pytorch/pull/34957 Differential Revision: D20645945 Pulled By: yf225 fbshipit-source-id: 383588065bf1859b38f0ad0a25d93d41e153c96e	2020-03-25 18:26:02 -07:00
Will Feng	cfc0ff1691	Renaming: MultiLabelMarginLossFuncOptions -> MultilabelMarginLossFuncOptions, MultiLabelSoftMarginLossFuncOptions -> MultilabelSoftMarginLossFuncOptions (#35163 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35163 This PR is BC-breaking in the following way: Renaming: - `torch::nn::functional::MultiLabelMarginLossFuncOptions` -> `torch::nn::functional::MultilabelMarginLossFuncOptions` - `torch::nn::functional::MultiLabelSoftMarginLossFuncOptions` -> `torch::nn::functional::MultilabelSoftMarginLossFuncOptions` Reason for renaming: to be consistent with the corresponding functional name after camel case to snake case conversion (e.g. the `multilabel_margin_loss` functional should use `MultilabelMarginLossFuncOptions` as options) Test Plan: Imported from OSS Differential Revision: D20582598 Pulled By: yf225 fbshipit-source-id: 0f5bdb8249d901b310875a14320449a2fdfa8ecd	2020-03-21 18:34:46 -07:00
Will Feng	bbec4520c6	Add inplace tests for several torch::nn modules / functionals (#35147 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35147 Test Plan: Imported from OSS Differential Revision: D20578217 Pulled By: yf225 fbshipit-source-id: b8bafa49ee94c7dfbbca6e100ee3d9df5b2b621c	2020-03-21 10:02:56 -07:00
Will Feng	a2557970f3	Fix F::interpolate and torch::nn::Upsample implementation (#35025 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35025 This PR fixes `F::interpolate` and `torch::nn::Upsample` implementation to match the Python API implementation. This PR is BC-breaking in the following way: There are changes to `UpsampleOptions` and `InterpolateFuncOptions`: - `size` is changed from `std::vector<int64_t>` to `c10::optional<std::vector<int64_t>>`. If you want to pass a list of `int64_t` to this argument, you must pass it as `std::vector<int64_t>`. - `scale_factor` is changed from `std::vector<double>` to `c10::optional<std::vector<double>>`. If you want to pass a list of `double` to this argument, you must pass it as `std::vector<double>`. TODO: cherry-pick this PR into v1.5 release branch. Test Plan: Imported from OSS Differential Revision: D20559892 Pulled By: yf225 fbshipit-source-id: ac18609e351a9f2931eaeced8966b9491b2995f7	2020-03-20 22:37:13 -07:00
Will Feng	d7462dcea6	Fix AdaptiveAvgPool{2,3}d and AdaptiveMaxPool{2,3}d implementation (#35022 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35022 This PR fixes `AdaptiveAvgPool{2,3}d` and `AdaptiveMaxPool{2,3}d` implementation to match the Python API implementation. Particularly, `output_size` is changed to accept `c10::nullopt` in its elements, matching the Python API behavior. TODO: cherry-pick this PR into v1.5 release branch. Test Plan: Imported from OSS Differential Revision: D20559890 Pulled By: yf225 fbshipit-source-id: ccddbd278dd39165cf1dda11fc0e49387c76dbef	2020-03-20 22:36:57 -07:00
anjali411	781f590f33	[C++ API Parity] Add xor_convergence test for lbfgs (#35001 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35001 Differential Revision: D20548983 Pulled By: anjali411 fbshipit-source-id: 1f858635d0680c0109d1ef348b7df4d3844fe0a6	2020-03-20 06:57:24 -07:00
Edward Yang	7c06b86e42	Revert D20518647: [pytorch][PR] [C++ API Parity] [Optimizers] Merged Optimizer and LossClosureOptimizer Test Plan: revert-hammer Differential Revision: D20518647 Original commit changeset: 4760d1d29df1 fbshipit-source-id: b84f1a06c2de27e147716279223a6844ef89f760	2020-03-19 07:53:43 -07:00
Natalia Gimelshein	be82e554fe	Revert D20524479: [pytorch][PR] [C++ API Parity] Add xor_convergence test for lbfgs Test Plan: revert-hammer Differential Revision: D20524479 Original commit changeset: 3413779676ab fbshipit-source-id: ef8007ed6c184bc8b8751eb713aac2a891260048	2020-03-18 21:56:17 -07:00
anjali411	b8e043abca	[C++ API Parity] [Optimizers] Merged Optimizer and LossClosureOptimizer (#34957 ) Summary: 1. Removed LossClosureOptimizer, and merged Optimizer into OptimizerBase (and renamed the merged class to Optimizer) 2. Merged the LBFGS-specific serialize test function and the generic test_serialize_optimizer function. 3. BC-compatibility serialization test for LBFGS 4. Removed mentions of parameters_ in optimizer.cpp, de-virtualize all functions 5. Made defaults_ optional argument in all optimizers except SGD Pull Request resolved: https://github.com/pytorch/pytorch/pull/34957 Test Plan: Imported from GitHub, without a `Test Plan:` line. Differential Revision: D20518647 Pulled By: anjali411 fbshipit-source-id: 4760d1d29df1784e2d01e2a476d2a08e9df4ea1c	2020-03-18 17:28:57 -07:00
anjali411	4521477f83	[C++ API Parity] Add xor_convergence test for lbfgs (#35001 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35001 Differential Revision: D20524479 Pulled By: anjali411 fbshipit-source-id: 3413779676ab95c1ee82298f95d3441a89873107	2020-03-18 17:06:53 -07:00
anjali411	d7e4a379a0	[C++ API Parity] LBFGS optimizer step() update and added closure to the Optimizer step() function (#34564 ) Summary: Follow-ups after this PR: * Remove `LossClosureOptimizer`, and merge `Optimizer` into `OptimizerBase` (and rename the merged class to Optimizer) * Merge the LBFGS-specific serialize test function and the generic `test_serialize_optimizer` function, possibly by passing a bool `has_only_global_state` flag into the `test_serialize_optimizer` function to denote whether `size()` should be equal to 1 or 2? * https://github.com/pytorch/pytorch/pull/34564#discussion_r393780303 * It seems that we don't have the equivalent `XORConvergence_LBFGS` test like the other optimizers, and it would be good to add one * Remove mentions of `parameters_` in optimizer.cpp, de-virtualize all functions, and remove the `OptimizerBase(std::vector<Tensor> parameters)` constructor from `OptimizerBase` Pull Request resolved: https://github.com/pytorch/pytorch/pull/34564 Test Plan: Imported from GitHub, without a `Test Plan:` line. Differential Revision: D20495701 Pulled By: anjali411 fbshipit-source-id: 6d35286d2decb6f7dff93d9d3e57515770666622	2020-03-17 22:27:24 -07:00
anjali411	762be86e63	[C++ API Parity] [Optimizers] added closure to optimizers (#34790 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34790 Differential Revision: D20468361 Pulled By: anjali411 fbshipit-source-id: 1c6115d735b211dc2bedf002d58931cb32cf657a	2020-03-16 07:51:44 -07:00
Will Feng	bdd7dbfd4b	[C++ API] RNN / GRU / LSTM layer refactoring (#34322 ) Summary: This PR refactors RNN / GRU / LSTM layers in C++ API to exactly match the implementation in Python API. BC-breaking changes: - Instead of returning `RNNOutput`, RNN / GRU forward method now returns `std::tuple<Tensor, Tensor>`, and LSTM forward method now returns `std::tuple<Tensor, std::tuple<Tensor, Tensor>>`, matching Python API. - RNN / LSTM / GRU forward method now accepts the same inputs (input tensor and optionally hidden state), matching Python API. - RNN / LSTM / GRU layers now have `forward_with_packed_input` method which accepts `PackedSequence` as input and optionally hidden state, matching the `forward(PackedSequence, ...)` variant in Python API. - RNN / LSTM / GRU layers no longer have these fields: `w_ih` / `w_hh` / `b_ih` / `b_hh`. Instead, to access the weights and biases of the gates, users should do e.g. `rnn->named_parameters()["weight_ih_l0"]`, which mirrors the Python API `rnn.weight_ih_l0`. - In `RNNOptions` - `tanh()` / `relu()` / `activation` are removed. Instead, `nonlinearity` is added which takes either `torch::kTanh` or `torch::kReLU` - `layers` -> `num_layers` - `with_bias` -> `bias` - In `LSTMOptions` - `layers` -> `num_layers` - `with_bias` -> `bias` - In `GRUOptions` - `layers` -> `num_layers` - `with_bias` -> `bias` The majority of the changes in this PR focused on refactoring the implementations in `torch/csrc/api/src/nn/modules/rnn.cpp` to match the Python API. RNN tests are then changed to reflected the revised API design. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34322 Differential Revision: D20458302 Pulled By: yf225 fbshipit-source-id: ffff2ae1ddb1c742c966956f6ad4d7fba03dc54d	2020-03-15 17:48:29 -07:00
Will Feng	6c555e1508	Revert D20311699: [pytorch][PR] [C++ API] RNN / GRU / LSTM layer refactoring Test Plan: revert-hammer Differential Revision: D20311699 Original commit changeset: e2b60fc7bac6 fbshipit-source-id: 72f4a762189490998d6b716857eeac053a11742d	2020-03-14 16:18:48 -07:00
Will Feng	e23a9dc140	[C++ API] RNN / GRU / LSTM layer refactoring (#34322 ) Summary: This PR refactors RNN / GRU / LSTM layers in C++ API to exactly match the implementation in Python API. BC-breaking changes: - Instead of returning `RNNOutput`, RNN / GRU forward method now returns `std::tuple<Tensor, Tensor>`, and LSTM forward method now returns `std::tuple<Tensor, std::tuple<Tensor, Tensor>>`, matching Python API. - RNN / LSTM / GRU forward method now accepts the same inputs (input tensor and optionally hidden state), matching Python API. - RNN / LSTM / GRU now has `forward_with_packed_input` method which accepts `PackedSequence` as input and optionally hidden state, matching the `forward(PackedSequence, ...)` variant in Python API. - In `RNNOptions` - `tanh()` / `relu()` / `activation` are removed. Instead, `nonlinearity` is added which takes either `torch::kTanh` or `torch::kReLU` - `layers` -> `num_layers` - `with_bias` -> `bias` - In `LSTMOptions` - `layers` -> `num_layers` - `with_bias` -> `bias` - In `GRUOptions` - `layers` -> `num_layers` - `with_bias` -> `bias` The majority of the changes in this PR focused on refactoring the implementations in `torch/csrc/api/src/nn/modules/rnn.cpp` to match the Python API. RNN tests are then changed to reflected the revised API design. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34322 Differential Revision: D20311699 Pulled By: yf225 fbshipit-source-id: e2b60fc7bac64367a8434647d74c08568a7b28f7	2020-03-14 12:09:04 -07:00
Will Feng	d041d0784e	[C++ API] RNNCell / LSTMCell / GRUCell layers (#34400 ) Summary: This PR adds `RNNCell` / `LSTMCell` / `GRUCell` layers to the C++ frontend, with implementations exactly matching the Python API equivalent. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34400 Differential Revision: D20316859 Pulled By: yf225 fbshipit-source-id: bb7cee092622334043c0d0fd0fcb4e75e707699c	2020-03-13 21:52:24 -07:00
Will Feng	a54416d208	[C++ API] Remove deprecated torch::nn::BatchNorm / FeatureDropout / modules_ordered_dict and torch::nn::init::Nonlinearity / FanMode (#34508 ) Summary: This PR is BC-breaking in the following way: - The deprecated `torch::nn::BatchNorm` is removed in favor of `torch::nn::BatchNorm{1,2,3}d` - The deprecated `torch::nn::FeatureDropout` is removed in favor of `torch::nn::Dropout{2,3}d` - The deprecated `torch::nn::modules_ordered_dict` is removed. User should do `Sequential sequential({{"m1", MyModule(1)}, {"m2", MyModule(2)}})` instead. - The deprecated `torch::nn::init::Nonlinearity` is removed, in favor of the following enums: - `torch::kLinear` - `torch::kConv1D` - `torch::kConv2D` - `torch::kConv3D` - `torch::kConvTranspose1D` - `torch::kConvTranspose2D` - `torch::kConvTranspose3D` - `torch::kSigmoid` - `torch::kTanh` - `torch::kReLU` - `torch::kLeakyReLU` - The deprecated `torch::nn::init::FanMode` is removed, in favor of the following enums: - `torch::kFanIn` - `torch::kFanOut` Pull Request resolved: https://github.com/pytorch/pytorch/pull/34508 Differential Revision: D20351601 Pulled By: yf225 fbshipit-source-id: cca0cd112f29a31bb023e348ca8f82780e42bea3	2020-03-12 10:09:58 -07:00
Mansoor	e95657b87e	[C++ API] AdaptiveLogSoftmaxWithLoss (#29076 ) Summary: Implemented AdaptiveLogSoftmaxWithLoss and some tests for modules. Reference https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/29076 Differential Revision: D20404588 Pulled By: yf225 fbshipit-source-id: edbadf432b8173cbcc6caf83c9c03dd92dc31a37	2020-03-12 09:53:58 -07:00
Lingyi Liu	09296c34a4	Add the build for runtime dispatch for AVX, AVX2 instruction set (#26125 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26125 We already had some optimization implementation using AVX2 for improve the quantized kernel performance. In this diff, we want to enable the runtime dispatch. Test Plan: Sandcastle build and test Also test with a python binary calling into vectorized op. torch.__config__.show() PyTorch built with: - GCC 4.2 - clang 8.0.20181009 - Intel(R) Math Kernel Library Version 2017.0.3 Product Build 20170413 for Intel(R) 64 architecture applications - Intel(R) MKL-DNN v0.18.1 (Git Hash N/A) - OpenMP 1 - CPU capability usage: AVX2 - Build settings: Reviewed By: jamesr66a Differential Revision: D17337251 fbshipit-source-id: 8e22d10011a12a4eaf54cea3485353eb1811d828	2020-03-10 15:32:57 -07:00
anjali411	2d24005d18	[C++ API Parity] rmsprop optimizer update (#33450 ) Summary: This PR is BC-breaking in the following way: In RMSpropOptions: 1. learning_rate is renamed to lr. Test plan before 1.5 release: Test that in 1.5 we can load a C++ RMSprop optimizer that was serialized in 1.4, and their states are the same. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33450 Differential Revision: D20366623 Pulled By: anjali411 fbshipit-source-id: 83250be9b583a766927e0e22a4de8b0765379451	2020-03-10 13:30:56 -07:00
Will Feng	baeb359e7a	Remove `using namespace torch::autograd` from header files (#34423 ) Summary: This PR prevents leaking symbols from `torch::autograd` namespace to the root namespace. Fixes https://github.com/pytorch/pytorch/issues/34371. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34423 Differential Revision: D20338404 Pulled By: yf225 fbshipit-source-id: e7ff3348193667a0cee5d38f9a003ae36cc704ca	2020-03-09 10:31:21 -07:00
Will Feng	739d4609c3	[C++ API] Fix ModuleList compile error: error: 'begin' was not declared in this scope (#34463 ) Summary: One example in the current docs for `torch::nn::ModuleList` doesn't compile, and this PR fixes it. Fixes https://github.com/pytorch/pytorch/issues/32414. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34463 Test Plan: Imported from GitHub, without a `Test Plan:` line. Differential Revision: D20331120 Pulled By: yf225 fbshipit-source-id: 50bb078fe1a900c9114d5434e92dc40ee13b52bf	2020-03-09 08:15:50 -07:00
Will Feng	415595ace4	[C++ API] Remove init-list form of at::indexing::Slice (#34255 ) Summary: The init-list form of `at::indexing::Slice` (i.e. `tensor.index({{1, None, 2}, ...})` instead of `tensor.index({Slice(1, None, 2), ...})`) in C++ API can be easily confused with the list-form indexing in Python API (e.g. `tensor[[1, 3, 2], ...]`), which is not good from readability perspective. This PR removes the init-list form of `at::indexing::Slice` to make the API less confusing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34255 Test Plan: Imported from GitHub, without a `Test Plan:` line. Differential Revision: D20290166 Pulled By: yf225 fbshipit-source-id: abbcbeca0b179219e5e1f196a33ef8aec87ebb76	2020-03-06 05:51:53 -08:00
meganset	b8fd88319a	C++ make torch::nn::Sequential push_back(AnyModule) methods public (#34208 ) Summary: Issue https://github.com/pytorch/pytorch/issues/33192 Moves Sequential::push_back methods with AnyModule from private -> public Allows adding an existing AnyModule via something like: ``` torch::nn::Sequential q; auto a=torch::nn::AnyModule(torch::nn::Linear(1,2)); q->push_back(a); q->push_back("fc",a); ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/34208 Differential Revision: D20300278 Pulled By: yf225 fbshipit-source-id: 4525319bb7fb6667e43a006c9f446a2193781005	2020-03-06 05:47:14 -08:00
anjali411	76035f050b	[C++ API Parity] Adam: updated step and class design (#33730 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33730 Differential Revision: D20292073 Pulled By: anjali411 fbshipit-source-id: a7b4a70f29027ab355aebb91873ea55d5cb51783	2020-03-05 19:15:24 -08:00
Wanchao Liang	f909b5535e	[autograd] fix allow_unused checking for C++ API (#34035 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34035 Bug for the conditon check in https://github.com/pytorch/pytorch/pull/24342, realized we don't have tests in either python or cpp to catch this, so added testes for both python and cpp. Thanks hczhu on capturing it! Test Plan: Imported from OSS Differential Revision: D20198837 Pulled By: wanchaol fbshipit-source-id: 33846a14c0a8e7aac2e8328189d10c38a0d7e6ee	2020-03-02 17:57:15 -08:00
Will Feng	1494005cfd	C++ tensor indexing: more indexing tests (#30427 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30427 Test Plan: Imported from OSS Differential Revision: D18695899 Pulled By: yf225 fbshipit-source-id: 74455fe52ef922556fabe65aefca9ec93fe2346d	2020-02-28 22:07:41 -08:00
Will Feng	5c33d98b0d	Add assert_tensor_equal and assert_tensor_not_equal to test/cpp/api/support.h (#30426 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30426 This PR adds `assert_tensor_equal` and `assert_tensor_not_equal` to `test/cpp/api/support.h`, as better functions for testing whether two tensors are equal / not equal. Test Plan: Imported from OSS Differential Revision: D18695900 Pulled By: yf225 fbshipit-source-id: c19b9bc4c4e84d9f444015023649d27618fcbdf5	2020-02-26 13:25:25 -08:00
Will Feng	0dded4026e	[C++ API] Add PackedSequence / pack_padded_sequence / pad_packed_sequence / pack_sequence (#33652 ) Summary: Most of the function implementation and test code are translated from the Python version. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33652 Differential Revision: D20052211 Pulled By: yf225 fbshipit-source-id: ce6767db54364f91ef4f06674239a12278c2752a	2020-02-25 12:53:41 -08:00
Will Feng	36919278cc	C++ tensor multi-dim indexing: add index() and index_put_() overloads, simple indexing tests, merge with Python indexing path (#32841 ) Summary: This PR adds the following items: - 1st item: `ArrayRef<TensorIndex>` and `std::initializer_list<TensorIndex>` overloads for `Tensor::index` and `Tensor::index_put_`, to be used specifically for multi-dim indexing purpose. Design rationale: * C++ `Tensor::index` and `Tensor::index_put_` are both existing tensor APIs, and they currently (before this PR) only accept a list of tensors (i.e. `ArrayRef<Tensor>`) as indices. If we change their signatures to also accept non-tensors as indices (i.e. `ArrayRef<TensorIndex>`, and `TensorIndex` is convertible from `Tensor` / `Slice` / `None` / `Ellipsis`), it would slow down the original code path (since now it has to go through more steps), which is undesirable. To get around this problem, the proposed solution is to keep the original `ArrayRef<Tensor>` overload, and add `ArrayRef<TensorIndex>` and `std::initializer_list<TensorIndex>` overloads to `Tensor::index` and `Tensor::index_put_`. This way, the original code path won’t be affected, and the tensor multi-dim indexing API is only used when the user explicitly pass an `ArrayRef<TensorIndex>` or a braced-init-list of `TensorIndex`-convertible types to `Tensor::index` and `Tensor::index_put_` . Note that the above proposed solution would still affect perf for the user’s original `Tensor::index` or `Tensor::index_put_` call sites that use a braced-init-list of tensors as input, e.g. `tensor.index({...})` or `tensor.index_put_({...}, value)`, since now such function calls would take the multi-dim indexing path instead of the original advanced indexing path. However, there are only two instances of this in our codebase (one in ATen cpp test, one in a C++ API nn init function), and they can be easily changed to explicitly use `ArrayRef<Tensor>` as input (I changed them in this PR). For external user’s code, since this is part of the C++ frontend which is still considered experimental, we will only talk about this change in the release note, and ask users to switch to using `ArrayRef<Tensor>` explicitly if they want to keep using the original advanced indexing code path. - 2nd item: Mechanisms for parsing `ArrayRef<TensorIndex>` indices and performing indexing operations (mirroring the functions in `torch/csrc/autograd/python_variable_indexing.cpp`). - 3rd item: Simple tests to demonstrate that the `Tensor::index()` and `Tensor::index_put_()` APIs work. I will add more tests after the first few PRs are reviewed. - 4th item: Merge Python/C++ indexing code paths, for code simplicity. I tested locally and found that there is no perf regression resulting from the merge. I will get more concrete numbers for common use cases when we settle on the overall design. This PR supersedes https://github.com/pytorch/pytorch/pull/30425. Pull Request resolved: https://github.com/pytorch/pytorch/pull/32841 Differential Revision: D19919692 Pulled By: yf225 fbshipit-source-id: 7467e64f97fc0e407624809dd183c95ea16b1482	2020-02-24 22:04:00 -08:00
Suyash458	47e90d774e	C++/Python API Parity: add pad_sequence (#32387 ) Summary: - add `pad_sequence` and tests - related issue https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/32387 Differential Revision: D20025421 Pulled By: yf225 fbshipit-source-id: caa9ae2114bece8db387a3a1610f24a3e06b1324	2020-02-21 13:16:09 -08:00
nicolov	e77abb9a5b	Normalize reward-to-go in C++ actor-critic (#33550 ) Summary: Comparing to the [Python implementation](https://github.com/pytorch/examples/blob/master/reinforcement_learning/actor_critic.py), it seems like the tensor of normalized reward-to-go is computed but never used. Even if it's just an integration test, this PR switches to the normalized version for better convergence. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33550 Differential Revision: D20024393 Pulled By: yf225 fbshipit-source-id: ebcf0fee14ff39f65f6744278fb0cbf1fc92b919	2020-02-21 09:19:39 -08:00
Will Feng	a203dc2e6d	[C++ API] Allow skipping default arguments in module's forward method when module is used in Sequential (#33027 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33027 This PR allows default arguments in module's forward method to be skipped when module is used in `torch::nn::Sequential`, by introducing the `FORWARD_HAS_DEFAULT_ARGS` macro and requiring that all modules that have default arguments in its forward method must have a corresponding `FORWARD_HAS_DEFAULT_ARGS` macro call. Fixes issue mentioned in https://github.com/pytorch/pytorch/issues/30931#issuecomment-564144468. Test Plan: Imported from OSS Differential Revision: D19777815 Pulled By: yf225 fbshipit-source-id: 73282fcf63377530063e0092a9d84b6c139d2e32	2020-02-17 20:38:02 -08:00
Will Feng	4724964810	[C++ API] Expose AnyValue and AnyModuleHolder classes (#33026 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33026 This PR contains necessary changes to prepare for https://github.com/pytorch/pytorch/pull/33027. It exposes the following classes to public: 1. `torch::nn::AnyValue`, because if the user has optional arguments in their module's forward method, they must also use the `FORWARD_HAS_DEFAULT_ARGS` macro and pass in the default values for those optional arguments wrapped by `torch::nn::AnyValue`. 2. `torch::nn::AnyModuleHolder`, because `torch::nn::Module` needs to declare it as a friend class for it to be able to access `torch::nn::Module`'s protected methods such as `_forward_has_default_args` / `_forward_num_required_args` / `_forward_populate_default_args`. Test Plan: Imported from OSS Differential Revision: D19777814 Pulled By: yf225 fbshipit-source-id: 1c9d5aa24f0689154752c426a83ee98f64c9d02f	2020-02-17 20:35:22 -08:00
Will Feng	5d7f42847c	Add at::Tensor::retain_grad API (#33349 ) Summary: This PR adds `at::Tensor::retain_grad`, and its implementation mirrors the Python `torch.Tensor.retain_grad` API: `c6271c63f2/torch/tensor.py (L292-L315)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/33349 Differential Revision: D19944524 Pulled By: yf225 fbshipit-source-id: e61d5d761996b6d1b860c04c4b4650c1a49a6a8c	2020-02-17 20:03:48 -08:00
anjali411	91744907d4	SGD: updated step and class design (#32592 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32592 Differential Revision: D19868154 Pulled By: anjali411 fbshipit-source-id: ce888efc68b1531d97e8b0abf2b146198e012d2f	2020-02-12 18:38:55 -08:00
albanD	3655975565	Add allow_rebase_history flag and fix codegen functions for multiple views (#32790 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32790 Same as https://github.com/pytorch/pytorch/pull/31990 but without the first commit in the stack that is problematic for a lot of people. Test Plan: Imported from OSS Differential Revision: D19814116 Pulled By: albanD fbshipit-source-id: d104911a5b098a5807b4bc08b69803ebd4f69fa6	2020-02-11 07:16:02 -08:00
albanD	e1c53a5c86	Fix version counter bump in cpp Function (#33068 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33068 The version counter is already tracked if we use pytorch's functions but not if the user unpack the Tensor and modifies it by hand or with a third party library. Test Plan: Imported from OSS Differential Revision: D19791564 Pulled By: albanD fbshipit-source-id: a73c0f73d8fd0c0e5bf838f14bed54fa66937840	2020-02-10 07:22:29 -08:00
Pavel Belevich	d141465713	Fix torch::allclose to handle std::numeric_limits<T>::lowest() for integral types (#32978 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32978 Fixes #32946 Test Plan: Imported from OSS Differential Revision: D19726013 Pulled By: pbelevich fbshipit-source-id: ada4aeabc8e39016d24f1a40f02fb7c56f069cd3	2020-02-04 19:06:52 -08:00
Will Feng	b564eaf7a8	Bug fixes: torch::tensor(floating-point values) -> default dtype, and torch::tensor(integer values) ->at::kLong (#32367 ) Summary: Some of the `torch::tensor` behavior is updated to better match Python API. Fixes https://github.com/pytorch/pytorch/issues/32234. This PR is BC-breaking in the following way: - `torch::tensor({1.0f, 2.0f})`: float -> default dtype - `torch::tensor(at::ArrayRef<int>({1, 2, 3}))`: int -> at::kLong - `torch::tensor(std::vector<int>({1, 2, 3}))`: int -> at::kLong - `torch::tensor(at::ArrayRef<float>({1.f, 2.f, 3.f}))`: float -> default dtype - `torch::tensor(std::vector<float>({1.f, 2.f, 3.f}))`: float -> default dtype - `torch::tensor(at::ArrayRef<double>({1., 2., 3.}))`: double -> default dtype - `torch::tensor(std::vector<double>({1., 2., 3.}))`: double -> default dtype Pull Request resolved: https://github.com/pytorch/pytorch/pull/32367 Differential Revision: D19498484 Pulled By: yf225 fbshipit-source-id: 19c8dc2a56476266153cff4c404e7f84d309eb12	2020-02-01 15:00:07 -08:00
Alban Desmaison	db8ce7ea2d	Back out "Make autogen functions correct for multiple outputs and views" (#32681 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32681 Original commit changeset: a2b41c2d231e Test Plan: fb and oss tests Reviewed By: hudeven Differential Revision: D19591864 fbshipit-source-id: 7068b5563e37bc9a5d415fd535c73fd9d71fe131	2020-01-27 19:54:34 -08:00
Charles Hofer	5fd037ce44	Fix MagmaInitializesCorrectly_CUDA by using an invertible matrix (#32547 ) Summary: This test case had been using the tensor ``` 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 ``` which is not an invertible tensor and causes the test case to fail, even if magma gets initialized just fine. This change uses a tensor that is invertible, and the inverse doesn't include any elements that are close to zero to avoid floating point rounding errors. Pull Request resolved: https://github.com/pytorch/pytorch/pull/32547 Differential Revision: D19572316 Pulled By: ngimel fbshipit-source-id: 1baf3f8601b2ba69fdd6678d7a3d86772d01edbe	2020-01-25 20:00:54 -08:00
albanD	3ab30753e9	Make autogen functions correct for multiple outputs and views (#31990 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31990 This PR does three things: - Add a new `allow_rebase_history` flag to the differentiable views. If set, trying to rebase their history will raise an error. - Make sure that the codegen functions verify this flag before doing inplace operations so that they fail before doing the inplace modification. - Make sure the codegen functions set this flag properly when we don't support rebasing the history of the output. The codegen change can be found [here](`4bf180caa0`). Test Plan: Imported from OSS Differential Revision: D19409649 Pulled By: albanD fbshipit-source-id: a2b41c2d231e952ecfe162bdb6bad620ac595703	2020-01-24 14:32:28 -08:00
anjali411	be6ffac1b6	Adagrad optimizer - updated step function, added param_groups, state to optimizers Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29335 Differential Revision: D19449382 Pulled By: anjali411 fbshipit-source-id: ee238801ed9cdf15a80f2ce31cc4aab8ba582aea	2020-01-21 14:41:12 -08:00
generatedunixname89002005287564	9482683065	Remove dead includes in caffe2/test Reviewed By: ezyang Differential Revision: D19273220 fbshipit-source-id: 3dfc3388914e60611c84472e3fc529f5b5e40534	2020-01-21 11:30:34 -08:00
Brian Wignall	f326045b37	Fix typos, via a Levenshtein-type corrector (#31523 ) Summary: Should be non-semantic. Uses https://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines to find likely typos, with https://github.com/bwignall/typochecker to help automate the checking. Uses an updated version of the tool used in https://github.com/pytorch/pytorch/pull/30606 . Pull Request resolved: https://github.com/pytorch/pytorch/pull/31523 Differential Revision: D19216749 Pulled By: mrshenli fbshipit-source-id: 7fd489cb9a77cd7e4950c1046f925d57524960ea	2020-01-17 16:03:19 -08:00
Will Feng	2bd179147a	Fix typo in config script to re-enable libtorch build and test in macOS CI (#32072 ) Summary: Currently, libtorch build and test are not running in macOS CI. This PR fixes the issue. Test Plan: Check that libtorch build and test are running again in macOS CI. Pull Request resolved: https://github.com/pytorch/pytorch/pull/32072 Differential Revision: D19391909 Pulled By: yf225 fbshipit-source-id: 1ab345b099869f78e1124f1a8bd185fa51371b6a	2020-01-14 16:23:57 -08:00
Will Feng	b6cee03e29	C++ tensor indexing: add Slice / TensorIndex (#30424 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30424 `at::indexing::TensorIndex` is used for converting C++ tensor indices such as `{None, "...", Ellipsis, 0, true, {1, None, 2}, torch::tensor({1, 2})}` into its equivalent `std::vector<TensorIndex>`, so that further tensor indexing operations can be performed using the supplied indices. Test Plan: Imported from OSS Differential Revision: D18695902 Pulled By: yf225 fbshipit-source-id: d73e14a411cdbec815866b02e75ffd71a9186e89	2020-01-10 17:53:41 -08:00
TH3CHARLie	1296e2d55e	C++ API parity: isinf (#31099 ) Summary: fixes https://github.com/pytorch/pytorch/issues/31021, port the legacy binding method of `isinf` to C++ therefore support JIT Pull Request resolved: https://github.com/pytorch/pytorch/pull/31099 Differential Revision: D19314733 Pulled By: yf225 fbshipit-source-id: 5725c51d19c33b4fddd0fc9e7034078580bd534e	2020-01-09 13:16:13 -08:00
Jeremy Lilley	114562cf93	For torch::from_blob() add clue when memory is non-owned. (#31222 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31222 - When constructing torch::from_blob() in the case where the deleter is a nop, switch to using a nullptr context in the DataPtr (with a nop deleter) - No real extra memory/cpu requirements here, actually saves a minor alloc. Why? Trying to get a signal that a Tensor might contain non-owned memory from torch::from_blob(), by detecting the nullptr context. ghstack-source-id: 96336078 Test Plan: buck test mode/dev caffe2/test/cpp/api/... buck test mode/dev-nosan caffe2/test/... Differential Revision: D18992119 fbshipit-source-id: 4eea642f82d0858b57fdfc6995364a760c10567d	2020-01-07 13:12:30 -08:00
Jerry Zhang	ebe69236d1	Expose class constant through `attr` and `setattr` in object (#29219 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29219 We added class constant in previous PRs, this PR allows access to class constant in the object API Test Plan: build/bin/test_jit python test/test_jit.py Imported from OSS Differential Revision: D18846851 fbshipit-source-id: 888a6517d5f747d1f8ced283c0c2c30b2f6c72c6	2020-01-04 11:09:35 -08:00
Jerry Zhang	1bb6c51421	Fix getAttribute (#31011 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31011 `getAttribute` is supposed to throw when there the attribute is not found rather than return a `nullptr`. Test Plan: . Imported from OSS Differential Revision: D18898417 fbshipit-source-id: 0fe7d824b978ad19bb5ef094d3aa560e9fc57f87	2019-12-18 19:27:39 -08:00
Pavel Belevich	47766e648f	C++ API parity: MultiheadAttention Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27309 Test Plan: Imported from OSS Differential Revision: D17766736 Pulled By: pbelevich fbshipit-source-id: 7a5f2399f081945d31d4c13d7a8d248c387fc1a6	2019-12-18 10:13:29 -08:00
Nathan Goldbaum	f531815526	Deprecate tensor.type() (#30281 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/29161. I looked a bit at the code changes related to this and think I have all of the use cases of `DeprecatedTypeProperties` covered in the message, but suggestions from someone with more context on this would be very much appreciated :) Pull Request resolved: https://github.com/pytorch/pytorch/pull/30281 Differential Revision: D18830818 Pulled By: ezyang fbshipit-source-id: 1a7fcee15354ae09e6644577e7fa33bd26acfe20	2019-12-05 10:55:34 -08:00
Will Feng	18ec4632b3	Exclude undefined tensors in the result of Module::parameters() / named_paramters() / buffers() / named_buffers() (#30626 ) Summary: PR https://github.com/pytorch/pytorch/pull/30523 attempted to fix https://github.com/pytorch/pytorch/issues/30508 and https://github.com/pytorch/pytorch/issues/30462, but the fix wasn't complete. This PR makes the following improvements: 1. Fixes https://github.com/pytorch/pytorch/issues/30508 and https://github.com/pytorch/pytorch/issues/30462 properly by excluding undefined tensors in the result of `Module::parameters()` / `named_parameters()` / `buffers()` / `named_buffers()`, which mirrors the Python API behavior. 2. Audits all use sites of `Module::parameters_` / `buffers_` and change them to `Module::named_parameters(/recurse=/false)` / `named_buffers(/recurse=/false)` when appropriate, so that use sites of module parameters / buffers never need to worry about undefined tensors. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30626 Differential Revision: D18777507 Pulled By: yf225 fbshipit-source-id: 55b64b69779e1186342efd3c44857f416334ed6b	2019-12-02 21:59:58 -08:00
Brian Wignall	e7fe64f6a6	Fix typos (#30606 ) Summary: Should be non-semantic. Uses https://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines to find likely typos. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30606 Differential Revision: D18763028 Pulled By: mrshenli fbshipit-source-id: 896515a2156d062653408852e6c04b429fc5955c	2019-12-02 20:17:42 -08:00
Will Feng	7ac8efa689	Skip undefined tensors when moving torch::nn module to a different device (#30523 ) Summary: This fixes high-pri issues such as https://github.com/pytorch/pytorch/issues/30508 and https://github.com/pytorch/pytorch/issues/30462. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30523 Differential Revision: D18732904 Pulled By: yf225 fbshipit-source-id: fe5a7a43838000f5803bd9c01ecfba0c3f02df5d	2019-11-27 21:21:02 -08:00
Will Feng	3ba1456aee	Fix clip_grad_norm_ / clip_grad_value_ to take input by value instead of by non-const ref (#30216 ) Summary: The original design of `torch::nn::utils::clip_grad_norm_` / `clip_grad_value_` takes input by non-const reference, which prevents users from passing rvalue reference as input into the functions. This PR changes the functions to take input by value, which matches the Python version's semantics, and also adheres to the C++ API convention that if a function modifies its input in-place, it should take that input by value. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30216 Differential Revision: D18632543 Pulled By: yf225 fbshipit-source-id: 97a09d6467f982fe9c8120f483a9c07fcf13699e	2019-11-21 10:07:00 -08:00
lsrock1	0a77c090d5	C++ parity, convert_parameters (#29267 ) Summary: yf225 https://github.com/pytorch/pytorch/issues/25883 update parameters_to_vector and vector_to_parameters check please! Pull Request resolved: https://github.com/pytorch/pytorch/pull/29267 Differential Revision: D18628571 Pulled By: yf225 fbshipit-source-id: 03783e6b0f8183dd97ae48f3da4acb1d07083555	2019-11-20 19:59:11 -08:00
Will Feng	5cbdbddc12	Add test for F::max_unpool3d, and update parity table Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30171 Differential Revision: D18620503 Pulled By: yf225 fbshipit-source-id: 52adf9a6c0238b5cdb2e11e03807fb7dd73880bf	2019-11-20 12:42:24 -08:00
Will Feng	a460c856dd	Fix naming for kl_div and binary_cross_entropy functional options (#30146 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30146 This PR fixes naming for kl_div and binary_cross_entropy functional options, to be more consistent with the naming scheme of other functional options. Test Plan: Imported from OSS Differential Revision: D18618971 Pulled By: yf225 fbshipit-source-id: 2af62c1a0ace2cd0c36c2f1071639bf131d8fe61	2019-11-20 12:23:50 -08:00
Pavel Belevich	f8e7f3fca4	C++ API parity: BCEWithLogitsLoss Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28783 Test Plan: Imported from OSS Differential Revision: D18202435 Pulled By: pbelevich fbshipit-source-id: 011b028bbb2a091e98d3548616b99d7b4569c239	2019-11-20 06:46:38 -08:00
Pavel Belevich	cc81769e10	C++ API parity: isfinite Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30083 Test Plan: Imported from OSS Differential Revision: D18594723 Pulled By: pbelevich fbshipit-source-id: 5970e0aa6ef8994e9c4a741784fd053383aaceb7	2019-11-19 20:00:05 -08:00
Will Feng	bb1d9b238d	torch::nn::FractionalMaxPool{2,3}d module and functional Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29933 Test Plan: Imported from OSS Differential Revision: D18548174 Pulled By: yf225 fbshipit-source-id: 070776db6e8b7ad94d9b7cbd82b3d6966f061a46	2019-11-19 17:24:07 -08:00
Divyansh Singhvi	ec52d911bd	InstanceNorm{1,2,3}d (#28790 ) Summary: Hi yf225, I have a few doubts related to implementation: 1) What tests do I have to write? 2) What does _load_state_from_dict does? 3) Do I need to override reset() function as I can not see it's utility? 4) InstanceNormOptions could be removed with BatchNormOptions, but I find that `track_running_status` is not defined instead `stateful` is defined. InstanceNorm{1,2,3}d https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28790 Differential Revision: D18588666 Pulled By: yf225 fbshipit-source-id: bb9b81f01f62c3fc8765fa0ba0716768087ee155	2019-11-19 16:57:01 -08:00
Will Feng	05a7aaa742	Pass Tensor instead of Tensor& to torch::nn functionals that can change input in place (#30112 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30112 Currently, we have torch::nn functionals that takes `input` as `Tensor&` in order to be able to in-place change `input`'s value. We likely shouldn't do this because it will prevent the following use case: ```cpp F::elu(torch::tensor(1), F::ELUFuncOptions().inplace(true)) ``` The solution is to change the type of `input` to `Tensor`, so that we can pass an rvalue into the functional. Test Plan: Imported from OSS Differential Revision: D18601580 Pulled By: yf225 fbshipit-source-id: 639a86eb62f6c986b0f20bf7e201983e83126e73	2019-11-19 16:11:39 -08:00
nuka137	a75b669b0f	C++ API: torch::nn::ConvTranspose{1,2,3}d (#29721 ) Summary: Add torch::nn::ConvTranspose{1,2,3}d module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/29721 Differential Revision: D18588943 Pulled By: yf225 fbshipit-source-id: d4dbb091389367e70459399d5cda3778325c2120	2019-11-19 16:04:12 -08:00
Suyash458	e88d096321	C++/Python API Parity: add AlphaDropout (#28424 ) Summary: - add `AlphaDropoutImpl` to `modules/dropout.h` and `modules/dropout.cpp` - add `functional/dropout.h` containing the `alpha_dropout` function - include `functional/dropout.h` in `nn/functional.h` - add functional and module tests - related issue https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28424 Differential Revision: D18589162 Pulled By: yf225 fbshipit-source-id: c85734e02431a6c052515e26b11ca30ad7303644	2019-11-19 10:05:51 -08:00
Will Feng	3bd0f476d4	Revert D18233037: C++ API parity: isfinite Test Plan: revert-hammer Differential Revision: D18233037 Original commit changeset: c76b9467bbc1 fbshipit-source-id: 97d2cfa9de767a8c3a0ca919f9d768e959fa484e	2019-11-18 20:26:19 -08:00
Pavel Belevich	8df5e10ee9	C++ API parity: isfinite Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28918 Test Plan: Imported from OSS Differential Revision: D18233037 Pulled By: pbelevich fbshipit-source-id: c76b9467bbc1fbb2c9bf49855895c98438b36c12	2019-11-18 19:06:57 -08:00
Will Feng	689b4bea7b	torch::nn::GLU and F::glu (#29922 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29922 * #29920 [C++ API] torch::nn::GroupNorm and F::group_norm Test Plan: Imported from OSS Differential Revision: D18558818 Pulled By: yf225 fbshipit-source-id: ff80d634309fcb55f53db8dcf86eb9cf8161b37e	2019-11-16 21:03:38 -08:00
Will Feng	d5bf51b684	torch::nn::GroupNorm and F::group_norm Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29920 Test Plan: Imported from OSS Differential Revision: D18539314 Pulled By: yf225 fbshipit-source-id: dabbbaac31796fe7bfde02487737971bde699c1c	2019-11-16 19:22:11 -08:00
PyExtreme	e1d13f4f8b	C++ API parity: NLLLoss & CrossEntropyLoss (#29812 ) Summary: Hi yf225 , I have added NLLLoss and CrossEntropyLoss. ``` Also, while using log_softmax in cross_entropy_loss, I am getting an error ../caffe2/../torch/csrc/api/include/torch/nn/functional/loss.h:537:63: error: no matching function for call to log_softmax(const at::Tensor&)’ const Tensor& log_softmax_input = torch::log_softmax(input); aten/src/ATen/Functions.h:5551:22: note: candidate: at::Tensor at::log_softmax(const at::Tensor&, int64_t, c10::optional<c10::ScalarType>) static inline Tensor log_softmax(const Tensor & self, int64_t dim, c10::optional<ScalarType> dtype) { ^~~~~~~~~~~ aten/src/ATen/Functions.h:5551:22: note: candidate expects 3 arguments, 1 provided ``` I think the other two parameters should be optional as in python frontend(shown in documentation here at https://pytorch.org/docs/stable/nn.functional.html#torch.nn.functional.log_softmax ). Rest, there were no errors in build and tests have passed Pull Request resolved: https://github.com/pytorch/pytorch/pull/29812 Differential Revision: D18548249 Pulled By: yf225 fbshipit-source-id: 2ab350abd2a6f498d4dba2345f51ad87471f3038	2019-11-16 10:49:09 -08:00
Pavel Belevich	27afac2134	C++ API parity: Dropout, Dropout2d, Dropout3d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29761 Test Plan: Imported from OSS Differential Revision: D18530820 Pulled By: pbelevich fbshipit-source-id: 9d351561692f7de099d7c6aaf2ecb930b5c867e9	2019-11-15 20:32:06 -08:00
Edward Yang	65bb34d885	Remove TensorImpl::is_variable, deprecate Tensor::is_variable (#29653 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29653 I didn't remove is_variable from Tensor for BC reasons, but I did remove as many uses as I could from the codebase. at::impl::variable_excluded_from_dispatch got moved to TensorBody.h so that it's more widely accessible. This diff is NOT semantics preserving. Here are the major differences: - In a number of native operator implementations, we tested that arguments are not variable. I replaced these with asserts that variable is excluded from dispatch. I actually don't think these asserts are really necessary now (they should certainly be true, but it's hard to get it wrong), but I've kept them for old time's sake. At least, they'll detect if you call these functions before you've processed variable (indicating a bug in your kernel.) - There are a number of places where we do a per-tensor test for being a variable, for better error reporting when someone commits Tensor/Variable confusion. Although these tests are substantively the same as the tests above, in these cases I decided to delete the test entirely. The reasoning is that in these cases, we didn't really care about dispatch (also, see above; I'm not too sure we really need the dispatch asserts), we cared about Tensor/Variable confusion. Since Tensor/Variable confusion is impossible now, we don't need the tests. One of the key factors which pushed me one way or another was whether or not a function was doing per-tensor validation; if I kept the assert in such functions, I'd repeatedly access the TLS. Even if we want to bring back the asserts, they would have to go somewhere else. Another similar idiom is the number of places we do !x.defined() \|\| x.is_variable(); I treated this equivalently. - nuclear_norm's computation of compute_uv is a bit weird, but I think it's OK to just delete the is_variable case (I suspect that it is always the case that self.is_variable(), but it doesn't really matter.) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D18496168 Pulled By: ezyang fbshipit-source-id: 5a1ded931e0c10a6b758ba64a8380d34110e0c3e	2019-11-14 11:41:02 -08:00
Will Feng	a68c52494c	Use F::*FuncOptions for embedding/embeddingbag functionals (#29673 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29673 Following https://github.com/pytorch/pytorch/pull/29364 and https://github.com/pytorch/pytorch/pull/29404, this PR makes `F::EmbeddingFuncOptions` and `F::EmbeddingBagFuncOptions` separate classes from `torch::nn::EmbeddingOptions` and `torch::nn::EmbeddingBagOptions`, so that it's easier to enforce that arguments such as `num_embeddings` and `embedding_dim` are required for `torch::nn::EmbeddingOptions` and `torch::nn::EmbeddingBagOptions`. Test Plan: Imported from OSS Differential Revision: D18462540 Pulled By: yf225 fbshipit-source-id: f2abf431e48675b0a9d7f6f398cdb90ff9037c35	2019-11-13 18:47:22 -08:00
Will Feng	65f691f2c2	Add more tests for torch::arange Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29689 Test Plan: Imported from OSS Differential Revision: D18465818 Pulled By: yf225 fbshipit-source-id: 0cf0aaa7febcf4318abdaae7d17a43ab3acde017	2019-11-13 15:17:16 -08:00
Will Feng	2bcac59a30	Use default dtype for torch::tensor(floating_point_values) and torch::tensor(empty braced-init-list) when dtype is not specified (#29632 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29632 This PR is BC-breaking in the following way: Previously, C++ `torch::tensor` with a floating-point literal with no suffix (e.g. `torch::tensor(1.1)`) or a (nested) braced-init-list of floating-point literals with no suffix (e.g. `torch::tensor({{1.1, 2.2}})` produces a tensor with dtype `at::kDouble`. After this PR, it produces a tensor with dtype `torch::get_default_dtype()`, matching Python `torch.tensor` behavior. Test Plan: Imported from OSS Differential Revision: D18465819 Pulled By: yf225 fbshipit-source-id: 6834fe50335c677bc3832f2a5e9cf8d1ede9f665	2019-11-13 15:17:11 -08:00
Will Feng	b37c235d86	C++/Python API parity for Conv{1,2,3}d layers, and add F::conv{1,2,3}d functionals (#28917 ) Summary: This PR changes the implementation of C++ Conv{1,2,3}d layers to exactly match the Python version, and add F::conv{1,2,3}d functionals. For more thorough testing, I will rely on the parity test mechanism which uses values from `common_nn.py` to generate the inputs and options that we are interested in testing. This PR is BC-breaking in the following way: In `Conv{1,2,3}dOptions`: - `with_bias` is renamed to `bias`. - `input_channels` is renamed to `in_channels`. - `output_channels` is renamed to `out_channels`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/28917 Differential Revision: D18471526 Pulled By: yf225 fbshipit-source-id: 7a33f60654ad93cc2e043245e7ff9e0ef9da15b3	2019-11-13 12:53:31 -08:00
Edward Yang	30092df15e	Rename getNonVariableDeprecatedTypeProperties to getDeprecatedTypeProperties (#29203 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29203 There is no more Variable/Tensor distinction, so fix the misleading name. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D18353505 Pulled By: ezyang fbshipit-source-id: dadc394d533ab7746f70bc186c6645441a784518	2019-11-13 07:43:32 -08:00
Will Feng	65bfcde05e	Use c10::variant-based enums for SmoothL1Loss module and functional Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29536 Test Plan: Imported from OSS Differential Revision: D18432272 Pulled By: yf225 fbshipit-source-id: fa355145962e93025b7de98b99b0a4fc82e8c871	2019-11-12 16:05:31 -08:00
Will Feng	57eab22c6a	Use c10::variant-based enums for F::grid_sample Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29535 Test Plan: Imported from OSS Differential Revision: D18432273 Pulled By: yf225 fbshipit-source-id: 11476f0431a9b544dfb62bc7a89bab84399f9b83	2019-11-12 16:05:26 -08:00
Will Feng	9f879ef532	Make all non-input arguments to functionals part of its options (#29404 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29404 This PR makes all non-input arguments to functionals part of its options parameters, so that we won't break backward compatibility even if we add or reorder some of the non-input arguments to functionals in the future. Test Plan: Imported from OSS Differential Revision: D18378526 Pulled By: yf225 fbshipit-source-id: f5cf6bdfb844e75bf94fdee58c121e0955631b6e	2019-11-12 16:05:22 -08:00
Anjali Chourdia	604fc9ec41	F::embedding, F::embedding_bag, moved Embedding and EmbeddingBag options to embedding.h in options Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28669 Differential Revision: D18377609 Pulled By: anjali411 fbshipit-source-id: 6a2c547368849ebd1a2f8828cfbe7252152b26a2	2019-11-11 11:51:26 -08:00
eellison	e01fc56ecb	move type inference for arange into c++ (#27629 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/17662 I'm not sure if `arange` needs to be in python_arg_parser at all, given the schemas in native_functions.yaml. In any case this at least fixes the dytpe mismatch. In follow up PRs I will try to handle some of the other ops that do type inference at the python level, like randint. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27629 Differential Revision: D17885939 Pulled By: eellison fbshipit-source-id: f97a8bc722b7ab77de1c42a992e49a4a3175ad60	2019-11-11 11:26:21 -08:00
Will Feng	cb74ede59e	Pass F::FuncOptions instead of torch::nn::Options to functionals, and make F::FuncOptions a different class when necessary (#29364 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29364 Currently, we use `torch::nn::Options` both as module options and functional options. However, this makes it very hard to manage the parameters in `torch::nn::Options`, because a module's constructor can take a different set of arguments than the module's equivalent functional (e.g. `torch.nn.BatchNorm1d` takes `num_features, eps=1e-5, momentum=0.1, affine=True, track_running_stats=True`, while `F::batch_norm` takes `running_mean, running_var, weight=None, bias=None, training=False, momentum=0.1, eps=1e-5`). This PR resolves the above problem by making `F::FuncOptions` a different class from `torch::nn::Options` when necessary (i.e. when a module's constructor takes a different set of arguments than the module's equivalent functional). In the rest of the cases where the module constructor takes the same set of arguments as the module's equivalent functional, `F::FuncOptions` is an alias of `torch::nn::*Options`. Also as part of this PR, we change all functional options to pass-by-value, to make the semantics consistent across all functionals. Test Plan: Imported from OSS Differential Revision: D18376977 Pulled By: yf225 fbshipit-source-id: 8d9c240d93bfd5af0165b6884fdc912476b1d06b	2019-11-08 22:38:21 -08:00
Edward Yang	4e21157e01	Revert "Revert D18171156: Merge Tensor and Variable." (#29299 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29299 This reverts commit `9c43b16df9`, but also with the changes from D18348622. Comments there: thpp-compatibility is used by admarket/adreview/service:adreviewservice and libtorch is too big for the service to deal with. thpp-compatibility doesn't support autograd, so we hack around dispatching variables by using AutoNonVariableTypeMode everywhere we call into ATen, so we never attempt to call into Variable stubs. If you get it wrong, you'll get an error like: ``` what(): Could not run 'aten::empty' with arguments from the 'VariableTensorId' backend. 'aten::empty' is only available for these backends: [SparseCPUTensorId, CPUTensorId, MkldnnCPUTensorId]. (lookup_ at caffe2/aten/src/ATen/core/dispatch/DispatchTable.h:298) ``` Test Plan: Imported from OSS ``` buck test //thpp-compatibility/... buck build mode/opt-clang admarket/adreview/service:adreviewservice ``` adreviewservice canary: https://our.intern.facebook.com/intern/ads/canary/422290029716387895 (comparing against parent comment due to current breakage) ==> experiment store https://our.intern.facebook.com/intern/experiment_store/experiment/43990006/ adfinder canary: https://our.intern.facebook.com/intern/ads/canary/422268535840333934 adindexer canary: https://our.intern.facebook.com/intern/ads/canary/422268550559034675 adreview second canary: https://our.intern.facebook.com/intern/ads/canary/422307863515591925 canary without thpp-compat fixups https://our.intern.facebook.com/intern/ads/canary/422308951649168772 Reviewed By: dreiss Differential Revision: D18353504 Pulled By: ezyang fbshipit-source-id: 65feaba39fa07bb66762810909aeb38868668a30	2019-11-08 09:11:20 -08:00
Zachary DeVito	796363147f	Implement more of of the nn.Module API (#28828 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28828 This updates torch::script::Module to more closely match the behavior of nn.Module. In particular, it implements the (optionally recurisive) iterators that retrieve submodules, parameters, and buffers and makes their names match the python versions. This also removes the individual accessors for Parameter, Module, Buffer, etc. and replaces them with a single `attr` function which is equivalent to writing `a.foo` in Python (`setattr` emulates `a.foo = v`). As we build out the user-facing API for TorchScript values this will end up matching how an attribute is accessed on general objects. This PR preservers the python bindings for script::Module by emulating the old API at the binding level. A followup will clean up the usage to more directly match the C++ API. Test Plan: Imported from OSS Differential Revision: D18197611 Pulled By: zdevito fbshipit-source-id: 7ee4dcbb258605d1c988314b05d938423f1ccee5	2019-11-06 22:58:25 -08:00
Edward Yang	9c43b16df9	Revert D18171156: Merge Tensor and Variable. Test Plan: revert-hammer Differential Revision: D18171156 Original commit changeset: 5b6a045beba3 fbshipit-source-id: f5581d902c2305018ea49f8473592be2a465560b	2019-11-06 10:57:00 -08:00
lsrock1	6389c18709	C++ parity, nn::CrossMapLRN2d (#29039 ) Summary: yf225 https://github.com/pytorch/pytorch/issues/25883 re- pull request because of rebase mistake! Pull Request resolved: https://github.com/pytorch/pytorch/pull/29039 Differential Revision: D18326829 Pulled By: yf225 fbshipit-source-id: 5ed737f6275e4463efa4951d9b7f45c6f2723c82	2019-11-05 15:27:08 -08:00
Pavel Belevich	69f845cb77	C++ API parity: MarginRankingLoss Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29000 Test Plan: Imported from OSS Differential Revision: D18271855 Pulled By: pbelevich fbshipit-source-id: cbafc7f059173306c83673d7be374c2d3700911f	2019-11-05 05:41:40 -08:00
Will Feng	026fd36c71	Use at::kLong for torch::tensor(integer_value) when dtype is not specified (#29066 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29066 This PR is BC-breaking in the following way: Previously, C++ `torch::tensor` with an integer literal or a braced-init-list of integer literals produces a tensor with dtype being the type of the integer literal(s). After this PR, it always produces a tensor of dtype `at::kLong` (aka. int64_t), matching Python `torch.tensor` behavior. Test Plan: Imported from OSS Differential Revision: D18307248 Pulled By: yf225 fbshipit-source-id: 7a8a2eefa113cbb238f23264843bdb3b77fec668	2019-11-04 21:39:10 -08:00
Edward Yang	25261a4776	Merge Tensor and Variable. (#28620 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28620 All Tensors are Variables now, they just happen to have requires_grad=False. Tensors ALWAYS have `VariableTensorId` in their type set. When constructing this patch, I had to make decisions about what I would fix in this patch, and what I would leave for follow up PRs. Here is the cleanup that happens in this patch: - The `is_variable` property is removed from TensorOptions. I removed this immediately because unlike Tensor::is_variable, TensorOptions::is_variable doesn't respect our VariableTensorId thread-local state. This means that there were a bunch of places where TensorOptions::is_variable was false, which is obviously bogus in the world when tensor and variable are merged. Instead of keeping the method as a function that always returns true, I just opted to remove it entirely (it's not public API.) All places we set `is_variable` are deleted. - Knock on effect: there is no longer a separate DeprecatedTypeProperties for the variable and non-variable versions of type. - Knock on effect: instead of asserting on TensorOptions::is_variable, instead we just test `at::impl::variable_is_excluded()` - There is now only one copy of the cuDNN RNN dropout cache, not two (I'm not sure why we had two to begin with) Some cleanup that doesn't happen in this patch: - Eliminating unnecessary uses of `make_variable` - Eliminating `Tensor::is_variable` The most subtle part of this patch is retaining tracing behavior: the fact that everything is a Variable means that more code gets routed to VariableType than before; this can change traces. I identified two places where we didn't appropriately turn off VariableType, mostly factory functions: - `torch.tensor` must turn off VariableType before invoking `at::empty` to construct the tensor, as it subsequently does direct data access - `tensor_slow` (invoked when you pass a Python scalar to a tensor argument) must turn off VariableType before calling `scalar_to_tensor` so the scalar gets traced as constant, rather than as a call to `scalar_to_tensor`. Honestly, these are all giant hacks, and should be replaced with a more specialized guard that just toggles tracing. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: dreiss Differential Revision: D18171156 Pulled By: ezyang fbshipit-source-id: 5b6a045beba37492647e350190f495114e86504d	2019-11-04 14:59:57 -08:00
Xiaomeng Yang	2460dced8f	Add torch.nn.GELU for GELU activation (#28944 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28944 Add torch.nn.GELU for GELU activation Test Plan: buck test mode/dev-nosan //caffe2/test:nn -- "GELU" Reviewed By: hl475, houseroad Differential Revision: D18240946 fbshipit-source-id: 6284b30def9bd4c12bf7fb2ed08b1b2f0310bb78	2019-11-03 21:55:05 -08:00
nuka137	a68c1e109e	C++ API: torch::nn::BatchNorm{2,3}d (#28936 ) Summary: Add torch::nn::BatchNorm{2,3}d module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 #28176 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28936 Differential Revision: D18274584 Pulled By: yf225 fbshipit-source-id: 3784eee9f8947f6c7c9f1699544a3d36a1a019b7	2019-11-01 17:50:33 -07:00
Pavel Belevich	4a94eaa60b	C++ API parity: PoissonNLLLoss Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28755 Test Plan: Imported from OSS Differential Revision: D18202436 Pulled By: pbelevich fbshipit-source-id: a7a27d5f3cdbcbbd9bbbffa02b576609d5fdc9b3	2019-11-01 12:35:59 -07:00
Edward Yang	bbea34f283	Revert D18266918: C++ API: torch::nn::BatchNorm{2,3}d Test Plan: revert-hammer Differential Revision: D18266918 Original commit changeset: f432904c7298 fbshipit-source-id: 0e1c596b2e2f13b59082ff422c67ba025df4be07	2019-11-01 10:46:49 -07:00
nuka137	b7c5b3d398	C++ API: torch::nn::BatchNorm{2,3}d (#28936 ) Summary: Add torch::nn::BatchNorm{2,3}d module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 #28176 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28936 Differential Revision: D18266918 Pulled By: yf225 fbshipit-source-id: f432904c72985d52ec52cb992cceb372b6ff0244	2019-11-01 09:28:58 -07:00
Carlos Miranda	72b9bda9e5	Smooth L1 loss (#27661 ) Summary: In accordance with https://github.com/pytorch/pytorch/issues/25883, I added the `SmoothL1Loss` module and `smooth_l1_loss` functional. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27661 Differential Revision: D18002332 Pulled By: yf225 fbshipit-source-id: b382df8becb0de14986ec16ee0dc953d7b10e917	2019-10-31 23:41:35 -07:00
jokerkeny	aa30176c68	Add C++ API clip_grad_value_ for nn:utils (#28736 ) Summary: Adds C++ API clip_grad_value_ for torch::nn:utils module. Also, fix the for indent level error in the original test/test_nn.py. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28736 Differential Revision: D18263807 Pulled By: yf225 fbshipit-source-id: 29282450bd2099df16925e1d0edd3d933f6eeb9b	2019-10-31 19:11:54 -07:00
Will Feng	595209bddc	Fix bugs in torch::tensor constructor (#28523 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28523 New features: 1. Previously, `torch::tensor({true, false, true})` throws `"tensor_cpu" not implemented for 'Bool'`. After this PR, it produces the correct bool tensor, matching the Python API behavior. 2. Tensors with zero-size dimensions are now supported, e.g. `torch::tensor({{}, {}})` produces a tensor with sizes `{2, 0}`, matching the Python API behavior. BC-breaking bug fixes: 1. Previously, `torch::tensor({{1}, {2}})` produces a tensor of sizes `{2}`. After this PR, it produces a tensor of sizes `{2, 1}`, matching the Python API behavior. 2. Fixed semantics of `torch::tensor(1.1)`: it now returns a 0-dim tensor instead of a 1-dim tensor, matching the Python API behavior. 3. Previously, when passed a non-dtype `TensorOptions` to the `torch::tensor` constructor, it always produces a tensor of dtype `float`. After this PR, it produces tensor of different dtypes based on the dtype of the braced-init-list, matching the behavior of the no-options case. ```cpp // Previously: torch::tensor({1, 2, 3}, torch::TensorOptions(/non-dtype-options/)).dtype() -> float torch::tensor({{1, 2, 3}}, torch::TensorOptions(/non-dtype-options/)).dtype() -> float torch::tensor({1., 2., 3.}, torch::TensorOptions(/non-dtype-options/)).dtype() -> float torch::tensor({{1., 2., 3.}}, torch::TensorOptions(/non-dtype-options/)).dtype() -> float // Now: torch::tensor({1, 2, 3}, torch::TensorOptions(/non-dtype-options/)).dtype() -> int torch::tensor({{1, 2, 3}}, torch::TensorOptions(/non-dtype-options/)).dtype() -> int torch::tensor({1., 2., 3.}, torch::TensorOptions(/non-dtype-options/)).dtype() -> double torch::tensor({{1., 2., 3.}}, torch::TensorOptions(/non-dtype-options/)).dtype() -> double // As comparison, currently: torch::tensor({1, 2, 3}).dtype() -> int torch::tensor({{1, 2, 3}}).dtype() -> int torch::tensor({1., 2., 3.}).dtype() -> double torch::tensor({{1., 2., 3.}}).dtype() -> double ``` Notes: 1. From now on, the behavior of `at::tensor(scalar_value)` (which produces a 1-dim tensor) would be different from `torch::tensor(scalar_value)` (which produces a 0-dim tensor). I will fix the behavior of `at::tensor(scalar_value)` in a follow-up PR. 2. From now on, the behavior of `at::tensor({1, 2, 3}, torch::TensorOptions(/non-dtype-options/))` (which produces a `float` tensor) would be different from `torch::tensor({1, 2, 3}, torch::TensorOptions(/non-dtype-options/))` (which produces a an `int` tensor). I will fix this behavior of `at::tensor` constructor in a follow-up PR. Context for the changes in this PR: The motivation comes from fixing the "`torch::tensor({{1}, {2}})` gives tensor of wrong sizes" bug - in order to fix it, I have to move the handling of `at::ArrayRef` and `std::vector` into `InitListTensor` (see below on why we need to do this) and renamed `InitListTensor` to `TensorDataContainer`. After such changes, support for bool values comes out of the box without extra effort, and support for tensors with zero-size dimensions only requires adding a default constructor for `TensorDataContainer`, so I added those two in this PR. For the semantic change of `torch::tensor(1.1)`, it's actually more effort to preserve the original wrong behavior (i.e. we need to check the sizes of the tensor converted from `TensorDataContainer` and reshape any scalar tensor to a 1-D tensor). I think preserving the original wrong behavior doesn't give us much value, and since the above changes naturally fix the problem, we should just start using the right behavior instead. For the "constructor with non-dtype options behavior" fix, the code looks simpler and easier to reason about with the fix, so I included it in this PR. -------- Why we need to move the handling of `at::ArrayRef` and `std::vector` into `TensorDataContainer`: `torch::tensor({{1}, {2}})` can match this function overload: `torch::tensor(at::ArrayRef<int> values)`, because `{1}` and `{2}` can be treated as a list-initialization of an `int` value. However, this will produce a Tensor with sizes `{2}`, but we actually want a Tensor with sizes `{2, 1}`. In order to avoid matching this function overload, we removed the function overload and moved the ability to convert `at::ArrayRef<T>` (and similarly `std::vector<T>`) into `TensorDataContainer`, and since for braced-init-list the `TensorDataContainer(std::initializer_list<TensorDataContainer>)` constructor is always preferred over all other constructors, it will take the `std::initializer_list` path, and all is good. Test Plan: Imported from OSS Differential Revision: D18234625 Pulled By: yf225 fbshipit-source-id: 0f3f6912e82e2117d2103e31b74e7e97baaa8693	2019-10-31 12:53:06 -07:00
Pavel Belevich	d6f1e49c4a	C++ API parity: CTCLoss Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28654 Test Plan: Imported from OSS Differential Revision: D18202437 Pulled By: pbelevich fbshipit-source-id: a4b80a57e65da84f3988002a026c648fa52a0fde	2019-10-30 14:35:02 -07:00
jon-tow	1d3d9ec7d4	C++ API Parity: `functional::fold` and `Fold::pretty_print` (#28732 ) Summary: Adds `torch::nn::functional::fold` support and updates `Fold::pretty_print` in the C++ API for more thorough Python parity. Note: Small updates in source files to maintain consistency elsewhere. Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28732 Differential Revision: D18219955 Pulled By: yf225 fbshipit-source-id: fd2e9be8f17db77c1b1f384c0d2e16cc34858c0c	2019-10-30 11:37:39 -07:00
mansoorcheema	a465b033fd	Local response norm (#28759 ) Summary: Implemented LocalResponseNorm and some initial tests for modules and functional. Reference https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28759 Differential Revision: D18219745 Pulled By: yf225 fbshipit-source-id: e6aad568a8b1e81f54752decaefd4f9044029da9	2019-10-30 11:31:00 -07:00
Thomas Viehmann	2526f97464	Include hierarchy information in C++ API loading error messages (#28499 ) Summary: Before, we would only give the key we are looking for (i.e. typically just "No such serialized tensor 'weight'", no matter for which submodule we were looking for a weight. Now we error with "No such serialized tensor '0.conv1.weight'" or similar. The analogous information is added to missing module error messages. I threw in a test, and it saved me already... Pull Request resolved: https://github.com/pytorch/pytorch/pull/28499 Differential Revision: D18122442 Pulled By: yf225 fbshipit-source-id: a134b6d06ca33de984a11d6fea923244bcd9fb95	2019-10-30 08:41:37 -07:00
mrsalehi	dfe7b25eaf	Add nn::Flatten to C++ Frontend (#28072 ) Summary: Adds torch::nn::Flatten module support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28072 Differential Revision: D18202778 Pulled By: yf225 fbshipit-source-id: 43345dcbdf2f50d75746bf9a0ba293b84df275ab	2019-10-29 17:52:47 -07:00
nuka137	cbc234bceb	C++ API: torch::nn::BatchNorm1d (#28176 ) Summary: Add torch::nn::BatchNorm1d function/module support for the C++ API. torch::nn::BatchNorm{2,3}d will be added after this PR is merged. Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 I would like to discuss about below items. * Necessity of `num_batches_tracked` in `BatchNormImplBase` * `num_batches_tracked` is needed to calculate `momentum` when we do not feed `momentum` argument in Python API. But in C++ API, `momentum` argument has a default value. * `num_batches_tracked` is only used for counting up `BatchNorm1d::foward()` call. I think it is no necessary for user anymore. * The design of `BatchNorm{1,2,3}dOptions` * We have already `BatchNormOptions` used for deprecated `BatchNorm` module. However, it is hard to use it for `BatchNorm{1,2,3}dOptions` because of the arguments disagreement of each modules. * In this PR, I introduce `BatchNormOptionsv2` template class for the `BatchNorm{1,2,3}dOptions`. But I'm not sure this design is good or not. Pull Request resolved: https://github.com/pytorch/pytorch/pull/28176 Differential Revision: D18196843 Pulled By: yf225 fbshipit-source-id: 667e2b5de4150d5776c41b9088c9e6c2ead24cd4	2019-10-29 17:29:42 -07:00
Will Feng	e33b4b6761	Use c10::variant-based enums for Reduction Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27942 Test Plan: Imported from OSS Differential Revision: D18202857 Pulled By: yf225 fbshipit-source-id: 0303ce2508e3b7665c6a91ae270a7d0ef0e45900	2019-10-29 14:15:48 -07:00
jon-tow	52dd587123	C++ API parity: Upsample (#28413 ) Summary: Adds `interpolate` functional and `Upsample` module support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28413 Differential Revision: D18165014 Pulled By: yf225 fbshipit-source-id: ecae2f432a301b1f4afa7c038b2d104cbad139f2	2019-10-28 21:34:44 -07:00
Will Feng	5804e54c81	Deprecate torch::nn::modules_ordered_dict API (#28774 ) Summary: I finally found a way to get the following API to work for constructing a list of named submodules for `Sequential`: ```cpp Sequential sequential({ {"m1", MyModule(1)}, {"m2", MyModule(2)} })` ``` which was actually our original proposed design and much simpler than our current API: ```cpp Sequential sequential(modules_ordered_dict({ {"m1", MyModule(1)}, {"m2", MyModule(2)} })); ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/28774 Differential Revision: D18174013 Pulled By: yf225 fbshipit-source-id: 3a18c2d36b6a65a07bee7346a7516780567c7774	2019-10-28 13:01:13 -07:00
nuka137	648749b203	C++ API: torch::nn::LPPool2d (#28492 ) Summary: Add torch::nn::LPPool2d module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 #27800 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28492 Differential Revision: D18109401 Pulled By: yf225 fbshipit-source-id: 5cedecb895d9d44c2167cdb3f6f758f3426b3497	2019-10-28 12:28:25 -07:00
anjali411	dc17a2ecc5	Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28433 Differential Revision: D18138240 Pulled By: anjali411 fbshipit-source-id: 314e5902f103be1feb4cacde47c90204b3d353cc	2019-10-25 11:44:28 -07:00
Will Feng	d04973beda	Use c10::variant-based enums for EmbeddingBag mode (#28330 ) Summary: This PR is BC-breaking in the following way: Previous, we require the use of `std::string` to specify the mode for `EmbeddingBag`. After this PR, we use variant-based enums such as `torch::kSum` / `torch::kMean` / `torch::kMax` to specify the mode for `EmbeddingBag`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/28330 Differential Revision: D18127116 Pulled By: yf225 fbshipit-source-id: 15cd86c764777f4d399587be92cda15b6ce8524b	2019-10-24 17:47:42 -07:00
lsrock1	e885ce6130	C++ parity, grid_sample functional (#28354 ) Summary: https://github.com/pytorch/pytorch/issues/25883 I put grid_sample in vision.h with affine grid. I have a question in string argument(interpolation mode, padding mode) I reuse torch::native::detail::GridSamplerInterpolation in GridSampler.h instead of using string. It follows the way that uses reduction enum in loss functions. I am not sure this is right. yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28354 Differential Revision: D18109333 Pulled By: yf225 fbshipit-source-id: 1bf972b671b107464f73b937bbe0de76fb259fbf	2019-10-24 15:14:37 -07:00
Will Feng	92b39434a2	C++ nn::ConstantPad{1,2,3}d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28541 Test Plan: Imported from OSS Differential Revision: D18115607 Pulled By: yf225 fbshipit-source-id: 736df791ddc3cd30ad9af89eacfb4a0c6b53f2cd	2019-10-24 15:10:27 -07:00
Will Feng	7f9941c4ea	C++ nn::ZeroPad2d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28540 Test Plan: Imported from OSS Differential Revision: D18115610 Pulled By: yf225 fbshipit-source-id: ced7c0917f4712838e753cd2e9fc4fa79fd5d310	2019-10-24 14:23:57 -07:00
Pavel Belevich	46f96d1538	C++ API parity: at::Tensor::requires_grad_ Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26332 Test Plan: Imported from OSS Differential Revision: D17427575 Pulled By: pbelevich fbshipit-source-id: 5500169a4fa0ef9cc2a7272e13b6e2d89df09260	2019-10-24 13:24:18 -07:00
Will Feng	303527d733	C++ nn::ReplicationPad{1,2,3}d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28539 Test Plan: Imported from OSS Differential Revision: D18115609 Pulled By: yf225 fbshipit-source-id: 15f4ab6a114279bb06bf62f1265b62aa12f8700f	2019-10-24 12:49:41 -07:00
Will Feng	78375c02b8	C++ nn::ReflectionPad1d and nn::ReflectionPad2d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28538 Test Plan: Imported from OSS Differential Revision: D18115608 Pulled By: yf225 fbshipit-source-id: 3a48d8c11721f013076db2965f5f75b71662c78e	2019-10-24 12:02:51 -07:00
Pavel Belevich	dd277e9086	C++ API parity: Linear Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27382 Test Plan: Imported from OSS Differential Revision: D17766735 Pulled By: pbelevich fbshipit-source-id: c7a66daeb17550eb9a5d26944427723d4ebdc6c8	2019-10-24 07:11:51 -07:00
Thomas Viehmann	09ad464d68	Change activation modules in C++ from using Tensor& to Tensor (#28501 ) Summary: Sequential does not like modules added to it to take Tensor& (const Tensor& and Tensor are both OK). Functional and others use Tensor when they want to potentially change things in-place. This changes ReLU and friends to also do that. Unfortunately, this seems to be BC breaking on the ABI level. On the other hand, use of the module ReLU seems rare enough outside Sequential (in particular in C++ models, the standard seems to be to use torch::relu instead). is the BC breaking OK here? (yf225 or anyone else) Pull Request resolved: https://github.com/pytorch/pytorch/pull/28501 Differential Revision: D18089978 Pulled By: yf225 fbshipit-source-id: ac9aba6dc2081117dece57cd8a15bafe14ec8f51	2019-10-23 13:42:22 -07:00
Anjali Chourdia	7b59174882	torch::nn::LayerNorm Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28032 Differential Revision: D18047371 Pulled By: anjali411 fbshipit-source-id: fb61aea52d6622a67ec1d84950e17e85686461ae	2019-10-22 12:50:22 -07:00
Will Feng	079b3cc02c	Add C++ nn::functional pad Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26601 Test Plan: Imported from OSS Differential Revision: D17517468 Pulled By: yf225 fbshipit-source-id: 9ee8b93b88a60f91f2ae78c242f9eaa246b3293c	2019-10-21 22:20:38 -07:00
nuka137	9ea42f8d7c	C++ API: torch::nn::LPPool1d (#27800 ) Summary: Add torch::nn::LPPool1d module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27800 Differential Revision: D18045040 Pulled By: yf225 fbshipit-source-id: e61fefe9efec3423f7a93dd1e946f3e380122927	2019-10-21 15:33:51 -07:00
Will Feng	eb4bb00a9c	Use c10::variant-based enums for Nonlinearity and FanMode Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27933 Test Plan: Imported from OSS Differential Revision: D18009044 Pulled By: yf225 fbshipit-source-id: e88229ee30badf7a699f62af61d1e88debc0dc7d	2019-10-18 17:48:34 -07:00
Carlos Miranda	a1e14a6626	PixelShuffle module and functional (#28140 ) Summary: Added `PixelShuffle` module and functional https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28140 Differential Revision: D18008474 Pulled By: yf225 fbshipit-source-id: f482495bb56998701c79a61ef065a121bf5a5154	2019-10-18 15:54:14 -07:00
naresh	bd6f9e1d6c	torch.nn.functional.gumbel_softmax #27078 (#28121 ) Summary: Comments: * Grad check from `848d1ba13a/test/test_nn.py (L8898)` not added * Double data type as seen in `848d1ba13a/test/test_nn.py (L8916)` not tested Issue: https://github.com/pytorch/pytorch/issues/27078 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28121 Differential Revision: D18008515 Pulled By: yf225 fbshipit-source-id: 9363fe9430df0f2bfd337cc788b11ac93adaa360	2019-10-18 09:41:40 -07:00
Shahriar	91a260cef9	Adding MSELoss, KLDivLoss and BCELoss to C++ front-end (#27156 ) Summary: This PR adds ```MSELoss```, ```KLDivLoss``` and ```BCELoss```. The tests for ```BCELoss``` fail with the following error: ``` unknown file: Failure C++ exception with description "autograd_meta() INTERNAL ASSERT FAILED at /home/shahriar/Contrib/pytorch/c10/core/TensorImpl.h:533, please report a bug to PyTorch. set_requires_grad is not implemented for Tensor (set_requires_grad at /home/shahriar/Contrib/pytorch/c10/core/TensorImpl.h:533) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/27156 Differential Revision: D17960323 Pulled By: yf225 fbshipit-source-id: 84b8431064f2f573679c03a8d7994e3e2f81a4d1	2019-10-17 22:07:01 -07:00
Will Feng	aad5071206	Use torch::variant for enums in C++ API Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26837 Test Plan: Imported from OSS Differential Revision: D17579438 Pulled By: yf225 fbshipit-source-id: 9ac59df28a317fdb3be2cc02c65962ad99117127	2019-10-16 22:40:57 -07:00
Ilia Cherniavskii	19956b200d	Relax set_num_threads restriction in parallel native case (#27947 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27947 Don't throw exception if the requested size is the same as the currently used one Test Plan: ATEN_THREADING=NATIVE python setup.py develop --cmake Imported from OSS Differential Revision: D17919416 fbshipit-source-id: 411f7c9bd6a46e7a003b43a200c2ce3b76453a2e	2019-10-16 21:53:36 -07:00
Carlos Miranda	7d277b0670	Multi Label Margin loss (#27659 ) Summary: In accordance with https://github.com/pytorch/pytorch/issues/25883, I added the `MultiLabelMarginLoss` module and `multilabel_margin_loss` functional. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27659 Differential Revision: D17931905 Pulled By: yf225 fbshipit-source-id: 3642f75c79843dda55ac38de9f6f970f3e237847	2019-10-16 15:44:38 -07:00
Carlos Miranda	9540f6c3fe	Soft Margin loss (#27660 ) Summary: In accordance with https://github.com/pytorch/pytorch/issues/25883, I added the `SoftMarginLoss` module and `soft_margin_loss` functional. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27660 Differential Revision: D17958325 Pulled By: yf225 fbshipit-source-id: c14422765e6e1fdabf6c9687080e6d5ff490d300	2019-10-16 12:04:08 -07:00
Zachary DeVito	5136ed0e44	Remove attempToRecoverType (#26767 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26767 Now that we have tagged ivalues, we can accurately recover the type with `ivalue.type()`. This reomoves the other half-implemented pathways that were created because we didn't have tags. Test Plan: Imported from OSS Differential Revision: D17561191 Pulled By: zdevito fbshipit-source-id: 26aaa134099e75659a230d8a5a34a86dc39a3c5c	2019-10-16 11:07:13 -07:00
Moksh Jain	f38beff800	Add nn.Bilinear to C++ Frontend (#26082 ) Summary: Adds support for the Bilinear layer to the C++ frontend Pull Request resolved: https://github.com/pytorch/pytorch/pull/26082 Differential Revision: D17954148 Pulled By: yf225 fbshipit-source-id: 5e746bdea29b00e25969cd7a22044b8059b53687	2019-10-16 09:54:01 -07:00
Jeremy Lilley	2e0294cb39	Make JIT Serialization support arbitrary std::function<> IO (#28039 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28039 Right now, torch::save() uses std::ostream, which results in unnecessary data copies in practice. Similar for torch::load(). Adding a std::function<size_t(const void*, size_t)> as an output option, parallel to the existing filename and std::ostream apis, gives users the flexibility to emit directly to a backing store. For a simple case of appending the output to a std::string, we observe significant benchmark savings (on order of -50%), even with the minor std::function<> dispatch overhead. The main reason is that std::ostringstream effectively requires 2 extra copies of the data beyond a simple string.append lambda. We also provide a parallel api for the load(), though this one is slightly more complex due to the need to do arbitrary position reads. Test Plan: buck test mode/dev-nosan caffe2/test/... (Basic serialization test in caffe2/test/cpp/api/serialize.cpp) Benchmark in experimental/jeremyl/c2/SerializationBench.cpp, with D17823443 (1M time goes from 90ms -> 40ms, albeit with crc patch applied) Differential Revision: D17939034 fbshipit-source-id: 344cce46f74b6438cb638a8cfbeccf4e1aa882d7	2019-10-15 22:12:04 -07:00
Will Feng	964d3d8b38	Revert D17822962: [pytorch][PR] Make JIT Serialization support arbitrary std::function<> IO Test Plan: revert-hammer Differential Revision: D17822962 Original commit changeset: d344a7e59707 fbshipit-source-id: ba153a2110faf91d103bd0f8dea4e9613bd6b0da	2019-10-15 13:55:11 -07:00
Jeremy Lilley	cbe5ab1109	Make JIT Serialization support arbitrary std::function<> IO (#27586 ) Summary: Right now, torch::save() uses std::ostream, which results in unnecessary data copies in practice. Similar for torch::load(). Adding a std::function<size_t(const void*, size_t)> as an output option, parallel to the existing filename and std::ostream apis, gives users the flexibility to emit directly to a backing store. For a simple case of appending the output to a std::string, we observe significant benchmark savings (on order of -50%), even with the minor std::function<> dispatch overhead. The main reason is that std::ostringstream effectively requires 2 extra copies of the data beyond a simple string.append lambda. We also provide a parallel api for the load(), though this one is slightly more complex due to the need to do arbitrary position reads. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27586 Test Plan: buck test mode/dev-nosan caffe2/test/... (Basic serialization test in caffe2/test/cpp/api/serialize.cpp) Benchmark in experimental/jeremyl/c2/SerializationBench.cpp, with D17823443 (1M time goes from 90ms -> 40ms, albeit with crc patch applied) Differential Revision: D17822962 Pulled By: jjlilley fbshipit-source-id: d344a7e59707f3b30d42280fbab78f87399e4d10	2019-10-15 12:39:58 -07:00
Divyansh Singhvi	3397d41b8a	Wrapping namespace Reduction in namespace at (#26606 ) (#27422 ) Summary: 1) Wrapped namespace `Reduction` in namespace `at` 2) Prefixed `at::` wherever `Reduction::` is used Pull Request resolved: https://github.com/pytorch/pytorch/pull/27422 Differential Revision: D17913759 Pulled By: yf225 fbshipit-source-id: 8f00ca01cad2e7f673d316b128abf59c026e216c	2019-10-15 11:05:40 -07:00
Will Feng	801b6cd0bd	Allow passing undefined Tensor to Module::register_parameter (#27948 ) Summary: C++ API `Module::register_parameter` should accept undefined Tensor as parameter, which is equivalent to `module.register_parameter("param", None)` in Python API. This unblocks https://github.com/pytorch/pytorch/pull/26082 and https://github.com/pytorch/pytorch/pull/27156. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27948 Differential Revision: D17931739 Pulled By: yf225 fbshipit-source-id: 21bdfc88e66e3dc39f3caf608a6a3de48c510fa9	2019-10-15 10:10:42 -07:00
Will Feng	11172c19be	codemod at::ArrayRef and torch::IntArrayRef to std::vector in C++ API tests (#27884 ) Summary: `at::ArrayRef` / `torch::IntArrayRef` should be discouraged in user code, because users might not be aware of the fact that it doesn't own the underlying data, which already leads to memory access bugs when they try to write the following: ```cpp auto expected_sizes = torch::IntArrayRef({2, 16, 6}); // The memory that represents `{2, 16, 6}` is released after this line ASSERT_EQ(output.sizes(), expected_sizes); // `expected_sizes` is pointing to invalid memory region ``` This PR changes all usage of `at::ArrayRef` and `torch::IntArrayRef` to the corresponding `std::vector` version, so that users won't pick up the habit of using `ArrayRef` by looking at the test code. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27884 Differential Revision: D17921646 Pulled By: yf225 fbshipit-source-id: 461e79fc22b598aac230d36cc028085ce6cbe937	2019-10-14 18:00:30 -07:00
Carlos Miranda	2cae3928b0	Multi-Label Soft Margin loss (#27669 ) Summary: In accordance with https://github.com/pytorch/pytorch/issues/25883, I added the `MultiLabelSoftMarginLoss` module and `multilabel_soft_margin_loss` functional. It looks like there isn't a C++ ATen implementation of `multilabel_soft_margin_loss`, so I translated the python version, which does not rely on a C/C++ backend either. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27669 Differential Revision: D17907608 Pulled By: yf225 fbshipit-source-id: ccb02951e009973c2adbe604593ce929f10c39eb	2019-10-14 13:29:45 -07:00
jon-tow	0003771423	C++ API parity: Unfold (#27809 ) Summary: Adds `unfold` functional and module support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27809 Differential Revision: D17901792 Pulled By: yf225 fbshipit-source-id: ff58a1866bf240f37ebc589463c60593b8931f51	2019-10-14 13:21:59 -07:00
Gaurav Tamba	cc5c34a0d0	Add nn::functional::normalize() to C++ Frontend (#27280 ) Summary: Addresses https://github.com/pytorch/pytorch/issues/27048 PR Summary: Files Added: _torch/csrc/api/include/torch/nn/options/normalization.h torch/csrc/api/include/torch/nn/functional/normalization.h_ Files Modified: _test/cpp/api/functional.cpp torch/csrc/api/include/torch/nn/functional.h_ --- yf225 : I couldn't find a C++ equivalent of gradcheck(), is there such a function or is it sufficient to call .backward() in the test body? I don't think any solutions are checked for the Python tests. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27280 Differential Revision: D17902109 Pulled By: yf225 fbshipit-source-id: 1bce1a88103d0f1848633fec90fde95ea8f3d1ed	2019-10-14 08:39:02 -07:00
nuka137	07d4374239	C++ API: torch::nn::Softmax2d (#27509 ) Summary: Add torch::nn::Softmax2d module support for the C++ API. Softmax2d only supports module in Python API, so this PR adds only module support as well. This PR is WIP because it uses the function in https://github.com/pytorch/pytorch/issues/27446 . After https://github.com/pytorch/pytorch/issues/27446 is merged, I will remove WIP. Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27509 Differential Revision: D17899715 Pulled By: yf225 fbshipit-source-id: bd891bc995f5a92bf4f5405f8bf07d1bd5de2479	2019-10-13 11:00:56 -07:00
PyExtreme	52528c041a	- TripletMarginLoss (#27713 ) Summary: Hi yf225 , I had to create a new branch to tackle merge conflict since I am using cloud due to some limitations on my PC. Therefore, I don't have enough command there. Also, I have incorporated the changes you have put before here https://github.com/pytorch/pytorch/pull/27613 Also, it would be great if you could recommend me some resources to work smmothly on GCP..:-D Thank you Pull Request resolved: https://github.com/pytorch/pytorch/pull/27713 Differential Revision: D17899695 Pulled By: yf225 fbshipit-source-id: eb6643223148774a5cbbd093bdcc5623872e5bba	2019-10-13 10:57:37 -07:00
Pavel Belevich	446a79b959	C++ API parity: Threshold Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27538 Test Plan: Imported from OSS Differential Revision: D17835415 Pulled By: pbelevich fbshipit-source-id: 2a887704655be79ee458081c46a7eea31eca51dc	2019-10-13 09:38:31 -07:00
Pavel Belevich	cbdd55c669	C++ API parity: Tanhshrink Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27537 Test Plan: Imported from OSS Differential Revision: D17835409 Pulled By: pbelevich fbshipit-source-id: ad4120cfe01ea2508bf3ce1054022a2da649ac74	2019-10-13 08:12:13 -07:00
Pavel Belevich	2750ea25b2	C++ API parity: Tanh Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27536 Test Plan: Imported from OSS Differential Revision: D17835411 Pulled By: pbelevich fbshipit-source-id: c8984aec2f4bae48ff901fafc8c53a4122192ac5	2019-10-13 06:34:18 -07:00
Pavel Belevich	96aafc3cdc	C++ API parity: Softsign Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27535 Test Plan: Imported from OSS Differential Revision: D17835408 Pulled By: pbelevich fbshipit-source-id: 8548deab91f6fe0f7285fdd919c25129ed042181	2019-10-12 08:30:10 -07:00
Pavel Belevich	fcb6dd079e	C++ API parity: Softshrink Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27534 Test Plan: Imported from OSS Differential Revision: D17835404 Pulled By: pbelevich fbshipit-source-id: 7b9f3d3ea793f82840496912f248b0c48bb7463e	2019-10-12 06:36:20 -07:00
nuka137	abaa44122d	C++ API: torch::nn::Softmin (#27459 ) Summary: Add torch::nn::Softmin module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27459 Differential Revision: D17892852 Pulled By: yf225 fbshipit-source-id: db15b06e8ad33947e7d65995df700f5e90c3b6a8	2019-10-11 23:03:55 -07:00
Pavel Belevich	c79d3a4a98	C++ API parity: Softplus Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27489 Test Plan: Imported from OSS Differential Revision: D17835410 Pulled By: pbelevich fbshipit-source-id: 51a8c4ab2ff4b860c96eda1ed8f073017b8cf9ae	2019-10-11 09:00:32 -07:00
Pavel Belevich	9d448099fd	C++ API parity: Sigmoid Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27488 Test Plan: Imported from OSS Differential Revision: D17835405 Pulled By: pbelevich fbshipit-source-id: 78e13047a2a1f2776c59e778db7ba120716e93d3	2019-10-11 07:45:31 -07:00
Pavel Belevich	795c913636	C++ API parity: CELU Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27487 Test Plan: Imported from OSS Differential Revision: D17835406 Pulled By: pbelevich fbshipit-source-id: a8282ae65d8996efcc8b8d846cfa637c3f89eda6	2019-10-11 06:23:57 -07:00
Pavel Belevich	6294a9a877	C++ API parity: RReLU Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27437 Test Plan: Imported from OSS Differential Revision: D17835413 Pulled By: pbelevich fbshipit-source-id: 5d943fdac4fd2633e7f7ca13db1a7fed5636ca50	2019-10-10 19:14:48 -07:00
Pavel Belevich	352092ca95	C++ API parity: ReLU6 Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27436 Test Plan: Imported from OSS Differential Revision: D17835414 Pulled By: pbelevich fbshipit-source-id: 77e743d2f6b71fb3ba5643f9d676f2bb8f236cfa	2019-10-10 17:12:17 -07:00
nuka137	6711969dd8	C++ API: torch::nn::LogSoftmax (#27462 ) Summary: Add torch::nn::LogSoftmax module and functional support for the C++ API. Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27462 Differential Revision: D17867121 Pulled By: yf225 fbshipit-source-id: dae8ac981c1c6ccdef013cd2d886ad4a043f6243	2019-10-10 16:18:15 -07:00
Pavel Belevich	8515650c2b	C++ API parity: ReLU Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27435 Test Plan: Imported from OSS Differential Revision: D17835407 Pulled By: pbelevich fbshipit-source-id: b8ee86c7a76674bc88d8e995424dad22d3caab59	2019-10-10 13:34:38 -07:00
Will Feng	e8087a3060	Change C++ API test files to only include torch/torch.h (#27067 ) Summary: One of the purposes of the C++ API tests in `test/cpp/api/` should be to check that including `torch/torch.h` is a sufficient prerequisite for using all C++ frontend features. This PR change ensures that. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27067 Differential Revision: D17856815 Pulled By: yf225 fbshipit-source-id: 49c057bd807b003e4a00f6ba73131d763a0f277a	2019-10-10 09:46:29 -07:00
jon-tow	f3df6b8ede	Add C++ torch::nn::functional::affine_grid (#27263 ) Summary: Adds`torch::nn::functional::affine_grid` functional support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883, https://github.com/pytorch/pytorch/issues/27196 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27263 Differential Revision: D17802350 Pulled By: yf225 fbshipit-source-id: e823ee53da4a4cc6a1650d2dfc09b0ef6a74e249	2019-10-09 23:17:49 -07:00
Pavel Belevich	1fec1441a1	C++ API parity: PReLU Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27429 Test Plan: Imported from OSS Differential Revision: D17835412 Pulled By: pbelevich fbshipit-source-id: e678d5920dad1293bb0ba3de28e2da3087d19bde	2019-10-09 16:31:54 -07:00
Carlos Miranda	3246fddfd6	Implement C++ API torch::nn::MultiMarginLoss. (#27424 ) Summary: Hi yf225 , here is the C++ frontend API MultiMarginLoss implementation and tests https://github.com/pytorch/pytorch/issues/27198. Could you review it and tell me if it is okay? I am not entirely sure I used `c10::optional` correctly, but `options.weight()` resulted in a compilation error, so I went with `options.weight().value()` instead of `value_or()` to follow the logic in `torch.nn._WeightedLoss.register_buffer` (where one can pass a `None` value). Oh, and are the tests supposed to be skipped or did I do something wrong? I ran `pytest test/test_cpp_api_parity.py -k Loss -v` , and the `L1Loss` test passed but the others were skipped... Thank you for the review in any case! Pull Request resolved: https://github.com/pytorch/pytorch/pull/27424 Differential Revision: D17839963 Pulled By: yf225 fbshipit-source-id: f4b6012590cf22d56d42751c214df80cce717cb8	2019-10-09 14:44:41 -07:00
jon-tow	0fed4756d0	C++ API parity: SELU (#27434 ) Summary: Adds `SELU` functional and module support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27434 Differential Revision: D17782762 Pulled By: yf225 fbshipit-source-id: 96c7ce84b9baf9e219a63e631929b8997ba6f3f0	2019-10-09 14:39:28 -07:00
nuka137	28a1806cbc	C++ API: torch::nn::Softmax (#27446 ) Summary: Add torch::nn::Softmax module support for the C++ API Related Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27446 Differential Revision: D17839546 Pulled By: yf225 fbshipit-source-id: 7c7fb55111b261614de7c3a75fa1019fbde93c67	2019-10-09 14:19:47 -07:00
Anjali Chourdia	a37be201c1	Implement torch.nn.Embedding / EmbeddingBag in PyTorch C++ API (#26358 ) Summary: added more variables to EmbeddingOptions and updated EmbeddingImpl reset, forward functions. Also added EmbeddingBag. ----- This PR is BC-breaking in the following way: Previously, `EmbeddingOptions` supports `count` and `dimension` as options arguments. After this PR, they are renamed to `num_embeddings` and `embedding_dim` respectively. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26358 Differential Revision: D17714337 Pulled By: yf225 fbshipit-source-id: f9f969c68e4bece106b92f8e2e02ac39c8455fb7	2019-10-08 22:13:39 -07:00
Jonathan Tow	3b5d40c339	Add C++ torch::nn::CosineEmbeddingLoss (#27345 ) Summary: Adds `torch::nn::CosineEmbeddingLoss` module and functional support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27345 Differential Revision: D17801402 Pulled By: yf225 fbshipit-source-id: 0eabe80d7d36397e6667b331c3fa2f56d7a15962	2019-10-08 10:52:05 -07:00
Pavel Belevich	2cc1e69cc9	C++ API parity: LogSigmoid Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27060 Test Plan: Imported from OSS Differential Revision: D17682404 Pulled By: pbelevich fbshipit-source-id: d60d64cd4caf1f56a2e05c516f91321d46ec9624	2019-10-05 06:18:25 -07:00
Pavel Belevich	8b61a220c0	C++ API parity: LeakyReLU Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27059 Test Plan: Imported from OSS Differential Revision: D17682407 Pulled By: pbelevich fbshipit-source-id: 2a4f42e9438799ba8de7282ac7a6fd3ff97ee048	2019-10-04 14:18:03 -07:00
Rohan Varma	badb08d577	Add clip_grad_norm_ to c++ api (#26140 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26140 Per https://github.com/pytorch/pytorch/issues/25883, we want to work towards C++/Python API parity. This diff adds clip_grad_norm_ to the c++ API to improve parity. ghstack-source-id: 91334333 ghstack-source-id: 91334333 Test Plan: Added a unit test Differential Revision: D17312367 fbshipit-source-id: 753ba3a4d084d01f3cc8919da3108e67c809ad65	2019-10-04 13:50:36 -07:00
Pavel Belevich	192ca9730f	C++ API parity: Hardtanh Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27038 Test Plan: Imported from OSS Differential Revision: D17682405 Pulled By: pbelevich fbshipit-source-id: f65e76696e0041c3518f56da94f2e3b800305234	2019-10-04 12:53:33 -07:00
Pavel Belevich	05df6b67c6	C++ API parity: TensorTest.BackwardNonScalarOutputs Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27314 Test Plan: Imported from OSS Differential Revision: D17746371 Pulled By: pbelevich fbshipit-source-id: 246fae22a60ed9a6d7b9843239b4b3391cc9dc3e	2019-10-03 15:36:35 -07:00
Zino Benaissa	803f7bfaac	Implement C++ API version of torch.nn.functional.one_hot (#27081 ) (#27177 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27177 Add support for F::one_hot C++ function. Test Plan: Added 3 new tests to verify API is working Imported from OSS Differential Revision: D17697934 fbshipit-source-id: a8127fb87c00daa119bb92a5702bc4bbba48290d	2019-10-02 17:28:39 -07:00
Pavel Belevich	515e3b85da	C++ API parity: Hardshrink Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27035 Test Plan: Imported from OSS Differential Revision: D17682403 Pulled By: pbelevich fbshipit-source-id: 186377fe577abfdd53acc95751a7ed845b51af95	2019-10-02 08:30:20 -07:00
Edward Yang	33db4e02cb	Separate libtorch tests from libtorch build. (#26927 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26927 When we build a "normal" copy of PyTorch, we internally build a copy of libtorch. If we want to test libtorch: we have a choice: test against the regular PyTorch build, or test against the libtorch only build. All of our libtorch tests require Python-side PyTorch to run. So it makes more sense to test the regular PyTorch build. There is probably still utility in making sure that it is still possible to build libtorch only, but in that case we should endeavour to run tests that ONLY require libtorch build, and not Python side stuff. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D17695384 Pulled By: ezyang fbshipit-source-id: 02522a8be0f5944f2b6255a8f1281e53ce2dcc6f	2019-10-02 08:04:52 -07:00
Pavel Belevich	c864454a8f	C++ API parity: ELU Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27028 Test Plan: Imported from OSS Differential Revision: D17682406 Pulled By: pbelevich fbshipit-source-id: 9c313237cb93b9870c6fcf8d01b3dbe4af4c6f2a	2019-10-02 07:12:08 -07:00
Pavel Belevich	5005f7bce7	C++ API parity: MaxUnpool3d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27027 Test Plan: Imported from OSS Differential Revision: D17682402 Pulled By: pbelevich fbshipit-source-id: 2008ce405176c174cdba88b4f25cd77a82bb13ea	2019-10-02 05:40:42 -07:00
Pavel Belevich	5cac738713	C++ API parity: MaxUnpool2d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26915 Test Plan: Imported from OSS Differential Revision: D17627826 Pulled By: pbelevich fbshipit-source-id: 04a5a7e7d19b1610cafaaa0bd329d4d228ab4be5	2019-10-01 19:29:15 -07:00
Pavel Belevich	d125a83f98	C++ API parity: MaxUnpool1d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26896 Test Plan: Imported from OSS Differential Revision: D17627825 Pulled By: pbelevich fbshipit-source-id: 369d0080412467d0259eb5e692a0778c71b12343	2019-10-01 14:53:40 -07:00
jon-tow	18eea8269a	Add C++ torch::nn::functional::pdist (#27122 ) Summary: Adds `torch::nn::functional::pdist` module support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883, https://github.com/pytorch/pytorch/issues/27082 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27122 Differential Revision: D17685823 Pulled By: yf225 fbshipit-source-id: f8ceb09635385ef2e16a002e5fc255be8eb2ebf4	2019-10-01 07:05:25 -07:00
jon-tow	209dc4c4ba	Add C++ torch::nn::HingeEmbeddingLoss (#27101 ) Summary: Adds `torch::nn::HingeEmbeddingLoss` module support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27101 Differential Revision: D17680489 Pulled By: yf225 fbshipit-source-id: 1f8f41775a9e1272a98232c8f899418b2b907eca	2019-09-30 19:29:24 -07:00
Will Feng	27d4b34ea6	Add temporary torch::k{name} enum declarations (#27051 ) Summary: This PR adds temporary declarations for `torch::k{name}` enums, so that we can submit a PR to rename the enum usage in torchvision. And then, after the changes to torchvision is done, we can remove the temporary declarations in https://github.com/pytorch/pytorch/pull/26837 to officially move over to using `c10::variant` for enums. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27051 Differential Revision: D17672220 Pulled By: yf225 fbshipit-source-id: 4ae77634e8c7efa3404698f7c1a69177cbb5dab3	2019-09-30 13:38:29 -07:00
Pavel Belevich	1a3997e0b8	C++ API parity: AdaptiveAvgPool3d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26819 Test Plan: Imported from OSS Differential Revision: D17627829 Pulled By: pbelevich fbshipit-source-id: be4d803c7d4ba2c59e54d154eeebc63794465191	2019-09-28 22:32:21 -07:00
Pavel Belevich	a31fd5ea68	C++ API parity: AdaptiveAvgPool2d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26818 Test Plan: Imported from OSS Differential Revision: D17627822 Pulled By: pbelevich fbshipit-source-id: 0e1dea1c3ff2650dbc7902ce704ac6b47588d0bb	2019-09-28 10:45:03 -07:00
Pavel Belevich	7d58060f49	C++ API parity: AdaptiveAvgPool1d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26808 Test Plan: Imported from OSS Differential Revision: D17627827 Pulled By: pbelevich fbshipit-source-id: 13ad1d0414e7b62f4fc2f6573332bb2c07b16b53	2019-09-28 10:23:31 -07:00
Pavel Belevich	5aa01fd89a	C++ API parity: AdaptiveMaxPool3d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26775 Test Plan: Imported from OSS Differential Revision: D17627824 Pulled By: pbelevich fbshipit-source-id: c4ae077ea5575c5d1df795e74a0dcb74a695ad06	2019-09-27 15:31:37 -07:00
Pavel Belevich	bb7a415bcc	C++ API parity: AdaptiveMaxPool2d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26772 Test Plan: Imported from OSS Differential Revision: D17627823 Pulled By: pbelevich fbshipit-source-id: 195f1edabbbbe245de3568beb0c7925eb347118a	2019-09-27 12:41:38 -07:00
Will Feng	2f1932fc5c	Fix issues in torch::tensor constructor (#26890 ) Summary: This PR contains the following: 1. Fix ambiguous overload problem when `torch::tensor({{1, 2}})` is used: ``` ../test/cpp/api/tensor.cpp: In member function ‘virtual void TensorTest_MultidimTensorCtor_Test::TestBody()’: ../test/cpp/api/tensor.cpp:202:41: error: call of overloaded ‘tensor(<brace-enclosed initializer list>)’ is ambiguous auto tensor = torch::tensor({{1, 2}}); ^ In file included from ../caffe2/../torch/csrc/api/include/torch/types.h:7:0, from ../caffe2/../torch/csrc/api/include/torch/detail/static.h:4, from ../caffe2/../torch/csrc/api/include/torch/nn/pimpl.h:4, from ../caffe2/../torch/csrc/api/include/torch/nn/module.h:3, from ../caffe2/../torch/csrc/api/include/torch/nn/cloneable.h:3, from ../test/cpp/api/support.h:7, from ../test/cpp/api/tensor.cpp:2: ../torch/csrc/autograd/generated/variable_factories.h:177:644: note: candidate: at::Tensor torch::tensor(c10::ArrayRef<unsigned char>) ../torch/csrc/autograd/generated/variable_factories.h:177:1603: note: candidate: at::Tensor torch::tensor(c10::ArrayRef<signed char>) ../torch/csrc/autograd/generated/variable_factories.h:177:2562: note: candidate: at::Tensor torch::tensor(c10::ArrayRef<short int>) ../torch/csrc/autograd/generated/variable_factories.h:177:3507: note: candidate: at::Tensor torch::tensor(c10::ArrayRef<int>) ../torch/csrc/autograd/generated/variable_factories.h:177:4450: note: candidate: at::Tensor torch::tensor(c10::ArrayRef<long int>) ../torch/csrc/autograd/generated/variable_factories.h:177:5404: note: candidate: at::Tensor torch::tensor(c10::ArrayRef<float>) ../torch/csrc/autograd/generated/variable_factories.h:177:6354: note: candidate: at::Tensor torch::tensor(c10::ArrayRef<double>) ../torch/csrc/autograd/generated/variable_factories.h:177:7630: note: candidate: at::Tensor torch::tensor(c10::ArrayRef<bool>) ../torch/csrc/autograd/generated/variable_factories.h:177:9224: note: candidate: at::Tensor torch::tensor(c10::ArrayRef<c10::Half>) ../torch/csrc/autograd/generated/variable_factories.h:177:10838: note: candidate: at::Tensor torch::tensor(c10::ArrayRef<c10::BFloat16>) In file included from ../caffe2/../torch/csrc/api/include/torch/types.h:7:0, from ../caffe2/../torch/csrc/api/include/torch/detail/static.h:4, from ../caffe2/../torch/csrc/api/include/torch/nn/pimpl.h:4, from ../caffe2/../torch/csrc/api/include/torch/nn/module.h:3, from ../caffe2/../torch/csrc/api/include/torch/nn/cloneable.h:3, from ../test/cpp/api/support.h:7, from ../test/cpp/api/tensor.cpp:2: ../torch/csrc/autograd/generated/variable_factories.h:193:19: note: candidate: at::Tensor torch::tensor(torch::detail::InitListTensor) inline at::Tensor tensor(detail::InitListTensor list_init_tensor) { ^ ``` After this PR, the multidim tensor constructor `torch::tensor(...)` should be ready for general use. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26890 Differential Revision: D17632608 Pulled By: yf225 fbshipit-source-id: 2e653d4ad85729d052328a124004d64994bec782	2019-09-27 12:07:50 -07:00
Will Feng	3acbcb96d4	Include `iteration_` in SGD optimizer serialization (#26906 ) Summary: This PR fixes https://github.com/pytorch/pytorch/issues/24192 by including the private field `iteration_` in SGD optimizer serialization. Under the hood, `iteration_` is serialized into an `IValue`, then stored in a JIT module as an attribute. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26906 Differential Revision: D17628359 Pulled By: yf225 fbshipit-source-id: beec1367459e973a1c9080dc86f502e4c7bc5ebd	2019-09-27 09:37:20 -07:00
Pavel Belevich	0a393f6ef5	C++ API parity: AdaptiveMaxPool1d Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26755 Test Plan: Imported from OSS Differential Revision: D17627828 Pulled By: pbelevich fbshipit-source-id: f898a4d2c269b98eb5905291914caa25bca87ce0	2019-09-27 09:10:39 -07:00
Pavel Belevich	77bfe61ff4	C++ API parity: TensorTest.Data fix Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26920 Test Plan: Imported from OSS Differential Revision: D17614135 Pulled By: pbelevich fbshipit-source-id: 96d70a5e7724338d2829bf006696c2d0ac1025a6	2019-09-26 16:51:24 -07:00
Will Feng	b5d15315d8	Improve C++ maxpool and avgpool (#26521 ) Summary: This PR makes the following improvements: 1. Add `forward_with_indices` method to all C++ MaxPool modules, to return the max indices along with the outputs. (We can't make two `forward` methods that return different types based on input, because that will break the type deduction of `torch::detail::return_type_of_forward_t`) 2. Add `max_poolNd_with_indices` to `torch::nn::functional`, to be used when indices of the max values are needed. (We can't merge this with `torch::nn::functional::max_poolNd` because the return type of `max_poolNd` has to be defined statically). 3. Improve `pretty_print` of C++ MaxPoolNd and AvgPoolNd modules to match the Python `extra_repr`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26521 Differential Revision: D17507358 Pulled By: yf225 fbshipit-source-id: b6c0e2b27b38378cdc0c75f4bfc797b3c6b17cd9	2019-09-25 13:52:58 -07:00
Will Feng	d4dc844ec3	Add comments for multidim tensor factory limitations, and rename ListInitTensor for better clarity (#26756 ) Summary: This PR includes the following improvements: 1. Add comments for limitations of the multidim tensor factory function `torch::tensor(...)`, noting the fact that `torch::tensor({})` and mixed data type such as `torch::tensor({{bool, 2.0}})` are not supported at the moment. (I will also update https://pytorch.org/cppdocs/notes/tensor_creation.html to include usage examples for the multidim tensor factory function `torch::tensor(...)`) 2. Rename `ListInitTensor` to `InitListTensor`, for better naming consistency. This addresses reviews in https://github.com/pytorch/pytorch/pull/26210. I will work on a separate PR to move the factory function to `at::`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26756 Differential Revision: D17560136 Pulled By: yf225 fbshipit-source-id: eb8b45226e999784da48f75cc8953a998582df99	2019-09-24 19:21:23 -07:00
jon-tow	5e5b9a9321	Add C++ nn::Identity (#26713 ) Summary: Summary: Adds `torch::nn::Identity` module support for the C++ API. Issue: https://github.com/pytorch/pytorch/issues/25883 Reviewer: yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/26713 Differential Revision: D17550982 Pulled By: yf225 fbshipit-source-id: f24483846e82d5d276d77a1a0c50884f3bc05112	2019-09-24 16:29:49 -07:00
Will Feng	3cae3021e5	Add tests for C++ functional cosine_similarity and pairwise_distance, and clean up functional test code (#26559 ) Summary: This ensures that `F::cosine_similarity` and `F::pairwise_distance` can be used simply by including `torch/torch.h` and set `namespace F = torch::nn::functional`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26559 Differential Revision: D17507421 Pulled By: yf225 fbshipit-source-id: f895dde3634d5c8ca66ee036903e327e5cdab6b1	2019-09-24 09:10:42 -07:00
Pavel Belevich	450504cd95	C++ API parity: at::Tensor::set_data Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26647 Test Plan: Imported from OSS Differential Revision: D17542604 Pulled By: pbelevich fbshipit-source-id: 37d5d67ebdb9348b5561d983f9bd26d310210983	2019-09-24 04:51:22 -07:00
Pavel Belevich	6b25562489	C++ API parity: at::Tensor::detach Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26251 Test Plan: Imported from OSS Differential Revision: D17427578 Pulled By: pbelevich fbshipit-source-id: c3d23a8c2da4148b86e7760ba5023eb38f7835af	2019-09-22 06:10:48 -07:00
Pavel Belevich	d117842e56	C++ API parity: at::Tensor::version Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26561 Test Plan: Imported from OSS Differential Revision: D17507167 Pulled By: pbelevich fbshipit-source-id: 167890c7b745acc9cb9ce4185f1d8c1745aaecc2	2019-09-21 08:37:46 -07:00
Will Feng	da8fbe5bf0	Minor improvement to C++ nn::Distance tests (#26539 ) Summary: C++ `nn::Distance` tests can take advantage of the newly released multi-dimensional tensor constructor https://github.com/pytorch/pytorch/pull/26210 to simplify the tensor constructions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26539 Differential Revision: D17501041 Pulled By: yf225 fbshipit-source-id: 21d5f95ab3ec02227115c823c581218cee2ce458	2019-09-20 12:40:52 -07:00
Edward Yang	a5bcde97af	Revert D17427577: C++ API parity: at::Tensor::version Test Plan: revert-hammer Differential Revision: D17427577 Original commit changeset: e9b3e76ca44d fbshipit-source-id: a5bbae208ba33a31f90ab5c9b199f232de0c6d1b	2019-09-20 11:19:43 -07:00
Pavel Belevich	198521978b	C++ API parity: at::Tensor::version Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26217 Test Plan: Imported from OSS Differential Revision: D17427577 Pulled By: pbelevich fbshipit-source-id: e9b3e76ca44df883e3038b688dd7b930752d93a2	2019-09-20 11:02:41 -07:00
jon-tow	872ca919a9	Distance module (#26424 ) Summary: Adds `Distance` module parity. https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/26424 Differential Revision: D17487314 Pulled By: yf225 fbshipit-source-id: c7d124cb4afb08a4733e7212af0bb276bf32d172	2019-09-20 07:28:49 -07:00

... 3 4 5 6 7 ...

685 Commits