mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

History

Sebastian Messmer c7e9abb66a Making ops c10-full: list of optional tensors (#49138 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49138 See for details: https://fb.quip.com/QRtJAin66lPN We need to model optional types explicitly, mostly for schema inference. So we cannot pass a `Tensor?[]` as `ArrayRef<Tensor>`, instead we need to pass it as an optional type. This PR changes it to `torch::List<c10::optional<Tensor>>`. It also makes the ops c10-full that were blocked by this. ## Backwards Compatibility - This should not break the Python API because the representation in Python is the same and python_arg_parser just transforms the python list into a `List<optional<Tensor>>` instead of into a `List<Tensor>`. - This should not break serialized models because there's some logic that allows loading a serialized `List<Tensor>` as `List<optional<Tensor>>`, see https://github.com/pytorch/pytorch/pull/49138/files#diff-9315f5dd045f47114c677174dcaa2f982721233eee1aa19068a42ff3ef775315R57 - This will break backwards compatibility for the C++ API. There is no implicit conversion from `ArrayRef<Tensor>` (which was the old argument type) to `List<optional<Tensor>>`. One common call pattern is `tensor.index({indices_tensor})`, where indices_tensor is another `Tensor`, and that will continue working because the `{}` initializer_list constructor for `List<optional<Tensor>>` can take `Tensor` elements that are implicitly converted to `optional<Tensor>`, but another common call pattern was `tensor.index(indices_tensor)`, where previously, the `Tensor` got implicitly converted to an `ArrayRef<Tensor>`, and to implicitly convert `Tensor -> optional<Tensor> -> List<optional<Tensor>>` would be two implicit conversions. C++ doesn't allow chaining. two implicit conversions. So those call sites have to be rewritten to `tensor.index({indices_tensor})`. ghstack-source-id: 119269131 Test Plan: ## Benchmarks (C++ instruction counts): ### Forward #### Script ```py from torch.utils.benchmark import Timer counts = Timer( stmt=""" auto t = {{op call to measure}}; """, setup=""" using namespace torch::indexing; auto x = torch::ones({4, 4, 4}); """, language="cpp", ).collect_callgrind(number=1_000) print(counts) ``` #### Results \| Op call \|before \|after \|delta \| \| \|------------------------------------------------------------------------\|---------\|--------\|-------\|------\| \|x[0] = 1 \|11566015 \|11566015\|0 \|0.00% \| \|x.index({0}) \|6807019 \|6801019 \|-6000 \|-0.09%\| \|x.index({0, 0}) \|13529019 \|13557019\|28000 \|0.21% \| \|x.index({0, 0, 0}) \|10677004 \|10692004\|15000 \|0.14% \| \|x.index({"..."}) \|5512015 \|5506015 \|-6000 \|-0.11%\| \|x.index({Slice(None, None, None)}) \|6866016 \|6936016 \|70000 \|1.02% \| \|x.index({None}) \|8554015 \|8548015 \|-6000 \|-0.07%\| \|x.index({false}) \|22400000 \|22744000\|344000 \|1.54% \| \|x.index({true}) \|27624088 \|27264393\|-359695\|-1.30%\| \|x.index({"...", 0, true, Slice(1, None, 2), torch::tensor({1, 2})})\|123472000\|123463306\|-8694\|-0.01%\| ### Autograd #### Script ```py from torch.utils.benchmark import Timer counts = Timer( stmt=""" auto t = {{op call to measure}}; """, setup=""" using namespace torch::indexing; auto x = torch::ones({4, 4, 4}, torch::requires_grad()); """, language="cpp", ).collect_callgrind(number=1_000) print(counts) ``` Note: the script measures the forward path of an op call with autograd enabled (i.e. calls into VariableType). It does not measure the backward path. #### Results \| Op call \|before \|after \|delta \| \| \|------------------------------------------------------------------------\|---------\|--------\|-------\|------\| \|x.index({0}) \|14839019\|14833019\|-6000\| 0.00% \| \|x.index({0, 0}) \|28342019\|28370019\|28000\| 0.00% \| \|x.index({0, 0, 0}) \|24434004\|24449004\|15000\| 0.00% \| \|x.index({"..."}) \|12773015\|12767015\|-6000\| 0.00% \| \|x.index({Slice(None, None, None)}) \|14837016\|14907016\|70000\| 0.47% \| \|x.index({None}) \|15926015\|15920015\|-6000\| 0.00% \| \|x.index({false}) \|36958000\|37477000\|519000\| 1.40% \| \|x.index({true}) \|41971408\|42426094\|454686\| 1.08% \| \|x.index({"...", 0, true, Slice(1, None, 2), torch::tensor({1, 2})}) \|168184392\|164545682\|-3638710\| -2.16% \| Reviewed By: bhosmer Differential Revision: D25454632 fbshipit-source-id: 28ab0cffbbdbdff1c40b4130ca62ee72f981b76d		2021-01-04 05:04:02 -08:00
..
any.cpp	[C++ API] Allow skipping default arguments in module's forward method when module is used in Sequential (#33027 )	2020-02-17 20:38:02 -08:00
autograd.cpp	Fix auto exponent issue for torch.pow (#49809 )	2020-12-29 17:02:56 -08:00
CMakeLists.txt	Implement C++ ModuleDict (#47707 )	2020-11-19 08:07:51 -08:00
dataloader.cpp	Fix typos (#30606 )	2019-12-02 20:17:42 -08:00
dispatch.cpp	[Codemod][GleanFbcode] Remove dead includes in caffe2/test (#39023 )	2020-05-27 14:07:26 -07:00
enum.cpp	[C++ API] RNN / GRU / LSTM layer refactoring (#34322 )	2020-03-15 17:48:29 -07:00
expanding-array.cpp
fft.cpp	Remove deprecated spectral ops from torch namespace (#48594 )	2020-12-05 04:12:32 -08:00
functional.cpp	Add PixelUnshuffle (#49334 )	2020-12-22 20:14:55 -08:00
init_baseline.h
init_baseline.py
init.cpp	[Codemod][GleanFbcode] Remove dead includes in caffe2/test (#39023 )	2020-05-27 14:07:26 -07:00
integration.cpp	[C++ API] Remove deprecated torch::nn::BatchNorm / FeatureDropout / modules_ordered_dict and torch::nn::init::Nonlinearity / FanMode (#34508 )	2020-03-12 10:09:58 -07:00
jit.cpp	Remove attempToRecoverType (#26767 )	2019-10-16 11:07:13 -07:00
memory.cpp
misc.cpp	Throw error if `torch.set_deterministic(True)` is called with nondeterministic CuBLAS config (#41377 )	2020-08-05 12:42:24 -07:00
module.cpp	[pytorch] Route default warning sync to LOG(WARNING) - second try (#36984 )	2020-04-23 01:08:00 -07:00
moduledict.cpp	Implement C++ ModuleDict (#47707 )	2020-11-19 08:07:51 -08:00
modulelist.cpp	[C++ API] RNN / GRU / LSTM layer refactoring (#34322 )	2020-03-15 17:48:29 -07:00
modules.cpp	Add PixelUnshuffle (#49334 )	2020-12-22 20:14:55 -08:00
namespace.cpp	Remove `using namespace torch::autograd` from header files (#34423 )	2020-03-09 10:31:21 -07:00
nn_utils.cpp	[WIP] Fix cpp grad accessor API (#40887 )	2020-07-16 09:11:12 -07:00
operations.cpp	[Codemod][GleanFbcode] Remove dead includes in caffe2/test (#43953 )	2020-09-01 21:48:28 -07:00
optim_baseline.h	Add `AdamW` to C++ frontend (#40009 )	2020-06-18 15:28:12 -07:00
optim_baseline.py	Add `AdamW` to C++ frontend (#40009 )	2020-06-18 15:28:12 -07:00
optim.cpp	[WIP] Fix cpp grad accessor API (#40887 )	2020-07-16 09:11:12 -07:00
ordered_dict.cpp
parallel_benchmark.cpp	[aten] Pass std::function<> to thread_pool by value, instead of const ref. (#37681 )	2020-05-05 08:41:38 -07:00
parallel.cpp	[PyTorch] Modify `data_parallel` to work with small tensors (#37704 )	2020-05-04 11:06:42 -07:00
parameterdict.cpp	Python/C++ API Parity: Add impl and tests for ParameterDict (#40654 )	2020-06-29 08:50:44 -07:00
parameterlist.cpp	Impl for ParameterList (#41259 )	2020-07-12 20:50:31 -07:00
README.md
rnn.cpp	Adding support for CuDNN-based LSTM with projections (#47725 )	2020-12-16 11:27:02 -08:00
sequential.cpp	[C++ API] RNN / GRU / LSTM layer refactoring (#34322 )	2020-03-15 17:48:29 -07:00
serialize.cpp	Add `AdamW` to C++ frontend (#40009 )	2020-06-18 15:28:12 -07:00
static.cpp
support.cpp	Use default dtype for torch::tensor(floating_point_values) and torch::tensor(empty braced-init-list) when dtype is not specified (#29632 )	2019-11-13 15:17:11 -08:00
support.h	Changes warnings generated in cpp to show point of Python origination (#36052 )	2020-04-25 21:18:58 -07:00
tensor_cuda.cpp	Fix MagmaInitializesCorrectly_CUDA by using an invertible matrix (#32547 )	2020-01-25 20:00:54 -08:00
tensor_indexing.cpp	Making ops c10-full: list of optional tensors (#49138 )	2021-01-04 05:04:02 -08:00
tensor_options_cuda.cpp	Deprecate tensor.type() (#30281 )	2019-12-05 10:55:34 -08:00
tensor_options.cpp	[PyTorch] Narrow Device to 2 bytes by narrowing DeviceType and DeviceIndex (#47023 )	2020-11-18 19:39:40 -08:00
tensor.cpp	Change to.dtype_layout to c10-full (#41169 )	2020-07-10 16:04:34 -07:00
torch_include.cpp	Relax set_num_threads restriction in parallel native case (#27947 )	2019-10-16 21:53:36 -07:00
transformer.cpp	C++ APIs Transformer NN Module Top Layer (#44333 )	2020-09-11 08:25:27 -07:00

README.md

C++ Frontend Tests

In this folder live the tests for PyTorch's C++ Frontend. They use the GoogleTest test framework.

CUDA Tests

To make a test runnable only on platforms with CUDA, you should suffix your test with _CUDA, e.g.

TEST(MyTestSuite, MyTestCase_CUDA) { }

To make it runnable only on platforms with at least two CUDA machines, suffix it with _MultiCUDA instead of _CUDA, e.g.

TEST(MyTestSuite, MyTestCase_MultiCUDA) { }

There is logic in main.cpp that detects the availability and number of CUDA devices and supplies the appropriate negative filters to GoogleTest.

Integration Tests

Integration tests use the MNIST dataset. You must download it by running the following command from the PyTorch root folder:

$ python tools/download_mnist.py -d test/cpp/api/mnist

The required paths will be referenced as test/cpp/api/mnist/... in the test code, so you must run the integration tests from the PyTorch root folder.