pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Richard Barnes	72e4aab74b	Eliminate unused parameters in PyTorch (#73749 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73749 Unused parameters cause compiler warnings which distract from real issues. Let's remove unused parameters! Test Plan: Sandcastle Reviewed By: swolchok, ngimel Differential Revision: D34567731 fbshipit-source-id: 2e42301a29a8e1014ac8ab429588bb773db58850 (cherry picked from commit 3eda4743991328d532194efd0fe3d127a294343d)	2022-03-04 02:31:37 +00:00
Maksim Levental	20a037d80f	[Core] Update Exception.h (#64553 ) Summary: Just a small thing that's a little annoying; currently `TORCH_INTERNAL_ASSERT` generates messages like this: ``` unknown file: Failure C++ exception with description "g_outputs.count(out) > 0INTERNAL ASSERT FAILED at "../torch/csrc/jit/passes/memory_planning.cpp":481, please report a bug to PyTorch. 22 Exception raised from getManagedLiveRangesFromMemEvents at ../torch/csrc/jit/passes/memory_planning.cpp:481 (most recent call first): ``` i.e. with no space between the `#cond` checked and `INTERNAL ASSERT FAILED...` Re the `STRIP_ERROR_MESSAGES` branch: I'm not sure whether the differences are intentional or accidental? I.e. we don't pass `__FILE__` and `__VA_ARGS__` to `torchCheckFail` whereas we do to `torchInternalAssertFail`. I can reconcile the two in this PR if it's simply an omission. Pull Request resolved: https://github.com/pytorch/pytorch/pull/64553 Reviewed By: albanD Differential Revision: D34482923 Pulled By: swolchok fbshipit-source-id: 40b648dadb5ceea5d9038efd60771d98aa7f0b46 (cherry picked from commit 8d6f1168b2242dfc0d911cce39f0863260e384b1)	2022-02-28 19:34:20 +00:00
Sameer Deshmukh	d100d98db8	`torch.linalg` routines return `torch.linalg.LinAlgError` when a numerical error in the computation is found. (#68571 ) Summary: This PR fixes https://github.com/pytorch/pytorch/issues/64785 by introducing a `torch.LinAlgError` for reporting errors caused by bad values in linear algebra routines which should allow users to easily catch errors caused by numerical errors. Pull Request resolved: https://github.com/pytorch/pytorch/pull/68571 Reviewed By: malfet Differential Revision: D33254087 Pulled By: albanD fbshipit-source-id: 94b59000fdb6a9765e397158e526d1f815f18f0f	2021-12-23 10:53:26 -08:00
CodemodService FBSourceClangFormatLinterBot	f7210f8d90	[AutoAccept][Codemod][FBSourceClangFormatLinter] Daily `arc lint --take CLANGFORMAT` Reviewed By: zertosh Differential Revision: D33090919 fbshipit-source-id: 78efa486776014a27f280a01a21f9e0af6742e3e	2021-12-14 08:06:58 -08:00
Peter Bell	b08d64202a	Remove THGeneral (#69041 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69041 `TH_CONCAT_{N}` is still being used by THP so I've moved that into it's own header but all the compiled code is gone. Test Plan: Imported from OSS Reviewed By: anjali411 Differential Revision: D32872477 Pulled By: ngimel fbshipit-source-id: 06c82d8f96dbcee0715be407c61dfc7d7e8be47a	2021-12-13 16:14:28 -08:00
Peter Bell	5f45927d15	Autograd: Delay warnings until the end of backward execution (#66235 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/50209 This adds a new warning handler that stores all warnings in a shared queue, which can be "replayed" at a later time and, crucially, on another thread. Then, I use this inside the autograd engine to ensure that warnings are processed by the handler registered on the main thread. For testing, I also add an operator that always warns in the backward pass and test that the warning is a normal Python warning. cc ezyang albanD zou3519 gqchen pearu nikitaved soulitzer Lezcano Varal7 Pull Request resolved: https://github.com/pytorch/pytorch/pull/66235 Reviewed By: ejguan Differential Revision: D31505413 Pulled By: albanD fbshipit-source-id: 1a7f60b038f55c20591c0748b9e86735b3fec2f9	2021-10-13 15:38:04 -07:00
Jeffrey Wan	1733d10399	Warn when backward() is called with create_graph=True (#59412 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/4661 - Add warnings in engine's `execute` function so it can be triggered through both cpp and python codepaths - Adds an RAII guard version of `c10::Warning::set_warnAlways` and replaces all prior usages of the set_warnAlways with the new one Pull Request resolved: https://github.com/pytorch/pytorch/pull/59412 Reviewed By: jbschlosser Differential Revision: D28969294 Pulled By: soulitzer fbshipit-source-id: b03369c926a3be18ce1cf363b39edd82a14245f0	2021-06-08 17:19:04 -07:00
Scott Wolchok	cd22bdf236	[PyTorch] Autoformat c10, round 2 (#57645 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57645 Second round of autoformatting changes since the first pass became too large. ghstack-source-id: 128199695 Test Plan: CI Reviewed By: zertosh Differential Revision: D28131430 fbshipit-source-id: 24b03e38b087f31e8cac2404bebcd401c55b6cab	2021-05-05 15:45:53 -07:00
Scott Wolchok	b11a24209f	[PyTorch] Take advantage of string literals in TORCH_WARN (#54032 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54032 Add a `const char*` override to c10::Warning::warn so that we can avoid wrapping plain C string literals in std::string. ghstack-source-id: 125544720 Test Plan: Buildsizebot some iOS apps? Reviewed By: ezyang Differential Revision: D27061983 fbshipit-source-id: dc11150c911a4317a8edac75e50c5ba43511ff24	2021-04-30 19:02:42 -07:00
cyy	83c23703b7	Some simple optimizations (#51831 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51831 Reviewed By: albanD Differential Revision: D26379122 Pulled By: VitalyFedyunin fbshipit-source-id: d3562232f8501f2ad0b291586bf7f828e9b47010	2021-04-23 07:55:15 -07:00
Edward Yang	28d6e01511	Add TORCH_CHECK_NOT_IMPLEMENTED/c10::NotImplementedError; make dispatch use it (#53377 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53377 My underlying goal is I want to make the test suite ignore NotImplementedError without failing when bringing up a backend (meta) that doesn't have very many functions implemented. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: mrshenli Differential Revision: D26850766 Pulled By: ezyang fbshipit-source-id: ffbdecd22b06b5ac23e1997723a6e2a71dfcd14a	2021-03-09 09:04:22 -08:00
Scott Wolchok	7cd9892f83	[PyTorch] Sync TORCH_INTERNAL_ASSERT optis with TORCH_CHECK (#52226 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52226 This gets TORCH_INTERNAL_ASSERT to parity with TORCH_CHECK in terms of optimization for 0 or 1 argument. ghstack-source-id: 121877054 (Note: this ignores all push blocking failures!) Test Plan: Compare generated assembly for ``` #include <c10/util/Exception.h> void f(bool b) { TORCH_INTERNAL_ASSERT(b, "message"); } void g(bool b) { TORCH_INTERNAL_ASSERT(b); } void h(bool b) { TORCH_INTERNAL_ASSERT(b, "message", random()); } ``` before/after this diff. Before: P174916324 After: P174916411 Before, f and g called out to outlined lambdas to build std::strings. After, they load string constants and call torchInternalAssertFail. Similarly, h calls random() and c10::detail::_str_wrapper() inline and then calls out to torchInternalAssertFail. As with D26380783 (`efbb854ed8`), I hope to solve the problem of outlining the random & _str_wrapper calls separately. Profile AdIndexer benchmark & verify that toTensor() is still inlined (it calls TORCH_INTERNAL_ASSERT with an integer argument, like `h` above). Reviewed By: bhosmer Differential Revision: D26410575 fbshipit-source-id: f82ffec8d302c9a51f7a82c65bc698fab01e1765	2021-02-19 12:45:40 -08:00
Scott Wolchok	efbb854ed8	[PyTorch] Avoid std::string in TORCH_CHECK when possible (#52221 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52221 The previous code forced a `std::string` to be created even when the default message or a user-provided string literal message was used. Now it's not forced and we don't need an outlined lambda in those cases either. ghstack-source-id: 121877056 Test Plan: Compare assembly for ``` #include <c10/util/Exception.h> void f(bool b) { TORCH_CHECK(b, "message"); } void g(bool b) { TORCH_CHECK(b); } void h(bool b) { TORCH_CHECK(b, "message", random()); } ``` before/after in fbcode optimized build. Before: P174696735 After: P174696840 For `f()` and `g()`, we go from a call to an outlined lambda that did a bunch of `std::string` creation to a load of a string constant before calling `torchCheckFail`. This is a clear improvement. For `h()`, results are mixed: we save a bunch of extra string goop in the outlined lambda and instead call `c10::detail::_str_wrapper` directly. This is good for overall size. However, we no longer outline the call to `random()`, which is less than ideal. I hope to recover the ability to fully outline the `random()` call in future diffs; this is just thorny enough that I don't want to cram even more into one diff. Added automated test to make sure `TORCH_CHECK` and `TORCH_INTERNAL_ASSERT` only evaluate their arguments once. Profiled AdIndexer mergenet benchmark in perf to check that `IValue::toTensor` is still getting inlined. Reviewed By: bhosmer Differential Revision: D26380783 fbshipit-source-id: 288860772423994ac739a8f33e2c09f718e8dd38	2021-02-18 07:51:53 -08:00
Scott Wolchok	22b12179db	[PyTorch] Make TORCH_INTERNAL_ASSERT use torchCheckFail too (#52086 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52086 I previously fixed TORCH_CHECK in D25481308 (`7d406b4a07`), but didn't cover TORCH_INTERNAL_ASSERT. No reason not to fix it too. ghstack-source-id: 121456574 Test Plan: Run framework overhead benchmarks. Run build size check for igios. Adindexer benchmark looks encouraging. Before: ``` I0210 11:10:59.974778 2570617 BlackBoxPredictorBenchLib.cpp:384] C2 run finished. Milliseconds per iter: 0.0548625. Iters per second: 18227.4 I0210 11:11:07.591706 2570617 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 0.0677804. Iters per second: 14753.5 I0210 11:11:07.637014 2570617 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 6.35653. Iters per second: 157.319 I0210 11:11:14.592409 2572700 BlackBoxPredictorBenchLib.cpp:384] C2 run finished. Milliseconds per iter: 0.0543933. Iters per second: 18384.6 I0210 11:11:22.158799 2572700 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 0.0673752. Iters per second: 14842.3 I0210 11:11:22.204160 2572700 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 6.37655. Iters per second: 156.825 I0210 11:11:29.233793 2573079 BlackBoxPredictorBenchLib.cpp:384] C2 run finished. Milliseconds per iter: 0.0541586. Iters per second: 18464.3 I0210 11:11:36.726284 2573079 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 0.0666658. Iters per second: 15000.2 I0210 11:11:36.774489 2573079 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 6.36777. Iters per second: 157.041 I0210 11:11:43.799113 2573238 BlackBoxPredictorBenchLib.cpp:384] C2 run finished. Milliseconds per iter: 0.0535797. Iters per second: 18663.8 I0210 11:11:51.433924 2573238 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 0.0679261. Iters per second: 14721.9 I0210 11:11:51.479207 2573238 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 6.34747. Iters per second: 157.543 I0210 11:11:58.492782 2573599 BlackBoxPredictorBenchLib.cpp:384] C2 run finished. Milliseconds per iter: 0.0548257. Iters per second: 18239.6 I0210 11:12:06.072979 2573599 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 0.0674848. Iters per second: 14818.2 I0210 11:12:06.118813 2573599 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 6.34473. Iters per second: 157.611 ``` After: ``` I0210 11:13:00.267062 2577288 BlackBoxPredictorBenchLib.cpp:384] C2 run finished. Milliseconds per iter: 0.0531031. Iters per second: 18831.3 I0210 11:13:07.591711 2577288 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 0.0651389. Iters per second: 15351.8 I0210 11:13:07.636951 2577288 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 6.25168. Iters per second: 159.957 I0210 11:13:14.497283 2580005 BlackBoxPredictorBenchLib.cpp:384] C2 run finished. Milliseconds per iter: 0.0524907. Iters per second: 19051 I0210 11:13:21.814965 2580005 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 0.0650504. Iters per second: 15372.7 I0210 11:13:21.861150 2580005 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 6.32074. Iters per second: 158.209 I0210 11:13:28.775005 2580166 BlackBoxPredictorBenchLib.cpp:384] C2 run finished. Milliseconds per iter: 0.0528345. Iters per second: 18927 I0210 11:13:36.041087 2580166 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 0.0646226. Iters per second: 15474.5 I0210 11:13:36.087904 2580166 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 6.38721. Iters per second: 156.563 I0210 11:13:43.223469 2580706 BlackBoxPredictorBenchLib.cpp:384] C2 run finished. Milliseconds per iter: 0.0534523. Iters per second: 18708.3 I0210 11:13:50.603958 2580706 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 0.065639. Iters per second: 15234.8 I0210 11:13:50.649281 2580706 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 6.24524. Iters per second: 160.122 I0210 11:13:57.490873 2580904 BlackBoxPredictorBenchLib.cpp:384] C2 run finished. Milliseconds per iter: 0.0529411. Iters per second: 18888.9 I0210 11:14:04.745435 2580904 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 0.0644963. Iters per second: 15504.8 I0210 11:14:04.790006 2580904 PyTorchPredictorBenchLib.cpp:209] PyTorch run finished. Milliseconds per iter: 6.22258. Iters per second: 160.705 ``` Looks like a pretty clear win (though it seems to have helped C2 as well). I checked with perf stat as well and it looks like a 1.9% CPU cycles win: before: ``` 35,313,858,645 cycles # 1.989 GHz ( +- 0.32% ) (99.98%) 17,750.69 msec task-clock # 0.999 CPUs utilized ( +- 0.33% ) 70,524,321,763 instructions # 2.00 insn per cycle ( +- 0.52% ) (99.98%) ``` after: ``` 34,628,390,377 cycles # 1.988 GHz ( +- 0.41% ) (99.98%) 17,416.59 msec task-clock # 0.999 CPUs utilized ( +- 0.41% ) 70,800,211,396 instructions # 2.04 insn per cycle ( +- 0.11% ) (99.98%) ``` Reviewed By: ezyang Differential Revision: D26372806 fbshipit-source-id: 817c7e61741334bb3ac33b617f9628309959b9c3	2021-02-11 15:23:01 -08:00
mattip	b97a040f71	ENH: toggle TORCH_WARN_ONCE to TORCH_WARN for tests (#48560 ) Summary: Toward fixing https://github.com/pytorch/pytorch/issues/47624 ~Step 1: add `TORCH_WARN_MAYBE` which can either warn once or every time in c++, and add a c++ function to toggle the value. Step 2 will be to expose this to python for tests. Should I continue in this PR or should we take a different approach: add the python level exposure without changing any c++ code and then over a series of PRs change each call site to use the new macro and change the tests to make sure it is being checked?~ Step 1: add a python and c++ toggle to convert TORCH_WARN_ONCE into TORCH_WARN so the warnings can be caught in tests Step 2: add a python-level decorator to use this toggle in tests Step 3: (in future PRs): use the decorator to catch the warnings instead of `maybeWarnsRegex` Pull Request resolved: https://github.com/pytorch/pytorch/pull/48560 Reviewed By: ngimel Differential Revision: D26171175 Pulled By: mruberry fbshipit-source-id: d83c18f131d282474a24c50f70a6eee82687158f	2021-02-08 08:21:19 -08:00
Scott Wolchok	630ee57bc2	[PyTorch] Provide overload of torchCheckFail taking `const char*` (#51389 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51389 This should reduce code size when STRIP_ERROR_MESSAGES is defined by allowing callers of TORCH_CHECK to avoid creating `std::string`s. ghstack-source-id: 120692772 Test Plan: Measure code size of STRIP_ERROR_MESSAGES builds Reviewed By: ezyang Differential Revision: D25891476 fbshipit-source-id: 34eef5af7464da6534989443859e2765887c243c	2021-02-01 16:48:46 -08:00
Scott Wolchok	7d406b4a07	[PyTorch] Make TORCH_CHECK less likely to interfere with inlining (#49263 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49263 Now it is smaller and calls to an out-of-line function in case of failure. ghstack-source-id: 118480531 Test Plan: 1) Inspect perf profile of internal benchmark, much less time spent in (for example) `c10::impl::getDeviceImpl`, which calls TORCH_CHECK and should be inlined 2) Internal benchmarks Reviewed By: smessmer Differential Revision: D25481308 fbshipit-source-id: 0121ada779ca2518ca717f75920420957b3bb1aa	2020-12-14 08:11:23 -08:00
Jiakai Liu	a376d3dd5d	[pytorch] strip out warning message ifdef STRIP_ERROR_MESSAGES (#47827 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47827 Similar to TORCH_CHECK_WITH_MSG, strip messages for TORCH_WARN/TORCH_WARN_ONCE. Test Plan: Imported from OSS Reviewed By: ezyang Differential Revision: D24913586 Pulled By: ljk53 fbshipit-source-id: 00f0f2bf33a48d5d7008b70ff5820623586dfd4e	2020-11-12 19:16:42 -08:00
Yinghai Lu	c9caa828f5	Throw special exception when backend compilation is met with fatal error (#45952 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45952 Pull Request resolved: https://github.com/pytorch/glow/pull/4967 When glow compilation meets with nonrecoverable fatal error (hardware is busted), we would like to throw a special exception other than the normal caffe2::EnforceNotMet so that we can signal the upper layer application to handle it differently. Test Plan: Manually code some error and add LOG(FATAL) in the special exception path and wait for application to fatal. Reviewed By: ipiszy Differential Revision: D24156792 fbshipit-source-id: 4ae21bb0d36c89eac331fc52dd4682826b3ea180	2020-10-08 00:46:01 -07:00
Alban Desmaison	02ae9a1583	add TypeError to c10 and fix segfault in error checking in Tensor constructor (#40106 ) Summary: As per title. Pull Request resolved: https://github.com/pytorch/pytorch/pull/40106 Differential Revision: D22137193 Pulled By: albanD fbshipit-source-id: 11d059263c00a834211f016bd9a9e18fdc0437ef	2020-06-22 13:42:44 -07:00
Alban Desmaison	87f40fef84	Refactor check macros to reuse code (#38126 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38126 Test Plan: Imported from OSS Differential Revision: D21595300 Pulled By: albanD fbshipit-source-id: 53805053a7a1ad35e93f335e889718e699a5dce1	2020-05-15 16:49:03 -07:00
Alban Desmaison	adf67b81c5	Make strip error messages work for cuda code (#38125 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38125 Test Plan: Imported from OSS Differential Revision: D21595299 Pulled By: albanD fbshipit-source-id: 1eec45d32afbd3d09d71d7cb155b8e69f4ba496b	2020-05-15 16:47:15 -07:00
Edward Yang	a058e938f9	Refactor error msg stack handling, add TORCH_RETHROW (#37101 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37101 Fixes #36954. The basic concept is to streamline the process of rethrowing c10::Error with extra error information. This is in a few steps: - I completely remodeled the Error data type and the internal invariants. Instead of manually adding in newlines, the message stack formatting process is responsible for inserting newlines and spacing as necessary. Call sites are then modified to respect the new API model. - TORCH_RETHROW macro is added, which adds context to an error message and then rethrows it. New internal assert failure looks like: ``` 0 INTERNAL ASSERT FAILED at ../c10/test/util/exception_test.cpp:64, please report a bug to PyTorch. Exception raised from TestBody at ../c10/test/util/exception_test.cpp:64 (most recent call first): frame #0: <unknown function> + 0x6aab9 (0x7ff611d3aab9 in /data/users/ezyang/pytorch-tmp/build/lib/libc10.so) frame #1: ... ``` Error message with context looks like: ``` This is an error This is context 1 This is context 2 ``` Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D21202891 Pulled By: ezyang fbshipit-source-id: 361cadd16bc52e5886dba08e79277771ada76169	2020-05-04 11:56:45 -07:00
Edward Yang	efd8f70cac	Make msg() and msg_with_backtrace() private (#37094 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37094 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D21202892 Pulled By: ezyang fbshipit-source-id: d59e6bffabd90cc734056bdce2cd1fe63262fab8	2020-05-04 11:54:34 -07:00
Mike Ruberry	b64fc3c4b5	Changes warnings generated in cpp to show point of Python origination (#36052 ) Summary: Today in PyTorch, warnings triggered in C++ are printed to Python users like this: `../aten/src/ATen/native/BinaryOps.cpp:81: UserWarning: Integer division of tensors using div or / is deprecated, and in a future release div will perform true division as in Python 3. Use true_divide or floor_divide (// in Python) instead.` This may be unhelpful to Python users, who have complained it's difficult to relate these messages back to their programs. After this PR, warnings that go through the PyWarningHandler and allow it to add context print like this: ``` test/test_torch.py:16463: UserWarning: Integer division of tensors using div or / is deprecated, and in a future release div will perform true division as in Python 3. Use true_divide or floor_divide (// in Python) instead. (Triggered internally at ../aten/src/ATen/native/BinaryOps.cpp:81.) cpu_result = getattr(cpu_tensor, op_str)(*cpu_args) ``` This relates the warning back to the user's program. The information about the cpp file and line number is preserved in the body of the warning message. Some warnings, like those generated in the JIT, already account for a user's Python context, and so they specify that they should be printed verbatim and are unaffected by this change. Warnings originating in Python and warnings that go through c10's warning handler, which prints to cerr, are also unaffected. A test is added to test_torch.py for this behavior. The test relies on uint8 indexing being deprecated and its warning originating from its current header file, which is an unfortunate dependency. We could implement a `torch.warn` function, instead. Pull Request resolved: https://github.com/pytorch/pytorch/pull/36052 Differential Revision: D20887740 Pulled By: mruberry fbshipit-source-id: d3515c6658a387acb7fccaf83f23dbb452f02847	2020-04-25 21:18:58 -07:00
Pavel Belevich	1beca4ac6a	Prerequisites for CSPRNG (#36631 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36631 Summary of changes 1. Moved random transformation functions to DistributionHelper.h (`uniform_int_from_to_distribution`, `uniform_int_full_range_distribution`, `uniform_int_distribution`) to avoid code duplication between default CPU, CUDA rngs and custom rng extensions 2. Made GeneratorImpl fields protected instead of private 3. Introduced `TORCH_CHECK_IF_NOT_ON_CUDA` that does the same as `TORCH_CHECK` if it is not CUDA/ROCm device 4. To test multiple rng extensions I had to move ops registration to the method `registerOps()`, expose it to python and call it `def setUp(self)` Test Plan: Imported from OSS Differential Revision: D21229202 Pulled By: pbelevich fbshipit-source-id: 6aa3280f2fc3324cf3e748388b5087e3a1e49f23	2020-04-24 12:25:37 -07:00
Sebastian Messmer	de090c42b1	Optimize binary size of assert macros (#37023 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37023 Optimize binary size of assert macros, through two ideas: Concatenate string literals with __FILE__ and __LINE__ at compile time into one literal instead of keeping them in separate literals and combining them with c10::str Optimize binary size of c10::str for some scenarios, especially for the scenario where it is called with an empty parameter list, this is actually a common call scenario in assert macros. In server oss builds, this PR reduces binary size from 118.05 MB to 117.05 MB ghstack-source-id: 102607237 Test Plan: Run oss server build (python setup.py install) and check size of libtorch_cpu.so reducing from 118.05MB to 117.05MB Differential Revision: D20719400 fbshipit-source-id: 5c61f4195b947f06aafb8f0c8e255de3366e1ff2	2020-04-22 17:13:17 -07:00
Xiang Gao	15c7486416	Canonicalize includes in c10, and add tests for it (#36299 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36299 Test Plan: Imported from OSS Differential Revision: D20943005 Pulled By: ezyang fbshipit-source-id: 9dd0a58824bd0f1b5ad259942f92954ba1f63eae	2020-04-10 12:07:52 -07:00
Edward Yang	3f3b96b1f8	Revert D20735881: [pytorch][PR] [WIP] [reland][pytorch][PR] Fix some incorrect annotation… Test Plan: revert-hammer Differential Revision: D20735881 Original commit changeset: d21e940380f0 fbshipit-source-id: fb50a099320bfac92c9b8e1ca12cdc50d302342f	2020-03-30 12:28:27 -07:00
peter	e7a37823b0	[WIP] [reland][pytorch][PR] Fix some incorrect annotation… (#35588 ) Summary: …s found by clang-cl" This reverts commit `a9b540d109`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/35588 Differential Revision: D20735881 Pulled By: ezyang fbshipit-source-id: d21e940380f0c1b9b9b84e9cc892985fd3ad0ac3	2020-03-30 11:42:19 -07:00
Nikita Shulga	a9b540d109	Revert D20670031: [pytorch][PR] Fix some incorrect annotations found by clang-cl Test Plan: revert-hammer Differential Revision: D20670031 Original commit changeset: cd8018dee703 fbshipit-source-id: 6900bf46346f0f415812607e5eff67259fc7b478	2020-03-27 18:26:01 -07:00
peter	0c16cedafe	Fix some incorrect annotations found by clang-cl (#35364 ) Summary: Fixes incorrect usages of symbol annotations including: 1. Exporting or importing a function/class in an anonymous namespace. 2. Exporting or importing a function/class implementation in a header file. However, by removing the symbol annotations, they are now local symbols. If they need to be remain global, I can move the implementations to the source file. Pull Request resolved: https://github.com/pytorch/pytorch/pull/35364 Differential Revision: D20670031 Pulled By: ezyang fbshipit-source-id: cd8018dee703e2424482c27fe9608e040d8105b8	2020-03-27 10:40:04 -07:00
Hong Xu	925cdd57dc	Replace all uses of AT_INDEX_ERROR with TORCH_CHECK_INDEX (#35050 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35050 Differential Revision: D20550978 Pulled By: ezyang fbshipit-source-id: df7c0730f27d2986601b1dee17e41957be2956de	2020-03-24 09:10:51 -07:00
Hong Xu	a8ca340ad6	Remove all uses of AT_CHECK and replace them with TORCH_CHECK (#34846 ) Summary: AT_CHECK has been deprecated and provides no more features than TORCH_CHECK Pull Request resolved: https://github.com/pytorch/pytorch/pull/34846 Differential Revision: D20481339 Pulled By: mrshenli fbshipit-source-id: 1777e769a069a78e03118270294e5e273d516ca7	2020-03-17 08:59:02 -07:00
Hong Xu	027d7f7ba5	Delete AT_WARN and replace all AT_WARN with TORCH_WARN (#34623 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34623 The bandaid of "AT_WARN" keeps introducing new warnings. Let's get rid of it entirely. Close #34502 Test Plan: Imported from OSS Differential Revision: D20420112 Pulled By: albanD fbshipit-source-id: 7160c113cb4deb2d2f50a375356f423fe5e86f50	2020-03-13 12:27:22 -07:00
Hao Lu	fc6dce6033	[c10] Fix TORCH_INTERNAL_ASSERT_DEBUG_ONLY MSVC bug (#34173 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34173 Test Plan: Temporarily change `AT_ASSERTM` to `TORCH_INTERNAL_ASSERT_DEBUG_ONLY` to test MSVC fix. ``` buck test mode/opt //caffe2/caffe2:caffe2_test_cpu -- 'BlobTest' ``` & CI Reviewed By: yinghai Differential Revision: D20235886 fbshipit-source-id: 2b7d618e924a0ede95f4a6b8f60cc08e9d58b09d	2020-03-04 02:45:35 -08:00
Will Feng	36919278cc	C++ tensor multi-dim indexing: add index() and index_put_() overloads, simple indexing tests, merge with Python indexing path (#32841 ) Summary: This PR adds the following items: - 1st item: `ArrayRef<TensorIndex>` and `std::initializer_list<TensorIndex>` overloads for `Tensor::index` and `Tensor::index_put_`, to be used specifically for multi-dim indexing purpose. Design rationale: * C++ `Tensor::index` and `Tensor::index_put_` are both existing tensor APIs, and they currently (before this PR) only accept a list of tensors (i.e. `ArrayRef<Tensor>`) as indices. If we change their signatures to also accept non-tensors as indices (i.e. `ArrayRef<TensorIndex>`, and `TensorIndex` is convertible from `Tensor` / `Slice` / `None` / `Ellipsis`), it would slow down the original code path (since now it has to go through more steps), which is undesirable. To get around this problem, the proposed solution is to keep the original `ArrayRef<Tensor>` overload, and add `ArrayRef<TensorIndex>` and `std::initializer_list<TensorIndex>` overloads to `Tensor::index` and `Tensor::index_put_`. This way, the original code path won’t be affected, and the tensor multi-dim indexing API is only used when the user explicitly pass an `ArrayRef<TensorIndex>` or a braced-init-list of `TensorIndex`-convertible types to `Tensor::index` and `Tensor::index_put_` . Note that the above proposed solution would still affect perf for the user’s original `Tensor::index` or `Tensor::index_put_` call sites that use a braced-init-list of tensors as input, e.g. `tensor.index({...})` or `tensor.index_put_({...}, value)`, since now such function calls would take the multi-dim indexing path instead of the original advanced indexing path. However, there are only two instances of this in our codebase (one in ATen cpp test, one in a C++ API nn init function), and they can be easily changed to explicitly use `ArrayRef<Tensor>` as input (I changed them in this PR). For external user’s code, since this is part of the C++ frontend which is still considered experimental, we will only talk about this change in the release note, and ask users to switch to using `ArrayRef<Tensor>` explicitly if they want to keep using the original advanced indexing code path. - 2nd item: Mechanisms for parsing `ArrayRef<TensorIndex>` indices and performing indexing operations (mirroring the functions in `torch/csrc/autograd/python_variable_indexing.cpp`). - 3rd item: Simple tests to demonstrate that the `Tensor::index()` and `Tensor::index_put_()` APIs work. I will add more tests after the first few PRs are reviewed. - 4th item: Merge Python/C++ indexing code paths, for code simplicity. I tested locally and found that there is no perf regression resulting from the merge. I will get more concrete numbers for common use cases when we settle on the overall design. This PR supersedes https://github.com/pytorch/pytorch/pull/30425. Pull Request resolved: https://github.com/pytorch/pytorch/pull/32841 Differential Revision: D19919692 Pulled By: yf225 fbshipit-source-id: 7467e64f97fc0e407624809dd183c95ea16b1482	2020-02-24 22:04:00 -08:00
Michael Ranieri	e45343fa14	TORCH_INTERNAL_ASSERT_DEBUG_ONLY not eating message string (#33251 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33251 Somehow this was preventing `c10::Error` exceptions from ever being thrown on windows when `defined(NDEBUG) == false`. Kinda scary. Test Plan: sandcastle green, made sure `intrusive_ptr_test.cpp` (givenStackObject_whenReclaimed_thenCrashes) passed inside ovrsource using `mode/win/dev-debug` Reviewed By: malfet Differential Revision: D19865667 fbshipit-source-id: c32d5752025c043e57d16c6d14a94b069bed0bc3	2020-02-12 21:23:34 -08:00
Xiang Gao	87640570b3	Make CUDA OOM error a type (#33056 ) Summary: There are cases when we want to recover from CUDA OOM, for example, some cuDNN algorithms use huge workspace and we want to recover from OOM to pick a different algorithm, in such cases, there is no reason to catch all errors. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33056 Differential Revision: D19795359 Pulled By: ezyang fbshipit-source-id: a34e23bf6d172dc0257389251dafef5b38d27d2b	2020-02-12 10:45:40 -08:00
Edward Yang	9116f02beb	Rename TORCH_DCHECK to TORCH_INTERNAL_ASSERT_DEBUG_ONLY (#31917 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31917 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D19301480 Pulled By: ezyang fbshipit-source-id: fcce8868733965b9fbd326b4ec273135759df377	2020-01-07 17:28:47 -08:00
Liqian Peng	9407137102	Update the descriptive error message for enforce fail (#31575 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31575 We need a new exception class specifically for the enforce_finite operator, because we need to map it to a specific python exception ExitException, not the RuntimeError type that all c10::Errors get mapped to by default. This diff includes: - Define c10::EnforceFiniteNotMet - API CAFFE_ENFORCE_FINITE to throw c10::EnforceFiniteNotMet - Map from c10::EnforceFiniteNotMet to python ExitException - Apply CAFFE_ENFORCE_FINITE in caffe2 op Test Plan: - integration test pass: https://fburl.com/fblearner/xwkzbqyo - integration test with D19213617: https://fburl.com/fblearner/479y4jrj Generate error message as desired - Example: - Original error message f157597803 {F225477055} - Updated error message (with D19213617 to generate the error): f158571327 {F225477071} Reviewed By: zheng-xq Differential Revision: D19206240 fbshipit-source-id: bd256862801d5957a26b76d738edf4e531f03827	2020-01-03 13:53:20 -08:00
Mingbo Wan	647569e546	get rid of choco install (#30897 ) Summary: 7zip and cmake are part of base image, no need to re-install. Remove the install step can make build/test more stable. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30897 Differential Revision: D19232961 Pulled By: mingbowan fbshipit-source-id: fa3bbd1325839a2a977bf13fdbd97fda43793b8d	2019-12-27 13:12:04 -08:00
Junjie Bai	489dd6cb90	Add TORCH_DCHECK macro that checks only in debug builds (#31240 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31240 Follow up on discoveries/discussions in https://github.com/pytorch/pytorch/pull/30810 Mimic the `DCHECK` macro from https://github.com/pytorch/pytorch/blob/e5eb871/c10/util/logging_is_not_google_glog.h#L117-L125 With this change the perf gap is eliminated: ``` ================================================================================ Program Output: ================================================================================ Run on (36 X 1601 MHz CPU s) 2019-12-12 20:12:13 ----------------------------------------------------------------- Benchmark Time CPU Iterations ----------------------------------------------------------------- BM_IntrusivePtrCtorDtor 23 ns 23 ns 30914703 BM_SharedPtrCtorDtor 27 ns 27 ns 25895944 BM_IntrusivePtrArray/16 503 ns 503 ns 1392139 BM_IntrusivePtrArray/32 1006 ns 1006 ns 695749 BM_IntrusivePtrArray/64 2013 ns 2013 ns 347714 BM_IntrusivePtrArray/128 4024 ns 4024 ns 173964 BM_IntrusivePtrArray/256 8047 ns 8047 ns 86994 BM_IntrusivePtrArray/512 16106 ns 16106 ns 43461 BM_IntrusivePtrArray/1024 32208 ns 32207 ns 21731 BM_IntrusivePtrArray/2048 64431 ns 64430 ns 10865 BM_IntrusivePtrArray/4096 128940 ns 128938 ns 5429 BM_SharedPtrArray/16 503 ns 503 ns 1392128 BM_SharedPtrArray/32 1006 ns 1006 ns 695940 BM_SharedPtrArray/64 2012 ns 2012 ns 347817 BM_SharedPtrArray/128 4024 ns 4023 ns 173927 BM_SharedPtrArray/256 8069 ns 8069 ns 86741 BM_SharedPtrArray/512 16143 ns 16142 ns 43357 BM_SharedPtrArray/1024 32283 ns 32283 ns 21685 BM_SharedPtrArray/2048 64718 ns 64717 ns 10817 BM_SharedPtrArray/4096 129469 ns 129466 ns 5407 ================================================================================ ``` ``` ================================================================================ Program Output: ================================================================================ Run on (80 X 2001 MHz CPU s) 2019-12-12 20:12:23 ----------------------------------------------------------------- Benchmark Time CPU Iterations ----------------------------------------------------------------- BM_IntrusivePtrCtorDtor 18 ns 18 ns 38630411 BM_SharedPtrCtorDtor 22 ns 22 ns 32356114 BM_IntrusivePtrArray/16 402 ns 402 ns 1739637 BM_IntrusivePtrArray/32 805 ns 805 ns 869818 BM_IntrusivePtrArray/64 1610 ns 1609 ns 434881 BM_IntrusivePtrArray/128 3218 ns 3218 ns 217437 BM_IntrusivePtrArray/256 6436 ns 6436 ns 108739 BM_IntrusivePtrArray/512 12882 ns 12882 ns 54356 BM_IntrusivePtrArray/1024 25763 ns 25763 ns 27177 BM_IntrusivePtrArray/2048 51532 ns 51531 ns 13590 BM_IntrusivePtrArray/4096 103091 ns 103091 ns 6778 BM_SharedPtrArray/16 402 ns 402 ns 1740165 BM_SharedPtrArray/32 804 ns 804 ns 869035 BM_SharedPtrArray/64 1610 ns 1610 ns 434975 BM_SharedPtrArray/128 3218 ns 3218 ns 217505 BM_SharedPtrArray/256 6457 ns 6457 ns 108510 BM_SharedPtrArray/512 12909 ns 12909 ns 54249 BM_SharedPtrArray/1024 25810 ns 25810 ns 27127 BM_SharedPtrArray/2048 51763 ns 51763 ns 13531 BM_SharedPtrArray/4096 103506 ns 103505 ns 6759 ================================================================================ ``` Test Plan: buck test caffe2/c10/... buck test mode/opt caffe2/c10/... Differential Revision: D18998243 fbshipit-source-id: ddf0a118a80efe032b52d403867c1f416c721590	2019-12-18 21:55:58 -08:00
Sebastian Messmer	f0243ea712	Use [[deprecated]] instead of C10_DEPRECATED (#30918 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30918 This is a C++14 feature we can use now ghstack-source-id: 95811482 Test Plan: waitforsandcastle Differential Revision: D18869636 fbshipit-source-id: b5b3d78b61b6ceb2deda509131f8502e95b1d057	2019-12-17 15:21:34 -08:00
Linbin Yu	def2985e90	add flag to strip C10 error message (#30111 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30111 Add flag to strip C10 error message. To ensure there's no size regression, add the same flag to existing caffe2 and pytorch build Test Plan: size bot check Reviewed By: dreiss Differential Revision: D18577969 fbshipit-source-id: 84ac57b11ec5c29e831d619260024a0a4a6fdcd0	2019-11-19 22:53:59 -08:00
Linbin Yu	30d37e82db	Revert D18521937: Enable full error message for mobile builds Test Plan: revert-hammer Differential Revision: D18521937 Original commit changeset: 99673b60a03d fbshipit-source-id: 1946982201e4a21015bc9cd8abaa64a68ff8774f	2019-11-16 12:20:27 -08:00
Linbin Yu	5cad7d42ef	Enable full error message for mobile builds (#29926 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29926 add a macro to enable full error message for mobile Test Plan: buck build -c project.ignore= //xplat/experimental/pytorch/predictor:predictorAndroid#android-armv7 Reviewed By: dreiss Differential Revision: D18521937 fbshipit-source-id: 99673b60a03da249236dc916bab3dff88d24bc25	2019-11-15 17:48:47 -08:00
Alban Desmaison	9b875e1256	Buffer python warning to avoid deadlocks Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26613 Test Plan: Imported from OSS Differential Revision: D18249633 Pulled By: albanD fbshipit-source-id: 863f52400e1b97943a67a9e1abb09ae8d045e7f0	2019-11-07 08:35:06 -08:00
Pieter Noordhuis	9ef86b04e5	Make TORCH_WARN_ONCE capture variables by reference (#26289 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26289 It's not possible to refer to values of local variables otherwise. ghstack-source-id: 90160797 Test Plan: The code compiles. Differential Revision: D17397702 fbshipit-source-id: 49c74c44c88f197264603e4978e3d60bf199f6ac	2019-09-17 03:49:17 -07:00
Brian Vaughan	88e4cee3e7	Improve handling of mixed-type tensor operations (#22273 ) Summary: Improve handling of mixed-type tensor operations. This PR affects the arithmetic (add, sub, mul, and div) operators implemented via TensorIterator (so dense but not sparse tensor ops). For these operators, we will now promote to reasonable types where possible, following the rules defined in https://github.com/pytorch/pytorch/issues/9515, and error in cases where the cast would require floating point -> integral or non-boolean to boolean downcasts. The details of the promotion rules are described here: https://github.com/nairbv/pytorch/blob/promote_types_strict/docs/source/tensor_attributes.rst Some specific backwards incompatible examples: * now `int_tensor * float` will result in a float tensor, whereas previously the floating point operand was first cast to an int. Previously `torch.tensor(10) * 1.9` => `tensor(10)` because the 1.9 was downcast to `1`. Now the result will be the more intuitive `tensor(19)` * Now `int_tensor *= float` will error, since the floating point result of this operation can't be cast into the in-place integral type result. See more examples/detail in the original issue (https://github.com/pytorch/pytorch/issues/9515), in the above linked tensor_attributes.rst doc, or in the test_type_promotion.py tests added in this PR: https://github.com/nairbv/pytorch/blob/promote_types_strict/test/test_type_promotion.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/22273 Reviewed By: gchanan Differential Revision: D16582230 Pulled By: nairbv fbshipit-source-id: 4029cca891908cdbf4253e4513c617bba7306cb3	2019-09-05 18:26:09 -07:00

1 2

61 Commits