pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Mike Ruberry	bb8baea932	[primTorch] flatten, squeeze, unsqueeze... (#77043 ) This PR ... Makes the following testing changes: - Updates stride testing in test_python_reference_consistency to only check strides of dimensions with length > 1 - Creates reference inputs for reshape - Creates reference inputs for chunk - Extends the sample inputs for unsqueeze - Extends the sample inputs for stack -- test_conj_view and test_neg_view are now xfailed - https://github.com/pytorch/pytorch/issues/77046 Makes the following architecture changes: - Adds the refs.special (sub)module - Adds the refs.nn.functional (sub)module Adds the following prims: - expand_dims - view_of - rev - clone Adds the following references: - flatten - squeeze - unsqueeze - special.i0e - special.i1e - logical_or - logical_and - isclose - flip - stack - nn.functional.elu - chunk - clone - narrow Identifies the following bugs in PyTorch today: - https://github.com/pytorch/pytorch/issues/77054 - https://github.com/pytorch/pytorch/issues/77055 Pull Request resolved: https://github.com/pytorch/pytorch/pull/77043 Approved by: https://github.com/ngimel	2022-05-09 11:24:55 +00:00
Elias Ellison	0d7be81c9c	[JIT] Add Context Manager to force strict fusion Fixes https://github.com/pytorch/pytorch/issues/75464 Adds a context manager that will throw if the ops in the context are not fused. API is : ``` with torch.jit.strict_fusion(): ... ``` A few TODOs: [+] Compose/figure out how to do with autodiff - right now it will run on autodiff as well [+] Support all of the nvfuser operators that are added in guarding [+] Figure out what to do with control flow that isn't taken (right now it will just error). this is probably a source of the original issue :/ - will just error [+] (After those are figured out) add to docs Pull Request resolved: https://github.com/pytorch/pytorch/pull/75777 Approved by: https://github.com/davidberard98	2022-04-25 16:08:57 +00:00
David Berard	1324410f2e	[JIT] Reuse traced fn for jit opinfos Previously, jit opinfos would only run the traced function once. This is a problem for NNC and NVFuser, where the fused implementation only runs on the second invocation. This caches the traced function and calls the cached implementation, so that subsequent calls actually perform fusion and use the fused implementation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76000 Approved by: https://github.com/eellison	2022-04-22 20:14:29 +00:00
Nikita Shulga	320e5a8268	Revert D34808051: [tensorexpr] Enabled aten::stack in the fuser pass with static shapes Test Plan: revert-hammer Differential Revision: D34808051 Original commit changeset: 213e2ffdf87f Original Phabricator Diff: D34808051 fbshipit-source-id: b618daeb346f784e8ab9525040edcb4a30a39613 (cherry picked from commit e47b973cba5c95e9410f8aecdfd5619de6d4be7c)	2022-03-31 04:25:43 +00:00
Hui Guo	90c3699cc8	[tensorexpr] Enabled aten::stack in the fuser pass with static shapes (#74077 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/74077 Test Plan: Imported from OSS Reviewed By: gchanan Differential Revision: D34808051 Pulled By: huiguoo fbshipit-source-id: 213e2ffdf87fb1a74104037cea7ef25e4bfd4307 (cherry picked from commit ad9e84842e5b47eda845827d325b08ba361a8286)	2022-03-31 04:25:43 +00:00
Elias Ellison	6694fdaccd	Clean up profiling mode and profiling executor strategy (#73875 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73875 Previously we had a few settings: - getExecutor - which toggled between Profiling Executor and Legacy - getGraphOptimize - if true, overrides PE/Legacy to run with simple executor (no optimizations) and then... - getProfilingMode - which would set PE to 0 specializtions. The last mode is redundant with getGraphOptimize, we should just remove it and use getGraphOptimize in these cases. It would lead to potentially invalid combinations of logic - what does mean if getProfilingMode is true but getExecutor is set to false ? This would lead to a bug in specialize_autograd_zero in this case, see: https://github.com/pytorch/pytorch/blob/master/torch%2Fcsrc%2Fjit%2Fpasses%2Fspecialize_autogradzero.cpp#L93. The tests here are failing but get fixed with the PR above it, so i'll squash for landing. Test Plan: Imported from OSS Reviewed By: cpuhrsch Differential Revision: D34938130 Pulled By: eellison fbshipit-source-id: 1a9c0ae7f6d1cfddc2ed3499a5af611053ae5e1b (cherry picked from commit cf69ce3d155ba7d334022c42fb2cee54bb088c23)	2022-03-29 18:38:51 +00:00
David Berard	f685dfaac1	[JIT] call super().setUp() in test_jit_fuser_te.py (#73762 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73762 TestCase.setUp() controls slowTest behavior, so calling super().setUp() will prevent fast tests from running in the slow test CI jobs. example: https://github.com/pytorch/pytorch/runs/5413135014?check_suite_focus=true: despite PYTORCH_TEST_SKIP_FAST=1, TestTEFuserStatic tests are still running Test Plan: Imported from OSS Reviewed By: mruberry Differential Revision: D34628769 Pulled By: davidberard98 fbshipit-source-id: 84311ec1db2ac60fcafb7b77f377e9ae2ef792e3 (cherry picked from commit 67fdba7fb9b73ce2b9119f4c4bc84e5b38041e21)	2022-03-11 01:03:54 +00:00
Ryan Spring	4f8b986e28	Implement Tanh Gelu Approximation (#61439 ) Summary: 1. Implements https://github.com/pytorch/pytorch/issues/39853 2. Adds approximate boolean flag to Gelu 3. Enables Tanh Gelu approximation 4. Adds double backward support for Gelu 5. Enable Tanh Gelu in NvFuser ``` def gelu(x, approximate : str = 'none'): if approximate == 'tanh': # sqrt(2/pi) = 0.7978845608028654 return 0.5 * x * (1.0 + torch.tanh(0.7978845608028654 * (x + 0.044715 * torch.pow(x, 3.0)))) else: return x * normcdf(x) ``` Linking XLA PR - https://github.com/pytorch/xla/pull/3039 Pull Request resolved: https://github.com/pytorch/pytorch/pull/61439 Reviewed By: VitalyFedyunin Differential Revision: D33894937 Pulled By: jbschlosser fbshipit-source-id: b65e8fb6ea66168af8f34f45ed50e92737a33851 (cherry picked from commit `6e986f91a9`)	2022-02-14 03:40:32 +00:00
David Berard	2e04295790	[tensorexpr] support for fusing autocasting ops (#72478 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72478 aten::_autocast_to_reduced_precision and `aten::_autocast_to_full_precision are essentially just aten::to operations, so they can be fused the same way aten::to is fused. Test Plan: Imported from OSS Reviewed By: bdhirsh Differential Revision: D34057522 Pulled By: davidberard98 fbshipit-source-id: f3b53641415702a4ac56460587801b9c76d81b3c (cherry picked from commit `838ce5542e`)	2022-02-10 18:12:36 +00:00
David Berard	bbd42c605a	[JIT] Opinfo tests for nnc fusion - retry (#72486 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72486 Retry #70465. Test Plan: Imported from OSS Reviewed By: mikaylagawarecki Differential Revision: D34061628 Pulled By: davidberard98 fbshipit-source-id: e27ed315bc4ad57cdbfbc9cedffcbb7886004524 (cherry picked from commit `7937808d2e`)	2022-02-09 19:01:22 +00:00
Nikita Shulga	bb101ec78d	Revert D33595240: [JIT] Opinfo tests for nnc fusion Test Plan: revert-hammer Differential Revision: D33595240 (`0b57bd4c66`) Original commit changeset: e2e17a921bc3 Original Phabricator Diff: D33595240 (`0b57bd4c66`) fbshipit-source-id: 172a3ffd19d180b1b3617956b1f881be62f37bc9 (cherry picked from commit `324cfaea86`)	2022-02-08 01:28:42 +00:00
David Berard	0b57bd4c66	[JIT] Opinfo tests for nnc fusion (#70465 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/70465 These tests check to ensure that (a) the result after nnc fusion (of a single op) is the same as the unfused op (b) for certain ops where fusion is expected to occur, ensure that fusion does actually occur Test Plan: Imported from OSS Reviewed By: wenleix Differential Revision: D33595240 Pulled By: davidberard98 fbshipit-source-id: e2e17a921bc30c313e92e8e5bbc6c1b5fcd14bc1 (cherry picked from commit `b1ba221acc`)	2022-02-07 20:56:21 +00:00
Elias Ellison	defde3bb04	[NNC] Use index for stride mapping in kernel.cpp (#72266 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72266 Within the kernel, we may manipulate `Value ` in `OptimizeCat`, which would invalidate the input `Value ` -> Stride mapping. Fix for https://github.com/pytorch/pytorch/issues/72173 Test Plan: Imported from OSS Reviewed By: dagitses, davidberard98 Differential Revision: D33986306 Pulled By: eellison fbshipit-source-id: dc33cd2b545e49e90d1e46b9fcf1e6dbb4b829db (cherry picked from commit `5e4555968a`)	2022-02-04 00:12:38 +00:00
Elias Ellison	aa99df5cc3	Check for grad mode enabled in dynamic shape fusion check (#72161 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72161 Following logic here: `3dce68fdf4/aten/src/ATen/core/tensor_type.cpp (L329)` Test Plan: Imported from OSS Reviewed By: navahgar Differential Revision: D33934368 Pulled By: eellison fbshipit-source-id: 8555ef72070559905f65c6e883a7ae49e5bbbdc3 (cherry picked from commit `1db78befd6`)	2022-02-02 04:40:22 +00:00
Elias Ellison	27a4d39756	NNC Dynamic Channels last fixes (#72032 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72032 This contains a few channels last changes from benchmarking: - dont permute back to channels last on dynamic, cpu, perf is not good, and use cases for it are exotic atm - remove the conditional one handling in permutting channels last symbolic tensor on cuda, it's not needed in the permutation case as tests show - removing logic in torch/csrc/jit/tensorexpr/loopnest.cpp preventing inlining. the condition in checks is always valid given valid construction of ir I can split up as needed. Test Plan: Imported from OSS Reviewed By: navahgar Differential Revision: D33864652 Pulled By: eellison fbshipit-source-id: f16674fb02dfff22670d8a2f856c5a317fd15717 (cherry picked from commit `a9a0697839`)	2022-02-01 19:07:02 +00:00
Elias Ellison	59a6375639	[NNC] Add Tests for Dynamic Shape Fusion Change default fusion strategy (#71651 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71651 The only tests that regress are because chunk NYI, the other tests that I touched were passing just because the `assertAllFused` wasn't working correctly. That, and we're no longer compiling conv/matmul w dynamic shapes Test Plan: Imported from OSS Reviewed By: navahgar Differential Revision: D33801500 Pulled By: eellison fbshipit-source-id: 074118ab4a975b7db876a4fcdfb9483afb879e79 (cherry picked from commit `abaa7948c1`)	2022-02-01 19:07:02 +00:00
Elias Ellison	f1499d6c18	Refactor PE so fusion specializations are configurable (#71650 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71650 * Refactors PE so there is a current fusion strategy set, which will take in a vector of e.g. [(STATIC, 2), (DYNAMIC, 10)] which means fuse two static invocations then fuse 10 dynamic ones, then stop specializing. Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D33801501 Pulled By: eellison fbshipit-source-id: ebc7ac3c57e35a3b9bb15ab751f0aa1d25cc9bd5 (cherry picked from commit `8dd89088d3`)	2022-02-01 19:07:02 +00:00
Elias Ellison	cf1833df70	[WIP] add explicit dynamic fusion arg (#71173 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71173 Test Plan: Imported from OSS Reviewed By: navahgar Differential Revision: D33536222 Pulled By: eellison fbshipit-source-id: a097408ecdd6e284432de128feb297993d882d52 (cherry picked from commit `0e3419b2d3`)	2022-02-01 19:07:02 +00:00
Nikita Shulga	74c44ba9d6	Revert D33850228: [pytorch][PR] Implement Tanh Gelu Approximation Test Plan: revert-hammer Differential Revision: D33850228 (`23d03025dc`) Original commit changeset: 3cc33fb298e4 Original Phabricator Diff: D33850228 (`23d03025dc`) fbshipit-source-id: 9436e7df73c2b2e2011f321674f24973316d3692 (cherry picked from commit `c9efb58223`)	2022-01-31 17:44:19 +00:00
Ryan Spring	23d03025dc	Implement Tanh Gelu Approximation (#61439 ) Summary: 1. Implements https://github.com/pytorch/pytorch/issues/39853 2. Adds approximate boolean flag to Gelu 3. Enables Tanh Gelu approximation 4. Adds double backward support for Gelu 5. Enable Tanh Gelu in NvFuser ``` def gelu(x, approximate : str = 'none'): if approximate == 'tanh': # sqrt(2/pi) = 0.7978845608028654 return 0.5 * x * (1.0 + torch.tanh(0.7978845608028654 * (x + 0.044715 * torch.pow(x, 3.0)))) else: return x * normcdf(x) ``` Linking XLA PR - https://github.com/pytorch/xla/pull/3039 Pull Request resolved: https://github.com/pytorch/pytorch/pull/61439 Reviewed By: cpuhrsch Differential Revision: D33850228 Pulled By: jbschlosser fbshipit-source-id: 3cc33fb298e480d7ecc5c67716da019d60c6ab33 (cherry picked from commit `3a53b3e94f`)	2022-01-31 17:07:45 +00:00
Joel Schlosser	cb823d9f07	Revert D33744717: [pytorch][PR] Implement Tanh Gelu Approximation Test Plan: revert-hammer Differential Revision: D33744717 (`f499ab9cef`) Original commit changeset: d64532a562ed Original Phabricator Diff: D33744717 (`f499ab9cef`) fbshipit-source-id: 396c3f63de5865f894dbc353d0790a01a624be93 (cherry picked from commit `e9fb2d1db1`)	2022-01-28 18:35:01 +00:00
Ryan Spring	f499ab9cef	Implement Tanh Gelu Approximation (#61439 ) Summary: 1. Implements https://github.com/pytorch/pytorch/issues/39853 2. Adds approximate boolean flag to Gelu 3. Enables Tanh Gelu approximation 4. Adds double backward support for Gelu 5. Enable Tanh Gelu in NvFuser ``` def gelu(x, approximate : str = 'none'): if approximate == 'tanh': # sqrt(2/pi) = 0.7978845608028654 return 0.5 * x * (1.0 + torch.tanh(0.7978845608028654 * (x + 0.044715 * torch.pow(x, 3.0)))) else: return x * normcdf(x) ``` Linking XLA PR - https://github.com/pytorch/xla/pull/3039 Pull Request resolved: https://github.com/pytorch/pytorch/pull/61439 Reviewed By: mikaylagawarecki Differential Revision: D33744717 Pulled By: jbschlosser fbshipit-source-id: d64532a562ed53247bb4fa52bb16722634d5c187 (cherry picked from commit `4713dd9cca`)	2022-01-28 16:59:09 +00:00
Mikhail Zolotukhin	bd6ec4efb4	[TensorExpr] Add lowerings for scalar binary ops (+,-,*,/,&,\|,^,<<,>>,cmp). (#71298 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71298 Differential Revision: D33576534 D33576534 Test Plan: Imported from OSS Reviewed By: anjali411 Pulled By: ZolotukhinM fbshipit-source-id: 93787b6f11180fcbfbacbb55e1bfb79700320a0e (cherry picked from commit `b2a8e83f97`)	2022-01-26 06:32:51 +00:00
David Berard	8ba1ee6aa7	[tensorexpr][easy] add missing comma to test_jit_fuser_te.py (#71642 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71642 Missing comma was causing string concatenation in a list of strings Test Plan: Imported from OSS Reviewed By: ZolotukhinM Differential Revision: D33713185 Pulled By: davidberard98 fbshipit-source-id: a2458629d78202713a5bb2f8c720ff9b81939c31 (cherry picked from commit `b077598f1d`)	2022-01-24 22:18:37 +00:00
Raghavan Raman	70c9146c40	[nnc] Update block and thread extents in cuda_codegen to use int64_t (#71428 ) Summary: The block and thread extent calculations in `cuda_codegen` should be using `int64_t` instead of `int`. The updated test, `test_dynamic_shapes`, fails without this change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/71428 Reviewed By: samdow Differential Revision: D33640374 Pulled By: navahgar fbshipit-source-id: 64c340ad2a9a1fa1fe066cf1c5dfc3b546b7be6d (cherry picked from commit `6ea546ce11`)	2022-01-19 23:21:24 +00:00
Elias Ellison	5480deb183	Add support for permutting dynamic fusion group outputs to channels last format (#70656 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/70656 Test Plan: Imported from OSS Reviewed By: navahgar Differential Revision: D33458650 Pulled By: eellison fbshipit-source-id: f0c7d20743deac7a87f7c9176e60da8100aefe41	2022-01-12 09:11:34 -08:00
Elias Ellison	39be20f259	[JIT][NNC] Add handling of strides to dynamic shape support. (#70464 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/70464 Add handling of strided input tensors to dynamic fusion. This is done with the same set of input striding specializations as https://github.com/pytorch/pytorch/pull/60684/: ``` S_ONE, // STRIDE_ONE: packed S_CONT, // STRIDE_CONTIGUOUS: stride[i + 1] * sizes[i + 1] S_TRAN_CONT, // STRIDE_TRANSPOSED_CONTIGUOUS: stride[i-1] * sizes[i-1] S_AS_ARG, // STRIDE_AS_ARG: stride passed in as runtime value ``` and then two additional specializations for a) contiguous tensor and b) channels-last tensor. channels-last is a common case and we should optimize for it. additionally, tensors natively store whether they are contiguous/channels-last contiguous, which makes it faster to check if tensors follow this pattern. Output striding will be done in a follow up. The striding is stored on both the TensorGroup node and on the guard node. The striding descriptors are stored as a vector of strings on the node for debugability and to make use of storing ivalues as attributes on nodes. As an example: ``` %8 : Double(10, 11, 12, 13, strides=[1716, 1, 143, 11], requires_grad=0, device=cpu) = prim::TensorExprGroup_0[symbolic_shape_inputs=[-37, -36, -35, -34], striding_inputs_desc=[["TENSOR_CONT_CHANNELS_LAST"]](%x, %24, %23, %22, %21)``` ``` Test Plan: Imported from OSS Reviewed By: navahgar Differential Revision: D33458649 Pulled By: eellison fbshipit-source-id: c42616d3c683d70f6258180d23d3841a31a6030d	2022-01-12 09:11:31 -08:00
Elias Ellison	0adc7cc546	Inline Fallback Functions For Debugging (#70463 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/70463 Fix for https://github.com/pytorch/pytorch/issues/52940 When we call inlining on a fallback function, insert the runtime optimized version of its graph. Test Plan: Imported from OSS Reviewed By: jbschlosser, davidberard98 Differential Revision: D33458651 Pulled By: eellison fbshipit-source-id: fd7e5e2b5273a1677014ba1a766538c3ee9cad76	2022-01-10 12:15:11 -08:00
Animesh Jain	6896b2d734	[NNC Testing] Randomized loop nest infrastructure (#70410 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/70410 Trying again after #70174 was reverted. Earlier the env variable was read into a static var in C++ causing state to be retained and causing test failures. Static type is removed in this PR. Test Plan: Imported from OSS Reviewed By: ZolotukhinM Differential Revision: D33321435 fbshipit-source-id: 6d108eb00cac9150a142ccc3c9a65a1867dd7de4	2022-01-06 16:21:42 -08:00
Mikhail Zolotukhin	0ee663d2fa	Revert D33234529: [NNC Testing] Randomized loop nest infrastructure Test Plan: revert-hammer Differential Revision: D33234529 (`1d094587ea`) Original commit changeset: 9019f1f1d4ca Original Phabricator Diff: D33234529 (`1d094587ea`) fbshipit-source-id: a79deca9f186299bf884587eb7d50af2464979fb	2021-12-23 23:11:23 -08:00
Animesh Jain	1d094587ea	[NNC Testing] Randomized loop nest infrastructure (#70174 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/70174 Test Plan: Imported from OSS Reviewed By: ZolotukhinM Differential Revision: D33234529 fbshipit-source-id: 9019f1f1d4ca945c92bee401f7ec674b7d987de4	2021-12-22 22:07:39 -08:00
Mikhail Zolotukhin	3186d36972	[TensorExpr] Supress TracerWarnings in test_unsupported in test_jit_fuser_te.py. (#68757 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/68757 Test Plan: Imported from OSS Reviewed By: navahgar Differential Revision: D32600951 Pulled By: ZolotukhinM fbshipit-source-id: 7b9859d7dee1e9803b8fde5d071890a72d30cec9	2021-11-30 00:06:36 -08:00
Samantha Andow	e86058559a	Op info for activation functions 2 (softsign, tanh, tanhshrink, threshold, celu, sigmoid, mish, hardsigmoid) (#67492 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/67492 Reviewed By: zou3519 Differential Revision: D32282580 Pulled By: samdow fbshipit-source-id: 115afe790328577357a90117bede3b6502590441	2021-11-09 12:57:38 -08:00
Natalia Gimelshein	417dc7f86c	Revert D32007691: [pytorch][PR] Op info for activation functions 2 (softsign, tanh, tanhshrink, threshold, celu, sigmoid, mish, hardsigmoid) Test Plan: revert-hammer Differential Revision: D32007691 (`ea60e7d559`) Original commit changeset: 6cb14dc56e29 fbshipit-source-id: 9ef599ef07302fb521b1f413b989786adfa3576c	2021-11-08 21:16:53 -08:00
Samantha Andow	ea60e7d559	Op info for activation functions 2 (softsign, tanh, tanhshrink, threshold, celu, sigmoid, mish, hardsigmoid) (#67492 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/67492 Reviewed By: mruberry Differential Revision: D32007691 Pulled By: samdow fbshipit-source-id: 6cb14dc56e296154e2f48249049c4d2fe4f4d10d	2021-11-08 14:30:50 -08:00
Richard Zou	05d1dcc14c	Split channels_last test cases for tensor conversion OpInfos (#67368 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/67368 This PR adds an addition test variant for the tensor conversion functions (bfloat16, char, long, ...) that tests channels_last. This is because some backends (mostly just functorch right now) don't have channels last handling and may want to test that separately from the more general case of these operations. Test Plan: - wait for tests Reviewed By: mruberry Differential Revision: D31972959 Pulled By: zou3519 fbshipit-source-id: 68fea46908b2cdfeb0607908898bb8f9ef25b264	2021-11-03 07:39:41 -07:00
Ivan Kobzarev	7fbcf79684	[tensorexpr][nnc] Support quantization (#66676 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66676 Test Plan: Imported from OSS Reviewed By: navahgar Differential Revision: D31676329 Pulled By: IvanKobzarev fbshipit-source-id: 288b41ff4ed603dfaacb465f296997f14bb23c22	2021-10-31 22:49:30 -07:00
Jane Xu	49251d05ec	[skip ci] Set test owners for NNC tests (#66833 ) Summary: Action following https://github.com/pytorch/pytorch/issues/66232 Pull Request resolved: https://github.com/pytorch/pytorch/pull/66833 Reviewed By: albanD Differential Revision: D31907812 Pulled By: janeyx99 fbshipit-source-id: 5e5013b4276fd208ac68d61cf787679799695602	2021-10-26 07:46:18 -07:00
Bert Maher	bdb889aca1	[nnc] Use a descriptive name for fused kernels when profiling (#66990 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66990 NNC fusion groups currently show up as "TensorExpr" in the profiler, which is true but not super useful since it obscures what's actually happening in the fusion group. This change will log them as `fused_XXX` where XXX is a (length-limited) series of ops describing the subgraph, for instance `fused_mul_add` to represent a group containing `aten::mul`, `aten::add`. Test Plan: New unit test to check the output of autograd profiler. Reviewed By: dzhulgakov Differential Revision: D31762087 fbshipit-source-id: 3fadbdc67b054faa01aa42e5b6ea2c4a6bc3481f	2021-10-21 00:06:23 -07:00
kshitij12345	49a1d7bfcb	[opinfo] elemwise parcel : isfinite, isinf, isposinf, isneginf, isnan, isreal (#66400 ) Summary: Adds OpInfo for `isfinite, isinf, isposinf, isneginf, isnan, isreal` Pull Request resolved: https://github.com/pytorch/pytorch/pull/66400 Reviewed By: bdhirsh Differential Revision: D31602998 Pulled By: mruberry fbshipit-source-id: 235cc414f373f014f4822a72deb1a04a58ad4a7c	2021-10-14 10:11:57 -07:00
Richard Zou	5d4452937d	OpInfos for some Tensor dtype conversion methods (#64282 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64282 OpInfos for: - Tensor.bfloat16, Tensor.bool, Tensor.bypte, Tensor.char - Tensor.double, Tensor.float, Tensor.half, Tensor.int - Tensor.short, Tensor.long None of these are supported by TorchScript. Also, the OpInfo autograd test runner assumes that the operation is not allowed to change the dtype of the argument, so only Tensor.double has `supports_autograd=True` (in theory Tensor.bfloat16, Tensor.float, Tensor.half should be differentiable). Test Plan: - run tests Reviewed By: dagitses Differential Revision: D31452627 Pulled By: zou3519 fbshipit-source-id: b7f272e558558412c47aefe947af7f060dfb45c5	2021-10-14 09:13:30 -07:00
Mikhail Zolotukhin	5f1518609b	[TensorExpr] Fix lowering for aten::t. (#65859 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65859 Test Plan: Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D31289347 Pulled By: ZolotukhinM fbshipit-source-id: b9648416238657fe23366928e43ed63e992a8973	2021-10-12 01:26:36 -07:00
Mikhail Zolotukhin	6864146f2b	[TensorExpr] Fix lowerings for aten::view and aten::reshape. (#65852 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65852 Test Plan: Imported from OSS Reviewed By: navahgar Differential Revision: D31286024 Pulled By: ZolotukhinM fbshipit-source-id: eb5b5f2ed86b6f325f09904e841815b8183b4e1d	2021-10-12 01:26:34 -07:00
jjsjann123	d609957c95	patching graph_for (#55139 ) Summary: Allows individual DifferentiableGraphOp to display optimized forward graph. This improves user visibility to graph mutation via optimization pass, especially fusion. Pull Request resolved: https://github.com/pytorch/pytorch/pull/55139 Reviewed By: albanD Differential Revision: D31330909 Pulled By: dzhulgakov fbshipit-source-id: c745b482fdc34876dc404cbe3bacd99dcf2ac724	2021-10-04 21:50:22 -07:00
Max Ren	0eaf081018	[JIT] canonicalize aten::rsub (#65014 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65014 ghstack-source-id: 138656948 Test Plan: ``` (pytorch) [maxren@devvm3115.atn0 ~/pytorch] python3 test/test_jit.py TestPeephole CUDA not available, skipping tests monkeytype is not installed. Skipping tests for Profile-Directed Typing ........s...................... ---------------------------------------------------------------------- Ran 31 tests in 0.393s OK (skipped=1) (pytorch) [maxren@devvm3115.atn0 ~/pytorch] python3 test/test_jit.py TestPeephole.test_normalized_rsub CUDA not available, skipping tests monkeytype is not installed. Skipping tests for Profile-Directed Typing . ---------------------------------------------------------------------- Ran 1 test in 0.015s OK ``` Reviewed By: eellison Differential Revision: D30941389 fbshipit-source-id: 03f0416d99090845c9bfb1e5fcf771d5f1d7a050	2021-09-22 17:20:46 -07:00
Raghavan Raman	cad7a4b0ea	[nnc] Added an implementation of sign op (#64033 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64033 Test Plan: Imported from OSS Reviewed By: bertmaher Differential Revision: D30579197 Pulled By: navahgar fbshipit-source-id: f9f7fa7f2ffa109cf4e441eb1af821b8b891d4d3	2021-09-10 16:49:04 -07:00
Animesh Jain	18d24bb537	[NNC] Add Softplus operator (#64589 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64589 Adding softplus operator lowering for NNC. Enabling element wise fusion as well. Test Plan: Added a test in test_jit_fuser.py Reviewed By: bertmaher Differential Revision: D30736449 fbshipit-source-id: 6c5fc3bceb5cef2322ecd4449f827e4af018ea93	2021-09-08 10:49:58 -07:00
Elias Ellison	bccbe310ef	Add view with negative dim (#63516 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63516 how to review: pretty much just check that the inputs generated are a good representation of the op semantics, that should be sufficient for correctness, and then you can also double check the op size semantics by going to https://codebrowser.bddppq.com/pytorch/pytorch/ typing in native::{op_name} and looking at the op implementation as a bonus if you want Test Plan: Imported from OSS Reviewed By: driazati Differential Revision: D30738143 Pulled By: eellison fbshipit-source-id: c7cd01cb2c8a13cb2664415f3d98aedec19a8e07	2021-09-07 18:22:28 -07:00
Bert Maher	e7fb35021a	[nnc] Enable fusion of bfloat16 ops (#64196 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64196 Test Plan: Imported from OSS Reviewed By: ZolotukhinM Differential Revision: D30643864 Pulled By: bertmaher fbshipit-source-id: e95edeaf7089464d713ea1d1f951743d3e5f61c5	2021-08-30 20:09:36 -07:00
Bert Maher	ebc0aacf83	[nnc] Fix half2float conversion and re-enable float16 (#64199 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64199 Test Plan: Imported from OSS Reviewed By: ZolotukhinM Differential Revision: D30643865 Pulled By: bertmaher fbshipit-source-id: 9de6adca53bd08839328cbaf6364f7de9550264b	2021-08-30 18:37:55 -07:00

1 2 3 4

197 Commits