pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Michael Lazos	f302a0d380	Re-enable SGD (#117434 ) Re-enables the SGD optimizer now that compile times are more reasonable. [Benchmark run](https://github.com/pytorch/pytorch/actions/runs/7511073761) Pull Request resolved: https://github.com/pytorch/pytorch/pull/117434 Approved by: https://github.com/anijain2305, https://github.com/janeyx99	2024-01-19 04:28:50 +00:00
ydwu4	c317bf2c2b	[HigherOrderOp][BE] factor out merge_graph_inputs (#116912 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/116912 Approved by: https://github.com/zou3519 ghstack dependencies: #116721, #116823	2024-01-19 00:35:26 +00:00
ydwu4	eba5d5485d	[dynamo] make ConstantSource propagate through built-in ops for TensorVariable (#117704 ) Fixes #117685. This PR only makes ConstantSource perserved for built-in ops when we find all the inputs are either constant tensors or python constants. It doesn't fundamentally solve the problem of preserving ConstantSource information through all operators that's potentially can be constant folded. For the following code in the issue: ``` class Bob(torch.nn.Module): def __init__(self, p, val) -> None: super().__init__() self.p = p self.y = torch.nn.Parameter(torch.tensor(val)) def forward(self, x: torch.Tensor) -> torch.Tensor: # This only looks dynamic but it's actually a constant value if get_y(self.y) < self.p: return torch.cat([x,x]) else: return x ``` The graph exported looks like following: ```python class GraphModule(torch.nn.Module): def forward(self, x): arg0: "f32[s0, s1]"; arg0, = fx_pytree.tree_flatten_spec(([x], {}), self._in_spec) l_x_ = arg0 # File: /home/yidi/local/pytorch/test/dynamo/test_export.py:1498 in forward, code: return torch.cat([x, x]) cat = torch.cat([l_x_, l_x_]); l_x_ = None return pytree.tree_unflatten([cat], self._out_spec) ``` Test Plan: Added a new test for the given repro. Pull Request resolved: https://github.com/pytorch/pytorch/pull/117704 Approved by: https://github.com/jansel, https://github.com/anijain2305	2024-01-18 20:18:34 +00:00
Wanchao Liang	4720109d7f	[dynamo] add common methods to DistributedVariable (#117590 ) This PR refactors the distributed related variables to use DistributedVariable for common methods, so that things like `python_type` works for all distributed variables. Maybe we can add `as_python_constant` to the DistributedVariable too? I didn't add in this PR but if that make sense I can update. Pull Request resolved: https://github.com/pytorch/pytorch/pull/117590 Approved by: https://github.com/voznesenskym	2024-01-18 17:32:31 +00:00
Animesh Jain	6e4e81a9ef	[dynamo] Extend LazyVariableTracker to tuples (#117426 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/117426 Approved by: https://github.com/lezcano, https://github.com/jansel	2024-01-18 15:51:28 +00:00
Francisco Massa	8bf788c390	[SAC][Dynamo] Add support for functools.partial in CheckpointHigherOrderVariable (#117657 ) # Context In some cases, we might want to build the `context_fn` with runtime-defined policies. One way of implementing this is to make `context_fn` be a partial, which holds the information that we want to pass. One concrete example is the [automatic policy selection from `xformers`](`ad986981b1/xformers/checkpoint.py (L185)`). # The problem The previous implementation wouldn't work with partials because `FunctoolsPartialVariable` doesn't have a `fn` attribute. This PR addresses this case, but ideally we could get this solved in a more general fashion, as callable classes and `NestedUserFunctionVariable` are not supported by this PR. # Tests I've added a basic test that mimics the tests around it. The tests could probably be simplified, but I've decided to keep changes to a minimum. Pull Request resolved: https://github.com/pytorch/pytorch/pull/117657 Approved by: https://github.com/yf225	2024-01-18 11:59:23 +00:00
PyTorch MergeBot	b0084be114	Revert "Re-enable SGD (#117434 )" This reverts commit `e7fac72be7`. Reverted https://github.com/pytorch/pytorch/pull/117434 on behalf of https://github.com/lezcano due to breaks test_profiler.py when run with dynamo ([comment](https://github.com/pytorch/pytorch/pull/117434#issuecomment-1898311961))	2024-01-18 11:37:36 +00:00
lezcano	4ba5318d3f	[dynamo] Add DictView variable tracker (#108420 ) This also starts a comparison pattern where we don't ask variables what's their type, but what are their capabilities. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108420 Approved by: https://github.com/jansel ghstack dependencies: #112252, #117630, #110524	2024-01-18 09:37:33 +00:00
lezcano	f4df0f061c	Implement set in terms of dict (#110524 ) This allows to heavily simplify the implementation of set, which was "quite unique". Now we represent a set a as a dict where all its values are None. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110524 Approved by: https://github.com/jansel ghstack dependencies: #112252, #117630	2024-01-18 09:36:41 +00:00
lezcano	bc85eb948f	Break on unsupported keys for dicts / elements for sets (#117630 ) As per title Pull Request resolved: https://github.com/pytorch/pytorch/pull/117630 Approved by: https://github.com/jansel ghstack dependencies: #112252	2024-01-18 09:35:46 +00:00
lezcano	4512a95371	[easy]Remove specialized value (#112252 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112252 Approved by: https://github.com/jansel	2024-01-18 09:34:50 +00:00
Yue Dong	57ca455471	[dynamo] Add hasattr support for TupleVariable (#117694 ) Summary: This change adds support hasattr support for TupleVariable in dynamo. This fix is part of: https://github.com/pytorch/pytorch/issues/117670 Test Plan: Unit test and CI Differential Revision: D52850665 Pull Request resolved: https://github.com/pytorch/pytorch/pull/117694 Approved by: https://github.com/yanboliang	2024-01-18 07:47:43 +00:00
Michael Lazos	e7fac72be7	Re-enable SGD (#117434 ) Re-enables the SGD optimizer now that compile times are more reasonable. [Benchmark run](https://github.com/pytorch/pytorch/actions/runs/7511073761) Pull Request resolved: https://github.com/pytorch/pytorch/pull/117434 Approved by: https://github.com/anijain2305, https://github.com/janeyx99	2024-01-18 06:47:15 +00:00
Animesh Jain	8841d26046	[dynamo] LazyVariable - redirect __str__ to the realized variable __str__ (#117583 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/117583 Approved by: https://github.com/lezcano, https://github.com/jansel	2024-01-17 21:12:12 +00:00
Yanbo Liang	735715e6d3	[Dynamo] Make profiler function will be ignored warn only once (#117585 ) Fix #111632 #111622 accidentally reverted #111921, we should bring it back. Pull Request resolved: https://github.com/pytorch/pytorch/pull/117585 Approved by: https://github.com/williamwen42, https://github.com/mlazos, https://github.com/msaroufim	2024-01-17 04:05:45 +00:00
Oguz Ulgen	28bb31e4a5	[Dynamo] Trace autograd.function in dynamo when inputs require grad (#116358 ) (#116897 ) For training graphs (when inputs require grad), previously, we would speculate the forward and backward graph to determine if there are any graph breaks, side effect and etc but would not actually use these speculated graphs. We would just insert a call function node on the graph and later rely on autograd's tracing. This approach does not work for more generalized graphs like graphs that include user defined triton kernels because autograd is not able to do the higher order function conversation. This PR speculates the forward and backward functions and emits them in a HOF that later gets used via templating mechanism. While working on this PR, I have exposed some bugs in the current tracing due to trampoline functions losing the source information resulting in incorrect graphs being produced. I have fixed these source information bugs and killed the trampolines. Pull Request resolved: https://github.com/pytorch/pytorch/pull/116897 Approved by: https://github.com/Skylion007, https://github.com/jansel, https://github.com/voznesenskym	2024-01-16 03:57:13 +00:00
Yanbo Liang	dd2cff1591	[Dynamo] Use isinstance rather than istype when check if python module type (#117022 ) This is to fix a issue from Meta internal use case, where third-party ```DictConfig``` has bug on [```__eq__```](`fd730509ef/omegaconf/dictconfig.py (L596)`) and it triggers Dynamo error because we are using ```obj in [x, y]``` check. Then I found we can use ```isinstance``` to cover all and removing these special cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/117022 Approved by: https://github.com/ckluk2, https://github.com/jansel	2024-01-15 23:25:30 +00:00
Sun, Jiayi	d9b265adaf	modify the conditions as PythonModuleVariable (#116856 ) ## Motivation The current code of `value in [torch.backends.cudnn, torch.ops]` requires `value` to have the implementation of `__eq__`. If the value is a custom object and does not implement `__eq__`, dynamo will throw error. For example, ConvolutionOpContext, the custom 'torch._C.ScriptClass' object registered in IPEX, dynamo will throw the following error: torch._dynamo.exc.InternalTorchDynamoError: '__eq__' is not implemented for __torch__.torch.classes.ipex_prepack.ConvolutionOpContext I think this is a common issue, To avoid this issue, the PR replaces the current code `value in [torch.backends.cudnn, torch.ops]`with `isinstance(value, (torch.backends.cudnn.CudnnModule, torch._ops._Ops)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/116856 Approved by: https://github.com/jansel	2024-01-15 11:10:57 +00:00
Aaron Gokaslan	62496ffd0d	[dynamo][easy]: Add support for `operator.truth` (#117463 ) * This is an old builtin function equivalent to the bool constructor. it is easy enough to add support for. * I also realized the tests were in the wrong class (the one reserved for testing default args) so I moved them. Pull Request resolved: https://github.com/pytorch/pytorch/pull/117463 Approved by: https://github.com/jansel	2024-01-14 19:08:31 +00:00
Aaron Gokaslan	bf27dd6df9	Add dynamo support for operator.abs (#117442 ) A test case for operator.abs and allows for constant folding with it. Partially applies to #116396 Pull Request resolved: https://github.com/pytorch/pytorch/pull/117442 Approved by: https://github.com/jansel, https://github.com/malfet	2024-01-13 21:38:55 +00:00
voznesenskym	f008efa8e7	Reconstruct streams via global registration, temporary impl to unblock FSDP (#117386 ) This is a placeholder implementation for reconstructing streams via global storage to unblock FSDP, pending proper stream support design This PR does a few things: 1) fixes registration for devices with indices. We were only supporting "cuda", we now support "cuda:k" interfaces where k is # of gpu 2) Changes the stream objects in dynamo to take devices as device types, instead of strings, and updates the string based device APIs to gracefully take device types. 3) Introduces a reconstruct-by-global (using existing cleanup hook structures) to streams as a placeholder impl for now Pull Request resolved: https://github.com/pytorch/pytorch/pull/117386 Approved by: https://github.com/jansel	2024-01-13 07:03:33 +00:00
Yidi Wu	2bc7da1ab7	[HigherOrderOp] change signature of map_impl (#117161 ) Summary: X-link: https://github.com/pytorch/executorch/pull/1580 This PR changes the schema of map_impl from map_impl(f, num_mapped, *operands) to map_impl(f, mapped_args: Tuple, moperands: Tuple). This is to prepare for turning on dynamo for eager mode map, where we want to get rid of the num_mapped scalar. Test Plan: Existing tests. Differential Revision: D52495413 Pull Request resolved: https://github.com/pytorch/pytorch/pull/117161 Approved by: https://github.com/angelayi, https://github.com/tugsbayasgalan	2024-01-13 02:50:46 +00:00
Animesh Jain	e54b40e5eb	[dynamo] GetItemSource - restrict the supported index Source to be GlobalWeakRefSource (#117138 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/117138 Approved by: https://github.com/jansel, https://github.com/mlazos	2024-01-12 18:21:14 +00:00
PyTorch MergeBot	ac0bed01df	Revert "[dynamo] GetItemSource - restrict the supported index Source to be GlobalWeakRefSource (#117138 )" This reverts commit `c278a1b39c`. Reverted https://github.com/pytorch/pytorch/pull/117138 on behalf of https://github.com/zou3519 due to Broke jobs on main, I'm not sure why ([comment](https://github.com/pytorch/pytorch/pull/117138#issuecomment-1888290068))	2024-01-12 01:55:49 +00:00
Animesh Jain	c278a1b39c	[dynamo] GetItemSource - restrict the supported index Source to be GlobalWeakRefSource (#117138 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/117138 Approved by: https://github.com/jansel	2024-01-11 23:26:25 +00:00
vfdev-5	7005a4bcb6	[dynamo] Added dyn shapes support for math trigo ops: sin(h), cos(h), tan(h) ... (#114866 ) Description: - Added dynamic shapes support for math trigo ops: sin(h), cos(h), tan(h) ... ```python import math import torch def func(x, a, b): c = 0 c = c + math.sqrt(a) c = c + math.cos(a) c = c + math.cosh(a) c = c + math.sin(a) c = c + math.sinh(a) c = c + math.tan(a) c = c + math.tanh(a) c = c + math.asin(b) c = c + math.acos(b) c = c + math.atan(a) y = x + c return y cfunc = torch.compile(func, dynamic=True, fullgraph=True) device = "cpu" # or "cuda" x = torch.tensor([0, 1, 2, 3], dtype=torch.float32, device=device) a = 12 b = 1 out = cfunc(x, a, b) expected = func(x, a, b) torch.testing.assert_close(out, expected) ``` and the graph `TORCH_LOGS=+graph_code python check_math_ops.py`: <details> <summary> graph code </summary> ``` [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] TRACED GRAPH [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] ===== __compiled_fn_0 ===== [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] <eval_with_key>.0 class GraphModule(torch.nn.Module): [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] def forward(self, L_a_ : torch.SymInt, s1 : torch.SymInt, L_x_ : torch.Tensor): [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] l_a_ = L_a_ [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] l_x_ = L_x_ [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] # File: check_math_ops.py:57, code: c = c + math.sqrt(a) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] sym_sqrt = torch.sym_sqrt(l_a_) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] add = 0 + sym_sqrt; sym_sqrt = None [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] # File: check_math_ops.py:58, code: c = c + math.cos(a) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] sym_cos = torch.sym_cos(l_a_) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] add_1 = add + sym_cos; add = sym_cos = None [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] # File: check_math_ops.py:59, code: c = c + math.cosh(a) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] sym_cosh = torch.sym_cosh(l_a_) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] add_2 = add_1 + sym_cosh; add_1 = sym_cosh = None [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] # File: check_math_ops.py:60, code: c = c + math.sin(a) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] sym_sin = torch.sym_sin(l_a_) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] add_3 = add_2 + sym_sin; add_2 = sym_sin = None [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] # File: check_math_ops.py:61, code: c = c + math.sinh(a) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] sym_sinh = torch.sym_sinh(l_a_) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] add_4 = add_3 + sym_sinh; add_3 = sym_sinh = None [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] # File: check_math_ops.py:62, code: c = c + math.tan(a) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] sym_tan = torch.sym_tan(l_a_) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] add_5 = add_4 + sym_tan; add_4 = sym_tan = None [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] # File: check_math_ops.py:63, code: c = c + math.tanh(a) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] sym_tanh = torch.sym_tanh(l_a_) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] add_6 = add_5 + sym_tanh; add_5 = sym_tanh = None [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] # File: check_math_ops.py:64, code: c = c + math.asin(b) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] add_7 = add_6 + 1.5707963267948966; add_6 = None [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] # File: check_math_ops.py:65, code: c = c + math.acos(b) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] add_8 = add_7 + 0.0; add_7 = None [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] # File: check_math_ops.py:66, code: c = c + math.atan(a) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] sym_atan = torch.sym_atan(l_a_); l_a_ = None [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] add_9 = add_8 + sym_atan; add_8 = sym_atan = None [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] # File: check_math_ops.py:67, code: y = x + c [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] y = l_x_ + add_9; l_x_ = add_9 = None [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] return (y,) [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] [2023-11-30 22:16:10,654] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] ``` </details> Generated code with `TORCH_LOGS=+output_code python check_math_ops.py`: <details> <summary> C++ code </summary> ``` [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] cpp_fused_add_0 = async_compile.cpp(''' [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] #include "/tmp/torchinductor_root/2l/c2ljzlm4sosod7u6lyrroqdba6hmfcyijrric6p4t3fhbcmw6osp.h" [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] extern "C" void kernel(const float* in_ptr0, [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] float* out_ptr0, [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] const long ks0, [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] const long ks1) [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] { [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] { [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] #pragma GCC ivdep [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] for(long x0=static_cast<long>(0L); x0<static_cast<long>(ks0); x0+=static_cast<long>(1L)) [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] { [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] auto tmp0 = in_ptr0[static_cast<long>(x0)]; [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] auto tmp1 = c10::convert<float>(1.57079632679490 + (std::sqrt(ks1)) + (std::atan(ks1)) + (std::cos(ks1)) + (std::cosh(ks1)) + (std::sin(ks1)) + (std::sinh(ks1)) + (std::tan(ks1)) + (std::tanh(ks1))); [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] auto tmp2 = decltype(tmp0)(tmp0 + tmp1); [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] out_ptr0[static_cast<long>(x0)] = tmp2; [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] } [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] } [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] } [2023-11-30 22:19:09,709] [0/0] torch._inductor.graph.__output_code: [DEBUG] ''') ``` </details> <details> <summary> Triton code </summary> ``` [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] @pointwise( [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] size_hints=[4], [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] filename=__file__, [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] triton_meta={'signature': {0: 'fp32', 1: 'fp32', 2: 'i32', 3: 'i32'}, 'device': 0, 'device_type': 'cuda', 'constants': {}, 'configs': [instance_descriptor(divisible_by_16=(0, 1), equal_to_1=(), i ds_of_folded_args=(), divisible_by_8=())]}, [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] inductor_meta={'autotune_hints': set(), 'kernel_name': 'triton_poi_fused_add_0', 'mutated_arg_names': []}, [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] min_elem_per_thread=0 [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] ) [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] @triton.jit [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] def triton_(in_ptr0, out_ptr0, ks0, xnumel, XBLOCK : tl.constexpr): [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] xoffset = tl.program_id(0) * XBLOCK [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] xindex = xoffset + tl.arange(0, XBLOCK)[:] [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] xmask = xindex < xnumel [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] x0 = xindex [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] tmp0 = tl.load(in_ptr0 + (x0), xmask) [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] tmp1 = 1.57079632679490 + (tl.math.sqrt(ks0.to(tl.float32))) + (tl.math.atan((ks0).to(tl.float32))) + (tl.math.cos((ks0).to(tl.float32))) + (tl.math.cosh((ks0).to(tl.float32))) + (tl.math.sin((ks0) .to(tl.float32))) + (tl.math.sinh((ks0).to(tl.float32))) + (tl.math.tan((ks0).to(tl.float32))) + (tl.math.tanh((ks0).to(tl.float32))) [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] tmp2 = tmp1.to(tl.float32) [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] tmp3 = tmp0 + tmp2 [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] tl.store(out_ptr0 + (x0), tmp3, xmask) [2023-11-30 22:20:00,383] [0/0] torch._inductor.graph.__output_code: [DEBUG] ''') ``` </details> Pull Request resolved: https://github.com/pytorch/pytorch/pull/114866 Approved by: https://github.com/peterbell10	2024-01-11 11:52:28 +00:00
Michael Lazos	558cc69641	Fix torch function kwarg dispatch (#117083 ) Previously, kwargs were incorrectly dispatched by passing them as the true kwargs to the torch function call. To fix, the kwargs of the original torch op need to be stored in a dictionary and passed as an argument to the torch function implementation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/117083 Approved by: https://github.com/drisspg	2024-01-10 10:55:10 +00:00
Guilherme Leobas	4f3d698cac	Impl. call_hasattr for BaseUserFunctionVariable (#116049 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/116049 Approved by: https://github.com/zou3519	2024-01-09 22:58:58 +00:00
Aaron Gokaslan	1dd4813328	[BE][dynamo]: Add operator is and is not tests to dynamo tests (#116397 ) Adds an operator that was unit not tested in our test suite - improves coverage. Inspired by looking into https://github.com/pytorch/pytorch/pull/116397 after @XuehaiPan brought up some issues with builtins in #116389 Pull Request resolved: https://github.com/pytorch/pytorch/pull/116397 Approved by: https://github.com/albanD, https://github.com/jansel	2024-01-09 21:13:22 +00:00
voznesenskym	4c0d63180a	Support NNModules as dict keys (#116723 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/116723 Approved by: https://github.com/lezcano	2024-01-09 03:32:47 +00:00
voznesenskym	83e8a0721d	Reland #111196 (take 4) "Support tensors as Dict keys" (#116934 ) Fixes #ISSUE_NUMBER See that PR Pull Request resolved: https://github.com/pytorch/pytorch/pull/116934 Approved by: https://github.com/ezyang, https://github.com/huydhn	2024-01-07 01:37:26 +00:00
PyTorch MergeBot	2dca3e99eb	Revert "Support tensors as Dict keys Re-PR of #111196 (#116785 )" This reverts commit `1badad9ce9`. Reverted https://github.com/pytorch/pytorch/pull/116785 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/116785#issuecomment-1879592261))	2024-01-06 08:22:33 +00:00
voznesenskym	1badad9ce9	Support tensors as Dict keys Re-PR of #111196 (#116785 ) This prepares the PR where we implement sets in terms of dicts. To do so, rather than storing internally a dictionary that maps literals to VariableTrackers, it stores (pretty much) a dictionary from VTs to VTs. To do so, keys are wrapped in an opaque internal class _Hashable. The Hashable class is opaque on purpose so that it fails hard if if it inadvertently leaks back into user code. We also found and fixed a number of latent bugs and inconsistencies in the way dynamo checked what can be a dict key. More generally, we make much clearer what are the things that need to be modified to add a new supported key type to Dicts. Fixes [#107595](https://www.internalfb.com/tasks?t=107595) Fixes [#111603](https://www.internalfb.com/tasks?t=111603) Re-PR of https://github.com/pytorch/pytorch/pull/111196 sadly due to reverts, we could not reuse @lezcano's original PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/116785 Approved by: https://github.com/mlazos	2024-01-06 03:35:35 +00:00
Oguz Ulgen	8894a97707	[Dynamo] Fix source for autograd.function default value (#116894 ) Before this PR, the source guard would emit ``` globals()['Gradient'].__class__.forward.__defaults__[0] ``` which is incorrect Pull Request resolved: https://github.com/pytorch/pytorch/pull/116894 Approved by: https://github.com/zou3519, https://github.com/yanboliang	2024-01-06 00:36:00 +00:00
Aaron Gokaslan	63ee35c4e0	BugFix: Fix F632 bug in dynamo (if statement is always false) (#116867 ) This was flagged by a preview ruff check as the if statement always evaluating false. Likely a typo between `is` and `in`. I also micro-optimized some list construction into tuple construction, which is semantically identical, but faster. Pull Request resolved: https://github.com/pytorch/pytorch/pull/116867 Approved by: https://github.com/lezcano, https://github.com/albanD, https://github.com/yanboliang	2024-01-05 19:15:05 +00:00
Guoliang He	0159e3abbd	[dynamo] add a handler for itertools_chain_from_iterable and test (#116849 ) 1. add a handler for itertools_chain_from_iterable 2. a test for itertools_chain_from_iterable Fixes #116463 Pull Request resolved: https://github.com/pytorch/pytorch/pull/116849 Approved by: https://github.com/ezyang	2024-01-05 15:14:18 +00:00
Edward Z. Yang	53f8d17d1e	Specialize SymNodeVariable when used as module index (#114377 ) Fixes #114171 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/114377 Approved by: https://github.com/Skylion007	2024-01-05 13:51:52 +00:00
lezcano	7c8f38700a	[dynamo] Fix np.issubdtype (#116459 ) Fixes the issue described at https://github.com/pytorch/pytorch/issues/93697#issuecomment-1828346590 This doesn't fix the full issue yet, now we hit ```python File "/home/lezcano/git/pytorch/pytorch/torch/_dynamo/symbolic_convert.py", line 744, in step getattr(self, inst.opname)(inst) File "/home/lezcano/git/pytorch/pytorch/torch/_dynamo/symbolic_convert.py", line 1366, in BUILD_MAP assert ( AssertionError ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/116459 Approved by: https://github.com/peterbell10	2024-01-05 01:48:07 +00:00
PyTorch MergeBot	75dae4f691	Revert "[dynamo] Fix np.issubdtype (#116459 )" This reverts commit `b5c33ccdb3`. Reverted https://github.com/pytorch/pytorch/pull/116459 on behalf of https://github.com/zou3519 due to Broke CI, seems to be a landrace ([comment](https://github.com/pytorch/pytorch/pull/116459#issuecomment-1877135999))	2024-01-04 14:00:11 +00:00
Tugsbayasgalan Manlaibaatar	81f98f1082	Experimental non-strict mode (#114658 ) This is proof-of-concept implementation of how people can use a marker `mark_strict` to enable torchdynamo while exporting under non-strict mode. The main idea is that `mark_strict` will turn into an HOO which then utilizes dynamo to do correctness analysis in the same way how torch.cond works today. There are some notable limitations: 1. This API is not meant for public use yet 2. Strict region can't work with arbitrary container inputs 3. We don't preserve `nn_module_stack` and other node metadata for the strict region. 4. strict_mode HOO will show up in the final graph. This is undesirable in the long term, but for short term experiments, it should be good enough. Will fix this in the follow up PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/114658 Approved by: https://github.com/ydwu4	2024-01-04 12:24:58 +00:00
lezcano	b5c33ccdb3	[dynamo] Fix np.issubdtype (#116459 ) Fixes the issue described at https://github.com/pytorch/pytorch/issues/93697#issuecomment-1828346590 This doesn't fix the full issue yet, now we hit ```python File "/home/lezcano/git/pytorch/pytorch/torch/_dynamo/symbolic_convert.py", line 744, in step getattr(self, inst.opname)(inst) File "/home/lezcano/git/pytorch/pytorch/torch/_dynamo/symbolic_convert.py", line 1366, in BUILD_MAP assert ( AssertionError ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/116459 Approved by: https://github.com/peterbell10	2024-01-04 03:55:50 +00:00
ydwu4	ce2df3f690	[HigherOrderOp] set set_subgraph_inputs to flatten_manual for map (#115853 ) We change manually_set_subgraph_inputs to three modes: manual, automatic and flatten_manual. The flatten_manual wil first flatten the sub_args then recussively call set_subgrah_inputs = "manual". This allows us to control the order of the placeholder shown up in the graph, which is necessary for map, where we want to keep the mapped arguments before the rest positional arguments. Right now, map only takes a single tensor as mapped argument but it would become pretty easy to match the subgraph inputs to original proxy if we have a "flatten_manual" option. Pull Request resolved: https://github.com/pytorch/pytorch/pull/115853 Approved by: https://github.com/zou3519	2024-01-04 00:27:07 +00:00
PyTorch MergeBot	68105da229	Revert "[Dynamo] Trace autograd.function in dynamo when inputs require grad (#116358 )" This reverts commit `97891b184c`. Reverted https://github.com/pytorch/pytorch/pull/116358 on behalf of https://github.com/izaitsevfb due to Breaks internal accuracy test, see D52491095, pytorch/benchmark/fb/test_gpu:run_test_gpu - test_train_ig_feed_over_inductor_accuracy ([comment](https://github.com/pytorch/pytorch/pull/116358#issuecomment-1875779697))	2024-01-03 18:20:51 +00:00
Aaron Gokaslan	bd10fea79a	[BE]: Enable F821 and fix bugs (#116579 ) Fixes #112371 I tried to fix as many of the bugs as I could, a few I could not figure out what the proper fix for them was though and so I left them with noqas. Pull Request resolved: https://github.com/pytorch/pytorch/pull/116579 Approved by: https://github.com/ezyang	2024-01-01 08:40:46 +00:00
Oguz Ulgen	97891b184c	[Dynamo] Trace autograd.function in dynamo when inputs require grad (#116358 ) For training graphs (when inputs require grad), previously, we would speculate the forward and backward graph to determine if there are any graph breaks, side effect and etc but would not actually use these speculated graphs. We would just insert a call function node on the graph and later rely on autograd's tracing. This approach does not work for more generalized graphs like graphs that include user defined triton kernels because autograd is not able to do the higher order function conversation. This PR speculates the forward and backward functions and emits them in a HOF that later gets used via templating mechanism. While working on this PR, I have exposed some bugs in the current tracing due to trampoline functions losing the source information resulting in incorrect graphs being produced. I have fixed these source information bugs and killed the trampolines. Pull Request resolved: https://github.com/pytorch/pytorch/pull/116358 Approved by: https://github.com/jansel	2023-12-30 01:51:30 +00:00
Yanbo Liang	7e12e722af	[Dynamo][12/N] Remove allowed_functions.py (#116401 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/116401 Approved by: https://github.com/angelayi	2023-12-28 21:26:06 +00:00
Yanbo Liang	d59350cc1c	[Dynamo] Consolidate common constant types (#116366 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/116366 Approved by: https://github.com/Skylion007	2023-12-27 23:54:35 +00:00
Yanbo Liang	6375eb15ef	[Dynamo][11/N] allow_in_graph/disallow_in_graph decorator refactor (#116365 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/116365 Approved by: https://github.com/jansel	2023-12-27 23:50:35 +00:00
Xuehai Pan	3149e4a667	[dynamo] fix `sum()` function with `start` argument (#116389 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/116389 Approved by: https://github.com/Skylion007, https://github.com/malfet	2023-12-27 20:42:27 +00:00
Aaron Gokaslan	bbe3261dd3	[BE]: Use `iterable.chain.from_iterable` where possible (#116376 ) This is more readable and more efficient when dealing with lots of sequences to chain together. Pull Request resolved: https://github.com/pytorch/pytorch/pull/116376 Approved by: https://github.com/albanD	2023-12-27 19:20:07 +00:00

1 2 3 4 5 ...

945 Commits