pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
eellison	2a246c5259	update type() calling to not use unneeded device (#110163 ) Previous code path was doing an unnecessary cuda init as well as causing an unnecessary "device" to occur in the jit trace. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110163 Approved by: https://github.com/henryhu6, https://github.com/albanD	2023-09-28 17:34:46 +00:00
Randolf Scholz	837272f150	Python 3.10 Union operator \| support for JIT (#109293 ) Fixes #101777 - [x] Duplicated the tests from `test/jit/test_union.py` into [`test/jit/test_union_pep604.py`](https://github.com/pytorch/pytorch/pull/109293/files#diff-b981f6493093482b43b0e62057b0c01b004b3e932d4e63a1166c3808c0172b83), using PEP604 style Unions - [x] Exchanged custom `get_args` and `get_origin` with `typing.get_args` and `typing.get_origin` which have the same functionality and became part of the standard library in 3.8 - [x] Added utility function `pep604union_to_union` in `tree_views.h` which converts a `BinOP("\|")` node into the corresponding `Union`. This function intercepts `ScriptTypeParser::parseTypeFromExpr` and `ScriptTypeParser::parseTypeFromExprImpl` and patches the expression. - [ ] There is a single failing test, I commented it out for the moment to see if CI complains about anything else. I tried several hours to figure out how to patch it, but I am not experienced with C++ development and debugging. From what I could gather, the following fails: ```python def test_union_optional_of_union_return(self): @torch.jit.script def fn() -> None \| str \| int: y: Optional[int \| str] = "foo" return y ``` In the section: `75b954b715/torch/csrc/jit/frontend/script_type_parser.cpp (L232-L243)` When using regular `Union`, the `resolver` path is taken, whereas with the patch pep604 union, `resolveType` doesn't work. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109293 Approved by: https://github.com/ezyang	2023-09-25 15:35:54 +00:00
Nikita Shulga	55685d57c0	[JIT] Fix typed enum handling in 3.11 (#109717 ) In Python-3.11+ typed enums (such as `enum.IntEnum`) retain `__new__`,`__str__` and so on method of the base class via `__init__subclass__()` method (see https://docs.python.org/3/whatsnew/3.11.html#enum ), i.e. following code ```python import sys import inspect from enum import Enum class IntColor(int, Enum): RED = 1 GREEN = 2 class Color(Enum): RED = 1 GREEN = 2 def get_methods(cls): def predicate(m): if not inspect.isfunction(m) and not inspect.ismethod(m): return False return m.__name__ in cls.__dict__ return inspect.getmembers(cls, predicate=predicate) if __name__ == "__main__": print(sys.version) print(f"IntColor methods {get_methods(IntColor)}") print(f"Color methods {get_methods(Color)}") ``` Returns empty list for both cases for older Python, but on Python-3.11+ it returns list contains of enum constructors and others: ```shell % conda run -n py310 python bar.py 3.10.12 \| packaged by conda-forge \| (main, Jun 23 2023, 22:41:52) [Clang 15.0.7 ] IntColor methods [] Color methods [] % conda run -n py311 python bar.py 3.11.0 \| packaged by conda-forge \| (main, Oct 25 2022, 06:21:25) [Clang 14.0.4 ] IntColor methods [('__format__', <function Enum.__format__ at 0x105006ac0>), ('__new__', <function Enum.__new__ at 0x105006660>), ('__repr__', <function Enum.__repr__ at 0x1050068e0>)] Color methods [] ``` This change allows typed enums to be scriptable on 3.11, by explicitly marking several `enum.Enum` method to be dropped by jit script and adds test that typed enums are jit-scriptable. Fixes https://github.com/pytorch/pytorch/issues/108933 Pull Request resolved: https://github.com/pytorch/pytorch/pull/109717 Approved by: https://github.com/atalman, https://github.com/davidberard98	2023-09-20 22:09:41 +00:00
David Berard	405f014c26	[jit] Skip NNAPI, test_ivalue, CPU NNC tests in fbcode (#108937 ) Summary: NNAPI: Internal test infra can't find test_nnapi.py. Easiest solution is to just skip these tests if test_nnapi.py can't be found test_ivalue: fails due to qscheme op not implemented for CPU backend. In OSS, it doesn't run because it's not included in test_jit.py. CPU NNC tests: test_working_byte_cpu_float32 is failing, but hard to repro; we don't use CPU NNC internally, so let's just skip CPU NNC tests internally. Differential Revision: D48041615 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108937 Approved by: https://github.com/eellison	2023-09-11 22:42:30 +00:00
Kurt Mohler	3f88e3105f	Reland: Remove remaining global `set_default_dtype` calls from tests (#108088 ) Fixes #68972 Relands #107246 To avoid causing Meta-internal CI failures, this PR avoids always asserting that the default dtype is float in the `TestCase.setUp/tearDown` methods. Instead, the assert is only done if `TestCase._default_dtype_check_enabled == True`. `_default_dtype_check_enabled` is set to True in the `if __name__ == "__main__":` blocks of all the relevant test files that have required changes for this issue Pull Request resolved: https://github.com/pytorch/pytorch/pull/108088 Approved by: https://github.com/ezyang	2023-09-07 03:04:34 +00:00
PyTorch MergeBot	161ea463e6	Revert "Remove remaining global `set_default_dtype` calls from tests (#107246 )" This reverts commit `aa8ea1d787`. Reverted https://github.com/pytorch/pytorch/pull/107246 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/107246#issuecomment-1693838522))	2023-08-25 19:34:55 +00:00
Kurt Mohler	aa8ea1d787	Remove remaining global `set_default_dtype` calls from tests (#107246 ) Fixes #68972 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107246 Approved by: https://github.com/ezyang	2023-08-24 16:10:48 +00:00
David Berard	25d87c8301	torch.ops.aten.: sort aten ops before jit overloads (#107138 ) Summary: In fbcode, aten and jit ops can get registered in different orders depending on build mode. In dev mode, aten is registered first; in opt mode, jit is registered first. This causes problems in torch.ops.aten. calls; these calls use `torch._C._jit_get_operation`, which selects an overload based on the inputs to the call. It searches through the overloads for the op with the given name, and chooses the first one that matches the input types. "First" depends on whether aten or jit ops were registered first - e.g. in `test_both_scalars_cuda` in opt mode, it chooses `add.complex` and returns a complex value. We also saw this issue in https://github.com/pytorch/pytorch/pull/103576. This PR sorts the list of overloads first, putting the aten ops first. Differential Revision: D48304930 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107138 Approved by: https://github.com/ezyang, https://github.com/eellison	2023-08-17 03:05:59 +00:00
Philip Meier	a926be39d4	torch.jit.script escape hatch (#106229 ) Although the sun is setting for torchscript, it is not [officially deprecated](https://github.com/pytorch/pytorch/issues/103841#issuecomment-1605017153) since nothing currently fully replaces it. Thus, "downstream" libraries like TorchVision, that started offering torchscript support still need to support it for BC. torchscript has forced us to use workaround after workaround since forever. Although this makes the code harder to read and maintain, we made our peace with it. However, we are currently looking into more elaborate API designs that are severely hampered by our torchscript BC guarantees. Although likely not intended as such, while looking for ways to enable our design while keeping a subset of it scriptable, we found the undocumented `__prepare_scriptable__` escape hatch: `0cf918947d/torch/jit/_script.py (L977)` One can define this method and if you call `torch.jit.script` on the object, the returned object of the method will be scripted rather than the original object. In TorchVision we are using exactly [this mechanism to enable BC](`3966f9558b/torchvision/transforms/v2/_transform.py (L122-L136)`) while allowing the object in eager mode to be a lot more flexible (`args, *kwargs`, dynamic dispatch, ...). Unfortunately, this escape hatch is only available for `nn.Module`'s `0cf918947d/torch/jit/_script.py (L1279-L1283)` This was fine for the example above since we were subclassing from `nn.Module` anyway. However, we recently also hit a case [where this wasn't the case](https://github.com/pytorch/vision/pull/7747#issuecomment-1642045479). Given the frozen state on JIT, would it be possible to give us a general escape hatch so that we can move forward with the design unconstrained while still keeping BC? This PR implements just this by re-using the `__prepare_scriptable__` hook. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106229 Approved by: https://github.com/lezcano, https://github.com/ezyang	2023-08-11 18:24:46 +00:00
shibo19	bb2fcc7659	unify TEST_CUDA (#106685 ) Fixes #ISSUE_NUMBER as title, unify TEST_CUDA Pull Request resolved: https://github.com/pytorch/pytorch/pull/106685 Approved by: https://github.com/zou3519	2023-08-10 09:01:36 +00:00
Jason Lu	bc88028e8e	Back out "Reland "Make adding buffers more like adding parameters (#104069 )" (#106224 )" (#106743 ) Summary: Original commit changeset: 81319beb97f3 Original Phabricator Diff: D47961182 Test Plan: revert to maintain backward compat with legacy ads_dper3 production package. Read details in: S357822 Reviewed By: atuljangra Differential Revision: D48131623 @diff-train-skip-merge (D48131623 landed internally) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106743 Approved by: https://github.com/malfet	2023-08-08 15:27:34 +00:00
David Berard	3322bfb66e	[jit] test_complexity.py - don't set default dtype in global scope (#106486 ) Summary: Depending on import order, this was sometimes causing another assert to fail: `aec8418bd9/torch/testing/_internal/jit_metaprogramming_utils.py (L20)` Differential Revision: D48011132 Pull Request resolved: https://github.com/pytorch/pytorch/pull/106486 Approved by: https://github.com/eellison	2023-08-03 02:50:15 +00:00
Mikayla Gawarecki	d8e5f2aa6d	Reland "Make adding buffers more like adding parameters (#104069 )" (#106224 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106224 Approved by: https://github.com/atalman, https://github.com/albanD	2023-07-31 17:18:56 +00:00
Aaron Gokaslan	6d43c89f37	[BE]: Update Ruff to 0.0.280 (#105724 ) Removes unusued loop values in python dictionary iteration. Automated fix from Ruff master Pull Request resolved: https://github.com/pytorch/pytorch/pull/105724 Approved by: https://github.com/ezyang, https://github.com/janeyx99	2023-07-22 23:03:34 +00:00
Edward Z. Yang	0af18f2234	Unify TEST_CUDNN definition (#105594 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/105594 Approved by: https://github.com/larryliu0820, https://github.com/voznesenskym	2023-07-20 16:10:26 +00:00
PyTorch MergeBot	154d89b224	Revert "Unify TEST_CUDNN definition (#105594 )" This reverts commit `1ea153a11d`. Reverted https://github.com/pytorch/pytorch/pull/105594 on behalf of https://github.com/PaliC due to breaks periodic test `distributed/_tensor/test_dtensor.py::TestDynamoDTensor::test_dynamo_dtensor` ([comment](https://github.com/pytorch/pytorch/pull/105594#issuecomment-1644166414))	2023-07-20 15:48:25 +00:00
Edward Z. Yang	1ea153a11d	Unify TEST_CUDNN definition (#105594 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/105594 Approved by: https://github.com/larryliu0820, https://github.com/voznesenskym	2023-07-20 08:36:58 +00:00
Andrey Talman	c6653b65d8	Back out "Make adding buffers more like adding parameters (#104069 )" (#105581 ) Summary: D47537831 is breaking pyper tests: https://fb.workplace.com/groups/802176577445480/posts/1018902842439518/ with `TypeError: register_buffer() takes 3 positional arguments but 4 were given` Original commit changeset: d4b4069fbd38 Original Phabricator Diff: D47537831 Test Plan: ``` buck2 run //caffe2/torch/fb/training_toolkit/integration_tests/training_lifecycle/cogwheel_tests/pyper_release_v2:cogwheel_smallworld_inline_cvr_infer_pyper_pyper__canary_offline_training-launcher -- --run-harness-in-tupperware --build-fbpkg ads_dper3 --build-fbpkg training_platform ``` Reviewed By: atalman Differential Revision: D47600140 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105581 Approved by: https://github.com/mikaylagawarecki	2023-07-20 03:39:53 +00:00
Justin Chu	73e1455327	[BE] Enable ruff's UP rules and autoformat test/ (#105434 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105434 Approved by: https://github.com/albanD	2023-07-19 20:36:06 +00:00
ekamiti	32d422f335	Make adding buffers more like adding parameters (#104069 ) Add similar semantics for creating a buffer object similar to creating a parameter. This is done by introducing a new `Buffer` class that can be used for type disambiguation. The underlying functionality of registering a buffer remains the same as the `register_buffer` method has not been changed. The `persistent` parameter in the `Buffer` type is to indicate whether a buffer object should be persistent or not. Other non-test changes have to do with getting the new `Buffer` type recognized by inductor and dynamo. Remaining changes are test changes to make sure that the `Buffer` type can be used as a drop in replacement for `register_buffer` as it just leads to `register_buffer` being called. The addition of this new functionality still allows for normal tensors to be used as buffers so these changes are intended to be backwards compatible. Fixes #35735 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104069 Approved by: https://github.com/mikaylagawarecki	2023-07-17 17:59:05 +00:00
Kurt Mohler	ffce2492af	Remove set_default_dtype calls from jit and ops tests (#105072 ) Part of #68972 This only attempts to avoid setting the default dtype for `test_jit.py` and `test_ops.py`. There are other tests, like `test_nn.py`, which will be addressed in follow up PRs Pull Request resolved: https://github.com/pytorch/pytorch/pull/105072 Approved by: https://github.com/ezyang	2023-07-15 03:18:33 +00:00
Aaron Gokaslan	2f95a3d0fc	[BE]: Apply ruff PERF fixes to torch (#104917 ) Applies automated ruff fixes in the PERF modules and enables all automatic ones. I also updated ruff which applied some additional fixes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/104917 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-07-11 20:45:21 +00:00
David Berard	13ea3d8530	[jit] Fix inspect.get_annotations usage in python >= 3.10 (#104485 ) Fixes #104484 For >= 3.10, we use `inspect.get_annotations` instead of `getattr(.., "__annotations__")`. [Docs](https://docs.python.org/3/library/inspect.html#inspect.get_annotations) say that get_annotations() "Ignores inherited annotations on classes. If a class doesn’t have its own annotations dict, returns an empty dict.". In practice though, this doesn't seem always true; until you call inspect.getmembers it seems like you still get inherited annotations. In particular, this means that if you script a certain type twice, the first time it may pass scripting but on the second try it may not pass scripting. This PR adds a more comprehensive handling of get_annotations by recursively reading the annotations of the base types. (TorchScript doesn't officially support this; but since it worked in <3.10, it's now breaking internal stuff as python gets upgraded to 3.10) Verified in #104486 that the test does actually fail before the changes in this PR were added. Differential Revision: [D47163891](https://our.internmc.facebook.com/intern/diff/D47163891) Pull Request resolved: https://github.com/pytorch/pytorch/pull/104485 Approved by: https://github.com/eellison	2023-07-06 00:37:47 +00:00
Peter Bell	12ca224662	Add hacked_twin overloads for _unsafe indexing functions (#104127 ) Fixes #104037 This hacky workaround already exists for the normal overloads. Pull Request resolved: https://github.com/pytorch/pytorch/pull/104127 Approved by: https://github.com/ezyang	2023-07-05 01:05:27 +00:00
cyy	54cb61f7d9	enable ASAN on some tests (#103647 ) Enabling more tests on ASAN, meanwhile we disable float-divide-by-zero and float-cast-overflow, both are disabled because they are also disabled by default in latest clang. The following cited doc explains the reasons. ``` -fsanitize=float-cast-overflow: Conversion to, from, or between floating-point types which would overflow the destination. Because the range of representable values for all floating-point types supported by Clang is [-inf, +inf], the only cases detected are conversions from floating point to integer types. -fsanitize=float-divide-by-zero: Floating point division by zero. This is undefined per the C and C++ standards, but is defined by Clang (and by ISO/IEC/IEEE 60559 / IEEE 754) as producing either an infinity or NaN value, so is not included in -fsanitize=undefined. ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/103647 Approved by: https://github.com/kit1980	2023-06-28 02:17:14 +00:00
nihuini	7157dfdd4a	[jit] fix duplicated module input and output values in tracing module (#102510 ) remap shall record the original inp pointers instead of remapped ones testcase ```python import torch import torch.nn as nn import torch.nn.functional as F class Normalize(nn.Module): def __init__(self): super().__init__() self.norm = nn.GroupNorm(num_groups=32, num_channels=32) def forward(self, x, y): if y is None: y = x else: y = self.norm(y) y = y * 2 return y class G(nn.Module): def __init__(self): super().__init__() self.norm = Normalize() def forward(self, x): A = self.norm(x, None) B = F.relu(A) return A, B class Net(nn.Module): def __init__(self): super().__init__() self.g = G() self.norm_1 = Normalize() def forward(self, x): hs = self.g(x) A, B = hs h = self.norm_1(B, A) return h net = Net() net = net.eval() x = torch.randn(1, 32, 16, 16) traced = torch.jit.trace(net, x) print(traced.graph) ``` without this patch, there are duplicated lifted values, %80, %81, %82, %83, %84, %85 ``` graph(%self.1 : __torch__.Net, %x : Float(1, 32, 16, 16, strides=[8192, 256, 16, 1], requires_grad=0, device=cpu)): %norm_1 : __torch__.___torch_mangle_1.Normalize = prim::GetAttr[name="norm_1"](%self.1) %g : __torch__.G = prim::GetAttr[name="g"](%self.1) %86 : (Tensor, Tensor, Tensor, Tensor, Tensor, Tensor, Tensor) = prim::CallMethod[name="forward"](%g, %x) %79 : Float(1, 32, 16, 16, strides=[8192, 256, 16, 1], requires_grad=0, device=cpu), %80 : Float(1, 32, 16, 16, strides=[8192, 256, 16, 1], requires_grad=0, device=cpu), %81 : Float(1, 32, 16, 16, strides=[8192, 256, 16, 1], requires_grad=0, device=cpu), %82 : Float(1, 32, 16, 16, strides=[8192, 256, 16, 1], requires_grad=0, device=cpu), %83 : Float(1, 32, 16, 16, strides=[8192, 256, 16, 1], requires_grad=0, device=cpu), %84 : Float(1, 32, 16, 16, strides=[8192, 256, 16, 1], requires_grad=0, device=cpu), %85 : Float(1, 32, 16, 16, strides=[8192, 256, 16, 1], requires_grad=0, device=cpu) = prim::TupleUnpack(%86) %87 : Tensor = prim::CallMethod[name="forward"](%norm_1, %79, %80, %81, %82, %83, %84, %85) return (%87) ``` with this patch ``` graph(%self.1 : __torch__.Net, %x : Float(1, 32, 16, 16, strides=[8192, 256, 16, 1], requires_grad=0, device=cpu)): %norm_1 : __torch__.___torch_mangle_1.Normalize = prim::GetAttr[name="norm_1"](%self.1) %g : __torch__.G = prim::GetAttr[name="g"](%self.1) %71 : Tensor = prim::CallMethod[name="forward"](%g, %x) %72 : Tensor = prim::CallMethod[name="forward"](%norm_1, %71) return (%72) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/102510 Approved by: https://github.com/davidberard98	2023-06-27 03:43:06 +00:00
Tarun Karuturi	6d2da6106d	Raise AttributeError in _OpsNamespace if __self__ attribute is requested (#104096 ) Summary: Trying to get the `__self__` attribute on any `_OpNamespace` object should be an invalid operation. The `__self__` attribute only exists on instance method object and not on class objects. In [dynamo](`a152b3e3b8/torch/_dynamo/variables/torch.py (L164)`) there is code that tries to access the `__self__` attribute on `TorchVariable`, this currently results in an expensive call to `torch._C._jit_get_operation` [here](`a152b3e3b8/torch/_ops.py (L740)`) which ultimately fails and throws an exception. For cases where it fails the operation turns out to be quite expensive on the order of ~0.03s. For edge use cases when exporting large models with quantized ops this exception is thrown 100's of times resulting in a lot of time wasted. By preventing the call to `torch._C._jit_get_operation` we can quickly return from this function and significantly reduce export times. On a large ASR model for example export currently takes ~405 seconds. With this change we can reduce it to ~340s. Overall this should also be a harmless change as no one should mostly ever try to access the `__self__` attribute on any `_OpNamespace` object. Test Plan: Added test case. Differential Revision: D46959879 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104096 Approved by: https://github.com/larryliu0820, https://github.com/ezyang, https://github.com/zou3519	2023-06-27 01:42:06 +00:00
ecao	223f232928	Fix shape function for transpose convolution (#102139 ) Fixes #98129. Fixes the shape function for jit conv_transpose, as defined by the documentation https://pytorch.org/docs/stable/generated/torch.nn.ConvTranspose2d.html#torch.nn.ConvTranspose2d, includes output_padding. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102139 Approved by: https://github.com/mingfeima, https://github.com/davidberard98	2023-06-21 17:50:56 +00:00
Nikita Shulga	4cfa06f706	[BE] Deprecate `has_XYZ` attributes (#103279 ) Use [`__getattr__`](https://peps.python.org/pep-0562/) to raise warningwhen one tries to access `has_XYZ` methods and recommend appropriate `torch.backends.XYZ` methods Make respective properties in `torch._C` private (by prefixing them with underscore), to exclude from `from torch._C import *`. Added `warnings.simplefilter` to workaround Python-3.11 torch.compile lineinfo issue. Fixes https://github.com/pytorch/pytorch/issues/102484 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103279 Approved by: https://github.com/janeyx99, https://github.com/Skylion007	2023-06-10 05:17:17 +00:00
shibo19	213e10dc3d	fix bug in trace model when out-operator has more than one output (#101563 ) Fixes #https://github.com/pytorch/pytorch/issues/101960 when I trace a func to run out-operator has more than one output, I got the error. This is because the situation when the output of the out operator is greater than 1 is not handled. ``` def test_trace_out_operator_with_two_output(): example_input = torch.rand(2, 8) out_1, out_2 = torch.cummax(example_input, 1) def run_cummax(example_input, out_1, out_2): output_1, output_2 = torch.cummax(example_input, 1, out=(out_1, out_2)) return output_1, output_2 trace_model = torch.jit.trace(run_cummax, (example_input, out_1, out_2)) and the error info: raise TracingCheckError( torch.jit._trace.TracingCheckError: Tracing failed sanity checks! encountered an exception while running the trace with test inputs ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/101563 Approved by: https://github.com/jgong5, https://github.com/EikanWang, https://github.com/davidberard98	2023-05-31 19:39:52 +00:00
Aaron Gokaslan	738ba13b35	[BE]: enable PLE error codes in ruff and fix bugs (#101079 ) Enables PyLint error codes implemented in ruff. These are un-opinionated static analysis checks on Python code that finds common bugs. After running all the PLE error codes that are implemented in ruff, I fixed the bugs, added a few ignores for malformed Python code that is part of our JIT test script, and finally added a few ignores for a false positive on PLE0605 and submitted an issue upstream to fix in ruff https://github.com/charliermarsh/ruff/issues/4345 . Common bugs found here include analysis for malformed logging format calls, bad string format calls, invalid escape sequences, and more. Pull Request resolved: https://github.com/pytorch/pytorch/pull/101079 Approved by: https://github.com/malfet	2023-05-11 23:57:25 +00:00
Yanbo Liang	075d36d37f	[Dynamo] Fix nested function resume execution (#100426 ) Fixes #99665 Let me explain the root cause using the unit test I added: * This bug is triggered when: * ```wrapped``` is a nested function. * ```wrapped``` is in another module which is different from the main function ```fn```. * There is a graph break inside of ```wrapped```. * The root cause is when resuming nested function, actually we are using the outermost function(```fn``` in my example)'s global variables, but ```wrapped``` calls ```inner_func``` which is not part of ```fn```'s globals, so we have to set correct globals when nested function resume execution. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100426 Approved by: https://github.com/jansel	2023-05-11 03:10:23 +00:00
PyTorch MergeBot	4b8127b90e	Revert "[Dynamo] Fix nested function resume execution (#100426 )" This reverts commit `d719f0276d`. Reverted https://github.com/pytorch/pytorch/pull/100426 on behalf of https://github.com/jeanschmidt due to breaking internal builds ([comment](https://github.com/pytorch/pytorch/pull/100426#issuecomment-1540915913))	2023-05-09 21:32:13 +00:00
Janet Yang	812cadf90a	[3/n] loading meta to device (#100495 ) Summary: Make it possible to `torch.jit.load(model, device)` to a device when `model` contains weights that are on device `meta`. Just leave the `meta` weights on `meta`, and load the weights that can be loaded to the target device. Reviewed By: singlaiiit, RoshanPAN, sayitmemory Differential Revision: D45099145 Pull Request resolved: https://github.com/pytorch/pytorch/pull/100495 Approved by: https://github.com/houseroad	2023-05-08 22:14:38 +00:00
Yanbo Liang	d719f0276d	[Dynamo] Fix nested function resume execution (#100426 ) Fixes #99665 Let me explain the root cause using the unit test I added: * This bug is triggered when: * ```wrapped``` is a nested function. * ```wrapped``` is in another module which is different from the main function ```fn```. * There is a graph break inside of ```wrapped```. * The root cause is when resuming nested function, actually we are using the outermost function(```fn``` in my example)'s global variables, but ```wrapped``` calls ```inner_func``` which is not part of ```fn```'s globals, so we have to set correct globals when nested function resume execution. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100426 Approved by: https://github.com/jansel	2023-05-06 05:04:50 +00:00
David Berard	cde35b4069	[JIT] clarify errors due to non-literal indexing into ModuleList, ModuleDict (#98606 ) TorchScript only supports indexing into ModuleLists with integer literals. The error message already warns about this; but this PR adds clarifications around what a "literal" is. I'm adding this PR because, in my opinion, it's not obvious what a "literal" is and how strict its definition is. The clarification provided in this PR should make it easier for users to understand the issue and how to fix it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98606 Approved by: https://github.com/eellison, https://github.com/gmagogsfm	2023-04-18 02:53:53 +00:00
Aaron Gokaslan	85f38b8a33	[BE] Update flake8-comprehensions and adapt to rule C418 (#99178 ) Applies rule C418 and fixes all instances of it. Also updates flake8-comprehension Pull Request resolved: https://github.com/pytorch/pytorch/pull/99178 Approved by: https://github.com/ezyang	2023-04-15 15:33:42 +00:00
Lu Fang	df43fef87f	Support >4GB strings in the TorchScript model (#99104 ) Summary: The support of BINUNICODE8 is missing. So adding it. So we can support attributes > 4GB. For example, for very large model, we save the lowered model in the EngineHolder using a string attribute. Test Plan: buck2 test mode/opt //caffe2/test:jit -- --exact 'caffe2/test:jit - test_save_load_large_string_attribute (jit.test_save_load.TestSaveLoad)' Differential Revision: D44905770 Pull Request resolved: https://github.com/pytorch/pytorch/pull/99104 Approved by: https://github.com/qihqi	2023-04-14 18:46:19 +00:00
Vivek Khandelwal	f501234be0	Add test for squeeze.dims shape function (#98144 ) Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/98144 Approved by: https://github.com/davidberard98	2023-04-13 08:43:55 +00:00
Vivek Khandelwal	bb4998b531	Add shape function for `aten::cross_entropy_loss` (#97875 ) Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/97875 Approved by: https://github.com/davidberard98	2023-04-12 22:11:56 +00:00
Edward Z. Yang	5a7aad9681	Convert logging f-strings to use % format, part four (#98705 ) This does multi-line concatenated string literals. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/98705 Approved by: https://github.com/voznesenskym	2023-04-11 13:17:59 +00:00
Aidyn-A	69eef5a4be	[CUDA12] set_device change (#94864 ) This PR adds workaround for CUDA 12 [`cudaSetDevice` change](https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__DEVICE.html#group__CUDART__DEVICE_1g159587909ffa0791bbe4b40187a4c6bb) which will always create primary context on target device. So operations like this: ```Python import torch x = torch.randn(1, device="cuda:1") ``` would always create primary context on on device `cuda:1` because it is creating a tensor on it and on device `cuda:0` because the destructor of CUDA Device guard calls `cudaSetDevice(0)`. After this PR the CUDA Device guard will not call `cudaSetDevice(0)` if primary context does not exist on `cuda:0`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94864 Approved by: https://github.com/malfet, https://github.com/atalman, https://github.com/ezyang	2023-04-10 17:31:12 +00:00
Edward Z. Yang	9a8f71f23e	Convert logging f-strings to use % format (#98697 ) Codemod done with https://gist.github.com/ezyang/2e8b0463cdc6be278478495b23ff0530 with assistance from ChatGPT. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/98697 Approved by: https://github.com/voznesenskym	2023-04-10 12:19:31 +00:00
Han Qi (qihqi)	4adae2d1ae	Enable flatbuffer tests properly. (#98363 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/98363 Approved by: https://github.com/angelayi	2023-04-07 22:36:19 +00:00
PyTorch MergeBot	279ca5f9db	Revert "[CUDA12] set_device change (#94864 )" This reverts commit `c18be2b2ec`. Reverted https://github.com/pytorch/pytorch/pull/94864 on behalf of https://github.com/ezyang due to avoid affecting cuda 11	2023-04-05 14:53:00 +00:00
Aidyn-A	c18be2b2ec	[CUDA12] set_device change (#94864 ) This PR adds workaround for CUDA 12 [`cudaSetDevice` change](https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__DEVICE.html#group__CUDART__DEVICE_1g159587909ffa0791bbe4b40187a4c6bb) which will always create primary context on target device. So operations like this: ```Python import torch x = torch.randn(1, device="cuda:1") ``` would always create primary context on on device `cuda:1` because it is creating a tensor on it and on device `cuda:0` because the destructor of CUDA Device guard calls `cudaSetDevice(0)`. After this PR the CUDA Device guard will not call `cudaSetDevice(0)` if primary context does not exist on `cuda:0`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94864 Approved by: https://github.com/malfet, https://github.com/atalman, https://github.com/ezyang	2023-04-05 14:34:00 +00:00
Wang, Yi A	8564ed24a8	do not need to check if element in dict input is Tensor. (#97866 ) sometimes it's a tuple with tensor element such as past value key in text generation case Fixes https://github.com/pytorch/pytorch/issues/97229 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97866 Approved by: https://github.com/jgong5, https://github.com/davidberard98	2023-03-31 19:39:00 +00:00
Aaron Gokaslan	597b558c51	[BE]: Update flake8 and plugins and fix bugs (#97795 ) Update flake8 and flake8-plugins in lintrunner to a modern version. Enables more checks and makes flake8 checks significantly faster. Added a few additional rule ignores that will need to be fixed in the future. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97795 Approved by: https://github.com/alexsio27444, https://github.com/janeyx99, https://github.com/ezyang	2023-03-28 23:51:55 +00:00
David Berard	dc3d6fe6b0	[jit][easy] add missing quotes in namedtuple forwardref tests (#97736 ) Follow-up to #96933. This test was intended to have quotes around the type annotations for the attributes of the NamedTuple. This PR adds this missing quotes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97736 Approved by: https://github.com/eellison	2023-03-28 21:44:01 +00:00
Sergii Dymchenko	b24052b1d9	Make test_binary_shape_functions actually test the ops (#90566 ) Because of the break, only operator.__mul__ was actually tested. <!-- copilot:poem --> ### <samp>🤖 Generated by Copilot at 0e6aaa1</samp> > _`break` statement gone_ > _loop tests all shape functions_ > _symbolic winter_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/90566 Approved by: https://github.com/huydhn, https://github.com/malfet	2023-03-27 21:03:59 +00:00

1 2 3 4 5 ...

825 Commits