pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Aaron Gokaslan	9c3fbe7475	[BE] Enable flake8-simplify checks (#97984 ) Enable some sensible flake8-simplify rules. Mainly wanted to enable the SIM101, and `yield from` SIM103 checks. @kit1980 since you wanted to be tagged on this CI check. Enabling this check also helped flag one logical bug so it's definitely beneficial (also fixed in this PR). Pull Request resolved: https://github.com/pytorch/pytorch/pull/97984 Approved by: https://github.com/ezyang	2023-03-31 03:40:21 +00:00
William Wen	24a5d006f2	[dynamo 3.11] Refactor create_instruction (#96499 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/96499 Approved by: https://github.com/jansel, https://github.com/albanD	2023-03-30 17:05:27 +00:00
Yanbo Liang	7fcf8b1829	[Dynamo] Support torch.{cuda/cpu}.amp.autocast (#95416 ) For Meta internal use cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/95416 Approved by: https://github.com/jansel	2023-03-10 21:48:08 +00:00
PyTorch MergeBot	3ce1e15cf7	Revert "[Dynamo] Support torch.{cuda/cpu}.amp.autocast (#95416 )" This reverts commit `c88aa336aa`. Reverted https://github.com/pytorch/pytorch/pull/95416 on behalf of https://github.com/huydhn due to Sorry for reverting your PR. But it seems that the smoke test issue is related as it starts to fail consistently in trunk https://hud.pytorch.org/hud/pytorch/pytorch/master/1?per_page=50&name_filter=inductor_torchbench_smoketest_perf	2023-03-08 06:51:57 +00:00
Yanbo Liang	c88aa336aa	[Dynamo] Support torch.{cuda/cpu}.amp.autocast (#95416 ) For Meta internal use cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/95416 Approved by: https://github.com/jansel	2023-03-08 01:40:27 +00:00
Yanbo Liang	12ab4f08b7	[Dynamo] No graph break on namedtuple and potential other functions (#96122 ) ```collections.namedtuple``` caused 40+ ```dynamo.export``` testing failing in 14k github models. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96122 Approved by: https://github.com/jansel, https://github.com/mlazos	2023-03-07 08:00:21 +00:00
jon-chuang	7a192cc51c	dynamo: wrap graph break inst in try except block - with context manager setup/teardown (#94758 ) Replacement to https://github.com/pytorch/pytorch/pull/94672. Follow up to https://github.com/pytorch/pytorch/pull/94137. We simply replace the set grad mode try except blocks with one for a more generic contextmanager (using `__enter__` and `__exit__`), storing the context manager into a `symbolic_local` for the duration of the try block. (see https://github.com/pytorch/torchdynamo/issues/207 for the original motivation) This allows us to handle calling inner functions with graph breaks for any arbitrarily deep nesting of live context managers subclassing `AbstractContextManager`. (see tests) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94758 Approved by: https://github.com/yanboliang	2023-03-06 14:04:17 +00:00
Yanbo Liang	6ca286df69	[Dynamo] Support call dict with list/tuple as input (#95928 ) Fixes Meta internal use case Pull Request resolved: https://github.com/pytorch/pytorch/pull/95928 Approved by: https://github.com/jansel	2023-03-04 05:52:33 +00:00
Yanbo Liang	02d44e5de4	[Dynamo] Support CUDA stream passed from outside of torch.compile decrator (#94627 ) Fixes #94499 Pull Request resolved: https://github.com/pytorch/pytorch/pull/94627 Approved by: https://github.com/jansel	2023-02-25 19:15:59 +00:00
Yanbo Liang	b5ff41a47a	[Dynamo] No graph break on calling dict & collections.OrderedDict() (#95250 ) It's common to call ```dict()``` or ```collections.OrderedDict()``` inside of ```forward``` function, so we should not graph break. This pattern has been used in many places including: * The use case in [torchvision]( `928b05cad3/torchvision/models/_utils.py (L66-L73)`). * It causes ~100 model failures(nopython=True) in the 14k github models. * Also it hits several Meta internal use cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/95250 Approved by: https://github.com/jansel	2023-02-23 09:03:07 +00:00
William Wen	055a9e45aa	[dynamo 3.11] changes to LOAD_GLOBAL and function calls (#94098 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94098 Approved by: https://github.com/albanD	2023-02-21 18:47:30 +00:00
jon-chuang	d1d5d16df3	dynamo: handle straight-line graph breaks for autocast context manager with constant args (#94137 ) Fixes https://github.com/pytorch/pytorch/issues/93890 We do the following: 1. fix __init__constructor for `AutocastModeVariable` with exisiting `mode` while copying 2. `resume_execution` is made aware of constant args (`target_values`), by storing said args in `ReenterWith`. To propagate between subgraphs (in straightline code), we also store the constant args in the downstream's `code_options["co_consts"]` if not already. --- Future work: 1. handle instantiating context manager in non-inlineable functions. Simultaneously fix nested grad mode bug. 2. generalize to general `ContextManager`s 3. generalize to variable arguments passed to context manager, with guards around the variable. --- Actually, if we look at the repro: `74592a43d0/test/dynamo/test_repros.py (L1249)`, we can see that the method in this PR doesn't work for graph breaks in function calls, in particular, in function calls that don't get inlined. Why inlining functions with graph breaks is hard: - When we handle graph breaks, we create a new code object for the remainder of the code. It's hard to imagine doing this when you are inside a function, then we need a frame stack. And we just want to deal with the current frame as a sequence of straight line codes. Why propagating context manager information is hard: - If we do not inline the function, the frame does not contain any information about the parent `block_stack` or `co_consts`. So we cannot store it on local objects like the eval frame. It has to be a global object in the output_graph. --- Anyway, I'm starting to see clearly that dynamo must indeed be optimized for torch use-case. Supporting more general cases tends to run into endless corner-cases and caveats. One direction that I see as viable to handle function calls which have graph breaks and `has_tensor_in_frame` is stick with not inlining them, while installing a global `ContextManagerManager`, similar to the `CleanupManager` (which cleans up global variables). We can know which context managers are active at any given point, so that we can install their setup/teardown code on those functions and their fragments. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94137 Approved by: https://github.com/yanboliang	2023-02-14 14:00:37 +00:00
Yanbo Liang	8ad10eab4d	[Dynamo] Fix bug of calling super from class extended from metaclass (#94547 ) Fixes #94299 Pull Request resolved: https://github.com/pytorch/pytorch/pull/94547 Approved by: https://github.com/jansel	2023-02-11 18:53:17 +00:00
Xuehai Pan	5b1cedacde	[BE] [2/3] Rewrite `super()` calls in functorch and torch (#94588 ) Rewrite Python built-in class `super()` calls. Only non-semantic changes should be applied. - #94587 - #94588 - #94592 Also, methods with only a `super()` call are removed: ```diff class MyModule(nn.Module): - def __init__(self): - super().__init__() - def forward(self, ...): ... ``` Some cases that change the semantics should be kept unchanged. E.g.: `f152a79be9/caffe2/python/net_printer.py (L184-L190)` `f152a79be9/test/test_jit_fuser_te.py (L2628-L2635)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94588 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-02-10 21:16:33 +00:00
Michael Voznesensky	68b35017a9	Tiny unimplemented improvements (#94150 ) fix names Pull Request resolved: https://github.com/pytorch/pytorch/pull/94150 Approved by: https://github.com/ezyang, https://github.com/jansel	2023-02-08 02:57:29 +00:00
Yanbo Liang	2362b5fca3	[Dynamo] Put torch.cuda.stream into Dynamo FX graph (#93808 ) Fixes #92804 This PR only handles ```torch.cuda.stream```. If this is a right direction, I'll add support for several relevant functions, e.g, ```torch.cuda.current_stream().wait_stream(s)``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/93808 Approved by: https://github.com/jansel	2023-02-05 04:52:43 +00:00
Edward Z. Yang	ca9ebf9e2b	Delete dynamo_import and inductor_import (#93851 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/93851 Approved by: https://github.com/albanD, https://github.com/jansel	2023-02-02 01:51:29 +00:00
David Berard	3e6978172e	[dynamo] Handle general tensor attributes with a getattr proxy node (#91840 ) Background: Before this PR, support in dynamo for tensor attributes (e.g. `x.H`, `x.T`, ...) need to be individually implemented one-by-one. This could potentially lead to errors, e.g. if the implementation in [variables/tensor.py](`21c7c7c72f/torch/_dynamo/variables/tensor.py (L160)`) differs from the implementation from a direct call to the attribute. For attributes that were not special-cased in tensor.py, dynamo tracing would fail. This PR adds generic support for tensor attributes that return tensors without needing to specially handle them. (Notably, for x.real and x.imag, which previously weren't supported). In this PR: This directly creates a proxy node for a `"call_function"` node with `target=getattr`, and feeds it into wrap_fx_proxy. This will produce a TensorVariable for the attribute returned. This also removes the implementations for H, T, mH, mT which were broken (previously `torch.relu(x.T)` would fail). They now fall back to this default implementation (for which `torch.relu(x.T)` passes). Further context: * Ed's original suggestion in [90463](https://github.com/pytorch/pytorch/pull/90463#discussion_r1043398340) is to use `torch.Tensor.H.__get__(x)`. I wasn't able to get this to work; fx compilation fails with `getset_descriptor does not have attribute __module__`. Basically, the `__module__` attribute which is available on most python attributes, is not available on `getset_descriptor` objects. (i.e., these are implemented in C++ as attributes on torch.Tensor, so they don't obey some assumptions made by fx) * Although both tensor attributes and methods (like `x.relu()`) both go through this, this PR should only handle attributes (e.g. see the `"getset_descriptor"` in variables/tensor.py). Methods are handled already by by GetAttrVariable. * Prior to this PR, we already returned GetAttrVariables for unsupported attrs: the parent caller would catch the NotImplementedError and fallback to returning a GetAttrVariable. But if this GetAttrVariable was ever passed into a torch.\* function (as it could quite possibly be, since most of these attrs are tensors), it would fail because its proxy node would be missing an [example_value](https://github.com/pytorch/pytorch/blob/master/torch/_dynamo/utils.py#L1017). So: before, for some tensor x, `x.real` would work fine; but `torch.relu(x.real)` would fail. Testing: added tests in test_misc.py for x.real, x.imag, x.T, x.real.T. Pull Request resolved: https://github.com/pytorch/pytorch/pull/91840 Approved by: https://github.com/ezyang	2023-02-01 22:34:03 +00:00
Yanbo Liang	c9ce0e63e8	[Dynamo] Support context wrapping(e.g, torch.no_grad) on nested functions w/o closure (#92922 ) Fixes 14k github models: https://github.com/jansel/pytorch-jit-paritybench/blob/master/generated/test_ELEKTRONN_elektronn3.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/92922 Approved by: https://github.com/jansel, https://github.com/mlazos	2023-01-26 04:23:35 +00:00
Yanbo Liang	2a3954372a	[Dynamo] Make torch.autograd.Function.forward support graph break and no re-compilation (#91295 ) Fixes #91101 Pull Request resolved: https://github.com/pytorch/pytorch/pull/91295 Approved by: https://github.com/jansel, https://github.com/mlazos	2023-01-20 06:25:09 +00:00
Will Constable	8e2e648f84	Propagate sources in VariableBuilder and add SuperSource (#91729 ) Motivation When adding support for default args (#90575), a lot of VariableTrackers missing sources were encountered. Currently, in a lot of cases it seems OK to skip the source for VariableTrackers created (especially during inlining), but that assumption breaks down when inlining functions with default arguments. Summary of changes - propagate the self.source of the VariableBuilder to the new variables being built, which seems like it was an omission previously - Add SuperSource to track usages of super(), so that SuperVariables can support function calls with default args Pull Request resolved: https://github.com/pytorch/pytorch/pull/91729 Approved by: https://github.com/ezyang	2023-01-12 05:04:18 +00:00
Samantha Andow	a7749ae177	[reland] rename DisableTorchFunction to DisableTorchFunctionSubclass (#88218 ) (#89221 ) Summary: First half of #87990. This doesn't change any of the behavior and is just a rename #88218 got reverted for internal breakages. This is the reland of started from internal Differential Revision: D41268423 LaMa Project: L1098534 Pull Request resolved: https://github.com/pytorch/pytorch/pull/89221 Approved by: https://github.com/meliy-meyada, https://github.com/zou3519	2023-01-04 18:32:49 +00:00
Edward Z. Yang	dfe916ca88	Dynamo comptime, with public ComptimeContext API (#90983 ) This PR adds `@comptime`, a decorator that causes a given function to be executed at compile time when Dynamo is symbolically evaluating their program. To query the Dynamo state, we offer a public ComptimeContext API which provides a limited set of APIs for querying Dynamo's internal state. We intend for users to use this API and plan to keep it stable. Here are some things you can do with it: * You want to breakpoint Dynamo compilation when it starts processing a particular line of user code: give comptime a function that calls breakpoint * You want to manually induce a graph break for testing purposes; give comptime a function that calls unimplemented * You want to perform a debug print, but you don't want to induce a graph break; give comptime a function that prints. * You can print what the symbolic locals at a given point in time are. * You can print out the partial graph the Dynamo had traced at this point. * (My original motivating use case.) You want to add some facts to the shape env, so that a guard evaluation on an unbacked SymInt doesn't error with data-dependent. Even if you don't know what the final user API for this should be, with comptime you can hack out something quick and dirty. (This is not in this PR, as it depends on some other in flight PRs.) Check out the tests to see examples of comptime in action. In short, comptime is a very powerful debugging tool that lets you drop into Dynamo from user code, without having to manually jerry-rig pdb inside Dynamo to trigger after N calls. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/90983 Approved by: https://github.com/jansel	2022-12-19 11:06:01 +00:00
Bin Bao	93ac8c4aeb	[dynamo] Refactor how autocast parameters are binded (#90953 ) Summary: Use `inspect.signature` for unified args handling Test Plan: `test_dynamo` Differential Revision: D42078621 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90953 Approved by: https://github.com/brad-mengchi	2022-12-16 23:12:49 +00:00
Yanbo Liang	e2674aafed	[Dynamo] Supports calling parent class‘s non classmethod from child class (#90682 ) Fixes https://github.com/pytorch/pytorch/issues/90558 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90682 Approved by: https://github.com/jansel	2022-12-12 22:33:46 +00:00
Michael Voznesensky	11442accc6	Make torch._guards, shuffle structures around for migration (#90636 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90636 Approved by: https://github.com/ezyang	2022-12-11 23:16:07 +00:00
PyTorch MergeBot	15a4c60383	Revert "Make torch._guards, shuffle structures around for migration (#90636 )" This reverts commit `933b6c4eed`. Reverted https://github.com/pytorch/pytorch/pull/90636 on behalf of https://github.com/huydhn due to Breaking lint on master. Please rebase and run lintrunner -a before re-merging the PR	2022-12-11 10:15:47 +00:00
Michael Voznesensky	933b6c4eed	Make torch._guards, shuffle structures around for migration (#90636 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90636 Approved by: https://github.com/ezyang	2022-12-11 06:04:17 +00:00
William Wen	ebeecbf833	Dynamo FX graph stack traceback fix (#87136 ) Migration from https://github.com/pytorch/torchdynamo/pull/1655. Pull Request resolved: https://github.com/pytorch/pytorch/pull/87136 Approved by: https://github.com/voznesenskym	2022-12-06 02:22:16 +00:00
Michael Lazos	40dd03eeaa	[dynamo] Don't copy the graph during checkpointing (copy_graphstate) (#89232 ) copy_graphstate is called a ton, this makes copy_graphstate a lot faster, helps with https://github.com/pytorch/torchdynamo/issues/1803 tag each graph node with a timestamp, when checkpointing store the timestamp, when restoring remove nodes older than the timestamp stored in the state. This essentially has the same behavior as the original impl, just doesn't copy the whole graph. Pull Request resolved: https://github.com/pytorch/pytorch/pull/89232 Approved by: https://github.com/jansel	2022-11-29 07:19:02 +00:00
Yanbo Liang	e4ccec6eca	[Dynamo] Fix bug of using customized torch.autograd.Function (#89397 ) Fixes https://github.com/pytorch/torchdynamo/issues/1899 Pull Request resolved: https://github.com/pytorch/pytorch/pull/89397 Approved by: https://github.com/jansel	2022-11-24 05:28:58 +00:00
Michael Lazos	85a87e635c	[dynamo] mutable local caching to make dynamo faster at tracing mutation (#89170 ) Make mutation faster to speed up tracing optimizers, helps with https://github.com/pytorch/torchdynamo/issues/1803 `replace_all` no longer iterates over the entire variable tracker data structure every time a mutation is performed Each variable tracker internally keeps a set of contained mutable variable trackers, to provide a hint to `replace_all`. This is populated with a call to `apply` from `__post_init__` in the base `VariableTracker` Pull Request resolved: https://github.com/pytorch/pytorch/pull/89170 Approved by: https://github.com/jansel	2022-11-19 01:47:48 +00:00
Yanbo Liang	b72f5b9ae3	[Dynamo] Support typing.Mapping & Support function as argument (#88963 ) These missing features come from https://github.com/pytorch/benchmark/pull/1302, where we'd like to enable E2E hf_bert dynamo train/eval. The dependent [HuggingFace accelerate library](https://huggingface.co/docs/accelerate/index) requires these improvements. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88963 Approved by: https://github.com/jansel	2022-11-17 06:57:42 +00:00
Yanbo Liang	848e7240a1	[Dynamo] Add a dummy profiler to avoid activating real profiler (#88930 ) See context at https://github.com/pytorch/torchdynamo/issues/1721#issuecomment-1312396059 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88930 Approved by: https://github.com/jansel	2022-11-16 19:08:49 +00:00
Michael Voznesensky	06ce1338bc	[dynamo] Port all pytorch/dynamo and test/dynamo pieces over from symbolic-shapes branch (#88768 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88768 Approved by: https://github.com/jansel, https://github.com/ezyang	2022-11-13 04:50:21 +00:00
PyTorch MergeBot	ba4d5aae06	Revert "rename DisableTorchFunction to DisableTorchFunctionSubclass (#88218 )" This reverts commit `7f28be10e5`. Reverted https://github.com/pytorch/pytorch/pull/88218 on behalf of https://github.com/izaitsevfb due to BC-breaking change, D41211901	2022-11-11 19:13:05 +00:00
samdow	7f28be10e5	rename DisableTorchFunction to DisableTorchFunctionSubclass (#88218 ) First half of #87990. This doesn't change any of the behavior and is just a rename Pull Request resolved: https://github.com/pytorch/pytorch/pull/88218 Approved by: https://github.com/ezyang, https://github.com/zou3519	2022-11-10 14:51:13 +00:00
Yanbo Liang	bd1ffc6501	[Dynamo] Fix bug: GradMode doesn't carry grad state correctly after graph break (#88537 ) Fixes https://github.com/pytorch/torchdynamo/issues/1446 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88537 Approved by: https://github.com/jansel	2022-11-07 18:03:31 +00:00
PyTorch MergeBot	f3cc588d09	Revert "Dynamo FX graph stack traceback fix (#87136 )" This reverts commit `89e6078bc3`. Reverted https://github.com/pytorch/pytorch/pull/87136 on behalf of https://github.com/clee2000 due to causing a lot of tests to fail on master even though pr is green	2022-10-19 18:57:24 +00:00
William Wen	89e6078bc3	Dynamo FX graph stack traceback fix (#87136 ) Migration from https://github.com/pytorch/torchdynamo/pull/1655. Pull Request resolved: https://github.com/pytorch/pytorch/pull/87136 Approved by: https://github.com/voznesenskym	2022-10-19 17:15:43 +00:00
Jason Ansel	c7c09722ad	Move TorchDynamo into PyTorch core (#86461 ) Context: https://github.com/pytorch/torchdynamo/issues/1588 This PR moves [TorchDynamo](https://github.com/pytorch/torchdynamo) and TorchInductor into PyTorch core. - `torchdynamo` becomes `torch._dynamo` - `torchinductor` becomes `torch._inductor` This PR was generated by running `copy_to_core.sh` in https://github.com/pytorch/torchdynamo/pull/1538 Pull Request resolved: https://github.com/pytorch/pytorch/pull/86461 Approved by: https://github.com/voznesenskym	2022-10-13 23:18:06 +00:00

41 Commits