pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Edward Z. Yang	384d3ec2b6	Extra CR comments from #95621 (#96043 ) Specifically: `063e441471 (r1120306196)` https://github.com/pytorch/pytorch/pull/95621#discussion_r1125015510 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/96043 Approved by: https://github.com/Chillee, https://github.com/albanD	2023-03-10 01:10:48 +00:00
Horace He	5bbec680d7	Fix usages of contextmanager without finally (#96170 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/96170 Approved by: https://github.com/ngimel, https://github.com/malfet	2023-03-08 20:59:27 +00:00
Edward Z. Yang	d303665d33	Make int unspecialization actually work (#95621 ) OK, so this PR used to be about reducing the number of constants we specialize on, but it turns out that unspecialization was ~essentially never used (because we still constant specialized way too aggressively) and I ended up having to fix a bunch of issues to actually get tests to pass. So this PR is now "make int unspecialization actually work". As part of this, I have to turn off unspecialization by default, as there are still latent bugs in inductor. The general strategy is that an unspecialized int is represented as a SymInt. Representing it as a 0d tensor (which is what the code used to do) is untenable: (1) we often need unspecialized ints to participate in size computations, but we have no way of propagating sympy expressions through tensor compute, and (2) a lot of APIs work when passed SymInt, but not when passed a Tensor. However, I continue to represent Numpy scalars as Tensors, as they are rarely used for size computation and they have an explicit dtype, so they are more accurately modeled as 0d tensors. * I folded in the changes from https://github.com/pytorch/pytorch/pull/95099 as I cannot represent unspecialized ints as SymInts without also turning on dynamic shapes. This also eliminates the necessity for test_unspec.py, as toggling specialization without dynamic shapes doesn't do anything. As dynamic shapes defaults to unspecializing, I just deleted this entirely; for the specialization case, I rely on regular static shape tests to catch it. (Hypothetically, we could also rerun all the tests with dynamic shapes, but WITH int/float specialization, but this seems... not that useful? I mean, I guess export wants it, but I'd kind of like our Source heuristic to improve enough that export doesn't have to toggle this either.) * Only 0/1 integers get specialized by default now * A hodgepodge of fixes. I'll comment on the PR about them. Fixes https://github.com/pytorch/pytorch/issues/95469 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/95621 Approved by: https://github.com/jansel, https://github.com/Chillee	2023-03-04 01:22:08 +00:00
Michael Voznesensky	34a7c79eac	Rename func (#95639 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/95639 Approved by: https://github.com/ezyang	2023-03-01 23:03:09 +00:00
Edward Z. Yang	835122c89f	Add missing f-string specifiers (#95707 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/95707 Approved by: https://github.com/Skylion007, https://github.com/albanD	2023-02-28 20:20:05 +00:00
Kazuaki Ishizaki	46385b3e48	Fix typos under torch/_dynamo directory (#95599 ) This PR fixes typos in comments and messages of `.py` files under `torch/_dynamo` directory Pull Request resolved: https://github.com/pytorch/pytorch/pull/95599 Approved by: https://github.com/ezyang	2023-02-28 03:44:24 +00:00
Michael Voznesensky	eff5ae8746	Better mark_dynamic assertions (#95566 ) This PR allows us to reuse the static per tensor decision making we make at fake tensorification time. We can use this to avoid setting up dynamic dim guards later if the tensor was never a candidate. Pull Request resolved: https://github.com/pytorch/pytorch/pull/95566 Approved by: https://github.com/ezyang	2023-02-28 00:02:22 +00:00
David Berard	a4085ab837	[dynamo] support custom __getattr__ on torch.nn.Modules (#94658 ) Summary: torch.nn.Module implementations previously did not support custom implementations of `__getattr__`; if a torch.nn.Module subclass implemented `__getattr__` and we tried to access an attribute that was expected to be present in `__getattr__`, dynamo would not check `__getattr__` and would error out with an AttributeError. This PR copies the functionality from UserDefinedObjectVariable into torch.nn.Module so that it also supports `__getattr__` Example of a module which previously would fail: ```python class MyMod(torch.nn.Module): def __init__(self): super().__init__() self.custom_dict = {"queue": [torch.rand((2, 2)) for _ in range(3)]} self.other_attr = torch.rand((2, 2)) def __getattr__(self, name): custom_dict = self.custom_dict if name in custom_dict: return custom_dict[name] return super().__getattr__(name) def forward(self, x): return x @ self.other_attr + self.queue[-1] ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94658 Approved by: https://github.com/yanboliang, https://github.com/jansel	2023-02-16 04:00:51 +00:00
jon-chuang	d1d5d16df3	dynamo: handle straight-line graph breaks for autocast context manager with constant args (#94137 ) Fixes https://github.com/pytorch/pytorch/issues/93890 We do the following: 1. fix __init__constructor for `AutocastModeVariable` with exisiting `mode` while copying 2. `resume_execution` is made aware of constant args (`target_values`), by storing said args in `ReenterWith`. To propagate between subgraphs (in straightline code), we also store the constant args in the downstream's `code_options["co_consts"]` if not already. --- Future work: 1. handle instantiating context manager in non-inlineable functions. Simultaneously fix nested grad mode bug. 2. generalize to general `ContextManager`s 3. generalize to variable arguments passed to context manager, with guards around the variable. --- Actually, if we look at the repro: `74592a43d0/test/dynamo/test_repros.py (L1249)`, we can see that the method in this PR doesn't work for graph breaks in function calls, in particular, in function calls that don't get inlined. Why inlining functions with graph breaks is hard: - When we handle graph breaks, we create a new code object for the remainder of the code. It's hard to imagine doing this when you are inside a function, then we need a frame stack. And we just want to deal with the current frame as a sequence of straight line codes. Why propagating context manager information is hard: - If we do not inline the function, the frame does not contain any information about the parent `block_stack` or `co_consts`. So we cannot store it on local objects like the eval frame. It has to be a global object in the output_graph. --- Anyway, I'm starting to see clearly that dynamo must indeed be optimized for torch use-case. Supporting more general cases tends to run into endless corner-cases and caveats. One direction that I see as viable to handle function calls which have graph breaks and `has_tensor_in_frame` is stick with not inlining them, while installing a global `ContextManagerManager`, similar to the `CleanupManager` (which cleans up global variables). We can know which context managers are active at any given point, so that we can install their setup/teardown code on those functions and their fragments. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94137 Approved by: https://github.com/yanboliang	2023-02-14 14:00:37 +00:00
Aaron Gokaslan	67d9790985	[BE] Apply almost all remaining flake8-comprehension checks (#94676 ) Applies the remaining flake8-comprehension fixes and checks. This changes replace all remaining unnecessary generator expressions with list/dict/set comprehensions which are more succinct, performant, and better supported by our torch.jit compiler. It also removes useless generators such as 'set(a for a in b)`, resolving it into just the set call. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94676 Approved by: https://github.com/ezyang	2023-02-12 01:01:25 +00:00
Aaron Gokaslan	3d82d8d0ed	[BE] Enable more flake8-comprehensions checks (#94601 ) I applied some flake8 fixes and enabled checking for them in the linter. I also enabled some checks for my previous comprehensions PR. This is a follow up to #94323 where I enable the flake8 checkers for the fixes I made and fix a few more of them. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94601 Approved by: https://github.com/ezyang	2023-02-10 23:40:29 +00:00
Xiaodong Wang	88e16849db	[pt2] Fix multiple races in log folder (#93407 ) Summary: There are a few races/permission errors in file creation, fixing OSS: 1. caffe2/torch/_dynamo/utils.py, get_debug_dir: multiple process may conflict on it even it's using us. Adding pid to it 2. caffe2/torch/_dynamo/config.py: may not be a right assumption that we have permission to cwd Test Plan: sandcastle Differential Revision: D42905908 Pull Request resolved: https://github.com/pytorch/pytorch/pull/93407 Approved by: https://github.com/soumith, https://github.com/mlazos	2023-02-09 21:10:14 +00:00
Aaron Gokaslan	8fce9a09cd	[BE]: pyupgrade Python to 3.8 - imports and object inheritance only (#94308 ) Apply parts of pyupgrade to torch (starting with the safest changes). This PR only does two things: removes the need to inherit from object and removes unused future imports. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94308 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-02-07 21:10:56 +00:00
Jason Ansel	ee2729890c	Refactor dynamo register_backend/BACKENDS (#93389 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/93389 Approved by: https://github.com/voznesenskym	2023-02-02 19:41:48 +00:00
Edward Z. Yang	ca9ebf9e2b	Delete dynamo_import and inductor_import (#93851 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/93851 Approved by: https://github.com/albanD, https://github.com/jansel	2023-02-02 01:51:29 +00:00
Edward Z. Yang	902b4dba75	Change capture_scalar_outputs to use SymInt/SymFloat rather than Tensor to model scalars (#93150 ) Previously, Dynamo faked support for item() when `capture_scalar_outputs` was True by representing it internally as a Tensor. With dynamic shapes, this is no longer necessary; we can represent it directly as a SymInt/SymFloat. Do so. Doing this requires you to use dynamic shapes; in principle we could support scalar outputs WITHOUT dynamic shapes but I won't do this unless someone hollers for it. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Differential Revision: [D42885775](https://our.internmc.facebook.com/intern/diff/D42885775) Pull Request resolved: https://github.com/pytorch/pytorch/pull/93150 Approved by: https://github.com/voznesenskym	2023-01-31 21:23:23 +00:00
Edward Z. Yang	e5235fb62c	Convert GuardOnDataDependentSymNode into graph break (#93373 ) Extracted from https://github.com/pytorch/pytorch/pull/93150 because I need it earlier in trunk. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/93373 Approved by: https://github.com/Skylion007	2023-01-31 19:31:44 +00:00
Yanbo Liang	304d8dd6c8	[Dynamo] Support enum.Enum type as dict key (#93026 ) Fixes Meta internal user case of using ```enum.Enum``` type as dict key, pleaser refer the added test case for details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/93026 Approved by: https://github.com/mlazos	2023-01-29 06:37:10 +00:00
David Berard	58acab4616	[dynamo] support [tensor].type(torch.FloatTensor) (#93043 ) for some tensor x, x.type(torch.FloatTensor) will essentially do the same thing as x.to(torch.float). x.type can be called with at least 3 types of inputs: * a string "torch.FloatTensor" * a dtype torch.float * a tensor type torch.FloatTensor the third option (torch.FloatTensor) fails in fx, because fx cannot trace torch.FloatTensor objects. So this PR will replace the torch.FloatTensor type with a string "torch.FloatTensor" Why not fix this in fx? Well, it's possible, but I'm not sure a nice way to do it. We would want to update [torch.fx.node.BaseArgumentTypes](`d88bc38b0c/torch/fx/node.py (L17)`) to contain torch.FloatTensor etc. We could hard-code a list of tensor types there (the types vary depending on build type, e.g. whether or not cuda tensors are available), but that's not great in case our hardcoded list differs from the actual list registered by python_tensor.cpp. Another option is to dynamically populate the list of types with `Union[tuple(...)])`, and fill the tuple with `torch._tensor_classes` (which is directly populated by python_tensor.cpp), but apparently this breaks most typecheckers. Pull Request resolved: https://github.com/pytorch/pytorch/pull/93043 Approved by: https://github.com/jansel	2023-01-27 21:27:13 +00:00
Michael Voznesensky	d322f82b05	Add @count util to torch, use it to track benchmark stats (#93013 ) <img width="1333" alt="image" src="https://user-images.githubusercontent.com/4755252/214687911-f766f072-c162-4298-9aed-c889f1375336.png"> Pull Request resolved: https://github.com/pytorch/pytorch/pull/93013 Approved by: https://github.com/ezyang	2023-01-26 03:09:12 +00:00
Michael Voznesensky	5778c04a15	Add `--timing` flag, phase timing to @dynamo_timed (#92637 ) Ex output: ``` TIMING: entire_frame_compile:8.574629999999999 backend_compile:5.26806 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/92637 Approved by: https://github.com/ezyang	2023-01-21 10:52:13 +00:00
PyTorch MergeBot	44132cc4b0	Revert "Add `--timing` flag, phase timing to @dynamo_timed (#92637 )" This reverts commit `773b513435`. Reverted https://github.com/pytorch/pytorch/pull/92637 on behalf of https://github.com/malfet due to Broke lint	2023-01-20 16:23:20 +00:00
Edward Z. Yang	387357539f	Log accuracy failure in more cases (#92645 ) Fixes https://github.com/pytorch/torchdynamo/issues/1910 But not durably, it's easy to forget if you add more cases. I'd like someone else to do that refactor. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/92645 Approved by: https://github.com/Chillee	2023-01-20 15:23:35 +00:00
Michael Voznesensky	773b513435	Add `--timing` flag, phase timing to @dynamo_timed (#92637 ) Ex output: ``` TIMING: entire_frame_compile:8.574629999999999 backend_compile:5.26806 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/92637 Approved by: https://github.com/ezyang	2023-01-20 05:01:21 +00:00
Michael Lazos	cac217c80a	Fix key error formatting and move exc code to exc.py (#92593 ) Fixes https://github.com/pytorch/torchdynamo/issues/1953 and moves exception formatting code from convert_frame.py to exc.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/92593 Approved by: https://github.com/ezyang	2023-01-19 02:54:00 +00:00
lezcano	77b8aa6e43	Wrap a few more functions to ease their tracking during debugging (#92004 ) Yup Pull Request resolved: https://github.com/pytorch/pytorch/pull/92004 Approved by: https://github.com/ezyang	2023-01-17 16:53:36 +00:00
Wu, Chunyuan	a111dd9014	[dynamo] support comparing numpy ndarray (#91870 ) The output of Torchbench model `doctr_det_predictor` on CPU is a `numpy ndarray`. When running the accuracy benchmark of this model, the below error is raised: `RuntimeError: unsupported type: ndarray`. Repro CMD: ```bash python benchmarks/dynamo/torchbench.py --accuracy --float32 -dcpu -n50 --inductor --no-skip --dashboard --only doctr_det_predictor --batch_size 1 --threads 1 ``` This PR adds the support to compare `numpy ndarray` in the dynamo utils. Pull Request resolved: https://github.com/pytorch/pytorch/pull/91870 Approved by: https://github.com/jgong5, https://github.com/Chillee	2023-01-13 12:11:49 +00:00
Andrew M. James	7cd951c21e	Properly guard all numpy usage within dynamo and remove UnspecializedNumpyVariable (#90795 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90795 Approved by: https://github.com/ngimel, https://github.com/cpuhrsch	2023-01-06 22:36:38 +00:00
Tugsbayasgalan Manlaibaatar	d4713b4c7d	[dynamo] Fix bug in tensor.item fake tensor propogation (#91668 ) When we run the node with fake value for tensor.item, it would previously error because the utility method doesn't know how to handle placeholder node. The tensor we are calling item can be input from user will be placeholder in the graph. Pull Request resolved: https://github.com/pytorch/pytorch/pull/91668 Approved by: https://github.com/voznesenskym	2023-01-04 19:51:19 +00:00
Mengchi Zhang	d1123c94a7	[pytorch] Update troubleshooting_url (#91298 ) Summary: Update new troubleshooting_url. Old one does not exist. Test Plan: None Differential Revision: D42205626 Pull Request resolved: https://github.com/pytorch/pytorch/pull/91298 Approved by: https://github.com/jianyuh	2022-12-22 21:29:54 +00:00
Bin Bao	8992eec781	[inductor] Update how REQUIRE_HIGHER_TOLERANCE is handled (#91227 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/91227 Approved by: https://github.com/kit1980	2022-12-21 05:43:39 +00:00
PyTorch MergeBot	94262efc7d	Revert "[inductor] Rewrite Triton templates + epilogue fusion (retry) (#91105 )" This reverts commit `d6dd2e97da`. Reverted https://github.com/pytorch/pytorch/pull/91105 on behalf of https://github.com/atalman due to Broke internal builds	2022-12-21 00:02:38 +00:00
lezcano	e6fcf7ad9d	Remove breakpoint (#91128 ) This was left in https://github.com/pytorch/pytorch/pull/90026 Pull Request resolved: https://github.com/pytorch/pytorch/pull/91128 Approved by: https://github.com/kit1980	2022-12-20 22:14:35 +00:00
Jason Ansel	d6dd2e97da	[inductor] Rewrite Triton templates + epilogue fusion (retry) (#91105 ) https://github.com/pytorch/pytorch/pull/90738 seems a bit borked. ghimport fails on it, and I unlinked it from the Phabricator diff, but it still won't land. This is an exact copy that PR without using ghstack. Pull Request resolved: https://github.com/pytorch/pytorch/pull/91105 Approved by: https://github.com/ngimel	2022-12-20 02:38:23 +00:00
Edward Z. Yang	bbea58d500	Stop using GraphArgs for shape env guard source tracking (#90911 ) GraphArgs worked fairly well, but it was still missing sources sometimes. Now, we maintain an auxiliary data structure which we MUST populate whenever we fakeify a tensor / allocate a bare SymInt. This should guarantee once and for all that every symbol is available. Should fix swin_base_patch4_window7_224. While I was at it, I moved fakeification utility back to builder as it was only used at once call site. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/90911 Approved by: https://github.com/voznesenskym	2022-12-16 05:22:56 +00:00
Edward Z. Yang	45109ec30a	Completely redo how ShapeEnv guards are generated (#90528 ) Instead of inferring shape mappings from a bunch of data structures that were plumbed in InstructionTranslator, we instead work out mappings by just iterating over the GraphArgs and mapping symbols to arguments as they show up. If multiple argument sizes/strides/offset map to the same symbol, this means they are duck sized, so we also generate extra equality tests that they must be equal. Finally, we generate 0/1 specialization guards. The resulting code is much shorter, and I think also easier to understand. TODO: Delete all the tensor ref tracking code, it's unnecessary Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/90528 Approved by: https://github.com/voznesenskym	2022-12-10 13:35:04 +00:00
Edward Z. Yang	b68dead20c	Keep track of source name on all allocated SymInts (#90295 ) Wow, I had to sweat so much to get this PR out lol. This PR enforces the invariant that whenever we allocate SymInts as part of fakeification, the SymInt is associated with a Source, and in fact we store the string source name on SymbolWithSourceName. We use 'sname' as the shorthand for source name, as 'name' is already used by sympy to name symbols. In order to store source names, we have to plumb source names from Dynamo to PyTorch. This made doing this PR a bit bone crushing, because there are many points in the Dynamo codebase where we are improperly converting intermediate tensors into fake tensors, where there is no source (and there cannot be, because it's a frickin' intermediate tensor). I've fixed all of the really awful cases in earlier PRs in the stack. This PR is just plumbing in source names from places where we do have it. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/90295 Approved by: https://github.com/voznesenskym	2022-12-10 13:17:34 +00:00
Bin Bao	f7cdd3a7a0	[inductor] Use a large tolerance for botnet26t_256 (#90383 ) Summary: botnet26t_256 shows random tolerance failure on CI. The root cause of this randomness is still to-be-invesitgated, but let's use a larger tolerance for now. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90383 Approved by: https://github.com/ezyang	2022-12-07 19:35:06 +00:00
Ram Rachum	351d73b97f	Fix exception causes all over the codebase (#90271 ) This is the continuation to #90134 and hopefully the final PR in this series. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90271 Approved by: https://github.com/kit1980	2022-12-07 04:29:00 +00:00
Edward Z. Yang	3d4b92b171	Ensure that we fakeify tensor subclasses when they are initially tracked (#90009 ) The old code didn't actually fakeify traceable tensor subclasses at the time they are added as a GraphArg to the module; now we do, by ignoring the subclass during fakeification and relying on Dynamo to simulate the subclass on top. See comments for more details. BTW, this codepath is super broken, see filed issues linked on the inside. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/90009 Approved by: https://github.com/wconstab, https://github.com/voznesenskym	2022-12-06 22:36:32 +00:00
Michael Voznesensky	3b9a386d48	Add `TORCH_FAKE_TENSOR_DEBUG` use it to enable storage of traces on fake tensors at init time (#90215 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90215 Approved by: https://github.com/ezyang	2022-12-06 22:28:52 +00:00
Michael Voznesensky	41c3b41b92	Use dynamo fake tensor mode in aot_autograd, move aot_autograd compilation to lowering time [Merger of 89672 and 89773] (#90039 ) After all of the preparatory commits, this is a subset of the changes in https://github.com/pytorch/pytorch/pull/89392 that actually change us to propagating fake tensors to backends. Signed-off-by: Edward Z. Yang <ezyangfb.com> This is the merger of Ed's PR #89672, which is a rewrite of an older PR of mine (#89392), with CI Fixes on top of it (#89773) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90039 Approved by: https://github.com/ezyang	2022-12-05 01:56:50 +00:00
PyTorch MergeBot	4648baa911	Revert "Use dynamo fake tensor mode in aot_autograd, move aot_autograd compilation to lowering time [Merger of 89672 and 89773] (#90039 )" This reverts commit `ef0c7ec958`. Reverted https://github.com/pytorch/pytorch/pull/90039 on behalf of https://github.com/clee2000 due to broke xla tests `ef0c7ec958` https://github.com/pytorch/pytorch/actions/runs/3606308473/jobs/6077646142	2022-12-04 21:57:30 +00:00
Michael Voznesensky	ef0c7ec958	Use dynamo fake tensor mode in aot_autograd, move aot_autograd compilation to lowering time [Merger of 89672 and 89773] (#90039 ) After all of the preparatory commits, this is a subset of the changes in https://github.com/pytorch/pytorch/pull/89392 that actually change us to propagating fake tensors to backends. Signed-off-by: Edward Z. Yang <ezyangfb.com> This is the merger of Ed's PR #89672, which is a rewrite of an older PR of mine (#89392), with CI Fixes on top of it (#89773) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90039 Approved by: https://github.com/ezyang	2022-12-03 01:19:55 +00:00
Animesh Jain	3162a48a77	[dynamo][benchmarks] Call zero grad (#90026 ) Hoping that it might reduce some flakiness Pull Request resolved: https://github.com/pytorch/pytorch/pull/90026 Approved by: https://github.com/williamwen42	2022-12-02 04:05:57 +00:00
Michael Voznesensky	b5616cd5f4	Add simple assert to detect fake tensors on modules (#89723 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/89723 Approved by: https://github.com/ezyang	2022-11-28 08:57:33 +00:00
Edward Z. Yang	6904324781	Remove fake_tensor_propagation (#89646 ) You always have to run dynamo with fake tensors. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89646 Approved by: https://github.com/soumith	2022-11-25 03:27:32 +00:00
Edward Z. Yang	94a88b53ed	Remove fake_tensors_available (#89637 ) As we are one repo now, they are always available. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89637 Approved by: https://github.com/anjali411	2022-11-24 19:28:10 +00:00
Shunting Zhang	e545caa50f	dynamo/torchxla integration: trace on xla rather than eager (#88904 ) In #87741 we added the inference support for dynamo/torchxla integration. Later on in #88449 we attempt to add the training support. That attempt is not smooth because - we try 2 things together 1. let dynamo trace the model on xla rather than eager 2. enable training - It turns out neither of these two tasks are trivial enough. Furthermore, item 2 (enable training) depends on item 1 (tracing on xla). We enable training via AOTAutograd. AOTAutograd lift all model parameters/buffers as graph inputs. Without item 1 being done, we would need copy all graph inputs (including model parameters/buffers) from eager device to xla devices. That hurts performance a lot. Have a cache to map eager parameter to XLA parameter does not solve the problem since the update on either will not sync automatically to the other. They will easily go out of sync. This PR let dynamo trace the model on XLA rather than eager. This is a preparation step to enabling training. Also, tracing on XLA makes the data movement more efficient. We see 1.5x geomean speedup compared to previous 1.38x. ``` +-------------------------+--------------------+-------------------------+ \| Model \| XLA (trace once) \| XLA (trace everytime) \| +=========================+====================+=========================+ \| resnet18 \| 1.38 \| 1.008 \| +-------------------------+--------------------+-------------------------+ \| resnet50 \| 1.227 \| 0.998 \| +-------------------------+--------------------+-------------------------+ \| resnext50_32x4d \| 1.544 \| 1.008 \| +-------------------------+--------------------+-------------------------+ \| alexnet \| 1.085 \| 1.045 \| +-------------------------+--------------------+-------------------------+ \| mobilenet_v2 \| 2.028 \| 1.013 \| +-------------------------+--------------------+-------------------------+ \| mnasnet1_0 \| 1.516 \| 0.995 \| +-------------------------+--------------------+-------------------------+ \| squeezenet1_1 \| 0.868 \| 1.01 \| +-------------------------+--------------------+-------------------------+ \| vgg16 \| 1.099 \| 1.008 \| +-------------------------+--------------------+-------------------------+ \| BERT_pytorch \| 3.26 \| 1.027 \| +-------------------------+--------------------+-------------------------+ \| timm_vision_transformer \| 2.182 \| 1.015 \| +-------------------------+--------------------+-------------------------+ \| geomean \| 1.50389 \| 1.01261 \| +-------------------------+--------------------+-------------------------+ ``` Example command ``` GPU_NUM_DEVICES=1 python benchmarks/dynamo/torchbench.py --randomize-input --performance --trace-on-xla --only resnet18 --backend=torchxla_trace_once ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/88904 Approved by: https://github.com/wconstab, https://github.com/JackCaoG, https://github.com/jansel	2022-11-22 03:57:04 +00:00
Animesh Jain	82713a1cc4	[inductor][compilation time] Fallback when kernel size for avg/max pool is large (#89448 ) This fixes compilation time for yolov3 from 400 seconds to 48 seconds. yolov3 has a 13x13 max_pool2d kernel, which was creating really large Triton code. Pull Request resolved: https://github.com/pytorch/pytorch/pull/89448 Approved by: https://github.com/ngimel	2022-11-22 02:23:24 +00:00

1 2

61 Commits