pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Avik Chaudhuri	ebc7039bcb	New export API with dynamic shape specifications instead of constraints (#108448 ) Our experience using `constraints` / `dynamic_dim` with the existing export API has found it to be (subjectively) clunky and (objectively) verbose in common cases. This PR implements a new design for the export API that replaces the use of `constraints` / `dynamic_dim` with a new way of specifying dynamic shapes, involving the following concepts: * a constructor `Dim` for first-class named dynamic dimensions with ranges (similar to `functorch.dim`, and analogous to internal symbolic sizes) * a mechanism that uses the above in `export` calls to associate inputs to their dynamic shape specifications (`dynamic_shapes`) Design doc: https://docs.google.com/presentation/d/168U7XK72C_WSsZpGESP6Cho9udh193fi0gfjxCNcJ4E/edit#slide=id.p (Meta-only). Note that we only implement Option 1 in that doc. An older version of this PR also implemented Option 3, which is an alternative way of specifying dynamic shapes using tensor type annotations on the exported callable; but we have moved that to future work for now. See docs for these new features in `torch.export`. The existing `torch.export.export` is modified to use the new API, `torch._export.export__RC__`, whenever `constraints=None`. We have not deprecated the existing API yet, but will do in a follow-up. Constraint violation errors arising through use of the new API will now contain suggested fixes using the new API. No longer do we need to report all specializations for static dimensions and suggest all constraints over dynamic dimensions to fix such errors. Instead, due to the redesign, the suggested fixes are much more concise, only involving modifying the definitions of relevant `Dim`s. Differential Revision: [D48919204](https://our.internmc.facebook.com/intern/diff/D48919204/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108448 Approved by: https://github.com/suo, https://github.com/gmagogsfm	2023-09-22 06:58:26 +00:00
Edward Z. Yang	518308a740	Trace through `pytree` API with dynamo. (#108533 ) Fix: #107315 This PR enables dynamo to trace through the `pytree` API by inlining its functions. In order to do so, a few details of `pytree` had to be changed. In summary, this PR: - Introduces `TreeSpecVariable` for representing `TreeSpec` instances - Specializes `<type>.__bases__` call, returning a `TupleVariable` - Enables the call to `id` builtin function for every variable that implements `as_python_constant` method - Specializes `ConstantVariable.call_method` for its (un)flatten functions - Implements `UserDefinedObjectVariable.as_python_constant` - Modifies `pytree` by: - Make `SUPPORTED_NODES` a map of ids (instead of types) to `NodeDef` - Removed `functools.wraps` function, since it can't be inlined Pull Request resolved: https://github.com/pytorch/pytorch/pull/108533 Approved by: https://github.com/ezyang, https://github.com/voznesenskym ghstack dependencies: #109201	2023-09-20 00:04:56 +00:00
Michael Lazos	b1d2028eb0	Add compiled optimizer test for nadam (#109548 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/109548 Approved by: https://github.com/janeyx99	2023-09-19 22:54:36 +00:00
William Wen	b904432e82	[dynamo] preserve some FX node metadata of GraphModules (#107067 ) Requested from @tugsbayasgalan: we want dynamo to preserve some FX node metadata when we trace `GraphModule`s (`nn_module_stack`, `source_fn`, `stack_trace`). This is helpful for the case when we export an aten-level `GraphModule`, add some (possibly non-torch or non-aten) ops, and we want to transform the graph back into an aten-level graph. Without preserving metadata, future passes that look at metadata (e.g. quantization passes) won't work. This feature also has the additional benefit of being able to preserve origin line of code when `print_readable`'ing a `GraphModule`. This is helpful when debugging graphs that have passed through dynamo several times. The added unit test demonstrates the added functionality of this PR. ~This PR is currently a proof-of-concept implementation that shows that preserving node metadata across dynamo is possible.~ This PR preserves node metadata across dynamo by doing the following: - ~inject a counter variable into the `GraphModule` source code, which is incremented every time a node is run~ - Construct a line number -> node index map in `GraphModule` as the source code is being generated. - pass a list of node metadata and the line number map to dynamo's bytecode analyzer - ~dynamo traces the counter as a `ConstantVariable`, so when we create a new proxy, we can determine which original node index this proxy corresponds by looking at the value of the traced counter~ - When we create a new proxy, get the current instruction's line number, and get the node index using the line number map - index into the original node metadata ~using the counter variable's tracked value.~ ~Some things that should be addressed off the top of my head:~ - ~Is this feature even desirable? (Do we really want Dynamo to have special behavior for `GraphModules`? Should we expect users to re-export `GraphModules`?)~ - ~Is there a better approach than to use a counter? We considered using node names, line numbers, and assuming that proxies are created in the same order as the nodes, but each of these 3 have shortcomings. For node names, we only have access to new node names, not the old ones. Using line number is fragile. The third is problematic since not all created nodes go through `create_proxy` (e.g. inputs). We currently generate a line number to node index map when the `GraphModule`'s code is generated.~ - ~What's the best way to send data across the "CPython gap"? That is, it is not obvious how to cleanly pass data from dynamo's `eval_frame.py:_TorchDynamoContext.__call__` to `symbolic_convert.py:InstructionTranslatorBase.__init__`. In this PR, we use a global.~ Differential Revision: [D49257108](https://our.internmc.facebook.com/intern/diff/D49257108) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107067 Approved by: https://github.com/jansel	2023-09-15 23:29:14 +00:00
ydwu4	94a54b89aa	[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes (#107337 ) Motivation: We try to make torch.cond use torch.compile automatically so that we could error out when there is side-effects in the branches and correctly handle the closures. Before this PR, we have a warning if we don't turn on a config raise_on_backend_change (turning it on gives us an error) for the following code: ```python def foo() # Inside torch.cond, we'd like to do something like torch.compile(foo, backend="eager", fullgraph=True)(...) ... # Users may then call torch.compile somewhere else. # Dynamo will use the cached code of foo for "eager" backend # but we expect dynamo to recompile with "inductor" backend. torch.compile(foo, backend="inductor")(...) ``` This PR adds a BACKEND_MATCH guard. Effectively, it implements a per-backend cache. In the above example, the cached code for "eager" won't work for "inductor" due to guard check failures and the second torch.compile will do a re-compilation. In the future, it might be useful to have something like a configuration guard that guards against dynamo configuration changes across different compiles (e.g. compile a function with fullgraph=False then compile it again with fullgraph=True). Implementation: 1. We add a guarded_backend_cache and check the most_recent_backend against the backend associated with cached code. We also remove the raise_on_backend_change flag. Note: More lines are printed for debug log due to newly added context manager and guard adds . Test Plan: Removed original tests that raise on different backend and add a new test to test whether the BACKEND_MATCH guard can guard against backend change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107337 Approved by: https://github.com/jansel	2023-09-14 15:49:30 +00:00
zhxchen17	f4e96df60a	[export] Preserve shape dynamism for unused inputs. (#109239 ) Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/109239 Approved by: https://github.com/ydwu4	2023-09-14 07:43:36 +00:00
PyTorch MergeBot	c5e7588613	Revert "[dynamo] preserve some FX node metadata of GraphModules (#107067 )" This reverts commit `1d42148fee`. Reverted https://github.com/pytorch/pytorch/pull/107067 on behalf of https://github.com/DanilBaibak due to Break internal build ([comment](https://github.com/pytorch/pytorch/pull/107067#issuecomment-1717321061))	2023-09-13 09:59:33 +00:00
William Wen	1d42148fee	[dynamo] preserve some FX node metadata of GraphModules (#107067 ) Requested from @tugsbayasgalan: we want dynamo to preserve some FX node metadata when we trace `GraphModule`s (`nn_module_stack`, `source_fn`, `stack_trace`). This is helpful for the case when we export an aten-level `GraphModule`, add some (possibly non-torch or non-aten) ops, and we want to transform the graph back into an aten-level graph. Without preserving metadata, future passes that look at metadata (e.g. quantization passes) won't work. This feature also has the additional benefit of being able to preserve origin line of code when `print_readable`'ing a `GraphModule`. This is helpful when debugging graphs that have passed through dynamo several times. The added unit test demonstrates the added functionality of this PR. ~This PR is currently a proof-of-concept implementation that shows that preserving node metadata across dynamo is possible.~ This PR preserves node metadata across dynamo by doing the following: - ~inject a counter variable into the `GraphModule` source code, which is incremented every time a node is run~ - Construct a line number -> node index map in `GraphModule` as the source code is being generated. - pass a list of node metadata and the line number map to dynamo's bytecode analyzer - ~dynamo traces the counter as a `ConstantVariable`, so when we create a new proxy, we can determine which original node index this proxy corresponds by looking at the value of the traced counter~ - When we create a new proxy, get the current instruction's line number, and get the node index using the line number map - index into the original node metadata ~using the counter variable's tracked value.~ ~Some things that should be addressed off the top of my head:~ - ~Is this feature even desirable? (Do we really want Dynamo to have special behavior for `GraphModules`? Should we expect users to re-export `GraphModules`?)~ - ~Is there a better approach than to use a counter? We considered using node names, line numbers, and assuming that proxies are created in the same order as the nodes, but each of these 3 have shortcomings. For node names, we only have access to new node names, not the old ones. Using line number is fragile. The third is problematic since not all created nodes go through `create_proxy` (e.g. inputs). We currently generate a line number to node index map when the `GraphModule`'s code is generated.~ - ~What's the best way to send data across the "CPython gap"? That is, it is not obvious how to cleanly pass data from dynamo's `eval_frame.py:_TorchDynamoContext.__call__` to `symbolic_convert.py:InstructionTranslatorBase.__init__`. In this PR, we use a global.~ Pull Request resolved: https://github.com/pytorch/pytorch/pull/107067 Approved by: https://github.com/jansel	2023-09-11 17:11:51 +00:00
PyTorch MergeBot	38fcf77a1b	Revert "[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes (#107337 )" This reverts commit `1a64ec7dd4`. Reverted https://github.com/pytorch/pytorch/pull/107337 on behalf of https://github.com/huydhn due to Sorry for reverting your change but inductor perf smoke test starts to regress after this ([comment](https://github.com/pytorch/pytorch/pull/107337#issuecomment-1710974588))	2023-09-08 02:03:48 +00:00
ydwu4	1a64ec7dd4	[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes (#107337 ) Motivation: We try to make torch.cond use torch.compile automatically so that we could error out when there is side-effects in the branches and correctly handle the closures. Before this PR, we have a warning if we don't turn on a config raise_on_backend_change (turning it on gives us an error) for the following code: ```python def foo() # Inside torch.cond, we'd like to do something like torch.compile(foo, backend="eager", fullgraph=True)(...) ... # Users may then call torch.compile somewhere else. # Dynamo will use the cached code of foo for "eager" backend # but we expect dynamo to recompile with "inductor" backend. torch.compile(foo, backend="inductor")(...) ``` This PR adds a BACKEND_MATCH guard. Effectively, it implements a per-backend cache. In the above example, the cached code for "eager" won't work for "inductor" due to guard check failures and the second torch.compile will do a re-compilation. In the future, it might be useful to have something like a configuration guard that guards against dynamo configuration changes across different compiles (e.g. compile a function with fullgraph=False then compile it again with fullgraph=True). Implementation: 1. We add a guarded_backend_cache and check the most_recent_backend against the backend associated with cached code. We also remove the raise_on_backend_change flag. 2. Then newly added context manager and guard adds more lines for debug log so we change the uppper limit from 50 to 55. Test Plan: Removed original tests that raise on different backend and add a new test to test whether the BACKEND_MATCH guard can guard against backend change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107337 Approved by: https://github.com/jansel	2023-09-07 22:45:54 +00:00
youkaichao	b9fc6d7ded	[Dynamo] Update the implementation of _debug_get_cache_entry_list (#108335 ) In https://github.com/pytorch/pytorch/pull/106673 , I created a private API `_debug_get_cache_entry_list` to help pull out cache entries from compiled functions. Recently, I find that @anijain2305 commented in the code that this API should be revisited, and so I created this PR. First, this API cannot be removed even if cache entry becomes a first-class python class`torch._C._dynamo.eval_frame._CacheEntry`. The facts that `extra_index` is static, and `get_extra_state` is inline static, make them not accessible elsewhere. This API `_debug_get_cache_entry_list` is the only way for users to get all the cache entries from code. Second, since the`torch._C._dynamo.eval_frame._CacheEntry` class is a python class, I simplified the C-part code, and remove the necessity of creating a namedtuple for this in the python code. Third, I also add a small improvement, that if the argument is a function, we can automatically pass its `__code__` to the API. The above change will slightly change the output, from list of named tuple to list of `torch._C._dynamo.eval_frame._CacheEntry`. I will update the corresponding docs that use this API. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108335 Approved by: https://github.com/jansel, https://github.com/anijain2305	2023-09-02 16:38:59 +00:00
Will Constable	572bc4817d	Fix how DDPOptimizer clones dynamo callback (#107834 ) Instead of hardcoding a new callback creation using 'convert_frame', add an attribute to both callbacks that implement 'self cloning with new backend', so DDPOptimizer can invoke this in a consistent way. Fixes #107686 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107834 Approved by: https://github.com/ezyang	2023-08-25 17:46:36 +00:00
PyTorch MergeBot	3a3cf0e09d	Revert "[optim] Make casting to match params a hook (#106725 )" This reverts commit `9f86d85172`. Reverted https://github.com/pytorch/pytorch/pull/106725 on behalf of https://github.com/janeyx99 due to We acknowledge this is a huge risk because people do not remember to call super().__init__ from their Optimizer subclasses and so this will break lots of load_state_dict behavior ([comment](https://github.com/pytorch/pytorch/pull/106725#issuecomment-1693386137))	2023-08-25 13:47:19 +00:00
Avik Chaudhuri	bfcd26459c	improved error message for IO mismatch (#107907 ) Previously when we found some input or output mismatch between original args / traced result vs. graph-captured input / output, we would have a pretty sparse error message. (This might be partly due to the urge to reuse the same code for matching both inputs and outputs.) With this PR we now point out which input or output is problematic, what its type is, and also present the expected types along with descriptions of what they mean. We don't suggest any fixes, but the idea is that it should be evident what went wrong looking at the error message. Differential Revision: [D48668059](https://our.internmc.facebook.com/intern/diff/D48668059/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107907 Approved by: https://github.com/gmagogsfm	2023-08-25 06:08:44 +00:00
Avik Chaudhuri	cf76938f70	remove redundant dynamic_dim (#107815 ) Differential Revision: D48618472 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107815 Approved by: https://github.com/tugsbayasgalan, https://github.com/gmagogsfm	2023-08-24 10:46:24 +00:00
gmagogsfm	f8119f8bda	Move `Constraint` class to torch.export() to avoid circular dependency in _dynamo package (#107750 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107750 Approved by: https://github.com/tugsbayasgalan	2023-08-24 03:07:28 +00:00
Jane Xu	9f86d85172	[optim] Make casting to match params a hook (#106725 ) Moves the logic to casting state to match parameters into a hook so that users can choose to enable their hooks before or after the casting has happened. With this, there is a little bit of redundancy of the id_map building and the check that the param groups are still aligned in length. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106725 Approved by: https://github.com/albanD	2023-08-23 22:25:33 +00:00
Jane Xu	874d1b18b0	[BE] reorganize opt disables in dynamo for clarity (#107709 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107709 Approved by: https://github.com/Skylion007, https://github.com/mlazos	2023-08-23 00:17:34 +00:00
Tugsbayasgalan Manlaibaatar	ee72071fc7	Avoid executing side-effectful graph_module as validation step (#107271 ) Dynamo currently runs the real graph module with real inputs as a way to match the return result of graph module with the eager return type. This is unsafe when graph module is side effectful. In the long term, we will get rid of this step. But in the short term, we just fakify the graph module again and run it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107271 Approved by: https://github.com/ezyang	2023-08-22 04:22:31 +00:00
Animesh Jain	e201e3ffa1	[dynamo][eval frame] Make CacheEntry a PyObject (#107405 ) This PR makes CacheEntry a PyObject. This is prep PR for cache size changes. As CacheEntry is a py object, we can now traverse the linked list in Python and write cache size policies. It was possible to do in C, but Python is just easier to iterate upon. We call convert_frame only when we (re)compile, so a small bump in latency going from C to Python is acceptable here. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107405 Approved by: https://github.com/ezyang ghstack dependencies: #106917, #107117	2023-08-21 18:47:53 +00:00
Avik Chaudhuri	95f1591acb	error on bad input to equality constraint (#107311 ) Differential Revision: D48401664 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107311 Approved by: https://github.com/angelayi	2023-08-18 09:01:51 +00:00
Edward Z. Yang	10ce16bebb	Specify if mismatch is input or output in export (#107145 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/107145 Approved by: https://github.com/suo, https://github.com/gmagogsfm	2023-08-15 20:34:25 +00:00
ydwu4	3a300ed84e	[export] refactor and add same_signature flag to dynamo.export (#106569 ) This PR adds a same_signature flag to dynamo.export. Motivation: In https://github.com/pytorch/pytorch/pull/105679, we experimented on using dynamo to inspect the UDFs for cond in eager mode (without torch.compile). This helps us to normalize the inputs (e.g. lifting closure to inputs) and makes higher order operator more robust (e.g. forbid python side effects) and less error-prone in general. We decided to use dynamo.export (instead of torch.compile) to do the inspection (pointed out by @voznesenskym @zou3519): - We'd like a whole-graph capture for the UDF. - We'd like the dynamo inspection to be stateless. Using torch.compile would require resetting dynamo context before and after the inspection because the compile flags may be different from users' torch.compile. This will clear all dynamo cache. - We can still implement some caching based on the guards. However, this requires export to be able to handle the case where it cannot always rewrite signature: e.g. closure lifted as input. This PR makes the rewrite optional. Implementation: We just put all the code that are related to signature rewriting into a function called rewrite_signature and use a same_signature flag to optionally to the transformation. Test Plan: existing tests. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106569 Approved by: https://github.com/ezyang	2023-08-08 17:16:18 +00:00
youkaichao	bd3b6f1ab4	add a debug api to extract cache entry from code (#106673 ) Per the discussion with @jansel in https://dev-discuss.pytorch.org/t/how-are-guards-installed-on-frames-that-are-transient-objects/1415/7 , guards and compiled code live in `co_extra` field in pycodeobject, which cannot be accessed in a trivial way. This PR tries to add a debug API to extract the data from that field, which can make debugging torchdynamo much easier. The API is intended to be used for debug only, and should have no compatibility issues with the current system. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106673 Approved by: https://github.com/jansel	2023-08-08 16:33:46 +00:00
Edward Z. Yang	91afefb55b	Fix some fake mode confusion between inner/outer fake mode in export (#106515 ) Fixes https://github.com/pytorch/pytorch/issues/106412 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106515 Approved by: https://github.com/voznesenskym, https://github.com/BowenBao, https://github.com/thiagocrepaldi	2023-08-04 15:42:23 +00:00
Edward Z. Yang	697893568d	Improve error message when export encounters non-local input (#106403 ) Previously, you would get an error like ``` Dynamo input and output is a strict subset of traced input/output ``` now you get ``` Cannot export model which references tensors that are neither buffers/parameters/constants nor are direct inputs. For each tensor, if you'd like this tensor to be an explicit input, add it as a dummy argument to the top-level model definition you are exporting; if you would like its value to be embedded as an exported constant, wrap its access in a function marked with @assume_constant_result. G['bulbous_bouffant'], accessed at: File "test_export.py", line N, in f return bulbous_bouffant + y ``` This doesn't handle outputs, I'm going to hit that next. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106403 Approved by: https://github.com/tugsbayasgalan	2023-08-03 12:35:25 +00:00
Thiago Crepaldi	6d2162e644	Remove fake_mode arg from torch._dynamo.export API (#106345 ) #105477 removes the need of explicitly specifying `fake_mode`. The same effect can be achieved by wrapping `torch._dynamo.export` around a `torch._subclasses.FakeTensorMode` context. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106345 Approved by: https://github.com/ezyang	2023-08-01 17:52:06 +00:00
wangxiyuan	4eeda6616c	Correct URL Link for torchDynamo (#105903 ) Correct some error or 404 urls for torchDynamo doc Pull Request resolved: https://github.com/pytorch/pytorch/pull/105903 Approved by: https://github.com/malfet	2023-07-31 20:50:09 +00:00
Edward Z. Yang	1da4115702	Make _dynamo.export return a NamedTuple (#106062 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106062 Approved by: https://github.com/voznesenskym	2023-07-29 06:17:33 +00:00
Edward Z. Yang	7b9d250f06	Change _dynamo.export to be export(f)(args, *kwargs) (#106109 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106109 Approved by: https://github.com/voznesenskym	2023-07-27 21:41:13 +00:00
Edward Z. Yang	edebdaf182	Change _dynamo.explain to be explain(f)(args, *kwargs) (#106066 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106066 Approved by: https://github.com/wanchaol, https://github.com/voznesenskym	2023-07-27 03:21:52 +00:00
Edward Z. Yang	49e047e0f9	Delete dead summarize_dim_constraints (#106053 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106053 Approved by: https://github.com/ydwu4	2023-07-27 03:08:24 +00:00
Edward Z. Yang	6847c965f5	Turn on capture_dynamic_output_shape_ops/capture_scalar_outputs by default for export (#105962 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/105962 Approved by: https://github.com/tugsbayasgalan	2023-07-27 01:02:09 +00:00
Edward Z. Yang	3045e84e67	Tweak dynamic=False behavior (#105715 ) Previously, dynamic=False is a no-op, and dynamic=True preemptively turns on dynamic shapes everywhere. Now, dynamic=False disables automatic dynamic, and an unset dynamic defaults to dynamic=None (which uses automatic dynamic.) This seems to be more intuitive per https://github.com/pytorch/pytorch/issues/105634#issuecomment-1644883477 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/105715 Approved by: https://github.com/voznesenskym	2023-07-24 16:56:41 +00:00
Thiago Crepaldi	09b5c35911	Support torch.onnx.dynamo_export within FakeTensorMode (#105477 ) Currently, exporting a model to ONNX with fake tensor mode requires the user to load data and model within `torch.onnx.enable_fake_mode` context, but the actual call to `torch.onnx.dynamo_export` is done outside such context. With this PR, we enable `torch.onnx.dynamo_export` to be called either within `torch.onnx.enable_fake_mode` or outside of it. This feature required changes to the core PyTorch Dynamo, which were greatly supported by @ezyang In future steps we will determine which scenario we are going to support, but for now we can use either to explore different options and scenarios and asses their pros and cons. This PR also creates a separate suite of tests for fake mode specific scenarios (`TestFxToOnnxFakeTensorWithOnnxRuntime`). It was done separately to decrease the test time, but we could merge it with the default `TestFxToOnnxWithOnnxRuntime`. The additional parameters are `load_checkpoint_during_init` and `export_within_fake_mode` With the newly added supported of nested export within fake mode, the following scenarios are now supported: ```python import torch with torch.onnx.enable_fake_mode() as fake_context: fake_args = create_args() fake_kwargs = create_kwargs() fake_model = create_model() fake_model.load_state_dict(torch.load(tmp_checkpoint_file.name)) export_options = torch.onnx.ExportOptions(fake_context=fake_context) # `torch.onnx.dynamo_export` called WITHIN `torch.onnx.enable_fake_mode` export_output = torch.onnx.dynamo_export( fake_model, fake_args, *fake_kwargs, export_options=export_options, ) export_output.save("/path/to/model.onnx", model_state_dict=create_model()) ``` If we decide to only support scenarios in which `torch._dynamo.export` is called within `FakeTensorMode`, then we can remove `fake_mode` argument from `torch._dynamo.export` as a follow-up task ps: This PR is mostly Edward's https://github.com/pytorch/pytorch/pull/105468 + unit tests after an offline discussion ps: https://github.com/pytorch/pytorch/issues/105464 tracks pending tasks/limitations from this PR Pull Request resolved: https://github.com/pytorch/pytorch/pull/105477 Approved by: https://github.com/ezyang, https://github.com/BowenBao	2023-07-22 03:50:52 +00:00
Yanbo Liang	4c73016ff2	[Dynamo] Enable torch._dynamo.config.suppress_errors by default (#105307 ) Summary: We are working toward full model compilation, where when compilation error happens, we just fall back to eager mode rather than error out. But at the same time, we should fix these issues if they are bugs. We will: * 1/ log warnings in OSS; * 2/ log warnings and write them into Scuba in fbcode; to prevent us from ignoring these issues. Test Plan: Manual test Differential Revision: D47506314 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105307 Approved by: https://github.com/jansel	2023-07-21 19:17:46 +00:00
Michael Lazos	690ea933ca	Enable more e2e foreach optimizer compilation tests (#105438 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/105438 Approved by: https://github.com/jansel	2023-07-20 02:41:19 +00:00
Justin Chu	8a688277a2	[BE] Enable ruff's UP rules and autoformat dynamo / functorch and refs (#105432 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105432 Approved by: https://github.com/ezyang	2023-07-19 13:48:44 +00:00
willfengg	8010f6bf48	[dynamo][inductor] Provide public API to get compiler options/configs (#105026 ) issues resolved: https://github.com/pytorch/pytorch/issues/101832 context: get torch.compile config for further usage. E.g, the training platform wants to get if model is compiled with cudagraph enabled and trigger further action how it is implemented * the core logic is backend.get_compiler_config() in torch/_dynamo/eval_frame.py * for backend='inductor' / _TorchCompileInductorWrapper, we have inductor-specific implementation in get_compiler_config in torch/_inductor/compile_fx.py and torch/__init__.py how to use it: Below is an example. ``` model = DummyModule() optimized_module = torch.compile( model, options={"triton.cudagraphs": True} ) compiler_config = optimized_module.get_compiler_config() if compiler_config["triton.cudagraphs"]: pass ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/105026 Approved by: https://github.com/yanboliang, https://github.com/jansel	2023-07-18 06:12:06 +00:00
Michael Lazos	a63f3f4335	[dynamo] Disable fused adam op compile (#105256 ) Don't compile the hand-fused adam implementation Pull Request resolved: https://github.com/pytorch/pytorch/pull/105256 Approved by: https://github.com/Chillee	2023-07-15 06:13:40 +00:00
PyTorch MergeBot	0285366464	Revert "[dynamo] Maintainable code - Move export impl to a different file (#105071 )" This reverts commit `068f163dd3`. Reverted https://github.com/pytorch/pytorch/pull/105071 on behalf of https://github.com/clee2000 due to breaking internal builds ([comment](https://github.com/pytorch/pytorch/pull/105071#issuecomment-1636654945))	2023-07-15 04:18:07 +00:00
Animesh Jain	068f163dd3	[dynamo] Maintainable code - Move export impl to a different file (#105071 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105071 Approved by: https://github.com/voznesenskym	2023-07-14 09:28:33 +00:00
Animesh Jain	735e6ae801	[dynamo] Maintainable code - Move decorators in a separate file (#105070 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105070 Approved by: https://github.com/ezyang	2023-07-13 07:41:19 +00:00
Animesh Jain	8c191d8eef	[dynamo][ac] Reland #104397 - Remove disable monkeypatching of utils.checkpoint (#104665 ) NO CHANGE from before. The ancestor diff was reverted, so this diff got reverted as well. Pull Request resolved: https://github.com/pytorch/pytorch/pull/104665 Approved by: https://github.com/wconstab	2023-07-06 00:48:02 +00:00
Animesh Jain	0444f9f85b	[dynamo] Reland #104317 - Lazy disable_dynamo API out-of-dynamo (#104664 ) Internal failed because of torch.deploy issues with disable_dynamo in fx/* and _jit/* files. Removing disable_dynamo for both. Added a comment in the code. Pull Request resolved: https://github.com/pytorch/pytorch/pull/104664 Approved by: https://github.com/wconstab	2023-07-06 00:48:02 +00:00
Michael Lazos	a290cbf32b	Enable fused foreach Adam compilation (#104121 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/104121 Approved by: https://github.com/janeyx99	2023-07-05 23:40:03 +00:00
PyTorch MergeBot	54e320d4d1	Revert "[dynamo] Lazy disable_dynamo API out-of-dynamo (#104317 )" This reverts commit `5c12a810ac`. Reverted https://github.com/pytorch/pytorch/pull/104317 on behalf of https://github.com/huydhn due to This has been reverted internally by D47166892, so I need to also revert it on OSS to keep them in sync ([comment](https://github.com/pytorch/pytorch/pull/104317#issuecomment-1621099151))	2023-07-05 06:21:48 +00:00
PyTorch MergeBot	40f53912cf	Revert "[dynamo][ac] Remove disable monkeypatching of utils.checkpoint (#104397 )" This reverts commit `537a6c0651`. Reverted https://github.com/pytorch/pytorch/pull/104397 on behalf of https://github.com/huydhn due to This has been reverted internally by D47216591, so I need to also revert it on OSS to keep them in sync ([comment](https://github.com/pytorch/pytorch/pull/104397#issuecomment-1621086360))	2023-07-05 06:11:08 +00:00
Animesh Jain	537a6c0651	[dynamo][ac] Remove disable monkeypatching of utils.checkpoint (#104397 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/104397 Approved by: https://github.com/wconstab	2023-06-30 02:27:06 +00:00
Animesh Jain	5c12a810ac	[dynamo] Lazy disable_dynamo API out-of-dynamo (#104317 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/104317 Approved by: https://github.com/jansel, https://github.com/wconstab, https://github.com/mlazos	2023-06-29 13:30:17 +00:00

1 2 3 4 5

219 Commits