pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
PyTorch MergeBot	25a0fa6d13	Revert "[dynamo] Improve support for backwards hooks (#119525 )" This reverts commit `b1f4b2a63c`. Reverted https://github.com/pytorch/pytorch/pull/119525 on behalf of https://github.com/clee2000 due to broke test_autograd.py::TestAutograd::test_post_accumulate_grad_hook_gets_cleaned_up on dynamo https://github.com/pytorch/pytorch/actions/runs/7847212828/job/21416215820 `b1f4b2a63c`. The failure exists on the PR as well, but got masked by the other test. Putting this as no signal? ([comment](https://github.com/pytorch/pytorch/pull/119525#issuecomment-1936447169))	2024-02-09 18:58:55 +00:00
Jason Ansel	b1f4b2a63c	[dynamo] Improve support for backwards hooks (#119525 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/119525 Approved by: https://github.com/yanboliang	2024-02-09 17:02:40 +00:00
Yanbo Liang	0f478d9d61	[Dynamo][15/N] Merge allow_in_graph/inline/skip trace rules check into trace_rule.lookup (#118971 ) Finally we have this PR to merge allow_in_graph/inline/skip trace rules into ```trace_rules.lookup_inner```, where we can define and lookup trace rules at both function level and file level. Going forward, this is the central place that we define and consulte Dynamo trace rule for any function. * ```trace_rules.looup``` is the API can return allow_in_graph, inline or skip. * ```skipfiles.check``` is the API can return inline or skip, since we have multiple places that only do inline/skip check. * I'll move ```skipfiles.check``` to ```trace_rules.check``` as one of the follow-ups. * Both functions consulte ```trace_rules.lookup_inner``` to get the tracing rule. To avoid a single big PR, I left a few items as the follow-ups: * Remove ```skipfiles.py``` and merge the code into ```trace_rules.py```. * We do double check in ```symbolic_convert.check_inlineable```, will refactor and simplify it. We should only do inline/skip check before generating ```SkipFilesVariable``` and ```UserFunctionVariable```. * Rename ```SkipFilesVariable``` as ```SkipFunctionVariable```, since we only handle functions. * The inline/skip reasons are not logged for some cases, since the new lookup framework doesn't always return inline/skip reasons. I'll refactor loggings to record the inline/skip reason in next step. Pull Request resolved: https://github.com/pytorch/pytorch/pull/118971 Approved by: https://github.com/jansel	2024-02-07 05:15:39 +00:00
laith sakka	c814d8e5c2	Fix handling random() calls encountered inside inlined code. (#119218 ) Fix https://github.com/pytorch/pytorch/issues/118787 In the compiled function, calls to random() are replaced with a single function call to a function that generates all the random variables . The random calls encountered during compilation used to be tracked inside a variable stored inside the instruction translator. And when there are nested translators, the tracked calls used to get lost when the inner instructions translator popped out. This diff fixes that by moving the tracked calla to the output graph which is shared across translators that are generating the same function. More details about the issue and why this solution is picked are in the github issue above. Pull Request resolved: https://github.com/pytorch/pytorch/pull/119218 Approved by: https://github.com/jansel, https://github.com/anijain2305	2024-02-06 23:48:21 +00:00
Jason Ansel	62cc1053d8	[dynamo] Fix missing guards in FunctoolsPartialVariable (#118616 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/118616 Approved by: https://github.com/yanboliang ghstack dependencies: #118901	2024-02-06 23:42:43 +00:00
lezcano	65efbf078c	Optimize dict keys guard when all the keys are constant (#118855 ) We also rename ODICT_KEYS and make it use a list rather than a string. Split from https://github.com/pytorch/pytorch/pull/118630. Pull Request resolved: https://github.com/pytorch/pytorch/pull/118855 Approved by: https://github.com/peterbell10 ghstack dependencies: #117982, #118098, #117983, #117625, #118194, #118003, #118208, #118199, #118535	2024-02-02 14:42:56 +00:00
Edward Z. Yang	d03173e88c	Unify MYPYINDUCTOR and MYPY (#118432 ) The original motivation for MYPYINDUCTOR was a faster type checking configuration that only checked a subset of files. With the removal of `follow_imports = ignore`, we are now able to use dmypy to do fast incremental typechecking, eliminating the need for this. Perhaps erroneously, when I tee'ed up this PR I elected to delete the `follow_imports = skip` designations in the mypy-inductor.ini. This lead to a number of extra type error suppressions that I manually edited. You will need to review. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/118432 Approved by: https://github.com/Skylion007 ghstack dependencies: #118414, #118418	2024-01-27 17:23:20 +00:00
Yanbo Liang	dba160e676	[13/N][Dynamo] Refactor torch ctx manager classes check out of trace_rules.lookup (#118130 ) I'm going to merge inline/skip/allow_in_graph check into ```trace_rules.lookup```, so it's better to make it only handle function types. Pull Request resolved: https://github.com/pytorch/pytorch/pull/118130 Approved by: https://github.com/williamwen42	2024-01-24 22:33:41 +00:00
Oguz Ulgen	9ebaa27922	Fix types.MethodDescriptorType related bug in dynamo (#118041 ) Methods that were `types.MethodDescriptorType` were failing because the `tensor.method()` to `method(tensor)` conversion was dropping the tensor and just calling `method`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/118041 Approved by: https://github.com/yanboliang ghstack dependencies: #118000	2024-01-23 16:11:38 +00:00
voznesenskym	fed45aee54	Replace invoking self.value if there is a user defined init, avoiding arbitrary code execution (#117818 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/117818 Approved by: https://github.com/ezyang	2024-01-23 03:14:58 +00:00
voznesenskym	83e8a0721d	Reland #111196 (take 4) "Support tensors as Dict keys" (#116934 ) Fixes #ISSUE_NUMBER See that PR Pull Request resolved: https://github.com/pytorch/pytorch/pull/116934 Approved by: https://github.com/ezyang, https://github.com/huydhn	2024-01-07 01:37:26 +00:00
PyTorch MergeBot	2dca3e99eb	Revert "Support tensors as Dict keys Re-PR of #111196 (#116785 )" This reverts commit `1badad9ce9`. Reverted https://github.com/pytorch/pytorch/pull/116785 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/116785#issuecomment-1879592261))	2024-01-06 08:22:33 +00:00
voznesenskym	1badad9ce9	Support tensors as Dict keys Re-PR of #111196 (#116785 ) This prepares the PR where we implement sets in terms of dicts. To do so, rather than storing internally a dictionary that maps literals to VariableTrackers, it stores (pretty much) a dictionary from VTs to VTs. To do so, keys are wrapped in an opaque internal class _Hashable. The Hashable class is opaque on purpose so that it fails hard if if it inadvertently leaks back into user code. We also found and fixed a number of latent bugs and inconsistencies in the way dynamo checked what can be a dict key. More generally, we make much clearer what are the things that need to be modified to add a new supported key type to Dicts. Fixes [#107595](https://www.internalfb.com/tasks?t=107595) Fixes [#111603](https://www.internalfb.com/tasks?t=111603) Re-PR of https://github.com/pytorch/pytorch/pull/111196 sadly due to reverts, we could not reuse @lezcano's original PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/116785 Approved by: https://github.com/mlazos	2024-01-06 03:35:35 +00:00
Yanbo Liang	f657b2b1f8	[Dynamo][10/N] Remove TorchVariable and is_allowed (#116312 ) After this refactor: * ```TorchVariable``` definition and all references are removed. * All ```is_allowed``` references except one are removed. - The only left one is in ```torch/_dynamo/decorators:_disallow_in_graph_helper```. It was called when users put ```disallow_in_graph``` decorator on a function. Since we use the lists in ```trace_rules``` to decide the function's trace rule, so the decorator would only be used as customer function rather than torch functions. I'll defer this to a separate decorator refactor PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/116312 Approved by: https://github.com/jansel	2023-12-27 18:47:05 +00:00
PyTorch MergeBot	3b709d7c1e	Revert "[Dynamo][10/N] Remove TorchVariable and is_allowed (#116312 )" This reverts commit `015bd0e0a1`. Reverted https://github.com/pytorch/pytorch/pull/116312 on behalf of https://github.com/kit1980 due to breaking internal builds ([comment](https://github.com/pytorch/pytorch/pull/116312#issuecomment-1869825506))	2023-12-26 23:47:15 +00:00
Yanbo Liang	015bd0e0a1	[Dynamo][10/N] Remove TorchVariable and is_allowed (#116312 ) After this refactor: * ```TorchVariable``` definition and all references are removed. * All ```is_allowed``` references except one are removed. - The only left one is in ```torch/_dynamo/decorators:_disallow_in_graph_helper```. It was called when users put ```disallow_in_graph``` decorator on a function. Since we use the lists in ```trace_rules``` to decide the function's trace rule, so the decorator would only be used as customer function rather than torch functions. I'll defer this to a separate decorator refactor PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/116312 Approved by: https://github.com/jansel	2023-12-23 09:44:09 +00:00
Yanbo Liang	be9de33240	[Dynamo][9/N] Make SkipFilesVariable wrap functions only (#115963 ) Make ```SkipFilesVariable``` only handle function type, and route skipped classes to ```UserDefinedClassVariable```. The reasons behind this are: * We'd like to remove ```is_allowed```, so the allowed/disallowed torch classes should have a proper place to handle. We can put them in either ```SkipFilesVariable``` and ```UserDefinedClassVariable``` under the current architecture, but it's confusing to have two places do one thing. - Going forward, let's make ```SkipFilesVariable``` only handle functions, and probably I'll rename it to ```SkippedFunctionVariable``` in the following PRs. - Let's do dispatch by value's type, all torch classes stuff would go to ```UserDefinedClassVariable``` in the next PR. * We'd merge in_graph/skip/inline trace decision into the same API ```trace_rule.lookup```, so probably we have to limit the input to only function for better organizing ```VariableBuilder._wrap``` logics. - Next step, I'll merge ```skipfiles.check``` into ```trace_rules.lookup```, and do the skipfile check before wrapping them into correct variable tracker. - Though the ```TorchCtxManagerClassVariable``` is decided by ```trace_rules.lookup```, I'll refactor it out in the following PRs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/115963 Approved by: https://github.com/jansel	2023-12-21 01:35:07 +00:00
PyTorch MergeBot	bdfabe5e7d	Revert "[Dynamo][9/N] Make SkipFilesVariable wrap functions only (#115963 )" This reverts commit `bb5a27052f`. Reverted https://github.com/pytorch/pytorch/pull/115963 on behalf of https://github.com/jeanschmidt due to causing significant performance regression, identified by number of ops in ads, please check internal diff ([comment](https://github.com/pytorch/pytorch/pull/115963#issuecomment-1864361697))	2023-12-20 12:06:55 +00:00
Yanbo Liang	bb5a27052f	[Dynamo][9/N] Make SkipFilesVariable wrap functions only (#115963 ) Make ```SkipFilesVariable``` only handle function type, and route skipped classes to ```UserDefinedClassVariable```. The reasons behind this are: * We'd like to remove ```is_allowed```, so the allowed/disallowed torch classes should have a proper place to handle. We can put them in either ```SkipFilesVariable``` and ```UserDefinedClassVariable``` under the current architecture, but it's confusing to have two places do one thing. - Going forward, let's make ```SkipFilesVariable``` only handle functions, and probably I'll rename it to ```SkippedFunctionVariable``` in the following PRs. - Let's do dispatch by value's type, all torch classes stuff would go to ```UserDefinedClassVariable``` in the next PR. * We'd merge in_graph/skip/inline trace decision into the same API ```trace_rule.lookup```, so probably we have to limit the input to only function for better organizing ```VariableBuilder._wrap``` logics. - Next step, I'll merge ```skipfiles.check``` into ```trace_rules.lookup```, and do the skipfile check before wrapping them into correct variable tracker. - Though the ```TorchCtxManagerClassVariable``` is decided by ```trace_rules.lookup```, I'll refactor it out in the following PRs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/115963 Approved by: https://github.com/jansel	2023-12-19 02:01:47 +00:00
Michael Lazos	869e52e3dd	Support torch function user objects (#111765 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111765 Approved by: https://github.com/jansel	2023-12-13 22:11:52 +00:00
Michael Lazos	fbeca60b1f	Remove replace_all and make VTs mutable (#113725 ) 1. Removes calls to `replace_all` and `clone` and makes VTs mutable. 2. Properly handles Tuple Iterator mutation. Previously TupleIterator variables would only be properly reconstructed if they were advanced at least once in a frame. On calls to `next`, the source information would be lost (due to constructing a new iterator without using builder), which would ensure that during codegen the variable would be reconstructed from scratch. Now that VTs are mutated, the source is never lost, so we need to properly track mutation and handle it by replaying calls to `next` at the end of the modified bytecode. 3. Added test for checking iadd side effects, this was missing in our unit test coverage. 4. Fixed two incorrect sources, DelayGraphBreakVariable, and UserMethodVariable both relied on setting the source to AttrSource(parent, name) at the callsite of `var_getattr`. 5. Fixed a bug in inplace adding for lists, it would set the resulting VariableTracker's source to `None` which would utilize a different reconstruct path in codegen. Now this is handled explicitly by reconstructing vars when allow_cache=`False`, so that during side effect replay, the mutated var is correctly updated. In subsequent PRs: * Refactoring side effect tracking to be significantly simpler (I think we only need an `is_modified` flag) * Refactor `next_variables` iterator to match the signature of `next` * Remove all references to `options` in the code * Refactor VTs representing mutable collections to implement their own mutation update handling * Remove clone and/or make it specific to lists for creating slices * Add mutation tracking/replay for sets * Add mutation tracking/replay for iter.py * Removing setting source in builder (it's set at the top level after a var is returned) Pull Request resolved: https://github.com/pytorch/pytorch/pull/113725 Approved by: https://github.com/jansel	2023-12-10 09:31:21 +00:00
Jason Ansel	88642d44d9	[dynamo] Add RestrictedListSubclassVariable (#115057 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115057 Approved by: https://github.com/yanboliang ghstack dependencies: #115095, #115046	2023-12-05 19:01:23 +00:00
Jason Ansel	3d0bbb24a1	[dynamo] Improve support for list subclasses (#115052 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115052 Approved by: https://github.com/oulgen, https://github.com/eellison ghstack dependencies: #114830, #115047, #115048	2023-12-05 01:31:33 +00:00
Jon Chuang	9d2425c8a4	[dynamo] Be clearer about dict subtype source availability (#114069 ) ``` # [NOTE] OrderedDict, dict subtypes must always have source # We cannot instantiate such subtypes in-graph due to builtin __new__ ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/114069 Approved by: https://github.com/ezyang	2023-11-20 18:49:42 +00:00
Jon Chuang	100b9952b1	[dynamo] Fix user defined object sourceless callable (#114066 ) Fixes https://github.com/pytorch/pytorch/issues/114019 We do not need to guard on callable user object defined instantiated in graph Pull Request resolved: https://github.com/pytorch/pytorch/pull/114066 Approved by: https://github.com/ezyang	2023-11-20 18:38:03 +00:00
PyTorch MergeBot	5d170fce29	Revert "Support tensors as Dict keys (#111196 )" This reverts commit `b0805fa5d0`. Reverted https://github.com/pytorch/pytorch/pull/111196 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but it is failing internally. I will provide the details there ([comment](https://github.com/pytorch/pytorch/pull/111196#issuecomment-1813410149))	2023-11-15 23:08:00 +00:00
lezcano	b0805fa5d0	Support tensors as Dict keys (#111196 ) This prepares the PR where we implement sets in terms of dicts. To do so, rather than storing internally a dictionary that maps literals to VariableTrackers, it stores (pretty much) a dictionary from VTs to VTs. To do so, keys are wrapped in an opaque internal class `_Hashable`. The Hashable class is opaque on purpose so that it fails hard if if it inadvertently leaks back into user code. We also found and fixed a number of latent bugs and inconsistencies in the way dynamo checked what can be a dict key. More generally, we make much clearer what are the things that need to be modified to add a new supported key type to Dicts. Fixes https://github.com/pytorch/pytorch/issues/107595 Fixes https://github.com/pytorch/pytorch/issues/111603 Pull Request resolved: https://github.com/pytorch/pytorch/pull/111196 Approved by: https://github.com/jansel	2023-11-14 19:14:03 +00:00
Peter Bell	a3a2486be8	[dynamo] Avoid eager imports of classes with custom VariableTrackers (#112319 ) Currently custom VariableTrackers exist for classes that live outside of pytorch. For these cases dynamo currently eagerly imports the module to get the class object to compare against. This instead uses `sys.modules.get("module.path")` such that the module is never imported by dynamo itself, but if the user has imported the module then we will still access the module and grab the type we need to compare against. I noticed this issue because importing `KeyedJaggedTensor` fails half-way through if `fbgemm_gpu` has been built with an incompatible PyTorch version, in which case it retries the import again each time! Pull Request resolved: https://github.com/pytorch/pytorch/pull/112319 Approved by: https://github.com/lezcano, https://github.com/ezyang	2023-11-07 22:45:54 +00:00
Jason Ansel	356f3458c4	[dynamo] Remove incorrect sources (#112961 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112961 Approved by: https://github.com/voznesenskym, https://github.com/Skylion007 ghstack dependencies: #111306, #111415, #111725, #111726, #112962	2023-11-07 22:01:40 +00:00
Jason Ansel	5fe96eaaf4	[dynamo] Remove VariableTracker.propagate (#111726 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111726 Approved by: https://github.com/voznesenskym ghstack dependencies: #111306, #111415, #111725	2023-11-07 19:55:19 +00:00
Jason Ansel	843a8ecd24	[dynamo] Remove VariableTracker.add_options (#111725 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111725 Approved by: https://github.com/voznesenskym ghstack dependencies: #111306, #111415	2023-11-07 19:55:19 +00:00
Jason Ansel	9664190952	[dynamo] Eagerly install guards (#111415 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111415 Approved by: https://github.com/voznesenskym ghstack dependencies: #111306	2023-11-07 19:55:19 +00:00
Jason Ansel	f5088d2e45	[dynamo] fix None routing bug during var_getattr on UDO (#111614 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111614 Approved by: https://github.com/jansel	2023-10-29 01:57:43 +00:00
Jason Ansel	c7b78fb76c	[dynamo] Replace recursively_contains with parents_tracker (#112122 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112122 Approved by: https://github.com/voznesenskym	2023-10-28 06:46:48 +00:00
Yanbo Liang	061bf1a153	[5/N] Make torch context manager a TorchCtxManagerClassVariable (#111622 ) Major change in this PR is to make torch context manager class a separate ```TorchCtxManagerClassVariable```, since we have dynamo implementation for these ctx managers. I was thinking to wrap them as ```UserDefinedClassVariable``` and do dispatch at ```USCVariable.call_function```, but it seems almost the same amount of work and this way is more clear. This is on the way of moving ```TorchVariable``` to ```TorchFunctionVariable``` which will only handle the functions who would be allowed in graph (e.g, ```torch.sin```) and constant folded (e.g, ```torch.is_floating_point```). All other torch functions would be go through skip/inline rules, and would be wrapped as ```UserFunctionVariable``` (for inlined) and ```SkipFilesVariable``` (for skipped). The next steps: * Wrap torch modules, classes, objects as regular ```PythonModuleVariable```, ```UserDefinedClassVariable``` and ```UserDefinedObjectVariable```. * Generate the allow in graph torch functions list and wrap them as ```TorchFunctionVariable```. * Finally merge ```skipfiles.check``` and ```is_allowed``` into one function ```allow_skip.check(fn)``` which would return a Enum of allow, skip and inline. Pull Request resolved: https://github.com/pytorch/pytorch/pull/111622 Approved by: https://github.com/jansel	2023-10-27 21:26:54 +00:00
PyTorch MergeBot	5344468712	Revert "[dynamo] Properly track user-defined types for `type()` (#110794 )" This reverts commit `ad4ccf9689`. Reverted https://github.com/pytorch/pytorch/pull/110794 on behalf of https://github.com/ezyang due to looks like this actually fails internal tests ([comment](https://github.com/pytorch/pytorch/pull/110794#issuecomment-1778002262))	2023-10-24 20:42:26 +00:00
Ken Jin	ad4ccf9689	[dynamo] Properly track user-defined types for `type()` (#110794 ) Closes https://github.com/pytorch/pytorch/issues/110315. Thanks to @ezyang for the easy repro! Pull Request resolved: https://github.com/pytorch/pytorch/pull/110794 Approved by: https://github.com/ezyang	2023-10-23 17:34:23 +00:00
Tugsbayasgalan Manlaibaatar	bf7307adf8	Support inference_mode decorator (#109274 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109274 Approved by: https://github.com/williamwen42	2023-09-27 22:21:42 +00:00
Yanbo Liang	a81cb0de16	[Dynamo] Support python class member_descriptor (#109956 ) Fixes Meta internal cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109956 Approved by: https://github.com/jansel	2023-09-26 00:03:41 +00:00
PyTorch MergeBot	829b5c0949	Revert "[Dynamo] Support python class member_descriptor (#109956 )" This reverts commit `12cd776d90`. Reverted https://github.com/pytorch/pytorch/pull/109956 on behalf of https://github.com/jeanschmidt due to multiple slow jobs broken ([comment](https://github.com/pytorch/pytorch/pull/109956#issuecomment-1733706269))	2023-09-25 13:25:45 +00:00
Yanbo Liang	12cd776d90	[Dynamo] Support python class member_descriptor (#109956 ) Fixes Meta internal cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109956 Approved by: https://github.com/jansel	2023-09-25 03:15:39 +00:00
Michael Voznesensky	a902150a1e	[Easy] ConstantVariable() -> .create (#109896 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109896 Approved by: https://github.com/ezyang	2023-09-22 22:30:15 +00:00
Edward Yang	88600e7d2e	[RELAND] Force synced KJT to trace unbacked SymInt (#108960 ) (#109216 ) Summary: The basic concept behind this diff is to modify Dynamo's tracing behavior when it encounters a KeyedJaggedTensor that is synced (aka has `_length_per_key` and `_offset_per_key` populated). These fields are lists of integers; ordinarily, Dynamo will optimistically try to specialize on integers, however, for KJTs, we know that these integers will definitely vary from run-to-run. Furthermore, ordinarily, we would also specialize these integers if they are 0/1, but we will frequently expect features in KJTs to be 0/1. The fix is to detect KJTs and treat these integers as unbacked integers. This is NOT a universally sound optimization: when treating these integers as unbacked, we never report them as equal to zero or one. In return, we always generate graphs that generalize no matter the length of values on features. This is enough to trace through APS sparse arch, torchrec_dlrm and some small split-cat examples. The special integer behavior is triggered by a dynamically scoped `force_unspec_int_unbacked_size_like` variable on TracingContext, which we trigger when we wrap a KJT. There probably are other ways to do this, but this was simple and worked. Test Plan: ``` buck2 test mode/dev-nosan //pytorch/benchmark/fb/test_gpu:run_test_gpu ``` from aakhundov 1. first build feed_lower_benchmark: ``` buck2 build --show-output mode/opt -c python.package_style=inplace -c fbcode.enable_gpu_sections=true -c fbcode.platform=platform010 -c fbcode.split-dwarf=true hpc/new/models/feed/benchmark:feed_lower_benchmark ``` 2. then run the lowering of the model with it: ``` TORCHINDUCTOR_MAX_AUTOTUNE=1 TORCHINDUCTOR_UNIQUE_KERNEL_NAMES=1 TORCH_LOGS="output_code,graph_code" TORCH_COMPILE_DEBUG=1 ../buck-out/v2/gen/fbcode/79c6b019ee0f9469/hpc/new/models/feed/benchmark/__feed_lower_benchmark__/feed_lower_benchmark.par --load=manifold://ig_inference_model/tree/user/facebook/fblearner/predictor/960999465/60/gpu_lowering/input.predictor --skip-trt --skip-ait --sync-mode=0 --enable-aot-inductor --lower-presets="ig_stories" --gpu-trace ``` cf https://docs.google.com/document/d/1yD30xYrdmM8r2HTdmXnZTg0-MHVexfVrAa0294m1AUE/edit?pli=1#heading=h.qiv3fp7e6zg0 From torchrec: https://www.internalfb.com/intern/wiki/Torchrec/Development/Testing_production_models/ From ge0405 baseline (without your diff): f477293168 your diff: f477292363 ``` buck2 test //caffe2/test/dynamo:test_dynamo_torchrec buck2 run 'fbcode//mode/opt' fbcode//pytorch/benchmark/fb/test_gpu:run_test_gpu -- 'pytorch.benchmark.fb.test_gpu.test_gpu.TestBenchmarkFbGpu.test_train_blue_reels_vdd_v3_inductor_speedup' ``` Differential Revision: D49236757 Pull Request resolved: https://github.com/pytorch/pytorch/pull/109216 Approved by: https://github.com/voznesenskym	2023-09-18 14:39:44 +00:00
weifengpy	9021fb8dac	[dynamo] implement custom dict variable as a general solution for HF's ModelOutput class (#105044 ) before the PR, for HF's ModelOutput class, we use dicts.py/DataClassVariable with our own implementation on __getItem__, __setAttr__, __setItem__. There is a risk that ModelOutput logic may change since it is a user code after the PR, we inline __getItem__, __setAttr__, __setItem__ using dicts.py/CustomizedDictVariable so the logic always keep AA unit test * python test/dynamo/test_model_output.py -k test_HF_bert_model_output test on HF benchmark * python benchmarks/dynamo/huggingface.py -d cuda --inference --accuracy --progress --inductor --print-dataframe-summary 2>&1 * all metric are the same before/after the PR, including pass rate, unique_graphs, graph_breaks, unique_graph_breaks * before the PR: P790393916 * after the PR: P790368991 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105044 Approved by: https://github.com/jansel	2023-09-14 17:15:50 +00:00
Michael Voznesensky	064ae9ff33	Support register_hook on input tensors (#108903 ) The strategy in this PR is pretty straightforward. There are 2 kinds of hooks: 1) Hooks on objects with sources (inputs, params) 2) Hooks on objects w/o sources (intermediaries, and outputs). Note: As outputs can be made simple by how dynamo handles residuals, they could actually be handled as if they were inputs, but, for the sake of this PR, we will refer to hooks as either hooks on inputs (sourced), or hooks on intermediaries (not sourced). The plan: For tensors w/ a source: We record registered hooks, store them as a global, and associate them with the tensor in residuals. This means that when dynamo goes to create the frame, where we produce bytecode to stitch together our PT2 modified bytecode with the original eager code, we call `register_hook`. This registration of hooks in residuals is sound because (a) it happens right after a Pt2 frame region ends and (b) we know that the tensor is alive in f_locals, f_globals, or a module in the users invoking frame. This means we can soundly know it will be around to invoke `register_hook` on. As long as we guard on the identity of the lifted function, this is sound to do. For tensors w/o a source: Graph break - we will support this in a subsequent PR Handles: An interesting new component here is the creation of a `STORE_FAST `->`LOAD_FAST` associated with the handle, the return result of `register_hook`. If the user code stored the result of `register_hook` in a handle, we need to honor that. We do so by interceding into `STORE_FAST`, and recording the name of the local variable as directed by user code. We then honor that same name in the reconstructed bytecode. If the user did not store a hook, we merely pop the produced value to preserve the stack. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108903 Approved by: https://github.com/ezyang ghstack dependencies: #108846, #109092	2023-09-14 01:52:21 +00:00
PyTorch MergeBot	1d32c9c7f2	Revert "Force synced KJT to trace unbacked SymInt (#108960 )" This reverts commit `f9a250c35b`. Reverted https://github.com/pytorch/pytorch/pull/108960 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/108960#issuecomment-1715850779))	2023-09-12 14:37:36 +00:00
Edward Yang	f9a250c35b	Force synced KJT to trace unbacked SymInt (#108960 ) Summary: The basic concept behind this diff is to modify Dynamo's tracing behavior when it encounters a KeyedJaggedTensor that is synced (aka has `_length_per_key` and `_offset_per_key` populated). These fields are lists of integers; ordinarily, Dynamo will optimistically try to specialize on integers, however, for KJTs, we know that these integers will definitely vary from run-to-run. Furthermore, ordinarily, we would also specialize these integers if they are 0/1, but we will frequently expect features in KJTs to be 0/1. The fix is to detect KJTs and treat these integers as unbacked integers. This is NOT a universally sound optimization: when treating these integers as unbacked, we never report them as equal to zero or one. In return, we always generate graphs that generalize no matter the length of values on features. This is enough to trace through APS sparse arch, torchrec_dlrm and some small split-cat examples. The special integer behavior is triggered by a dynamically scoped `force_unspec_int_unbacked_size_like` variable on TracingContext, which we trigger when we wrap a KJT. There probably are other ways to do this, but this was simple and worked. Test Plan: ``` buck2 test mode/dev-nosan //pytorch/benchmark/fb/test_gpu:run_test_gpu ``` from aakhundov 1. first build feed_lower_benchmark: ``` buck2 build --show-output mode/opt -c python.package_style=inplace -c fbcode.enable_gpu_sections=true -c fbcode.platform=platform010 -c fbcode.split-dwarf=true hpc/new/models/feed/benchmark:feed_lower_benchmark ``` 2. then run the lowering of the model with it: ``` TORCHINDUCTOR_MAX_AUTOTUNE=1 TORCHINDUCTOR_UNIQUE_KERNEL_NAMES=1 TORCH_LOGS="output_code,graph_code" TORCH_COMPILE_DEBUG=1 ../buck-out/v2/gen/fbcode/79c6b019ee0f9469/hpc/new/models/feed/benchmark/__feed_lower_benchmark__/feed_lower_benchmark.par --load=manifold://ig_inference_model/tree/user/facebook/fblearner/predictor/960999465/60/gpu_lowering/input.predictor --skip-trt --skip-ait --sync-mode=0 --enable-aot-inductor --lower-presets="ig_stories" --gpu-trace ``` cf https://docs.google.com/document/d/1yD30xYrdmM8r2HTdmXnZTg0-MHVexfVrAa0294m1AUE/edit?pli=1#heading=h.qiv3fp7e6zg0 From torchrec: https://www.internalfb.com/intern/wiki/Torchrec/Development/Testing_production_models/ From ge0405 baseline (without your diff): f477293168 your diff: f477292363 Differential Revision: D49019987 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108960 Approved by: https://github.com/voznesenskym	2023-09-12 03:44:24 +00:00
Huy Do	5a4fe05a15	Revert "Force synced KJT to trace unbacked SymInt (#107788 )" (#108684 ) This reverts commit `3b92ef814d`. So let's manually revert it instead. (Not sure why the bot doesn't work on https://github.com/pytorch/pytorch/pull/107788) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108684 Approved by: https://github.com/ezyang	2023-09-06 19:15:45 +00:00
Edward Z. Yang	3b92ef814d	Force synced KJT to trace unbacked SymInt (#107788 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/107788 Approved by: https://github.com/voznesenskym	2023-09-06 03:18:26 +00:00
voznesenskym	5d85d897e0	Torchrec Enablement Fixes - Re-PR 107910 (#108018 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108018 Approved by: https://github.com/wconstab	2023-08-28 19:47:53 +00:00

1 2

93 Commits