pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Jerry Zhang	7b4d080496	[quant][pt2e] Rename _pt2e to pt2e (#104668 ) Summary: X-link: https://github.com/pytorch/executorch/pull/3 att Test Plan: Imported from OSS Differential Revision: D47202807 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104668 Approved by: https://github.com/andrewor14	2023-07-15 06:34:17 +00:00
Andrew Or	4b29829ece	[quant][pt2] Fix QAT convert for mobilenetv2 (#104110 ) Summary: QAT convert for mobilenetv2 was previously not working because we incorrectly applied dropout during eval as well as training. This is because, for exported models, model.eval() does not change the behavior of dropout, unlike models with torch ops. This commit simulates the effects of model.eval() for exported models as well by replacing the aten dropout pattern before eval. As of this commit, end-to-end QAT numerics now match for mobilenetv2 between FX and PT2. Test Plan: python test/test_quantization.py TestQuantizePT2EModels.test_qat_mobilenet_v2 Differential Revision: D46750343 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104110 Approved by: https://github.com/jerryzh168	2023-07-11 18:42:42 +00:00
Jerry Zhang	c98896b76f	[quant][pt2e] Add more precise representation for quantized add (#104130 ) Summary: The planned e2e for quantization in pytorch 2.0 export is the following: float_model -> prepare_pt2e -> calibration -> convert_pt2e -> ... inside convert_pt2e, we will first produce a q/dq representation of the quantized model, similar to the previous output of convert_to_reference_fx in fx grah mode quantization: ``` torch.ops.quantized_decomposed.dequantize_per_tensor -> torch.ops.aten.add -> torch.ops.quantized_decomopsed.quantize_per_tensor torch.ops.quantized_decomposed.dequantize_per_tensor / ``` Then we'll rewrite the above to a more precise representation that express the intention in a more precise manner, since here we actually want to do int8 addition, instead of simulating the int8 addition with fp32 operations, the representation for quantized add is: ``` def quantized_add(x_i8, x_scale, x_zero_point, y_i8, y_scale, y_zero_point, out_scale, out_zero_point): x = (x_scale / out_scale) * x_i8 y = (y_scale / out_scale) * y_i8 out = x + y out -= (x_zero_point * x_scale - y_zero_point * y_scale) / out_scale out += out_zero_point return out ``` Test Plan: ``` buck2 test caffe2/test:quantization_pt2e -- --exact 'caffe2/test:quantization_pt2e - test_representation_add (quantization.pt2e.test_quantize_pt2e.TestQuantizePT2E)' ``` Reviewed By: kimishpatel Differential Revision: D45628032 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104130 Approved by: https://github.com/kimishpatel	2023-06-27 20:11:30 +00:00
Animesh Jain	75dab587ef	[dynamo] FSDP + AC + torch.compile (#103953 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/103953 Approved by: https://github.com/wanchaol	2023-06-24 01:40:56 +00:00
kshitij12345	d552c271db	[pt2] grad support (#102264 ) Teach dynamo about grad Pull Request resolved: https://github.com/pytorch/pytorch/pull/102264 Approved by: https://github.com/zou3519	2023-06-21 10:13:09 +00:00
PyTorch MergeBot	e737a8486f	Revert "[pt2] grad support (#102264 )" This reverts commit `85b83954c8`. Reverted https://github.com/pytorch/pytorch/pull/102264 on behalf of https://github.com/huydhn due to This is failing in trunk `85b83954c8` and looks like a landrace ([comment](https://github.com/pytorch/pytorch/pull/102264#issuecomment-1600001309))	2023-06-21 03:02:55 +00:00
kshitij12345	85b83954c8	[pt2] grad support (#102264 ) Teach dynamo about grad Pull Request resolved: https://github.com/pytorch/pytorch/pull/102264 Approved by: https://github.com/zou3519	2023-06-21 01:37:08 +00:00
Zhengxu Chen	26bf8894b6	[export] Replicate exportdb examples and tests in oss. (#102769 ) Summary: Initial work to copy source to OSS for exportdb and make sure tests can run properly. Test Plan: test_export Differential Revision: D46369152 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102769 Approved by: https://github.com/angelayi	2023-06-04 20:01:57 +00:00
Michael Lazos	c75e064dd6	Disallow _foreach_utils.py, but allow it to be inlined (#102221 ) This function should not be allowed, but should be inlineable. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102221 Approved by: https://github.com/anijain2305	2023-06-02 05:14:09 +00:00
PyTorch MergeBot	8aa48315de	Revert "Disallow _foreach_utils.py, but allow it to be inlined (#102221 )" This reverts commit `552299c42c`. Reverted https://github.com/pytorch/pytorch/pull/102221 on behalf of https://github.com/huydhn due to Sorry for reverting your PR. It starts to break dynamo jobs in trunk `552299c42c` and it looks like a landrace ([comment](https://github.com/pytorch/pytorch/pull/102221#issuecomment-1563694599))	2023-05-26 01:27:19 +00:00
Michael Lazos	552299c42c	Disallow _foreach_utils.py, but allow it to be inlined (#102221 ) This function should not be allowed, but should be inlineable. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102221 Approved by: https://github.com/anijain2305	2023-05-25 23:48:36 +00:00
Kimish Patel	24e9b8f5f4	[PT2E][Quant] Use subgraph matcher annotate linear pattern (#100566 ) This diff adds subgraph matcher for pattern matching. Furthermore, we also move annotations for the matched subgraph in a way that only input and output nodes of the matched subgraph have quantization related valid annotations. Differential Revision: [D45535539](https://our.internmc.facebook.com/intern/diff/D45535539/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100566 Approved by: https://github.com/jerryzh168	2023-05-04 21:31:59 +00:00
andrewor14	9cda7b9e47	[hotfix] Do not import torch.ao.quantization._pt2e from dynamo (#100194 ) Summary: Importing torch.ao.quantization._pt2e from dynamo led to internal test failures related to memory profiling. For now, let's express the path using a simple string instead. Reviewers: jerryzh168, kimishpatel Pull Request resolved: https://github.com/pytorch/pytorch/pull/100194 Approved by: https://github.com/jerryzh168	2023-04-28 01:32:23 +00:00
Tugsbayasgalan (Tugsuu) Manlaibaatar	02f059c2b7	Add private _export API (#99992 ) Differential Revision: D45279206 Pull Request resolved: https://github.com/pytorch/pytorch/pull/99992 Approved by: https://github.com/angelayi, https://github.com/gmagogsfm	2023-04-27 16:24:14 +00:00
Edward Z. Yang	3a5427baf4	Add torch.utils._content_store (#99809 ) Implements a simple content-addressable store for storages (with tensors implemented as cheap references on top), enabling incremental serialization of tensors to disk, which I intend to use in the accuracy repro extractor. Check the comment at the top of torch/utils/_content_store.py for more details on the intended use case. One major piece of this PR is implementing the content hash for tensors. For our prospective use case, we may need to repeatedly hash up to 80 GB of tensor data every time we snapshot (and we may snapshot multiple times). Using a conventional cryptographic hash and hashing each snapshot would likely take on order of minutes, which seemed too slow to me. So instead, I implemented a crappy hash function that can be run on GPU. It is at least somewhat theoretically grounded: using random parameters generated by Philox, we use the standard shift-multiply and xor sum universal hash family. The hash function is a bit dorky though; instead of properly doing 160-bit math, it just runs 32-bit hash five times and cats them together. By the way, this sets the first precedent for kernel in PyTorch library which MUST be torch.compile'd to be run (in fact, this kernel does not run in eager mode because of the use of xor_sum, which doesn't actually exist in ATen.) I had to add a few more primitives to inductor, namely randint (over the entire int range) and xor_sum. Fortunately, these primitives are natively supported by Triton/C++, and so they were very easy to plumb through. xor_sum is exposed as a prim, while randint special cases on when low/high span the entire 32-bit signed integer range. Thanks to Jeff Johnson for letting me bounce ideas of him on a Saturday morning lol. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/99809 Approved by: https://github.com/voznesenskym	2023-04-26 18:02:59 +00:00
Will Constable	6427b849a3	Allow in graph einops operators (#99631 ) Coordinating with arogozhnikov from einops team, allowing specific operators in the dynamo graph avoids dynamo tracing problems provided the operators are screened for safety - they must not bake in unintended constants or take data-dependent control flow paths. Fixes #99031 Pull Request resolved: https://github.com/pytorch/pytorch/pull/99631 Approved by: https://github.com/jansel	2023-04-21 03:14:38 +00:00
andrewor14	22af604e1b	[quant][pt2] Add Conv + BN fusion for prepare QAT (#98568 ) Summary: This commit adds the `prepare_qat_pt2e` API and the fusion logic for Conv + BN. We use the subgraph rewriter to match and replace the pattern with the existing logic in `nniqat.ConvBn2d`. Note this is not the end-to-end flow yet. In particular, the convert flow needs to swap the new subgraph with another one that merges the batchnorm stats back into conv. The Conv + BN fusion is implemented in the following steps: 1. Annotate all nodes in the pattern `[conv - bn - getitem]` 2. Match and replace this pattern with the fused QAT pattern (note that this is a larger subgraph than the original one) 3. Copy over metadata from the original nodes to the corresponding nodes in the new subgraph, to ensure the stack traces and dtype annotations are preserved 4. Prepare will insert fake quantizes in the right places based on the annotations Test Plan: python test/test_quantization.py TestQuantizePT2E.test_qat_conv_bn_fusion Reviewers: jerryzh168, kimishpatel, yanboliang Pull Request resolved: https://github.com/pytorch/pytorch/pull/98568 Approved by: https://github.com/kimishpatel	2023-04-20 20:15:28 +00:00
PyTorch MergeBot	96a262d666	Revert "Allow in graph einops operators (#99478 )" This reverts commit `309b7edfe1`. Reverted https://github.com/pytorch/pytorch/pull/99478 on behalf of https://github.com/kit1980 due to dynamo/test_after_aot.py::TestAfterAot::test_save_graph_repro - AssertionError, see https://github.com/pytorch/pytorch/actions/runs/4750274195/jobs/8438535867	2023-04-20 06:42:35 +00:00
Will Constable	309b7edfe1	Allow in graph einops operators (#99478 ) Coordinating with @arogozhnikov from einops team, allowing specific operators in the dynamo graph avoids dynamo tracing problems provided the operators are screened for safety - they must not bake in unintended constants or take data-dependent control flow paths. Fixes #99031 Pull Request resolved: https://github.com/pytorch/pytorch/pull/99478 Approved by: https://github.com/jansel	2023-04-20 03:40:50 +00:00
PyTorch MergeBot	ab08284225	Revert "Disable dynamo tracing torchrec.distributed (#97824 )" This reverts commit `df216b5736`. Reverted https://github.com/pytorch/pytorch/pull/97824 on behalf of https://github.com/izaitsevfb due to back out diff that doubles memory consumption for multitask FAIM flows. See D44978317	2023-04-17 20:34:01 +00:00
Jason Ansel	9ab5fdff81	Remove obsolete HAS_PRIMS_REFS (#99252 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99252 Approved by: https://github.com/ngimel	2023-04-17 00:27:37 +00:00
Angela Yi	1d077f28ed	[export] Constraints API (#98433 ) Wrapper for users to insert constraints into model code. The constraints will not be maintained in the graph after tracing through make_fx so retracing with dynamo/make_fx will not work. This will be supported after torch._assert supported is implemented. Then we can convert the constrain_range calls to torch._asserts. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98433 Approved by: https://github.com/avikchaudhuri, https://github.com/tugsbayasgalan	2023-04-13 21:20:10 +00:00
PyTorch MergeBot	ab761605ae	Revert "[export] Constraints API (#98433 )" This reverts commit `1510eb4072`. Reverted https://github.com/pytorch/pytorch/pull/98433 on behalf of https://github.com/izaitsevfb due to Breaks internal tests, asked by author to revert	2023-04-12 23:37:19 +00:00
PyTorch MergeBot	629377ea8b	Revert "Replace _dynamo.config with an object instead of module (#96455 )" This reverts commit `420104a886`. Reverted https://github.com/pytorch/pytorch/pull/96455 on behalf of https://github.com/jansel due to BC breaking, was landed prematurely	2023-04-12 15:06:14 +00:00
Angela Yi	1510eb4072	[export] Constraints API (#98433 ) Wrapper for users to insert constraints into model code. The constraints will not be maintained in the graph after tracing through make_fx so retracing with dynamo/make_fx will not work. This will be supported after torch._assert supported is implemented. Then we can convert the constrain_range calls to torch._asserts. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98433 Approved by: https://github.com/avikchaudhuri, https://github.com/tugsbayasgalan	2023-04-12 01:32:44 +00:00
Han Qi	420104a886	Replace _dynamo.config with an object instead of module (#96455 ) Summary: Replace _dynamo.config with an object instead of module Current usage patterns of setting and reading fields on config will work unchanged. Only changes needed going forward: 1. import torch._dynamo.config will not work. However, just doing import torch._dynamo is sufficient to access dynamo config as torch._dynamo.config. 2. Files inside of _dynamo folder need to access config via from torch._dynamo.config_util import config instead of from torch._dynamo import config. Because _dynamo/__init__.py imports some of the files so it would be circular import. Test Plan: Reviewers: Subscribers: Tasks: Tags: Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/96455 Approved by: https://github.com/williamwen42	2023-04-11 21:23:32 +00:00
Jason Ansel	a7892802b9	[dynamo] Add einops to skipfiles (#98661 ) This was causing failures in a torchbench model Pull Request resolved: https://github.com/pytorch/pytorch/pull/98661 Approved by: https://github.com/yanboliang	2023-04-11 03:21:36 +00:00
Yanbo Liang	a9c7e882ac	[Dynamo] Support skip fbcode modules (#98192 ) Fix Meta internal use case: * We are going to skip tracing ```torchrec.distributed```, however, in fbcode, the structure is a bit different from OSS torchrec. * Meta internally uses ```torch.package```, so we should support skip tracing files like ```<torch_package_0>.torchrec/distributed/...```. * We put the logic behind a flag ```is_fbcode``` to avoid misuse. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98192 Approved by: https://github.com/yf225	2023-04-04 06:33:55 +00:00
Yanbo Liang	df216b5736	Disable dynamo tracing torchrec.distributed (#97824 ) This was used to unblock Meta internal use cases, where ```torchrec.distributed``` was used, however, it can't be traced by dynamo properly right now. We were sending the same fix(#90087) several months ago, but was reverted due to ```fbgemm``` conflicts. This PR catches ```Exception``` rather than ```ImportError``` which can handle the conflicts. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97824 Approved by: https://github.com/wconstab	2023-04-01 00:39:59 +00:00
PyTorch MergeBot	7868e4b45b	Revert "Disable dynamo tracing torchrec.distributed (#97824 )" This reverts commit `9d1d95099b`. Reverted https://github.com/pytorch/pytorch/pull/97824 on behalf of https://github.com/yanboliang due to need to catch more exception	2023-03-30 20:43:00 +00:00
Yanbo Liang	9d1d95099b	Disable dynamo tracing torchrec.distributed (#97824 ) This was used to unblock Meta internal use cases, where ```torchrec.distributed``` was used, however, it can't be traced by dynamo properly right now. We were sending the same fix(#90087) several months ago, but was reverted due to ```fbgemm``` conflicts. This PR catches ```Exception``` rather than ```ImportError``` which can handle the conflicts. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97824 Approved by: https://github.com/wconstab	2023-03-29 04:29:51 +00:00
Aaron Gokaslan	3d82d8d0ed	[BE] Enable more flake8-comprehensions checks (#94601 ) I applied some flake8 fixes and enabled checking for them in the linter. I also enabled some checks for my previous comprehensions PR. This is a follow up to #94323 where I enable the flake8 checkers for the fixes I made and fix a few more of them. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94601 Approved by: https://github.com/ezyang	2023-02-10 23:40:29 +00:00
Edward Z. Yang	ca9ebf9e2b	Delete dynamo_import and inductor_import (#93851 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/93851 Approved by: https://github.com/albanD, https://github.com/jansel	2023-02-02 01:51:29 +00:00
Edward Z. Yang	dfe916ca88	Dynamo comptime, with public ComptimeContext API (#90983 ) This PR adds `@comptime`, a decorator that causes a given function to be executed at compile time when Dynamo is symbolically evaluating their program. To query the Dynamo state, we offer a public ComptimeContext API which provides a limited set of APIs for querying Dynamo's internal state. We intend for users to use this API and plan to keep it stable. Here are some things you can do with it: * You want to breakpoint Dynamo compilation when it starts processing a particular line of user code: give comptime a function that calls breakpoint * You want to manually induce a graph break for testing purposes; give comptime a function that calls unimplemented * You want to perform a debug print, but you don't want to induce a graph break; give comptime a function that prints. * You can print what the symbolic locals at a given point in time are. * You can print out the partial graph the Dynamo had traced at this point. * (My original motivating use case.) You want to add some facts to the shape env, so that a guard evaluation on an unbacked SymInt doesn't error with data-dependent. Even if you don't know what the final user API for this should be, with comptime you can hack out something quick and dirty. (This is not in this PR, as it depends on some other in flight PRs.) Check out the tests to see examples of comptime in action. In short, comptime is a very powerful debugging tool that lets you drop into Dynamo from user code, without having to manually jerry-rig pdb inside Dynamo to trigger after N calls. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/90983 Approved by: https://github.com/jansel	2022-12-19 11:06:01 +00:00
Michael Lazos	1accd915a4	Re-enable optimizers (#90709 ) Fixes https://github.com/pytorch/pytorch/issues/90165 https://github.com/pytorch/torchdynamo/issues/328 Re-enables optimizer capture + compilation now that the dynamo slowdowns have been fixed and it has speedups, numbers to come soon Pull Request resolved: https://github.com/pytorch/pytorch/pull/90709 Approved by: https://github.com/anijain2305, https://github.com/jansel, https://github.com/yanboliang	2022-12-19 04:07:41 +00:00
Peter Bell	ba77afbce1	Move _test_inductor_realize into python (#90517 ) Addresses https://github.com/pytorch/pytorch/pull/90014/files#r1043625932 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90517 Approved by: https://github.com/ngimel	2022-12-14 12:40:00 +00:00
Michael Lazos	9c4189f82d	[dynamo] Add is_compiling for dynamo (#90329 ) `is_tracing` returns True during dynamo tracing and False when run in Eager Pull Request resolved: https://github.com/pytorch/pytorch/pull/90329 Approved by: https://github.com/jansel	2022-12-09 20:19:41 +00:00
Will Constable	772b726068	Revert "Disable dynamo tracing torchrec.distributed (#90087 )" (#90416 ) This reverts commit `7e9a8a1361`. This revert fixes a torchbench dlrm amp crash. Auto revert fails due to conflict. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90416 Approved by: https://github.com/yanboliang, https://github.com/malfet	2022-12-08 01:50:54 +00:00
Yanbo Liang	898b46d6cc	[Dynamo][Easy] capture more exceptions when import skip modules (#90338 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/90338 Approved by: https://github.com/williamwen42	2022-12-07 02:05:39 +00:00
Yanbo Liang	7e9a8a1361	Disable dynamo tracing torchrec.distributed (#90087 ) Summary: Context at T138318923 Test Plan: mannual test Reviewed By: yf225 Differential Revision: D41631076 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90087 Approved by: https://github.com/yf225	2022-12-06 22:17:16 +00:00
Michael Lazos	903ae4570e	Disable optimizer tracing, enable for tests only (#89500 ) Disabling optimizer tracing before launch until it can be added to the benchmark suites without increasing compile times Pull Request resolved: https://github.com/pytorch/pytorch/pull/89500 Approved by: https://github.com/anijain2305	2022-11-24 04:15:34 +00:00
Will Constable	874625e039	Graph-break on FSDP in dynamo (#87420 ) Why we want to graph-break FSDP - FSDP has communication ops during forward and backward which we currently can't trace into the graph but also want to ensure are overlapped with compute - dynamo has issues tracing into or capturing a call to fsdp module without a break (see below) How we graph-break on FSDP - marking FSDP.forward code as skip means the code frames will graph-break; but in this case all of torch.* is listed in skipfiles.py anyway, so this is taken care of - disallowing the FSDP module prevents dynamo trying to record a 'call_module(FSDPmodule)' node into a graph, which happens earlier than the graphbreak that would be caused by skip, and causes additional issues: dynamo deepcopies modules before call-module handling, and FSDP module isn't trivially deep-copyable cc @jansel @lezcano @fdrocha @mlazos @soumith @voznesenskym @yanboliang @penguinwu @anijain2305 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87420 Approved by: https://github.com/aazzolini	2022-10-25 17:07:44 +00:00
Jason Ansel	c7c09722ad	Move TorchDynamo into PyTorch core (#86461 ) Context: https://github.com/pytorch/torchdynamo/issues/1588 This PR moves [TorchDynamo](https://github.com/pytorch/torchdynamo) and TorchInductor into PyTorch core. - `torchdynamo` becomes `torch._dynamo` - `torchinductor` becomes `torch._inductor` This PR was generated by running `copy_to_core.sh` in https://github.com/pytorch/torchdynamo/pull/1538 Pull Request resolved: https://github.com/pytorch/pytorch/pull/86461 Approved by: https://github.com/voznesenskym	2022-10-13 23:18:06 +00:00

43 Commits