pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Yanbo Liang	7f40640342	[Dynamo] Support torch.amp.autocast as decorator (#114845 ) Fixes #114818 Pull Request resolved: https://github.com/pytorch/pytorch/pull/114845 Approved by: https://github.com/jansel	2023-11-30 23:54:57 +00:00
Yanbo Liang	870539670a	[Dynamo] Support skip/inline function by name and consolidate skip/inline check logics (#113888 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/113888 Approved by: https://github.com/mlazos	2023-11-18 21:36:29 +00:00
jiayisun	7f1cbc8b5a	remove intel_extension_for_pytorch from THIRDPARTY_SKIPLIST (#112840 ) Motivation: Since `intel_extension_for_pytorch` is added to `THIRDPARTY_SKIPLIST`, when the IPEX optimized model uses `torch.compile`, the functions defined in IPEX will be skipped, these functions will not be able to generate the corresponding FX graph through dynamo, cannot be optimized by the compiler, and unnecessary graph breaks occurred. This PR is to remove `intel_extension_for_pytorch` from `THIRDPARTY_SKIPLIST` so that IPEX and torch.compile can work better together. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112840 Approved by: https://github.com/jgong5, https://github.com/jansel	2023-11-12 09:40:51 +00:00
Jez Ng	26f907e09b	[dynamo] Enable typechecking for skipfiles.py (#112975 ) Not sure why mypy thinks `importlib.util.find_spec` is not a valid lookup, but it seems OK if I explicitly import it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112975 Approved by: https://github.com/yanboliang, https://github.com/eellison ghstack dependencies: #112130, #112970, #112971, #112972, #112973, #112974	2023-11-08 21:17:45 +00:00
Iris Zhang	b07cfd79fe	[DeviceMesh] Move DeviceMesh out from torch.distributed._tensor (#112364 ) Move DeviceMesh out as a standalone module. Once we make sure everything is migrated and doc is ready, we will make `torch.distributed._device_mesh` public in follow-up PRs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112364 Approved by: https://github.com/wanchaol, https://github.com/fegin, https://github.com/fduwjj	2023-11-02 04:44:25 +00:00
Yanbo Liang	e644b03775	[Forward fix] torch.fx.passes.shape_prop should not be skipped (#111771 ) Summary: As title Test Plan: All failures in T167831495 passed Differential Revision: D50542953 Pull Request resolved: https://github.com/pytorch/pytorch/pull/111771 Approved by: https://github.com/aakhundov	2023-10-23 18:05:26 +00:00
Michael Lazos	fb8876069d	Support tracing base torch_function impl (#111731 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111731 Approved by: https://github.com/jansel ghstack dependencies: #111730	2023-10-23 07:11:32 +00:00
Yanbo Liang	bf01a7b023	[3/N] Merge skipfiles.check rules (#111451 ) This major change in this PR is to consolidate the skipfiles.check rules, the major thing done is merging the original ```FILE_INLINELIST``` with ```SUBMOD_INLINELIST``` into new ```MOD_INLINELIST``` and a legacy ```LEGACY_MOD_INLINELIST```. Let's use the following example to illustrate what is the expected behavior for this force inline list: `fa995626a8/torch/_dynamo/skipfiles.py (L344-L369)` The handling logic is: * If f2 is inlined, we will check both ```MOD_INLINELIST``` and ```LEGACY_MOD_INLINELIST``` to consultant force inline rules for f3. * If f2 is skipped, we will check ```LEGACY_MOD_INLINELIST``` only for inline rules for f3. The reason behind this design is: if f2 is skipped, if we always trace all recursively called functions, we will go to the very low level functions (e.g, ```super().__init__```) which caused graph breaks. We treated this as a signal that all functions that f2 recursively called should be skipped as well if f2 is skipped. This is also a feature that many PyTorch developers requested, they just want to skip all recursive functions if they mark the upper level functions as skipped. For PyTorch developers, we should only use ```MOD_INLINELIST``` going forward. I think most of the modules in the ```LEGACY_MOD_INLINELIST``` are legacy things to workaround when we didn't have a good skip/inline API. Pull Request resolved: https://github.com/pytorch/pytorch/pull/111451 Approved by: https://github.com/ezyang	2023-10-22 04:35:15 +00:00
Michael Voznesensky	cff71c47dd	[dynamo] Forward fix a bunch of distributed collective allow fixes (#111171 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111171 Approved by: https://github.com/yanboliang	2023-10-13 15:49:04 +00:00
Michael Voznesensky	395d0eaea0	Dynamo - config gated torch.distributed allow, exclusion for special leaf funcs (#110894 ) `is_allowed` is a tricky bit of functionality - it sits early up in builder and is used to drive the creation of TorchVariable (more notes here, meta only https://fb.workplace.com/groups/pytorch.dev/permalink/1393563781222098/) If we are tracing distributed in full, we want to route certain calls in distributed to NOT PASS is_allowed (this does not, confusingly, mean that they are not allowed, lol, but rather that we dont want them to become TorchVariable), others, we are fine with preserving. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110894 Approved by: https://github.com/ezyang	2023-10-12 09:25:51 +00:00
Yanbo Liang	986ad3bfa6	[2/N] Dynamo supports skip by function & removes skipfiles circular import (#110835 ) Several improvements for skipfiles: * Add ```FUNC_INLINELIST``` to support function level skip/inline check. * Use ```fn.__code__``` to match function since we can't get the function object sometimes. * Use python module string name for ```FILE_INLINELIST``` and ```SUBMODULE_INLINELIST```. * Use filename to match file and python module, which can fundamentally resolved the circular import issues introduced by skipfiles. * Use ```TYPE_CHECKING``` to ensure the python module string name is correct. * Add unit tests. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110835 Approved by: https://github.com/ezyang	2023-10-12 00:44:41 +00:00
PyTorch MergeBot	d3205f8377	Revert "[2/N] Dynamo supports skip by function & removes skipfiles circular import (#110835 )" This reverts commit `0bd4ce728b`. Reverted https://github.com/pytorch/pytorch/pull/110835 on behalf of https://github.com/DanilBaibak due to Broken trunk ([comment](https://github.com/pytorch/pytorch/pull/110835#issuecomment-1758279590))	2023-10-11 18:39:36 +00:00
Yanbo Liang	0bd4ce728b	[2/N] Dynamo supports skip by function & removes skipfiles circular import (#110835 ) Several improvements for skipfiles: * Add ```FUNC_INLINELIST``` to support function level skip/inline check. * Use ```fn.__code__``` to match function since we can't get the function object sometimes. * Use python module string name for ```FILE_INLINELIST``` and ```SUBMODULE_INLINELIST```. * Use filename to match file and python module, which can fundamentally resolved the circular import issues introduced by skipfiles. * Use ```TYPE_CHECKING``` to ensure the python module string name is correct. * Add unit tests. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110835 Approved by: https://github.com/ezyang	2023-10-11 17:24:56 +00:00
Chien-Chin Huang	57f6368b8e	[collective] Add a torch.compile + functional_collectives test (#110688 ) Add a test to ensure functional_collectives + torch.compile always works. Differential Revision: [D50001491](https://our.internmc.facebook.com/intern/diff/D50001491/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110688 Approved by: https://github.com/wanchaol, https://github.com/fduwjj	2023-10-10 17:14:50 +00:00
Ken Jin	31d635803b	[Dynamo] Fx proxy for builtin all with list iterators (#109972 ) Fixes https://github.com/pytorch/pytorch/issues/109057. Fixes https://github.com/pytorch/pytorch/issues/103620. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109972 Approved by: https://github.com/ezyang	2023-10-04 07:59:26 +00:00
Yanbo Liang	9bc5e10899	[New][1/N] Dynamo skipfiles refactor (#110330 ) This is the replacement of #109567. Now I preserved all existing semantics and only focusing on API (for developers) and code structure changes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110330 Approved by: https://github.com/ezyang	2023-10-03 16:50:33 +00:00
atalman	b253fc9c93	Revert "[1/N] Dynamo skipfiles refactor (#109567 )" (#110296 ) This reverts commit `84c5435b29`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110296 Approved by: https://github.com/yanboliang	2023-09-29 20:35:46 +00:00
Yanbo Liang	84c5435b29	[1/N] Dynamo skipfiles refactor (#109567 ) This is 1/N of the dynamo skipfiles/allowed_functions refactor, the major change in this PR includes: * Refactor & define the [skipfiles rules](https://github.com/pytorch/pytorch/pull/109567/files#diff-5aa3ce9db729bf0901ea97a5d3cc51924cc8575d9c516c1c8f572a35de92544aR56) and interface * For every ```skipfiles.check```, we return both the check result and the skip/inline reason and log them for debugging. * We found several latent issues/bugs and incorrect implementations in the codebase, but I'm planning to fix them in follow-up PRs to make the refactor decoupled with bug fixes. * More details in the inline comments. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109567 Approved by: https://github.com/ezyang, https://github.com/jansel, https://github.com/anijain2305	2023-09-28 18:36:46 +00:00
PyTorch MergeBot	75462fd870	Revert "[1/N] Dynamo skipfiles refactor (#109567 )" This reverts commit `f8e0ebec8c`. Reverted https://github.com/pytorch/pytorch/pull/109567 on behalf of https://github.com/huydhn due to Many jobs are failing in trunk after this with FILENAME_ALLOWLIST is not defined error `f8e0ebec8c`. This looks like a landrace ([comment](https://github.com/pytorch/pytorch/pull/109567#issuecomment-1738344950))	2023-09-28 02:22:22 +00:00
Yanbo Liang	f8e0ebec8c	[1/N] Dynamo skipfiles refactor (#109567 ) This is 1/N of the dynamo skipfiles/allowed_functions refactor, the major change in this PR includes: * Refactor & define the [skipfiles rules](https://github.com/pytorch/pytorch/pull/109567/files#diff-5aa3ce9db729bf0901ea97a5d3cc51924cc8575d9c516c1c8f572a35de92544aR56) and interface * For every ```skipfiles.check```, we return both the check result and the skip/inline reason and log them for debugging. * We found several latent issues/bugs and incorrect implementations in the codebase, but I'm planning to fix them in follow-up PRs to make the refactor decoupled with bug fixes. * More details in the inline comments. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109567 Approved by: https://github.com/ezyang, https://github.com/jansel, https://github.com/anijain2305	2023-09-28 01:21:59 +00:00
Michael Voznesensky	b123fd168a	Higher order op for preserving leaf functions through trace, particularly for getting user defined hooks to compiled autograd (#109690 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109690 Approved by: https://github.com/ezyang	2023-09-27 20:47:15 +00:00
Kimish Patel	eb67c452c8	[Quant] Add DQ duplication pass (#107900 ) Summary: During convert step observers are first replaced by Q-DQ pair. In some scenarios like following output DQ has a fan out. ---> OP2 -> Q -> DQ / OP -> Q -> DQ - \ ---> OP3 -> Q -> DQ If either op OP2 or OP3 are configured to be quantized, then the input is expected to quantized. In this case quantized equivalent of some pattern, that quantizer asked to be quantized, should look like: [DQ -> {pattern} -> Q]. However, in scenario like above where DQ node is shared between multiple "quantized" patterns, boundary of "quantized" pattern is not clear because DQ now belongs to multiple quantized patterns. This poses challenge for: - Porting metadata: which "quantized" partition this DQ node belongs - Quantized representation, equivalently, needs to identify self-contained quantized pattern that is replaced by its equivalent pattern that captures compute in the quantized precision. Test Plan: test_duplicate_dq_pass Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D48663147](https://our.internmc.facebook.com/intern/diff/D48663147) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107900 Approved by: https://github.com/jerryzh168, https://github.com/andrewor14, https://github.com/leslie-fang-intel ghstack dependencies: #107105, #107106, #107899	2023-09-02 06:20:03 +00:00
Wanchao Liang	a29b9101fa	[dynamo] fix dynamo + DTensor to work with 2d (#108329 ) pair debugged with @wconstab and we found some issue in both dynamo and the TP's fsdp extension side. This PR fixes the dynamo + DTensor integration so that the current graph break FSDP can work with tensor parallel by moving the torch.compile after FSDP wrapping. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108329 Approved by: https://github.com/Skylion007, https://github.com/wconstab	2023-08-31 22:46:26 +00:00
ydwu4	49e964cad6	Automatically turn on dynamo in cond (#108028 ) A replacement of https://github.com/pytorch/pytorch/pull/107932. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108028 Approved by: https://github.com/zou3519 ghstack dependencies: #108025, #108026, #108027	2023-08-28 10:16:41 +00:00
Tugsbayasgalan Manlaibaatar	20c5add133	[export] Refactor `constrain_as_value` and `constrain_as_size` (#106591 ) Some notable changes: 1. `constrain_as_size` allows min value to be less than 2 as it will unconditionally assume min >= 2 for compiler purposes. Instead, we add additional check to make sure max value is always greater than 2. 2. Previously, we used to runtime assert on the unbacked symint's val range which would be always between [2, max]. I modified this logic to assert on [0, max] unless user explicitly specifies the min range. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106591 Approved by: https://github.com/gmagogsfm, https://github.com/ezyang	2023-08-15 05:41:43 +00:00
Zhengxu Chen	547ccae0db	[export] Support preserving calling convention to some modules. (#106798 ) Summary: APS use this feature to swap out some submodules after unflattening. Test Plan: test_export_preserve_signature Differential Revision: D48154341 Pull Request resolved: https://github.com/pytorch/pytorch/pull/106798 Approved by: https://github.com/tugsbayasgalan	2023-08-11 21:17:45 +00:00
PyTorch MergeBot	745d29b0cc	Revert "[export] Refactor `constrain_as_value` and `constrain_as_size` (#106591 )" This reverts commit `18989890bf`. Reverted https://github.com/pytorch/pytorch/pull/106591 on behalf of https://github.com/izaitsevfb due to Breaks inductor test on trunk ([comment](https://github.com/pytorch/pytorch/pull/106591#issuecomment-1675069091))	2023-08-11 16:37:47 +00:00
Tugsbayasgalan Manlaibaatar	18989890bf	[export] Refactor `constrain_as_value` and `constrain_as_size` (#106591 ) Some notable changes: 1. `constrain_as_size` allows min value to be less than 2 as it will unconditionally assume min >= 2 for compiler purposes. Instead, we add additional check to make sure max value is always greater than 2. 2. Previously, we used to runtime assert on the unbacked symint's val range which would be always between [2, max]. I modified this logic to assert on [0, max] unless user explicitly specifies the min range. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106591 Approved by: https://github.com/gmagogsfm, https://github.com/ezyang	2023-08-11 05:29:22 +00:00
kshitij12345	cce2c52b0b	[pt2] support vmap (#101707 ) Teach dynamo about `vmap` Pull Request resolved: https://github.com/pytorch/pytorch/pull/101707 Approved by: https://github.com/zou3519	2023-08-09 03:39:33 +00:00
Kshiteej K	af78e139a8	[functorch] fix dynamo support for functorch.grad (#106610 ) Ref: https://github.com/pytorch/pytorch/pull/106475#discussion_r1282384503 Pull Request resolved: https://github.com/pytorch/pytorch/pull/106610 Approved by: https://github.com/zou3519	2023-08-07 17:44:49 +00:00
Michael Voznesensky	8549abc347	Grab bag of DTensor enablement stuff (Enable whole graph capture for DTensor) (#105787 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105787 Approved by: https://github.com/ezyang	2023-07-30 00:17:45 +00:00
Jerry Zhang	3a77f9aaaf	[quant][api] Move torch.ao.quantization.pt2e.quantizer to torch.ao.quantization.quantizer (#105885 ) Summary: moving quantizer to torch.ao.quantization to make it a public api, since pt2e is a folder for implementations Test Plan: CIs sanity check: "buck test //executorch/backends/xnnpack/test:test_xnnpack_quantized_models -- test_resnet18" Differential Revision: D47727838 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105885 Approved by: https://github.com/andrewor14	2023-07-26 18:20:09 +00:00
PyTorch MergeBot	6dd4b99ec2	Revert "Disable torchrec/sparse from top-level Dynamo tracing (#105733 )" This reverts commit `60d5efdb15`. Reverted https://github.com/pytorch/pytorch/pull/105733 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/105733#issuecomment-1650931609))	2023-07-26 03:44:47 +00:00
Edward Z. Yang	60d5efdb15	Disable torchrec/sparse from top-level Dynamo tracing (#105733 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/105733 Approved by: https://github.com/voznesenskym	2023-07-22 02:00:36 +00:00
Jerry Zhang	dff4e034b8	[quant][pt2e][be] Rename qnnpack quantizer to xnnpack quantizer (#105551 ) Summary: att Test Plan: sandcastle CI and OSS CI Reviewed By: andrewor14 Differential Revision: D47422894 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105551 Approved by: https://github.com/andrewor14	2023-07-20 03:52:40 +00:00
Jerry Zhang	7b4d080496	[quant][pt2e] Rename _pt2e to pt2e (#104668 ) Summary: X-link: https://github.com/pytorch/executorch/pull/3 att Test Plan: Imported from OSS Differential Revision: D47202807 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104668 Approved by: https://github.com/andrewor14	2023-07-15 06:34:17 +00:00
Andrew Or	4b29829ece	[quant][pt2] Fix QAT convert for mobilenetv2 (#104110 ) Summary: QAT convert for mobilenetv2 was previously not working because we incorrectly applied dropout during eval as well as training. This is because, for exported models, model.eval() does not change the behavior of dropout, unlike models with torch ops. This commit simulates the effects of model.eval() for exported models as well by replacing the aten dropout pattern before eval. As of this commit, end-to-end QAT numerics now match for mobilenetv2 between FX and PT2. Test Plan: python test/test_quantization.py TestQuantizePT2EModels.test_qat_mobilenet_v2 Differential Revision: D46750343 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104110 Approved by: https://github.com/jerryzh168	2023-07-11 18:42:42 +00:00
Jerry Zhang	c98896b76f	[quant][pt2e] Add more precise representation for quantized add (#104130 ) Summary: The planned e2e for quantization in pytorch 2.0 export is the following: float_model -> prepare_pt2e -> calibration -> convert_pt2e -> ... inside convert_pt2e, we will first produce a q/dq representation of the quantized model, similar to the previous output of convert_to_reference_fx in fx grah mode quantization: ``` torch.ops.quantized_decomposed.dequantize_per_tensor -> torch.ops.aten.add -> torch.ops.quantized_decomopsed.quantize_per_tensor torch.ops.quantized_decomposed.dequantize_per_tensor / ``` Then we'll rewrite the above to a more precise representation that express the intention in a more precise manner, since here we actually want to do int8 addition, instead of simulating the int8 addition with fp32 operations, the representation for quantized add is: ``` def quantized_add(x_i8, x_scale, x_zero_point, y_i8, y_scale, y_zero_point, out_scale, out_zero_point): x = (x_scale / out_scale) * x_i8 y = (y_scale / out_scale) * y_i8 out = x + y out -= (x_zero_point * x_scale - y_zero_point * y_scale) / out_scale out += out_zero_point return out ``` Test Plan: ``` buck2 test caffe2/test:quantization_pt2e -- --exact 'caffe2/test:quantization_pt2e - test_representation_add (quantization.pt2e.test_quantize_pt2e.TestQuantizePT2E)' ``` Reviewed By: kimishpatel Differential Revision: D45628032 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104130 Approved by: https://github.com/kimishpatel	2023-06-27 20:11:30 +00:00
Animesh Jain	75dab587ef	[dynamo] FSDP + AC + torch.compile (#103953 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/103953 Approved by: https://github.com/wanchaol	2023-06-24 01:40:56 +00:00
kshitij12345	d552c271db	[pt2] grad support (#102264 ) Teach dynamo about grad Pull Request resolved: https://github.com/pytorch/pytorch/pull/102264 Approved by: https://github.com/zou3519	2023-06-21 10:13:09 +00:00
PyTorch MergeBot	e737a8486f	Revert "[pt2] grad support (#102264 )" This reverts commit `85b83954c8`. Reverted https://github.com/pytorch/pytorch/pull/102264 on behalf of https://github.com/huydhn due to This is failing in trunk `85b83954c8` and looks like a landrace ([comment](https://github.com/pytorch/pytorch/pull/102264#issuecomment-1600001309))	2023-06-21 03:02:55 +00:00
kshitij12345	85b83954c8	[pt2] grad support (#102264 ) Teach dynamo about grad Pull Request resolved: https://github.com/pytorch/pytorch/pull/102264 Approved by: https://github.com/zou3519	2023-06-21 01:37:08 +00:00
Zhengxu Chen	26bf8894b6	[export] Replicate exportdb examples and tests in oss. (#102769 ) Summary: Initial work to copy source to OSS for exportdb and make sure tests can run properly. Test Plan: test_export Differential Revision: D46369152 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102769 Approved by: https://github.com/angelayi	2023-06-04 20:01:57 +00:00
Michael Lazos	c75e064dd6	Disallow _foreach_utils.py, but allow it to be inlined (#102221 ) This function should not be allowed, but should be inlineable. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102221 Approved by: https://github.com/anijain2305	2023-06-02 05:14:09 +00:00
PyTorch MergeBot	8aa48315de	Revert "Disallow _foreach_utils.py, but allow it to be inlined (#102221 )" This reverts commit `552299c42c`. Reverted https://github.com/pytorch/pytorch/pull/102221 on behalf of https://github.com/huydhn due to Sorry for reverting your PR. It starts to break dynamo jobs in trunk `552299c42c` and it looks like a landrace ([comment](https://github.com/pytorch/pytorch/pull/102221#issuecomment-1563694599))	2023-05-26 01:27:19 +00:00
Michael Lazos	552299c42c	Disallow _foreach_utils.py, but allow it to be inlined (#102221 ) This function should not be allowed, but should be inlineable. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102221 Approved by: https://github.com/anijain2305	2023-05-25 23:48:36 +00:00
Kimish Patel	24e9b8f5f4	[PT2E][Quant] Use subgraph matcher annotate linear pattern (#100566 ) This diff adds subgraph matcher for pattern matching. Furthermore, we also move annotations for the matched subgraph in a way that only input and output nodes of the matched subgraph have quantization related valid annotations. Differential Revision: [D45535539](https://our.internmc.facebook.com/intern/diff/D45535539/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100566 Approved by: https://github.com/jerryzh168	2023-05-04 21:31:59 +00:00
andrewor14	9cda7b9e47	[hotfix] Do not import torch.ao.quantization._pt2e from dynamo (#100194 ) Summary: Importing torch.ao.quantization._pt2e from dynamo led to internal test failures related to memory profiling. For now, let's express the path using a simple string instead. Reviewers: jerryzh168, kimishpatel Pull Request resolved: https://github.com/pytorch/pytorch/pull/100194 Approved by: https://github.com/jerryzh168	2023-04-28 01:32:23 +00:00
Tugsbayasgalan (Tugsuu) Manlaibaatar	02f059c2b7	Add private _export API (#99992 ) Differential Revision: D45279206 Pull Request resolved: https://github.com/pytorch/pytorch/pull/99992 Approved by: https://github.com/angelayi, https://github.com/gmagogsfm	2023-04-27 16:24:14 +00:00
Edward Z. Yang	3a5427baf4	Add torch.utils._content_store (#99809 ) Implements a simple content-addressable store for storages (with tensors implemented as cheap references on top), enabling incremental serialization of tensors to disk, which I intend to use in the accuracy repro extractor. Check the comment at the top of torch/utils/_content_store.py for more details on the intended use case. One major piece of this PR is implementing the content hash for tensors. For our prospective use case, we may need to repeatedly hash up to 80 GB of tensor data every time we snapshot (and we may snapshot multiple times). Using a conventional cryptographic hash and hashing each snapshot would likely take on order of minutes, which seemed too slow to me. So instead, I implemented a crappy hash function that can be run on GPU. It is at least somewhat theoretically grounded: using random parameters generated by Philox, we use the standard shift-multiply and xor sum universal hash family. The hash function is a bit dorky though; instead of properly doing 160-bit math, it just runs 32-bit hash five times and cats them together. By the way, this sets the first precedent for kernel in PyTorch library which MUST be torch.compile'd to be run (in fact, this kernel does not run in eager mode because of the use of xor_sum, which doesn't actually exist in ATen.) I had to add a few more primitives to inductor, namely randint (over the entire int range) and xor_sum. Fortunately, these primitives are natively supported by Triton/C++, and so they were very easy to plumb through. xor_sum is exposed as a prim, while randint special cases on when low/high span the entire 32-bit signed integer range. Thanks to Jeff Johnson for letting me bounce ideas of him on a Saturday morning lol. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/99809 Approved by: https://github.com/voznesenskym	2023-04-26 18:02:59 +00:00

1 2

78 Commits