pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Pian Pawakapan	284b766898	[dynamic shapes] C++ bindings for guard_or_false/true (#150148 ) C++ version. Would like to add it in one place to prove it works, but couldn't find one that doesn't expose a chain of data-dependent changes... so just gonna put up the base implementation Pull Request resolved: https://github.com/pytorch/pytorch/pull/150148 Approved by: https://github.com/laithsakka, https://github.com/jingsh	2025-03-31 17:04:25 +00:00
Yuxin Wu	40ec9d2bfa	avoid allocation when tensor_new from storage (#149797 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/149797 Approved by: https://github.com/Skylion007	2025-03-24 20:02:45 +00:00
cyy	8fa81a6066	Enable misc-use-internal-linkage check and apply fixes (#148948 ) Enables clang-tidy rule [`misc-use-internal-linkage`](https://clang.llvm.org/extra/clang-tidy/checks/misc/use-internal-linkage.html). This new check was introduced in Clang-Tidy 18 and is available due to recent update of Clang-Tidy 19. The check marks functions and variables used only in the translation unit as static. Therefore undesired symbols are not leaked into other units, more link time optimisations are possible and the resulting binaries may be smaller. The detected violations were mostly fixed by using static. In other cases, the symbols were indeed consumed by others files, then their declaring headers were included. Still some declarations were wrong and have been fixed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/148948 Approved by: https://github.com/Skylion007	2025-03-12 14:22:56 +00:00
Wei-Sheng Chin	9c9b05bc4f	Expose functions used in custom backend in torch_python dll (#148213 ) Fixes #148208. There are solutions for exposing symbols implicitly from inline functions (i.e., inline function A calls non-inline function B in foo.h. Code includes foo.h has to see the symbol B in DLL). Solution 1: tag the entire struct where the inline functions are defined as member functions with TORCH_PYTHON_API --- this PR does this for python_arg_parser.h. An alternative solution exists but will slow down dispatching a lot --- drop inline keyword and move implementation to .cc file. Solution 2: tag individual functions with TORCH_PYTHON_API. This PR does this for python_tensor.h. Related discussion about hiding torch_python symbols: https://github.com/pytorch/pytorch/pull/142214 Pull Request resolved: https://github.com/pytorch/pytorch/pull/148213 Approved by: https://github.com/malfet	2025-03-07 02:34:37 +00:00
Zhengxu Chen	915b9c80ab	[export] Sync aoti schema to schema.py (#148017 ) Summary: Synchronizing internal AOTI schema to OSS schema.py Test Plan: CI Differential Revision: D70271151 Pull Request resolved: https://github.com/pytorch/pytorch/pull/148017 Approved by: https://github.com/yiming0416	2025-02-27 21:46:11 +00:00
vasiliy	382fbcc1e4	add the `torch.float8_e8m0fnu` dtype to PyTorch (#147466 ) Summary: Continuing the work from https://github.com/pytorch/pytorch/pull/146427 Adds the `torch.float8_e8m0fnu` dtype to PyTorch, as detailed in https://github.com/pytorch/pytorch/issues/146414 . Please see the issue for a detailed definition of the format. Example of basic functionality: ```python import torch # round trip x0 = torch.randn(4, 4, dtype=torch.float32) x1 = x0.to(torch.float8_e8m0fnu) # RNE rounding x2 = x1.to(torch.float32) # 2 ** exponent # creation with empty x0 = torch.empty(4, 4, dtype=torch.float8_e8m0fnu) # printing print(x0) ``` Done in this PR: * numerical correctness * op coverage (except for `torch._scaled_mm`): create tensor, cast to/from float32 * printing a tensor works For future PRs: * performance optimizations for casting * torch._scaled_mm * PT2 * various cleanups (detailed in comments with issue numbers) Test Plan: ``` pytest test/quantization/core/experimental/test_float8.py -s ``` Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/147466 Approved by: https://github.com/drisspg	2025-02-20 13:55:42 +00:00
Zhengxu Chen	0b84311842	[export] Generate printers/parsers for serialization enum values. (#147126 ) Summary: Generate two helper functions for enum classes in generated_serialization_types.h printEnum: will convert enum values into strings. parseEnum: will convert strings into enum values. Test Plan: CI Differential Revision: D69604850 Pull Request resolved: https://github.com/pytorch/pytorch/pull/147126 Approved by: https://github.com/yiming0416	2025-02-14 02:14:35 +00:00
Zhengxu Chen	683bb1242c	[export][ez] Update tag_ for union setters. (#146912 ) Summary: ez fix to set tag for union type fields. Test Plan: CI Differential Revision: D69467715 Pull Request resolved: https://github.com/pytorch/pytorch/pull/146912 Approved by: https://github.com/yiming0416	2025-02-12 03:52:36 +00:00
Zhengxu Chen	664550ecbf	[export] Serialize special values of float into strings for json. (#146490 ) Summary: Currently inf is serialized as Infinity in JSON which is not standard compliant. Instead we will tweak all special floating points into strings and handle them at json layer. Test Plan: see D69060784 CI Differential Revision: D69186425 Pull Request resolved: https://github.com/pytorch/pytorch/pull/146490 Approved by: https://github.com/yiming0416	2025-02-11 20:01:27 +00:00
cyy	15635b14ce	[4/N] Remove unnecessary once flag usage (#146783 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/146783 Approved by: https://github.com/albanD	2025-02-11 13:55:06 +00:00
angelayi	0c37c332da	[export] Additionally save pytree namedtuple field names (#145956 ) If a user passes in a namedtuple as an input, currently the input TreeSpec looks like: `TreeSpec(type=namedtuple, context=”class_fqn”, children_spec=[, ])` The user then saves the program containing this input TreeSpec. But what happens if they load it in a new environment where `class_fqn` now contains an additional field? This means that the exported program is now expected to take in another input. But since those fields were not used in the original program, users should be able just drop those additional fields and the program will run successfully. This is needed/used in APS where they use unflattener's adapter to adapt the inputs based on the previously saved treespecs. There are a couple of [solutions](https://docs.google.com/document/d/1V4ZSdy-8PUISWc8RqvGu3DU01BVegJhHHPWqa1Io7Eg/edit?tab=t.0) for how we can address this, but eventually we settled on saving a side table mapping namedtuple types to their list of field names, which can then be accessed by the adapter. Pull Request resolved: https://github.com/pytorch/pytorch/pull/145956 Approved by: https://github.com/zhxchen17	2025-02-04 04:42:30 +00:00
Zhengxu Chen	1580f47bf4	[export][ez] Fix generated header file. (#146208 ) Summary: as title. Test Plan: CI Differential Revision: D68978788 Pull Request resolved: https://github.com/pytorch/pytorch/pull/146208 Approved by: https://github.com/yiming0416	2025-02-03 06:01:05 +00:00
Zhengxu Chen	aad9f44b2e	[export] Sync model container types to schema.py (#145959 ) Summary: Synced from D68840230 Test Plan: No behavior changes to existing API. Will be tested internally. Differential Revision: D68846532 Pull Request resolved: https://github.com/pytorch/pytorch/pull/145959 Approved by: https://github.com/yiming0416	2025-01-31 18:17:56 +00:00
cyy	116af809eb	Use std::string_view (#145906 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/145906 Approved by: https://github.com/albanD	2025-01-30 03:14:27 +00:00
cyyever	ef28df5c9e	[Reland][Environment Variable][4/N] Use thread-safe getenv functions (#140593 ) Reland of #137843 , after checking the code again. Pull Request resolved: https://github.com/pytorch/pytorch/pull/140593 Approved by: https://github.com/albanD Co-authored-by: albanD <desmaison.alban@gmail.com>	2025-01-28 20:51:49 +00:00
wengshiy	73622fc5fa	Fix Throughputbenchmark issue (#144669 ) Fixes [144461](https://github.com/pytorch/pytorch/issues/144461) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144669 Approved by: https://github.com/leslie-fang-intel, https://github.com/williamwen42, https://github.com/jansel	2025-01-26 03:37:20 +00:00
Yichen Yan	d4171b724e	Let `tensor_a.new_tensor()` be on `tensor_a.device` by default (#144958 ) Fixes #144957 Closes #73838 cc @albanD @ezyang Currently, `tensor_a.new_tensor()` will return a on-cpu tensor no matter where is `tensor_a`. This differs from the document and is a side-effect of https://github.com/pytorch/pytorch/pull/41984. See #144957 how current logic breaks dynamo. This PR restore the documented behavior and add tests for `new_tensor`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144958 Approved by: https://github.com/ezyang	2025-01-24 22:12:31 +00:00
Edward Z. Yang	b3e90c8c33	Add support for torch function on dtype arguments (#145085 ) Along the lines of https://github.com/pytorch/pytorch/issues/119194 although it doesn't actually address the FCD case. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/145085 Approved by: https://github.com/vmoens, https://github.com/Skylion007	2025-01-21 17:44:47 +00:00
garfield1997	3a5bf0bc36	expose extra torch_python apis (#144746 ) Fixes #144302 After checking the code of my third-party devices, I think these APIs are also relied on by us, so I exposed them according to the discussion in the issue. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144746 Approved by: https://github.com/albanD	2025-01-16 20:50:31 +00:00
Brian Hirsh	4831f89790	support numbers as tensors for aten.copy(Tensor, Tensor) (#141161 ) Fixes https://github.com/pytorch/pytorch/issues/141149. `aten.copy_` supports numbers as tensors in the python arg parser. So we need to give the same treatment to `aten.copy`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/141161 Approved by: https://github.com/ezyang	2025-01-16 00:08:25 +00:00
Zhengxu Chen	834086c023	[export] Load side info about pos/kw argument kind for serialization. (#144686 ) Summary: Fixing issue of nodes like ``` torch.ops.aten.linear.default(x, w, b) ``` being deserialized as ``` torch.ops.aten.linear.default(x, w, bias=b) ``` which breaks roundtripping. Test Plan: buck test mode/opt caffe2/test:test_export -- -r TestDeserialize buck test mode/opt caffe2/test:test_export -- -r TestSerialize Differential Revision: D67991410 Pull Request resolved: https://github.com/pytorch/pytorch/pull/144686 Approved by: https://github.com/angelayi	2025-01-15 19:08:38 +00:00
dilililiwhy	7c52c97a65	Expose several APIs to public (torch python APIs) (#144525 ) Fixes #144302 Try to expose several APIs to public for privateuse1 scenario. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144525 Approved by: https://github.com/cyyever, https://github.com/albanD	2025-01-15 14:34:45 +00:00
Yiming Zhou	87843ee9ab	[export] Unify single and multiple return for hops (#143227 ) Summary: Introduce `is_hop_single_tensor_return` field to the `Node` class in serialization so that during deserialization when there is a single return, we know whether it is a tuple of a single element or a single element. Test Plan: ``` buck2 run @mode/dev-nosan sigmoid/inference/test:e2e_test_cpu -- -r E2ETestCPUCond buck2 run @mode/dev-nosan sigmoid/inference/test:test_passes -- -r test_const_folding2 ``` Differential Revision: D66991624 Pull Request resolved: https://github.com/pytorch/pytorch/pull/143227 Approved by: https://github.com/zhxchen17	2025-01-13 03:31:14 +00:00
Aaron Gokaslan	bbec35f028	[BE]: Replace clone detach with detach clone to be more efficient (#144469 ) Follow up to #144270 and fix some vulkan code Pull Request resolved: https://github.com/pytorch/pytorch/pull/144469 Approved by: https://github.com/awgu	2025-01-09 18:28:39 +00:00
cyy	b0be30dd79	[19/N] Fix extra warnings brought by clang-tidy-17 (#144448 ) Apply more clang-tidy fixes. There was a bug introduced by #144014 due to incorrect namespace concatenation which is reverted here. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144448 Approved by: https://github.com/albanD	2025-01-09 15:58:05 +00:00
cyy	d0070ca07e	[18/N] Fix extra warnings brought by clang-tidy-17 (#144014 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/144014 Approved by: https://github.com/Skylion007, https://github.com/albanD	2025-01-08 17:21:55 +00:00
Aaron Gokaslan	e4a05dec0f	[BE][Ez]: Fix docs recommending inefficient tensor op order (#144270 ) `detach().clone()` is faster than `.clone().detatch()` since the gradients are not cloned. Let's update all the documentation and tests so that users do not use the inefficient op ordering. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144270 Approved by: https://github.com/awgu, https://github.com/XuehaiPan	2025-01-07 17:31:32 +00:00
cyy	af629a8146	Enable readability-redundant-declaration (#143982 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/143982 Approved by: https://github.com/Skylion007	2024-12-31 00:20:10 +00:00
cyy	dca443835e	Enable more readability-redundant checks (#143963 ) They are helpful to simplifying code. Pull Request resolved: https://github.com/pytorch/pytorch/pull/143963 Approved by: https://github.com/albanD	2024-12-30 14:49:33 +00:00
Aaron Orenstein	9bf4b1c2e9	dynamo tracing perf: c++ strip_function_call: 49.12 -> 47.77 (#143063 ) See #143056 for overall docs. This PR: Convert `strip_function_call()` into C++ Pull Request resolved: https://github.com/pytorch/pytorch/pull/143063 Approved by: https://github.com/jansel ghstack dependencies: #143057, #143062	2024-12-22 06:38:46 +00:00
Shangdi Yu	c17a07ade3	Add float8 support in serde schema (#143343 ) Summary: Fix https://github.com/pytorch/pytorch/issues/141316 Bump up schema minor version. as title, add float8 support in serde schema Test Plan: ``` buck2 run 'fbcode//mode/dev-nosan' fbcode//caffe2/test:test_export -- -r test_serialize_float8 ``` Differential Revision: D67307670 Pull Request resolved: https://github.com/pytorch/pytorch/pull/143343 Approved by: https://github.com/yiming0416	2024-12-18 05:07:21 +00:00
Manav Avlani	aabe285aaf	Add 2 more APIs to the exposed public torch python APIs (#143380 ) These two APIs are being used internally for some projects and need to be exposed as the build for this is done using OSS toolchain. `af8789c056` - this change hid most apis in torch python barring the ones explicitly specified breaking the build. Pull Request resolved: https://github.com/pytorch/pytorch/pull/143380 Approved by: https://github.com/suo	2024-12-17 21:16:51 +00:00
Yu, Guangye	9706ada369	[RELAND] Add device-agnostic runtime Device/Stream C++ API (#138677 ) # Motivation This PR intends to add C++ accelerator device-agnostic APIs. # Additional Context This PR is relanded. It is reverted because `torch.Event` doesn't support mps backend. We have fixed it in https://github.com/pytorch/pytorch/pull/142468. The previous commit is `f84e533a2c` Pull Request resolved: https://github.com/pytorch/pytorch/pull/138677 Approved by: https://github.com/albanD, https://github.com/EikanWang ghstack dependencies: #143171, #133572	2024-12-16 02:18:41 +00:00
Yu, Guangye	c1d4d9d3cf	[MPS] Support torch.accelerator.synchronize() on mps (#143171 ) # Motivation Support `torch.accelerator.synchronize()` on mps. The root cause is that MPS doesn't support lazy initialization. So we must check if the current accelerator supports device lazy initialization rather than early return. # Additional Context Add a mps UT to test code change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/143171 Approved by: https://github.com/albanD	2024-12-16 02:18:32 +00:00
cyy	af8789c056	Hide torch_python symbols (#142214 ) Change symbols in torch_python to invisible by default on platforms other than Apple. Pull Request resolved: https://github.com/pytorch/pytorch/pull/142214 Approved by: https://github.com/ezyang	2024-12-16 00:59:26 +00:00
PyTorch MergeBot	cf538efd0c	Revert "Hide torch_python symbols (#142214 )" This reverts commit `da76e912a4`. Reverted https://github.com/pytorch/pytorch/pull/142214 on behalf of https://github.com/huydhn due to The MacOS failure looks legit as it shows up in trunk ([comment](https://github.com/pytorch/pytorch/pull/142214#issuecomment-2539543504))	2024-12-12 17:15:51 +00:00
cyyever	da76e912a4	Hide torch_python symbols (#142214 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/142214 Approved by: https://github.com/ezyang	2024-12-12 07:00:54 +00:00
PyTorch MergeBot	dfe5669076	Revert "[RELAND] Add device-agnostic runtime Device/Stream C++ API (#138677 )" This reverts commit `734bb01460`. Reverted https://github.com/pytorch/pytorch/pull/138677 on behalf of https://github.com/huydhn due to Sorry for reverting your change but the new test is still very flaky on MacOS even when it does not segfault anymore ([comment](https://github.com/pytorch/pytorch/pull/133572#issuecomment-2537256522))	2024-12-11 21:47:17 +00:00
Yu, Guangye	734bb01460	[RELAND] Add device-agnostic runtime Device/Stream C++ API (#138677 ) # Motivation This PR intends to add C++ accelerator device-agnostic APIs. # Additional Context This PR is relanded. It is reverted because `torch.Event` doesn't support mps backend. We have fixed it in https://github.com/pytorch/pytorch/pull/142468. The previous commit is `f84e533a2c` Pull Request resolved: https://github.com/pytorch/pytorch/pull/138677 Approved by: https://github.com/albanD, https://github.com/EikanWang ghstack dependencies: #142468, #133572	2024-12-11 02:04:52 +00:00
cyy	7d98b3dcee	[3/N] Apply bugprone-unchecked-optional-access (#142442 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/142442 Approved by: https://github.com/albanD	2024-12-11 01:39:10 +00:00
Zhengxu Chen	1986b46d63	[export] Change Tuple[()] to bool in schema to sync with thrift. (#142257 ) Summary: In thrift schema, we represent every None value as "True/False" while we represent None as () in OSS schema. This will cause some inconsistency between the type systems and the simplest thing to do here is changing Tuple[()] to bool in oss schema. This change should NOT cause version bump, because on deserializer side we never read the value from as_none fields, as it doesn't have real meaning. Therefore this schema change should be considered as safe. Test Plan: CI Reviewed By: SherlockNoMad Differential Revision: D66888892 Pull Request resolved: https://github.com/pytorch/pytorch/pull/142257 Approved by: https://github.com/yiming0416, https://github.com/hl475	2024-12-10 17:13:35 +00:00
PyTorch MergeBot	adbfdbd6a0	Revert "Add device-agnostic runtime Device/Stream C++ API (#138677 )" This reverts commit `f84e533a2c`. Reverted https://github.com/pytorch/pytorch/pull/138677 on behalf of https://github.com/malfet due to Sorry for reverting your PR, but it segfaults on MacOS ([comment](https://github.com/pytorch/pytorch/pull/133572#issuecomment-2530354401))	2024-12-10 04:42:55 +00:00
Yu, Guangye	f84e533a2c	Add device-agnostic runtime Device/Stream C++ API (#138677 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/138677 Approved by: https://github.com/albanD, https://github.com/EikanWang ghstack dependencies: #133572	2024-12-07 13:14:10 +00:00
Zhengxu Chen	1a7da6e7e9	[export] Add test to enforce consistency between synced thrift and generated thrift from schema.py (#141989 ) Summary: In this diff we implement a way to ensure the internal thrift schema from cfgr (configerator/structs/caffe2/torch/export/schema.thrift) and the schema in OSS (torch/_export/serde/schema.thrift) are in sync, by adding a unittest to reflect on the type names and fields from each schema and compare them field by field. When we detect new fields/types from torch/_export/serde/schema.thrift, there'll be a test failure on the trunk and the error message hints people to add the missing field/type to the thrift schema from cfgr, so that they are always in sync in practice. Test Plan: buck test mode/opt caffe2/test:test_export -- -r test_thrift_schema_in_sync Differential Revision: D66716834 Pull Request resolved: https://github.com/pytorch/pytorch/pull/141989 Approved by: https://github.com/yiming0416	2024-12-06 18:42:20 +00:00
bhack	ae9cda0221	Add `truediv` support in export serializer (#136364 ) Fixes #136113 - [x] Inital `truediv` coverage - [ ] Expand/reduce coverage? - [x] Add tests - [x] Re-check docstrings - [ ] Linting Pull Request resolved: https://github.com/pytorch/pytorch/pull/136364 Approved by: https://github.com/pianpwk Co-authored-by: Angela Yi <angelayi@meta.com> Co-authored-by: Pian Pawakapan <pianpwk@meta.com>	2024-12-05 17:33:33 +00:00
PyTorch MergeBot	09ce760fef	Revert "Add missing data types at torch export serialization (#138561 )" This reverts commit `1ef1b3b391`. Reverted https://github.com/pytorch/pytorch/pull/138561 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/138561#issuecomment-2513343401))	2024-12-03 01:32:50 +00:00
cyy	96be048f06	[1/N] Avoid copy in std::get (#141812 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/141812 Approved by: https://github.com/Skylion007	2024-12-01 03:53:35 +00:00
yintong-lu	1ef1b3b391	Add missing data types at torch export serialization (#138561 ) Related to #131654 Added missing FP8 data types at torch export serialization. Added test cases of FP8 data types. Pull Request resolved: https://github.com/pytorch/pytorch/pull/138561 Approved by: https://github.com/jerryzh168, https://github.com/jgong5	2024-11-28 08:35:03 +00:00
PyTorch MergeBot	6e61ff4fd3	Revert "Add `truediv` support in export serializer (#136364 )" This reverts commit `1df440dc4e`. Reverted https://github.com/pytorch/pytorch/pull/136364 on behalf of https://github.com/huydhn due to Sorry for reverting your change but its doc build failure is legit ([comment](https://github.com/pytorch/pytorch/pull/136364#issuecomment-2502620732))	2024-11-27 03:24:31 +00:00
bhack	1df440dc4e	Add `truediv` support in export serializer (#136364 ) Fixes #136113 - [x] Inital `truediv` coverage - [ ] Expand/reduce coverage? - [x] Add tests - [x] Re-check docstrings - [ ] Linting Pull Request resolved: https://github.com/pytorch/pytorch/pull/136364 Approved by: https://github.com/pianpwk Co-authored-by: Angela Yi <angelayi@meta.com> Co-authored-by: Pian Pawakapan <pianpwk@meta.com>	2024-11-27 00:31:47 +00:00

1 2 3 4 5 ...

1226 Commits