pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Tugsbayasgalan Manlaibaatar	5aee22e0e0	Move export.constrain_as_* to torch._constrain_as_* (#110757 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110757 Approved by: https://github.com/avikchaudhuri ghstack dependencies: #109859	2023-10-11 02:37:55 +00:00
Edward Z. Yang	24bf9aeb6b	Fix arange with dynamic end argument. (#110979 ) Fixes https://github.com/pytorch/pytorch/issues/93468 There's a few extra tests that are sort of unrelated, but I ended up writing them while working on the fix and decided to keep them. The big idea here is to split the `_check` so that `expect_true` works; I could have probably also improved the symbolic reasoning but I'm lazy. One small logging fix too. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/110979 Approved by: https://github.com/Skylion007	2023-10-11 00:32:34 +00:00
Jon Chuang	6e770c0dda	[dynamo] Add `itertools.repeat` via polyfill (#110953 ) Fixes https://github.com/pytorch/pytorch/issues/110286 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110953 Approved by: https://github.com/ezyang	2023-10-10 20:40:33 +00:00
Jon Chuang	844ea6408b	feat(dynamo): handle accumulate kwargs ("func", "initial") (#110686 ) Follow up to: https://github.com/pytorch/pytorch/pull/110683 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110686 Approved by: https://github.com/ezyang	2023-10-08 07:06:52 +00:00
cdzhan	fa8e4ea212	Add support for hasattr on ListVariable (#110438 ) Fixes #109502 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110438 Approved by: https://github.com/jansel	2023-10-08 05:34:00 +00:00
Animesh Jain	58637c4b43	[dynamo] Remove SuperSource (#110475 ) The motivation for removing this is already present in the pre-PR comments. Copying it ~~~ # NB - SuperSource is a weird one. # it is our only source with 2 bases, so we use the objec # as the base, rather than the type, since an invocation # like super(Foo, foo) is represented here, the source object base is more spiritually # aligned with the instance, rather than the type. # This whole construction is questionable tho, and we should probably find a way to # avoid this exception to our otherwise nice source parentage invariant. ~~~ Instead of using super(a, b), we can use `type(b).__mro__[index]`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110475 Approved by: https://github.com/jansel	2023-10-08 04:45:06 +00:00
Jon Chuang	9b55194f81	fix(dynamo): Incorrect `accumulate` implementation, bad tests (#110683 ) Root cause of: https://github.com/pytorch/pytorch/issues/110287 Fixed many tests that didn't actually test due to unreliability of `CompileCounter.frame_count` in detecting graph breaks: https://github.com/pytorch/pytorch/issues/110730 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110683 Approved by: https://github.com/voznesenskym	2023-10-06 23:07:56 +00:00
William Wen	71beca4899	[dynamo, logging] Report name of defining class along side function name in Dynamo logs (#110190 ) Implement https://github.com/pytorch/pytorch/issues/109236 Sample code: ```python import torch class AAA: class DUMMY: class DUMMY2: pass def dummy(self): def dummy2(): pass class BBB: @staticmethod def CCC(): class DDD: if True: @staticmethod def EEE(): x = [torch.ones(3, 3) for _ in range(5)] return x return DDD def fn(): return AAA.BBB.CCC().EEE() opt_fn = torch.compile(fn, backend="eager") opt_fn() ``` Logs: ```bash $TORCH_LOGS="trace_source" python playground2.py [2023-09-27 17:38:35,641] [0/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] TRACE starts_line /data/users/williamwen/pytorch/playground2.py:21 in fn (fn) [2023-09-27 17:38:35,641] [0/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] def fn(): [2023-09-27 17:38:35,642] [0/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] TRACE starts_line /data/users/williamwen/pytorch/playground2.py:22 in fn (fn) [2023-09-27 17:38:35,642] [0/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] return AAA.BBB.CCC().EEE() [2023-09-27 17:38:35,661] [0/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] TRACE starts_line /data/users/williamwen/pytorch/playground2.py:11 in CCC (AAA.BBB) (inline depth: 1) [2023-09-27 17:38:35,661] [0/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] @staticmethod [2023-09-27 17:38:35,661] [0/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] TRACE starts_line /data/users/williamwen/pytorch/playground2.py:13 in CCC (AAA.BBB.CCC.DDD) (inline depth: 1) [2023-09-27 17:38:35,661] [0/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] class DDD: [2023-09-27 17:38:35,723] [1/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] TRACE starts_line /data/users/williamwen/pytorch/playground2.py:17 in <listcomp> (AAA.BBB.CCC.DDD.EEE) [2023-09-27 17:38:35,723] [1/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] x = [torch.ones(3, 3) for _ in range(5)] ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/110190 Approved by: https://github.com/ezyang, https://github.com/mlazos	2023-10-05 20:41:38 +00:00
Yu Guo	2bf3ca1be7	[torchdynamo] preserve deterministic_algorithms_warn_only in convert_context (#110457 ) Summary: preserve deterministic_algorithms_warn_only in dynamo context Test Plan: modified unit tests to test warn_only Differential Revision: D49872622 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110457 Approved by: https://github.com/jansel	2023-10-04 07:12:32 +00:00
Peter Bell	a8a31bc165	[dynamo][BE] test_misc.py shouldn't change the default dtype globally (#110412 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110412 Approved by: https://github.com/jansel, https://github.com/lezcano, https://github.com/Fidget-Spinner ghstack dependencies: #110398	2023-10-03 19:25:37 +00:00
Peter Bell	dc794ec32c	[dynamo] Trace through builtin `abs` (#110398 ) In python `abs(x)` does nothing but delegate to `x.__abs__()` so we should do the same in dynamo. This also adds `SymNode.__abs__` so we can trace through indexing expressions involving `abs`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110398 Approved by: https://github.com/jansel, https://github.com/lezcano	2023-10-03 19:25:37 +00:00
Yukio Siraichi	6f48d872d0	Re-land: Break graph on `manual_seed`. (#109109 ) Re-landing: #108647 (old #107594) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109109 Approved by: https://github.com/lezcano	2023-09-28 15:28:40 +00:00
aashishthakur10	ee8983da70	109605 dynamo scalar ndarray pow gen (#109953 ) Fixes #109605 Generated code before: ``` def call(args): arg0_1, = args args.clear() assert_size_stride(arg0_1, (8, ), (1, )) buf0 = empty_strided((), (), device='cpu', dtype=torch.int64) cpp_fused_lift_fresh_0(c_void_p(buf0.data_ptr())) # Source Nodes: [wrapped_pow], Original ATen: [aten.lift_fresh, aten.pow] buf1 = aten.pow(arg0_1, reinterpret_tensor(buf0, (8, ), (0, ), 0)) del arg0_1 del buf0 buf2 = buf1 assert_size_stride(buf2, (8, ), (1, )) del buf1 return (buf2, ) ``` Generated code now: ``` def call(args): arg0_1, = args args.clear() assert_size_stride(arg0_1, (8, ), (1, )) buf0 = empty_strided((8, ), (1, ), device='cpu', dtype=torch.int64) cpp_fused_pow_0(c_void_p(arg0_1.data_ptr()), c_void_p(buf0.data_ptr())) del arg0_1 return (buf0, ) ``` @lezcano What would be a good way to add a test for this? Pull Request resolved: https://github.com/pytorch/pytorch/pull/109953 Approved by: https://github.com/lezcano	2023-09-28 13:11:06 +00:00
Yukio Siraichi	51a8c166a6	Add test for `ShapeEnv` recording fallback. (#109944 ) This PR adds a test for the previous PR in this stack: #109904. In summary, it calls functions decorated with `@record_shapeenv_event`, that don't have an explicit `ShapeEnv` parameter, with arguments that don't hold a `ShapeEnv` instance. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109944 Approved by: https://github.com/ezyang	2023-09-27 00:50:14 +00:00
PyTorch MergeBot	194d9aa0f2	Revert "[Dynamo] Match closures by code ID (#109427 )" This reverts commit `3de0857503`. Reverted https://github.com/pytorch/pytorch/pull/109427 on behalf of https://github.com/voznesenskym due to Fails test `PYTORCH_TEST_WITH_DYNAMO=1 python test_ops.py -k test_out_warning__refs_cat_cpu ([comment](https://github.com/pytorch/pytorch/pull/109427#issuecomment-1736101561))	2023-09-26 18:54:36 +00:00
PyTorch MergeBot	812bf847b7	Revert "Add test for `ShapeEnv` recording fallback. (#109944 )" This reverts commit `a4dec8d306`. Reverted https://github.com/pytorch/pytorch/pull/109944 on behalf of https://github.com/atalman due to New test failing internally ([comment](https://github.com/pytorch/pytorch/pull/109944#issuecomment-1735512734))	2023-09-26 13:11:22 +00:00
Yukio Siraichi	26e8cc0465	Add test for `ShapeEnv` state when not recording. (#109945 ) This PR adds a test for checking `ShapeEnv` state when it's built with `should_record_events=False`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109945 Approved by: https://github.com/ezyang ghstack dependencies: #109904, #109944	2023-09-26 07:20:46 +00:00
Yanbo Liang	a81cb0de16	[Dynamo] Support python class member_descriptor (#109956 ) Fixes Meta internal cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109956 Approved by: https://github.com/jansel	2023-09-26 00:03:41 +00:00
Yukio Siraichi	a4dec8d306	Add test for `ShapeEnv` recording fallback. (#109944 ) This PR adds a test for the previous PR in this stack: #109904. In summary, it calls functions decorated with `@record_shapeenv_event`, that don't have an explicit `ShapeEnv` parameter, with arguments that don't hold a `ShapeEnv` instance. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109944 Approved by: https://github.com/ezyang ghstack dependencies: #109904	2023-09-25 20:59:41 +00:00
Ken Jin	3de0857503	[Dynamo] Match closures by code ID (#109427 ) Closes https://github.com/pytorch/pytorch/issues/107866 Pull Request resolved: https://github.com/pytorch/pytorch/pull/109427 Approved by: https://github.com/ezyang, https://github.com/jansel	2023-09-25 19:10:35 +00:00
PyTorch MergeBot	829b5c0949	Revert "[Dynamo] Support python class member_descriptor (#109956 )" This reverts commit `12cd776d90`. Reverted https://github.com/pytorch/pytorch/pull/109956 on behalf of https://github.com/jeanschmidt due to multiple slow jobs broken ([comment](https://github.com/pytorch/pytorch/pull/109956#issuecomment-1733706269))	2023-09-25 13:25:45 +00:00
Yanbo Liang	12cd776d90	[Dynamo] Support python class member_descriptor (#109956 ) Fixes Meta internal cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109956 Approved by: https://github.com/jansel	2023-09-25 03:15:39 +00:00
Evgeni Burovski	ca5f3a7436	TST: test that numpy dtypes do not graph break (#109974 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109974 Approved by: https://github.com/lezcano	2023-09-25 01:00:39 +00:00
Animesh Jain	8ed08e5a7c	[dynamo] Graph break on rng get/set state - remove GeneratorStateSource (#109410 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109410 Approved by: https://github.com/ezyang ghstack dependencies: #109411	2023-09-22 22:31:55 +00:00
Michael Voznesensky	a902150a1e	[Easy] ConstantVariable() -> .create (#109896 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109896 Approved by: https://github.com/ezyang	2023-09-22 22:30:15 +00:00
Michael Lazos	24ba4b7059	[dynamo][`__torch_function__` 1/n] Add getset descriptor and `__get__` vars (#109542 ) Adds the MethodWrapperVariable and GetSetDescriptor variable types. These are used in `__torch_function__` tracing to represent attribute reads (`__get__`) and for comparing unbound methods. (the func argument when `__torch_function__` is dispatched from a method call) towards tracing for https://github.com/pytorch/pytorch/issues/93723 Pull Request resolved: https://github.com/pytorch/pytorch/pull/109542 Approved by: https://github.com/jansel	2023-09-22 10:39:15 +00:00
Edward Z. Yang	09622d8d49	Allow inferring size-nature from sizes passed to empty constructor (#109720 ) This removes the need for many constrain_as_size calls as we now infer them from error checking for sizes. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/109720 Approved by: https://github.com/aakhundov	2023-09-21 17:57:40 +00:00
lezcano	8597d37536	Implement numpy(force=True) (#109636 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109636 Approved by: https://github.com/ezyang ghstack dependencies: #109634	2023-09-20 20:06:13 +00:00
lezcano	1f6828ca99	Fix numpy(force=False) (#109634 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109634 Approved by: https://github.com/ezyang	2023-09-20 20:06:13 +00:00
Edward Z. Yang	103260a43b	Re-define check for `typing` classes. (#109201 ) This PR fix the `is_typing` function: checks whether a value is an instance of a class from the `typing` package. This reverts commit b09c09f7bb3adb6a5b8a107a5b96757b569daa8d. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109201 Approved by: https://github.com/ezyang	2023-09-20 00:04:56 +00:00
aashishthakur10	9e86a093e4	add torch.device to python type (#108116 ) Fixes #107856 This PR adds torch.device instance check in the python_type method for torch variables in dynamo. @ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/108116 Approved by: https://github.com/msaroufim, https://github.com/ezyang	2023-09-18 02:20:30 +00:00
Ken Jin	f9e72acc8f	Guard default dtype in torchdynamo (#109459 ) Fixes https://github.com/pytorch/pytorch/issues/109458 Pull Request resolved: https://github.com/pytorch/pytorch/pull/109459 Approved by: https://github.com/ezyang	2023-09-17 22:51:33 +00:00
Oguz Ulgen	b03ef1d969	[Dynamo] Fix numpy error in test_numpy_torch_operators (#109087 ) When you inplace matmul two one dimensional numpy arrays, numpy=="1.24.3" gives ``` TypeError: In-place matrix multiplication is not (yet) supported. Use 'a = a @ b' instead of 'a @= b'. ``` but numpy=="1.25.2" gives ``` ValueError: inplace matrix multiplication requires the first operand to have at least one and the second at least two dimensions. ``` This diff makes it so that newer versions of numpy does not fail on this test because we do not catch ValueError. An alternative solution would be to update the test cases to be 2 dimensional, but that would have impact on other operators being tested. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109087 Approved by: https://github.com/jansel	2023-09-16 07:37:07 +00:00
ydwu4	94a54b89aa	[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes (#107337 ) Motivation: We try to make torch.cond use torch.compile automatically so that we could error out when there is side-effects in the branches and correctly handle the closures. Before this PR, we have a warning if we don't turn on a config raise_on_backend_change (turning it on gives us an error) for the following code: ```python def foo() # Inside torch.cond, we'd like to do something like torch.compile(foo, backend="eager", fullgraph=True)(...) ... # Users may then call torch.compile somewhere else. # Dynamo will use the cached code of foo for "eager" backend # but we expect dynamo to recompile with "inductor" backend. torch.compile(foo, backend="inductor")(...) ``` This PR adds a BACKEND_MATCH guard. Effectively, it implements a per-backend cache. In the above example, the cached code for "eager" won't work for "inductor" due to guard check failures and the second torch.compile will do a re-compilation. In the future, it might be useful to have something like a configuration guard that guards against dynamo configuration changes across different compiles (e.g. compile a function with fullgraph=False then compile it again with fullgraph=True). Implementation: 1. We add a guarded_backend_cache and check the most_recent_backend against the backend associated with cached code. We also remove the raise_on_backend_change flag. Note: More lines are printed for debug log due to newly added context manager and guard adds . Test Plan: Removed original tests that raise on different backend and add a new test to test whether the BACKEND_MATCH guard can guard against backend change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107337 Approved by: https://github.com/jansel	2023-09-14 15:49:30 +00:00
Nakul Camsamudram	109ab6a0df	Support str() on user defined functions (#108973 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108973 Approved by: https://github.com/anijain2305	2023-09-14 01:32:02 +00:00
Guilherme Leobas	d046376c4f	Dispatch `numpy.take_along_axis` to `torch.take_along_dim` (#108880 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108880 Approved by: https://github.com/lezcano ghstack dependencies: #108879	2023-09-13 23:13:09 +00:00
drisspg	ad90ab31f2	Flash Attention v2 (#105602 ) # Summary ## PR Dependencies I don't use ghstack :( this is a PR where it would have been helpful. That beings said I am going to peel off some PRs to make reviewing this easier: - [x] Separate build flags for Flash and MemEff: #107985 ### Description This pull request updates the version of _scaled_dot_product_flash_attention from version 1 to version 2. The changes are based on the flash attention code originally authored by @tridao ### Changes Made The majority of the changes in this pull request involve: - Copying over the flash_attention sources. - Updating header files. - Removing padding and slicing code from within the flash_attention kernel and relocating it to the composite implicit region of the SDPA. This was need to make the kernel functional and appease autograd. - Introducing a simple kernel generator to generate different instantiations of the forward and backward flash templates. - Adding conditional compilation (ifdef) to prevent building when nvcc is invoked with gencode < sm80. - Introducing a separate dependent option for mem_eff_attention, as flash_attention v2 lacks support for Windows and cannot be built for sm50 generation codes. - Modifying build.sh to reduce parallelization on sm86 runners and to lower the maximum parallelization on the manywheel builds. This adjustment was made to address out-of-memory issues during the compilation of FlashAttentionV2 sources. - Adding/Updating tests. ### Notes for Reviewers This is not a fun review, and I apologize in advance. Most of the files-changed are in the flash_attn/ folder. The only files of interest here IMO: - aten/src/ATen/native/transformers/cuda/flash_attn/flash_api.cpp - aten/src/ATen/native/transformers/cuda/flash_attn/kernels/generate_kernels.py ( this has been incorporated upstream to flash-attention github) There are a number of files all related to avoiding OOMs in CI/CD. These are typically shell scripts. ### Follow up items - Include the updates from `e07aa036db` and `9e5e8bc91e` \| https://github.com/pytorch/pytorch/issues/108108 ### Work Items - [x] I don't think Windows will be supported for 3.1.0 - Need to update cmakee - [x] Let multi_query/attention pass through and test \| UPDATE: I have the fast path implemented here: https://github.com/pytorch/pytorch/pull/106730 but since this will require changes to semantics of math to call repeat_interleave, I think this should be done as a followup. - [x] Had to drop cutlass back to 3.0.0 to get it to compile. Need to figure out how to upgrade to 3.1.0 and later. Spoke with Tri and he is going to be taking a look. Note: compiling with clang currently errors for the cute headers. - [x] Update test exercise above codepath - [x] Still need to disable on seq_len % 128 != 0 for backward( Tri beat me to it `a4f148b6ab`) - [x] Add determinism warning to BWD, Tri got to this one as well: 1c41d2b - [x] Update dispatcher to universally prefer FlashV2 - [x] Update tests to exercise new head_dims - [x] Move the head_dim padding from kernel to top level composite implicit function in order to make it purely functional - [x] Create template generator script - [x] Initial cmake support for building kernels/ folder - [x] Replay CudaGraph changes ### Results #### Forward only The TFlops are reported here are on a100 that is underclocked. ![flashv2_tflops_vs_seq_len](https://github.com/pytorch/pytorch/assets/32754868/152de46d-8fa6-42f0-9a9c-ef1eb7ae29e7) #### Forward+Backward Ran a sweep and for large compute bound sizes we do see a ~2x performance increase for forw+back. <img width="1684" alt="Screenshot 2023-07-20 at 3 47 47 PM" src="https://github.com/pytorch/pytorch/assets/32754868/fdd26e07-0077-4878-a417-f3a418b6fb3b"> Pull Request resolved: https://github.com/pytorch/pytorch/pull/105602 Approved by: https://github.com/huydhn, https://github.com/cpuhrsch	2023-09-13 13:59:05 +00:00
Yukio Siraichi	12e8530b35	Record and replay for ShapeEnv. (#107989 ) This PR introduces record and replay functionality for `ShapeEnv` instances. In short, throughout the execution of a program, we record events (e.g. function calls that modify its state) so that, in the future, we are able to reproduce any intermediary state of the instance. In summary, this PR introduces the following changes (they mostly belong to _symbolic_shapes.py_ unless otherwise stated): - Create `ShapeEnvEvent` class for recording function calls + arguments - Create `record_shapeenv_event` decorator and decorate every function that changes the state of a `ShapeEnv`: it creates an appropriate event and add it to the available ShapeEnv instance (sometimes it has to extract from `SymTypes`). - Create `SymNode.with_shape_env` convenient function for replacing `ShapeEnv` references - Wraps `ShapeEnv` initialization method: so that we also save the exact way a `ShapeEnv` was constructed, i.e. arguments - Introduces a way to compare two `ShapeEnv` instances, defining a concept of state for that class. In short, the state of `ShapeEnv` is every variable that may change the execution flow - Create `check_shape_env_recorded_events` dynamo configuration for enabling the check for equality the state of `ShapeEnv` with another one that was constructed by replaying all the recorded events. This check takes place inside `produce_guards` - Create `replay_shape_env_events` function for replaying given events. It assumes the first event is `ShapeEnv` initialization function Pull Request resolved: https://github.com/pytorch/pytorch/pull/107989 Approved by: https://github.com/ezyang	2023-09-13 00:22:38 +00:00
Nakul Camsamudram	3b265e021f	Support Optional typehint without graph breaking (#108970 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108970 Approved by: https://github.com/anijain2305	2023-09-11 16:42:44 +00:00
William Wen	2c3febb273	[dynamo] disable flaky test_unhandled_exception_in_dynamo2 (#108906 ) Fix https://github.com/pytorch/pytorch/issues/106028. The test `test_unhandled_exception_in_dynamo` should cover most cases. The disabled test `test_unhandled_exception_in_dynamo2` covered some weird case that I found when implementing dynamo 3.11. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108906 Approved by: https://github.com/yanboliang	2023-09-09 01:10:09 +00:00
PyTorch MergeBot	8caaa4f4cd	Revert "Re-land: Break graph on `manual_seed`. (#108647 )" This reverts commit `c887309437`. Reverted https://github.com/pytorch/pytorch/pull/108647 on behalf of https://github.com/huydhn due to Ouch, we are hit again my another internal import error from https://github.com/pytorch/pytorch/blob/main/torch/_inductor/config.py#L205-L206 ([comment](https://github.com/pytorch/pytorch/pull/108647#issuecomment-1712230103))	2023-09-08 21:18:00 +00:00
Yanbo Liang	8990174676	[Dynamo] Should inline __new__ function rather than skipping frame (#108549 ) Fixes #107460 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108549 Approved by: https://github.com/jansel	2023-09-08 16:51:47 +00:00
Huy Do	a9c663c269	Revert "Flash Attention v2 (#105602 )" (#108827 ) This reverts commit `add45aea1c`. There are some conflicts on some benchmark csv file https://github.com/pytorch/pytorch/pull/105602#issuecomment-1710988951 so I need to revert this manually. The diff has been reverted internally. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108827 Approved by: https://github.com/kit1980	2023-09-08 07:43:04 +00:00
Jason Ansel	4965fffeda	[dynamo] Move global state guards to C++ (#108624 ) This combines a bunch of python global state guards into a single C++ guard and switches to checking them 100% of the time. It also adds a few new guards for things that change inductor's behavior. Even though we are checking more things, I expect this to be much faster. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108624 Approved by: https://github.com/anijain2305	2023-09-08 04:07:08 +00:00
PyTorch MergeBot	e45b290127	Revert "Revert "Flash Attention v2 (#105602 )" (#108827 )" This reverts commit `24e9bbe22a`. Reverted https://github.com/pytorch/pytorch/pull/108827 on behalf of https://github.com/huydhn due to I need to land this revert properly as there are new failures showing up on trunk ([comment](https://github.com/pytorch/pytorch/pull/108827#issuecomment-1711020924))	2023-09-08 03:25:45 +00:00
Huy Do	24e9bbe22a	Revert "Flash Attention v2 (#105602 )" (#108827 ) This reverts commit `add45aea1c`. There are some conflicts on some benchmark csv file https://github.com/pytorch/pytorch/pull/105602#issuecomment-1710988951 so I need to revert this manually. The diff has been reverted internally. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108827 Approved by: https://github.com/kit1980	2023-09-08 02:54:20 +00:00
PyTorch MergeBot	38fcf77a1b	Revert "[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes (#107337 )" This reverts commit `1a64ec7dd4`. Reverted https://github.com/pytorch/pytorch/pull/107337 on behalf of https://github.com/huydhn due to Sorry for reverting your change but inductor perf smoke test starts to regress after this ([comment](https://github.com/pytorch/pytorch/pull/107337#issuecomment-1710974588))	2023-09-08 02:03:48 +00:00
ydwu4	1a64ec7dd4	[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes (#107337 ) Motivation: We try to make torch.cond use torch.compile automatically so that we could error out when there is side-effects in the branches and correctly handle the closures. Before this PR, we have a warning if we don't turn on a config raise_on_backend_change (turning it on gives us an error) for the following code: ```python def foo() # Inside torch.cond, we'd like to do something like torch.compile(foo, backend="eager", fullgraph=True)(...) ... # Users may then call torch.compile somewhere else. # Dynamo will use the cached code of foo for "eager" backend # but we expect dynamo to recompile with "inductor" backend. torch.compile(foo, backend="inductor")(...) ``` This PR adds a BACKEND_MATCH guard. Effectively, it implements a per-backend cache. In the above example, the cached code for "eager" won't work for "inductor" due to guard check failures and the second torch.compile will do a re-compilation. In the future, it might be useful to have something like a configuration guard that guards against dynamo configuration changes across different compiles (e.g. compile a function with fullgraph=False then compile it again with fullgraph=True). Implementation: 1. We add a guarded_backend_cache and check the most_recent_backend against the backend associated with cached code. We also remove the raise_on_backend_change flag. 2. Then newly added context manager and guard adds more lines for debug log so we change the uppper limit from 50 to 55. Test Plan: Removed original tests that raise on different backend and add a new test to test whether the BACKEND_MATCH guard can guard against backend change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107337 Approved by: https://github.com/jansel	2023-09-07 22:45:54 +00:00
Evgeni Burovski	1f20531939	fall back to eager on `NotImplementedError` (#107863 ) Follow-up to https://github.com/pytorch/pytorch/pull/107710: Help dynamo fall back to eager when compiling unimplemented numpy constructs: - arrays of strings - (arg){min, max} for complex types - various arguments typed as NotImplemented (`np.ones(4, order="F")` etc) - numpy functions which torch._numpy does not implement To test, run (we do not implement arrays of strings) ``` import torch import numpy as np @torch.compile(fullgraph=False) def fn(): return np.asarray(["L", "U"]) ``` and observe it compiles with fullgraph=False and fails with fullgraph=True Fixes https://github.com/pytorch/pytorch/issues/107970 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107863 Approved by: https://github.com/ezyang, https://github.com/lezcano	2023-09-07 21:22:20 +00:00
Yukio Siraichi	c887309437	Re-land: Break graph on `manual_seed`. (#108647 ) Trying to re-land #107594. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108647 Approved by: https://github.com/eellison	2023-09-07 12:52:38 +00:00

1 2 3 4 5 ...

306 Commits