pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
rzou	76fcec74c4	[optests] Test names in failure dicts should be prefixed with test class (#110045 ) We want to use the same failures dict for multiple TestCase. This happens common in e.g. fbgemm. To move towards that, we need to prefix each test name with their test class to avoid ambiguity Differential Revision: [D49615962](https://our.internmc.facebook.com/intern/diff/D49615962/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110045 Approved by: https://github.com/williamwen42	2023-09-26 03:21:12 +00:00
rzou	f8fcc54f70	Add torch.library.impl_abstract (#109912 ) Changelog: - torch.library.impl_abstract optionally accepts a torch.library.Library object. If passed in, then the lifetime of the registration is tied to the Library object. - we've also changed torch.library.impl_abstract to work on all operators, including overloads. - we refactored the `torch._custom_ops.` and `torch._custom_op.` impl_abstract APIs and put them under torch._library. This is the final resting place for them. I will follow-up with deleting all the `torch._custom_ops.` stuff later. - There is a new "SimpleOperatorRegistry" where we actually collect the abstract_impl. We will expand this to also hold the other torch._custom_ops. APIs when we move those to torch.library NB: Previously we had designed `impl_abstract` assuming a very high-level Python-only custom op API. We've revisited that since; now, impl_abstract works for all custom ops, no matter python or C++, no matter the schema. The new refactored design reflects this better. Test Plan: - existing and new tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/109912 Approved by: https://github.com/ezyang	2023-09-26 01:59:50 +00:00
rzou	8124a6c40c	[TORCH_LIBRARY] Add impl_abstract_pystub (#109529 ) We want users to be able to define custom ops in C++ but put the abstract impl in Python (since it is easier to write them in Python and the abstract impl better models device semantics and data-dependent operators). `m.impl_abstract_pystub(opname, python_module, context)` declares the abstract_impl of the operator to exist in the given python module. When the abstract_impl needs to be accessed (either via FakeTensor or Meta), and it does not exist, the PyTorch Dispatcher will yell with a descriptive error message. Some details: - We construct a new global AbstractImplPyStub mapping in Dispatcher.cpp. Read/write to this map is protected by the Dispatcher lock. - We add a new Meta Tensor fallback kernel. The fallback errors out if there is no meta kernel, but also offers a nicer error message if we see that there is a pystub. - We create a `torch._utils_internal.throw_abstract_impl_not_imported_error` helper function to throw errors. This way, we can throw different error messages in OSS PyTorch vs internal PyTorch. To invoke this from C++, we added a PyInterpreter::throw_abstract_impl_not_imported_error. Differential Revision: [D49464753](https://our.internmc.facebook.com/intern/diff/D49464753/) Differential Revision: [D49464753](https://our.internmc.facebook.com/intern/diff/D49464753) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109529 Approved by: https://github.com/ezyang, https://github.com/bdhirsh	2023-09-22 04:55:36 +00:00
rzou	122264a0c0	[generate_opcheck_tests] tests should ignore meta/FakeTensors (#109641 ) These tests generally don't work on meta tensors because they need to compare the data of the Tensors. For example, SchemaCheckMode errors out if any inputs are meta or Fake because it needs to check their storages to see if any mutation occurred and those do not have storages. Test Plan: - new tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/109641 Approved by: https://github.com/bdhirsh, https://github.com/soulitzer ghstack dependencies: #109637, #109638, #109639, #109640	2023-09-20 06:33:37 +00:00
rzou	d3d71367b9	[generate_opcheck_tests] Always print a repro (#109640 ) On failure of a test, we will always print a "repro". This repro isn't really runnable but gives the user a sense of how to actually reproduce the test without the test suite, because using the test suite is a bit convoluted. If the user passes PYTORCH_OPCHECK_PRINT_BETTER_REPRO, we will print a fuller repro that saves the exact problematic test inputs to disk and reads them back out. Test Plan: - expecttests on the generate_repro helper function - tried this out locally. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109640 Approved by: https://github.com/bdhirsh, https://github.com/soulitzer ghstack dependencies: #109637, #109638, #109639	2023-09-20 06:33:37 +00:00
rzou	10d575911e	[generate_opcheck_tests] rename "success" to "xsuccess" (#109637 ) Not BC breaking because no existing failures dict have "success" in them. Test Plan: - new tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/109637 Approved by: https://github.com/bdhirsh, https://github.com/soulitzer	2023-09-20 06:33:37 +00:00
ydwu4	94a54b89aa	[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes (#107337 ) Motivation: We try to make torch.cond use torch.compile automatically so that we could error out when there is side-effects in the branches and correctly handle the closures. Before this PR, we have a warning if we don't turn on a config raise_on_backend_change (turning it on gives us an error) for the following code: ```python def foo() # Inside torch.cond, we'd like to do something like torch.compile(foo, backend="eager", fullgraph=True)(...) ... # Users may then call torch.compile somewhere else. # Dynamo will use the cached code of foo for "eager" backend # but we expect dynamo to recompile with "inductor" backend. torch.compile(foo, backend="inductor")(...) ``` This PR adds a BACKEND_MATCH guard. Effectively, it implements a per-backend cache. In the above example, the cached code for "eager" won't work for "inductor" due to guard check failures and the second torch.compile will do a re-compilation. In the future, it might be useful to have something like a configuration guard that guards against dynamo configuration changes across different compiles (e.g. compile a function with fullgraph=False then compile it again with fullgraph=True). Implementation: 1. We add a guarded_backend_cache and check the most_recent_backend against the backend associated with cached code. We also remove the raise_on_backend_change flag. Note: More lines are printed for debug log due to newly added context manager and guard adds . Test Plan: Removed original tests that raise on different backend and add a new test to test whether the BACKEND_MATCH guard can guard against backend change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107337 Approved by: https://github.com/jansel	2023-09-14 15:49:30 +00:00
Edward Z. Yang	55f956f1d2	optests improvements based on torchvision usage on nms (#108929 ) - Update cross-ref FakeMode test to use ShapeEnv. Dynamic ops can now return an unbacked SymInt. We always accept this as equal to whatever the real value was. - Relax test so it works on all classes, not just unittest.TestCase - Properly wrap the original method, so things like pytree.mark.parametrize are carried over - Support dynamic shapes by default for make_fx `tracing_mode="fake"` without symbolifying everything else Fixes https://github.com/pytorch/pytorch/issues/108927 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/108929 Approved by: https://github.com/zou3519	2023-09-13 13:26:15 +00:00
Richard Zou	bfa8429c6a	[optests] Changed failures_dict format to json; automatic update of failures_dict (#109110 ) We changed the failures_dict format from .py to json and added a way to automatically update the failures dict (the user can set PYTORCH_OPCHECK_ACCEPT=1 to do so), assuming the tests don't crash in the process. Some details: - We introduced a FailuresDict class that handles save/load and from which one can query a test status ("xfail", "skip", etc). - PYTORCH_OPCHECK_ACCEPT=1 does not override everything. In particular: it doesn't try to update the failures dict for a test marked as "skip", but it will update it for tests marked as "xfail" or "success". - PYTORCH_OPCHECK_ACCEPT=1 also does not override the "comment" field, unless it is flipping an "xfail" into "success". - I'll update the gdoc linked in the comments with how to actually use PYTORCH_OPCHECK_ACCEPT=1 internally (it's not trivial). Note that this isn't multithreading-safe, the current recommendation is to run the tests sequentially if the user wants to use PYTORCH_OPCHECK_ACCEPT=1. Differential Revision: D49167181 Pull Request resolved: https://github.com/pytorch/pytorch/pull/109110 Approved by: https://github.com/ezyang	2023-09-13 13:24:15 +00:00
PyTorch MergeBot	38fcf77a1b	Revert "[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes (#107337 )" This reverts commit `1a64ec7dd4`. Reverted https://github.com/pytorch/pytorch/pull/107337 on behalf of https://github.com/huydhn due to Sorry for reverting your change but inductor perf smoke test starts to regress after this ([comment](https://github.com/pytorch/pytorch/pull/107337#issuecomment-1710974588))	2023-09-08 02:03:48 +00:00
ydwu4	1a64ec7dd4	[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes (#107337 ) Motivation: We try to make torch.cond use torch.compile automatically so that we could error out when there is side-effects in the branches and correctly handle the closures. Before this PR, we have a warning if we don't turn on a config raise_on_backend_change (turning it on gives us an error) for the following code: ```python def foo() # Inside torch.cond, we'd like to do something like torch.compile(foo, backend="eager", fullgraph=True)(...) ... # Users may then call torch.compile somewhere else. # Dynamo will use the cached code of foo for "eager" backend # but we expect dynamo to recompile with "inductor" backend. torch.compile(foo, backend="inductor")(...) ``` This PR adds a BACKEND_MATCH guard. Effectively, it implements a per-backend cache. In the above example, the cached code for "eager" won't work for "inductor" due to guard check failures and the second torch.compile will do a re-compilation. In the future, it might be useful to have something like a configuration guard that guards against dynamo configuration changes across different compiles (e.g. compile a function with fullgraph=False then compile it again with fullgraph=True). Implementation: 1. We add a guarded_backend_cache and check the most_recent_backend against the backend associated with cached code. We also remove the raise_on_backend_change flag. 2. Then newly added context manager and guard adds more lines for debug log so we change the uppper limit from 50 to 55. Test Plan: Removed original tests that raise on different backend and add a new test to test whether the BACKEND_MATCH guard can guard against backend change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107337 Approved by: https://github.com/jansel	2023-09-07 22:45:54 +00:00
Richard Zou	df42f15e28	Improve `generate_opcheck_tests`, add opcheck utility (#107597 ) Summary: This PR improves `generate_opcheck_tests`: - We shouldn't run automated testing through operators called in torch.jit.trace / torch.jit.script - I improved the error message and added a guide on what to do if one of the tests fail. - While dogfooding this, I realize I wanted a way to reproduce the failure without using the test suite. If you pass `PYTORCH_OPCHECK_PRINT_REPRO`, it will now print a minimal repro on failure. This involves serializing some tensors to disk. - The minimal repro includes a call to a new API called `opcheck`. The opcheck utility runs the same checks as the tests generated by `generate_opcheck_tests`. It doesn't have a lot of knobs on it for simplicity. The general workflow is: if an autogenerated test fails, then the user may find it easier to reproduce the failure without the test suite by using opcheck Test Plan: - new tests Differential Revision: D48485013 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107597 Approved by: https://github.com/ezyang	2023-08-22 15:16:04 +00:00
Richard Zou	e4e9aa28a7	Add `generate_opcheck_tests`, a PT2 crossref testing mechanism (#106903 ) This PR adds `generate_opcheck_tests`. This is a utility that adds additional crossref tests to an existing TestCase that has tests that invokes operators. The main use case is if you have a large test suite that already exercises operators and want to add automated testing that the operators are correct, without actually refactoring your code into something like OpInfos. Given a `test_` method of a TestCase, we will generate one new additional test for each of {schema correctness, autograd registration, faketensor rule, aot_autograd static shapes, aot_autograd dynamic shapes}. Each newly generated test runs the original test method under a special torch_function mode (OpCheckMode) that intercepts `op(args, *kwargs)` calls and additional passes (op, args, kwargs) to a separate function (e.g. SchemaCheck). Nitty-gritty details: - If a test is named test_cumsum, we end up generating new tests (`test_schema__test_cumsum`, `test_<something>__test_cumsum`) - Users can provide a dictionary of expected failures / skips that is indexed on operators. This gives us a sense of what operators support PT2 and which operators require fixing before they support PT2. Due to some co-dev limitations, I'm planning on landing this PR first and then using it to add crossref testing for internal tests and fbgemms. I could squash this PR with the internal changes if we want to see how that works out, just let me know. Test Plan: - We create a mini op test suite called MiniOpTests. - Then, we use `generate_opcheck_tests` to generate tests onto it. - We have our own test xfail list to check that the things that should fail do fail. - Finally, there is a separate TestGenerateOpcheckTests that checks that the correct number of tests were generated and also tests some helper functions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106903 Approved by: https://github.com/ezyang, https://github.com/bdhirsh	2023-08-15 02:16:07 +00:00
Richard Zou	2932b0bf37	Extend impl_backward to be usable with torch.library operators (#106817 ) - impl_save_for_backward/impl_backward only work for functional, non-view schemas. We validate this. - impl_save_for_backward/impl_backward raise if there already exists an autograd implementation from torch.library / TORCH_LIBRARY. - Operators constructed via custom_op receive an "autograd indirection kernel". The "autograd indirection kernel" automatically pulls the constructed autograd kernel out of a dict. When impl_save_for_backward/impl_backward get used with torch.library operators, we also register the "autograd indirection kernel" so we can reuse the logic. Test Plan: - new tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/106817 Approved by: https://github.com/soulitzer ghstack dependencies: #106799, #106800	2023-08-14 14:33:46 +00:00
Richard Zou	db9a0cf689	Extend impl_backward to handle non-Tensor outputs (#106800 ) Recall that the user must give us a backward function that accepts `(ctx, saved, *grads)`, with one grad per output. Previously, impl_backward only worked for functions that return one or more Tensors. The new semantics are that if the output has: - a TensorList, the backward function provided by the user will receive a List[Tensor] of grads for that output. - a number, the backward function provided by the user will receive None as the grad. Also recall that impl_backward is implemented by registering an autograd.Function to the autograd dispatch key. We needed to make the following changes: - If an output is a TensorList, autograd.Function will ignore it. So we need to tree-flatten it before returning it from the autograd.Function - This means that the autograd.Function receives a flat list of grad during the backwards pass. We need to tree-unflatten it into the correct shape before passing it to the user-defined backward - We modify the logic of output_differentiability. Only Tensor/TensorList outputs can be marked as differentiable. If a TensorList is marked as non-differentiable, then this is equivalent to all Tensors in the list being non-differentiable. There is no finer-grain control over this (to match derivatives.yaml). Test Plan: - There are new `numpy_split_copy` (returns TensorList) and `numpy_split_copy_with_int` (returns (TensorList, int)) operators in custom_op_db - Added tests for output_differentiability into test/test_custom_ops.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/106800 Approved by: https://github.com/soulitzer ghstack dependencies: #106799	2023-08-14 14:33:46 +00:00
Richard Zou	9fcce1baf1	[custom_op] Allow constructor to infer more types (#106799 ) This expands the torch._custom_ops.custom_op API to be able to construct operators that return (int, bool, float, Scalar, List[Tensor]) to make it more in-line with our torch.library API. NB: there needs to be updates to our custom_op autograd registration API. For ease of review those changes will go in the next PR up but I can squash if requested. Test Plan: - new tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/106799 Approved by: https://github.com/soulitzer	2023-08-14 14:33:43 +00:00
Richard Zou	16b6873885	[custom_ops] extend impl_abstract to work with existing torch.library ops (#106088 ) This PR extends impl_abstract to work with existing torch.library/TORCH_LIBRARY ops. There's a question of what to do if the user calls impl_abstract and the op already has a registration for: - DispatchKey::Meta. We raise. - DispatchKey::CompositeImplicitAutograd. We raise. - DispatchKey::CompositeExplicitAutograd. To be pragmatic, we don't raise, since the user's CompositeExplicitAutograd might work for all other backends but Meta. Test Plan: - new tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/106088 Approved by: https://github.com/soulitzer ghstack dependencies: #106075, #106076	2023-08-08 13:53:20 +00:00
Richard Zou	cebff39fad	[custom_ops] make custom_ops.impl work on existing operators (#106076 ) The design is that we construct a CustomOp object around the existing operator and then use it to register things. It is totally OK if the operator isn't functional (unlike torch._custom_ops.custom_op that can only construct functional operators). If the operator already has an implementation from a backend (either via direct registration to e.g. DispatchKey::CPU, or an indirect registration like CompositeImplicitAutograd/CompositeExplicitAutograd), we raise an error. Test Plan: - new tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/106076 Approved by: https://github.com/soulitzer ghstack dependencies: #106075	2023-08-08 13:53:20 +00:00
Richard Zou	60a4ac3068	[custom_ops] Block overload names (#106075 ) These are valid with the torch.library API, but (1) they add complexity and (2) I have never seen a custom op actually use an overload name before. For simplicity we block all overloads. Test Plan: - new test Pull Request resolved: https://github.com/pytorch/pytorch/pull/106075 Approved by: https://github.com/soulitzer	2023-08-08 13:53:18 +00:00
Richard Zou	26e98040da	Improve AOTAutograd tests to do something when inputs don't require grad (#106558 ) This PR: - Changes the AOTAutograd tests to also check that the output of the forward is equal under AOTAutograd and eager-mode PyTorch. - Adds a "check_gradients" flag to `check_aot_autograd`. - If True, then we attempt to compute gradients and check them. - If False, then we we just check the outputs are equal - If "auto", then we will compute gradients and check them only if some input and some output requires grad. This option is useful for crossref tests where we don't necessarily have inputs that require grad. 1) I need a testing utility to test "AOTAutograd for inference", e.g. make_fx + functionalize. 2) I want to run aot_autograd_check in crossref tests for other test suites (e.g. fbgemm) and not all inputs require grad. Test Plan: - existing tests - new tests to test the degenerate cases Pull Request resolved: https://github.com/pytorch/pytorch/pull/106558 Approved by: https://github.com/ezyang, https://github.com/soulitzer	2023-08-07 00:11:30 +00:00
Richard Zou	e421edf377	Add utility to test if autograd was registered correctly (#106561 ) See docstring for more details. This API is not meant to be directly user-facing, it is meant to be used as a subtest of D47965247, which is coming soon. Test Plan: - Some new tests. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106561 Approved by: https://github.com/ezyang, https://github.com/soulitzer	2023-08-04 13:39:10 +00:00
Richard Zou	dad65d09f2	Update custom op API (#105947 ) As described in https://docs.google.com/document/d/1aGWtgxV3HppuxQAdddyPrs74_aEntpkYt9MalnCKnhk/edit This PR changes the CustomOp API to be private and adds new public wrappers around it so that the user does not need to know about the "CustomOp" object. We've effectively changed the "CustomOp" object to be some metadata about the operator that the user does not directly interact with. The "updated custom op API" is in torch._custom_ops. Pending good customer feedback, we will promote this module to torch.custom_ops. NB: I cannot move around the older torch._custom_op APIs yet because people are already using them. Test Plan: - I changed all of our tests to use the new `torch._custom_ops` module instead of the old CustomOp API. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105947 Approved by: https://github.com/soulitzer	2023-07-28 13:30:58 +00:00
Richard Zou	6d553a42fe	Move most custom op related tests to test_custom_ops.py (#106036 ) This PR moves most custom op related tests from test/test_python_dispatch.py to test/test_custom_ops.py. Motivation is that I had a difficult time finding the custom op tests inside test_python_dispatch.py. This doesn't preserve blame, but it's OK - I'm the only person who has really touched the moved tests so far :). Test Plan: - run tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/106036 Approved by: https://github.com/bdhirsh, https://github.com/soulitzer	2023-07-28 13:30:58 +00:00
Richard Zou	db365e1fb5	Create test/test_custom_ops.py, move test_custom_op_testing to it (#106035 ) I'm in the process of putting all the custom op tests into this file. I got tired of trying to find the custom ops tests in test_python_dispatch.py, which (1) is getting long and (2) should actually be the torch_dispatch and python torch.library tests. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106035 Approved by: https://github.com/bdhirsh, https://github.com/soulitzer	2023-07-28 13:30:58 +00:00

24 Commits