pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Nikita Shulga	0910429d72	[BE][CMake] Use FindPython module (#124613 ) As FindPythonInterp and FindPythonLibs has been deprecated since cmake-3.12 Replace `PYTHON_EXECUTABLE` with `Python_EXECUTABLE` everywhere (CMake variable names are case-sensitive) This makes PyTorch buildable with python3 binary shipped with XCode on MacOS TODO: Get rid of `FindNumpy` as its part of Python package Pull Request resolved: https://github.com/pytorch/pytorch/pull/124613 Approved by: https://github.com/cyyever, https://github.com/Skylion007	2024-05-29 13:17:35 +00:00
Tarun Karuturi	39f16c221e	Adding event_tracer evalue logging calls in codegen (#114584 ) Summary: This diff adds support in the ExecuTorch codegen layer to log the outputs of kernels to event_tracer. It does this by calling the `event_tracer_log_evalue` API. When the `ET_EVENT_TRACER_ENABLED` flag is disabled this is essentially a no-op and will add no overhead. Test Plan: CI Reviewed By: larryliu0820 Differential Revision: D51534590 Pull Request resolved: https://github.com/pytorch/pytorch/pull/114584 Approved by: https://github.com/larryliu0820	2023-11-28 18:32:05 +00:00
cyy	f9cc7f6a1c	Enable Wno-unused-private-field,Wunused-lambda-capture and fix CUDA warnings (#110856 ) This PR enables Wno-unused-private-field,Wunused-lambda-capture and some CUDA warnings were fixed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110856 Approved by: https://github.com/albanD, https://github.com/malfet	2023-10-25 03:39:05 +00:00
Mengwei Liu	0721a394b6	[executorch][kernel reg] Allow kernel manual registration (#110086 ) Summary: Exposing a codegen mode for generating a hook for user to register their kernels. If we pass `--manual-registration` flag to `gen_executorch.py`, we will generate the following files: 1. RegisterKernels.h which declares a `register_all_kernels()` API inside `torch::executor` namespace. 2. RegisterKernelsEverything.cpp which implements `register_all_kernels()` by defining an array of generated kernels. This way user can depend on the library declared by `executorch_generated_lib` macro (with `manual_registration=True`) and be able to include `RegisterKernels.h`. Then they can manually call `register_all_kernels()` instead of relying on C++ static initialization mechanism which is not available in some embedded systems. Test Plan: Rely on the unit test: ``` buck2 test fbcode//executorch/runtime/kernel/test:test_kernel_manual_registration ``` Reviewed By: cccclai Differential Revision: D49439673 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110086 Approved by: https://github.com/cccclai	2023-09-27 16:04:20 +00:00
Tarun Karuturi	a51b8df261	Add support for event_tracer in codegen layer (#109990 ) Summary: Split out from D48975975, this handles the pytorch specific changes to add support for event_tracer in codegen layer. Test Plan: CI Reviewed By: dbort Differential Revision: D49487710 Pull Request resolved: https://github.com/pytorch/pytorch/pull/109990 Approved by: https://github.com/Jack-Khuu	2023-09-27 09:09:03 +00:00
Dave Bort	d06e1df1aa	[torchgen] Rename executorch's RuntimeContext to KernelRuntimeContext (#104892 ) Rename the context type to match changes in executorch. Differential Revision: [D46977359](https://our.internmc.facebook.com/intern/diff/D46977359/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/104892 Approved by: https://github.com/larryliu0820	2023-07-14 21:15:50 +00:00
Jack Khuu	18dacf7e79	[Specialized Kernel] Update yaml syntax to use kernel instead of dispatch (#104070 ) Based on this [code search](https://fburl.com/code/gjcnw8ly) (.yaml with `dispatch: CPU:`), update all files found to use ``` kernels: - arg_meta: None kernel_name: ``` instead of ``` dispatch: CPU: ``` --- ## Code changes: - `fbcode/executorch/codegen/tools/gen_oplist.py` - Strip ET specific fields prior to calling parse_native_yaml_struct --- ## Files edited that are not `functions.yaml` or `custom_ops.yaml` - fbcode/executorch/kernels/optimized/optimized.yaml - fbcode/executorch/kernels/quantized/quantized.yaml - fbcode/executorch/kernels/test/custom_kernel_example/my_functions.yaml --- ## Found Files that were not edited Dispatched to more than just CPU - fbcode/caffe2/aten/src/ATen/native/native_functions.yaml - xplat/caffe2/aten/src/ATen/native/native_functions.yaml - xros/third-party/caffe2/caffe2/aten/src/ATen/native/native_functions.yaml Grouped ops.yaml path - fbcode/on_device_ai/Assistant/Jarvis/min_runtime/operators/ops.yaml --- Design Doc: https://docs.google.com/document/d/1gq4Wz2R6verKJ2EFseLyPdAF0wqomnCrVDDJpRkYsRw/edit?kh_source=GDOCS#heading=h.8raqyft9y50 Differential Revision: [D46952067](https://our.internmc.facebook.com/intern/diff/D46952067/) NOTE FOR REVIEWERS: This PR has internal Meta-specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D46952067/)! Pull Request resolved: https://github.com/pytorch/pytorch/pull/104070 Approved by: https://github.com/larryliu0820	2023-06-27 09:53:20 +00:00
Mengwei Liu	ce845dfe49	[Reland][ET] Select used et_kernel_metadata only (#104005 ) Summary: Currently we rely on root operator, but we also need to check for et_kernel_metadata for used specialized kernels. Test Plan: contbuild & OSS CI Reviewed By: Jack-Khuu Differential Revision: D46882119 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104005 Approved by: https://github.com/Jack-Khuu	2023-06-23 14:38:45 +00:00
PyTorch MergeBot	08a7d60a46	Revert "[Reland][ET] Select used et_kernel_metadata only (#103705 )" This reverts commit `59a01c49ee`. Reverted https://github.com/pytorch/pytorch/pull/103705 on behalf of https://github.com/osalpekar due to large number of internal failures in executorch contbuild. See [D46882119](https://www.internalfb.com/diff/D46882119) for more details ([comment](https://github.com/pytorch/pytorch/pull/103705#issuecomment-1601789900))	2023-06-21 22:51:38 +00:00
Hansong Zhang	59a01c49ee	[Reland][ET] Select used et_kernel_metadata only (#103705 ) Currently we rely on root operator, but we also need to check for et_kernel_metadata for used specialized kernels. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103705 Approved by: https://github.com/larryliu0820	2023-06-18 00:33:28 +00:00
Jack Khuu	e9674d146c	[Specialized Kernel] Propagate Specialized Kernel Support through ComputeCodegenUnboxedKernels (#103113 ) Updating ComputeCodegenUnboxedKernels to accept and write out kernel information to RegisterCodegenUnboxedKernels.cpp Differential Revision: [D46486195](https://our.internmc.facebook.com/intern/diff/D46486195/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/103113 Approved by: https://github.com/larryliu0820, https://github.com/kirklandsign	2023-06-14 10:18:16 +00:00
Mengwei Liu	41865bd8ed	[executorch] Add RuntimeContext to generated C++ API Signature (#94570 ) Summary: Pass runtime context all the way to kernel level. RegisterCodegenUnboxedKernels.cpp: ``` static Operator operators_to_register[] = { Operator( "aten::add.out", [](torch::executor::RuntimeContext & context, EValue** stack) { EValue& self = stack[0]; EValue& other = stack[1]; EValue& alpha = stack[2]; EValue& out = stack[3]; const torch::executor::Tensor & self_base = self.to<torch::executor::Tensor>(); const torch::executor::Tensor & other_base = other.to<torch::executor::Tensor>(); const torch::executor::Scalar & alpha_base = alpha.to<torch::executor::Scalar>(); torch::executor::Tensor & out_base = out.to<torch::executor::Tensor>(); EXECUTORCH_SCOPE_PROF("native_call_add.out"); torch::executor::aten::add_outf(context, self_base, other_base, alpha_base, out_base); } ), } ``` Functions.h ``` // aten::add.out(Tensor self, Tensor other, *, Scalar alpha=1, Tensor(a!) out) -> Tensor(a!) TORCH_API inline at::Tensor & add_outf(torch::executor::RuntimeContext & context, const at::Tensor & self, const at::Tensor & other, const at::Scalar & alpha, at::Tensor & out) { return torch::executor::native::add_out(self, other, alpha, out); } ``` Test Plan: TBD Differential Revision: D41325633 Pull Request resolved: https://github.com/pytorch/pytorch/pull/94570 Approved by: https://github.com/cccclai	2023-02-16 02:43:18 +00:00
Xuehai Pan	a229b4526f	[BE] Prefer dash over underscore in command-line options (#94505 ) Preferring dash over underscore in command-line options. Add `--command-arg-name` to the argument parser. The old arguments with underscores `--command_arg_name` are kept for backward compatibility. Both dashes and underscores are used in the PyTorch codebase. Some argument parsers only have dashes or only have underscores in arguments. For example, the `torchrun` utility for distributed training only accepts underscore arguments (e.g., `--master_port`). The dashes are more common in other command-line tools. And it looks to be the default choice in the Python standard library: `argparse.BooleanOptionalAction`: `4a9dff0e5a/Lib/argparse.py (L893-L895)` ```python class BooleanOptionalAction(Action): def __init__(...): if option_string.startswith('--'): option_string = '--no-' + option_string[2:] _option_strings.append(option_string) ``` It adds `--no-argname`, not `--no_argname`. Also typing `_` need to press the shift or the caps-lock key than `-`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94505 Approved by: https://github.com/ezyang, https://github.com/seemethere	2023-02-09 20:16:49 +00:00
Larry Liu	7568484d54	[torchgen] Add CI job to cover custom ops registration for Executorch (#91291 ) As titled. To register a custom op into Executorch, we need: * `custom_ops.yaml`, defines the operator schema and the corresponding native function. * `custom_ops.cpp`, defines the kernel. * `RegisterDispatchKeyCustomOps.cpp`, a template to register operator into PyTorch. Added a new test for custom ops. The custom op `custom::add_3.out` takes 3 tensors and add them together. The test makes sure it is registered correctly and then verifies the outcome is correct. Differential Revision: [D42204263](https://our.internmc.facebook.com/intern/diff/D42204263/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/91291 Approved by: https://github.com/ezyang	2023-01-14 02:30:54 +00:00
Yimin Tang	6783db13ef	Update CMakeLists.txt since MacOS linker doesn't support whole-archive (#91736 ) --whole-archive is a linker option(notice, that flag is passed as -Wl,--whole-archive), and -force_load is indeed available on MacOS platform (below is the quote from man ld): -force_load path_to_archive Loads all members of the specified static archive library. Note: -all_load forces all members of all archives to be loaded. This option allows you to target a specific archive. Quote from malfet Pull Request resolved: https://github.com/pytorch/pytorch/pull/91736 Approved by: https://github.com/larryliu0820	2023-01-13 18:03:02 +00:00
Mengwei Liu	2f154f68ea	[torchgen] Add CI job to make sure torchgen works for Executorch op registration (#89596 ) ## Job Test running on most CI jobs. ## Test binary * `test_main.cpp`: entry for gtest * `test_operator_registration.cpp`: test cases for gtest ## Helper sources * `operator_registry.h/cpp`: simple operator registry for testing purpose. * `Evalue.h`: a boxed data type that wraps ATen types, for testing purpose. * `selected_operators.yaml`: operators Executorch care about so far, we should cover all of them. ## Templates * `NativeFunctions.h`: for generating headers for native functions. (not compiled in the test, since we will be using `libtorch`) * `RegisterCodegenUnboxedKernels.cpp`: for registering boxed operators. * `Functions.h`: for declaring operator C++ APIs. Generated `Functions.h` merely wraps `ATen/Functions.h`. ## Build files * `CMakeLists.txt`: generate code to register ops. * `build.sh`: driver file, to be called by CI job. Pull Request resolved: https://github.com/pytorch/pytorch/pull/89596 Approved by: https://github.com/ezyang	2022-12-21 03:07:32 +00:00

16 Commits