pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

History

chunyuan 8b11d81058 [Re-landing 68111] Add JIT graph fuser for oneDNN Graph API (Preview4.1) Re-landing https://github.com/pytorch/pytorch/pull/68111 ## Description Preview4 PR of this [RFC](https://github.com/pytorch/pytorch/issues/49444). On the basis of https://github.com/pytorch/pytorch/pull/50256, the below improvements are included: - The [preview4 release branch](https://github.com/oneapi-src/oneDNN/releases/tag/graph-v0.4.1) of the oneDNN Graph API is used - The fuser now works with the profiling graph executor. We have inserted type check nodes to guard the profiled tensor properties. ### User API: The optimization pass is disabled by default. Users could enable it by: ``` torch.jit.enable_onednn_fusion(True) ``` ### Performance: [pytorch/benchmark](https://github.com/pytorch/benchmark) tool is used to compare the performance: - SkyLake 8180 (1 socket of 28 cores): ![image](https://user-images.githubusercontent.com/65992142/151162305-05e44425-a24e-4d5e-94e1-743b40b87a8c.png) - SkyLake 8180 (single thread): ![image](https://user-images.githubusercontent.com/65992142/151162528-69f90b79-d08d-46b8-8775-d80a6ccbce8a.png) \* By mapping hardswish to oneDNN Graph, it’s 8% faster than PyTorch JIT (NNC + OFI) \** We expect performance gain after mapping transpose, contiguous & view to oneDNN graph ops ### Directory structure of the integration code Fuser-related code are placed under: ``` torch/csrc/jit/codegen/onednn/ ``` Optimization pass registration is done in: ``` torch/csrc/jit/passes/onednn_graph_fuser.h ``` CMake for the integration code is: ``` caffe2/CMakeLists.txt ``` ## Limitations - In this PR, we have only supported the optimization on Linux platform. The support on Windows and MacOS will be enabled as the next step. - We have only optimized the inference use case. Pull Request resolved: https://github.com/pytorch/pytorch/pull/74596 Approved by: https://github.com/malfet		2022-04-29 01:01:33 +00:00
..
_static	clarify the documentation of `torch.meshgrid` (#62977 )	2021-08-18 04:01:22 -07:00
_templates	DOC: Merge extraheader block from theme instead of override (#70187 )	2022-01-05 06:42:38 -08:00
community	Update persons of interest for ONNX (#72072 )	2022-02-16 23:01:13 +00:00
elastic	(torchelastic) make --max_restarts explicit in the quickstart and runner docs (#65838 )	2021-09-29 19:29:01 -07:00
notes	fix docs error in Autograd Mechanics	2022-03-29 18:32:16 +00:00
rpc	Support Union in TorchScript (#64234 )	2021-09-03 06:12:24 -07:00
scripts	[quant][fx] Move backend_config folder to torch.ao.quantization	2022-04-19 15:38:57 +00:00
amp.rst	add autocast cpu doc	2022-03-22 02:02:43 +00:00
autograd.rst	Targeted documentation updates in autograd.functional (#72111 )	2022-02-02 03:19:31 +00:00
backends.rst	Cleanup all module references in doc (#73983 )	2022-03-10 22:26:29 +00:00
benchmark_utils.rst	Cleanup all module references in doc (#73983 )	2022-03-10 22:26:29 +00:00
bottleneck.rst	Cleanup all module references in doc (#73983 )	2022-03-10 22:26:29 +00:00
checkpoint.rst
complex_numbers.rst	Grammatical update of tech docs (#61547 )	2021-07-14 14:01:59 -07:00
conf.py	[quant][fx] Move backend_config folder to torch.ao.quantization	2022-04-19 15:38:57 +00:00
config_mod.rst	rename config module file to work with gh pages better	2022-03-10 20:41:44 +00:00
cpp_extension.rst	Check clang++/g++ version when compiling CUDA extensions (#63230 )	2022-02-24 08:32:32 +00:00
cpp_index.rst
cuda.rst	Document `torch.cuda.ExternalStream`, `torch.cuda.caching_allocator_alloc` and `torch.cuda.caching_allocator_delete` (#70126 )	2022-01-12 15:44:40 -08:00
cudnn_persistent_rnn.rst	Remove orphan from cuDNN persistent note (#65160 )	2021-09-21 11:09:47 -07:00
cudnn_rnn_determinism.rst	Forbid trailing whitespace (#53406 )	2021-03-05 17:22:55 -08:00
data.rst	Cleanup all module references in doc (#73983 )	2022-03-10 22:26:29 +00:00
ddp_comm_hooks.rst	[DDP Comm Hook] Add debugging communication hooks to ddp_comm_hooks.rst (#64352 )	2021-09-01 17:37:19 -07:00
deploy.rst	[deploy] docs (#69251 )	2021-12-01 21:55:18 -08:00
distributed.algorithms.join.rst	Add tutorial link (#62785 )	2021-08-05 17:28:02 -07:00
distributed.elastic.rst	[1/n][torch/elastic] Move torchelastic docs *.rst (#148 )	2021-05-04 00:57:56 -07:00
distributed.optim.rst	[distributed][docs] Delete distributed optimimzer section from RPC and add reference to namespace docs page (#68068 )	2021-11-09 15:01:54 -08:00
distributed.rst	[PyTorch Distributed] Update documentation about NCCL environment variables (#74006 )	2022-03-11 23:57:17 +00:00
distributions.rst	[Reinstate] Wishart distribution (#70377 )	2021-12-30 11:41:46 -08:00
dlpack.rst	Lint trailing newlines (#54737 )	2021-03-30 13:09:52 -07:00
docutils.conf
fft.rst	Cleanup all module references in doc (#73983 )	2022-03-10 22:26:29 +00:00
fsdp.rst	make fsdp folder to be public (#72084 )	2022-02-02 15:50:14 +00:00
futures.rst	Update docs to mention CUDA support for Future (#50048 )	2021-05-11 08:26:33 -07:00
fx.rst	Fix doc build	2022-04-19 04:07:47 +00:00
hub.rst	Add more details to the known limitations section of torchhub docs (#69970 )	2021-12-16 02:43:48 -08:00
index.rst	Update Index.rst to add TorchRec to domain list.	2022-04-15 02:39:12 +00:00
jit_builtin_functions.rst	Lint trailing newlines (#54737 )	2021-03-30 13:09:52 -07:00
jit_language_reference_v2.rst	Add Union type to TorchScript Language Ref (#69514 )	2021-12-07 12:53:54 -08:00
jit_language_reference.rst	fix typos in jit_language_reference.rst (#68706 )	2021-11-22 19:09:06 -08:00
jit_python_reference.rst	[JIT] improve documentation (#57991 )	2021-05-19 11:47:32 -07:00
jit_unsupported.rst	[JIT] Update docs for recently added features (#45232 )	2020-09-28 18:17:42 -07:00
jit.rst	[Re-landing 68111] Add JIT graph fuser for oneDNN Graph API (Preview4.1)	2022-04-29 01:01:33 +00:00
linalg.rst	Add torch.linalg.ldl_factor_ex and torch.linalg.ldl_solve	2022-04-28 19:23:37 +00:00
math-quantizer-equation.png
mobile_optimizer.rst	Mod lists to neutral+descriptive terms in caffe2/docs (#49803 )	2020-12-23 11:37:11 -08:00
model_zoo.rst
monitor.rst	torch/monitor: merge Interval and FixedCount stats (#72009 )	2022-01-30 23:21:59 +00:00
multiprocessing.rst	Forbid trailing whitespace (#53406 )	2021-03-05 17:22:55 -08:00
name_inference.rst	Abladawood patch 1 (#58496 )	2021-05-20 10:32:18 -07:00
named_tensor.rst	Forbid trailing whitespace (#53406 )	2021-03-05 17:22:55 -08:00
nested.rst	Minimal NestedTensor (#72881 )	2022-03-02 16:31:51 +00:00
nn.functional.rst	Revert D34154832: [pytorch][PR] Add `multi_head_attention_forward` to functional rst docs	2022-02-11 05:08:46 +00:00
nn.init.rst
nn.rst	move the stateless util to public API!	2022-04-21 13:42:24 +00:00
onnx_supported_aten_ops.rst	Add list of supported ATen ops by ONNX converter into torch.onnx page	2022-04-07 00:05:44 +00:00
onnx.rst	Update onnx.rst	2022-04-08 20:07:01 +00:00
optim.rst	To add SequentialLR to PyTorch Core Schedulers (#64037 )	2021-09-09 09:36:32 -07:00
package.rst	Fix some typos.	2022-04-11 21:55:59 +00:00
pipeline.rst	Minor changes in documentation (#68557 )	2021-11-18 17:57:16 -08:00
profiler.rst	Add low level torch.profiler.kineto_profile base class (#63302 )	2021-12-14 14:47:43 -08:00
quantization-backend-configuration.rst	quantization: autogenerate quantization backend configs for documentation (#75126 )	2022-04-04 22:22:30 +00:00
quantization-support.rst	Cleanup all module references in doc (#73983 )	2022-03-10 22:26:29 +00:00
quantization.rst	[quant][docs] Fix formatting for quantization.rst (#76223 )	2022-04-26 03:16:39 +00:00
random.rst
rpc.rst	Add note in RPC docs about retries. (#73601 )	2022-03-03 00:29:31 +00:00
sparse.rst	Cleanup all module references in doc (#73983 )	2022-03-10 22:26:29 +00:00
special.rst	Implement torch.special.log_ndtr	2022-03-29 23:13:37 +00:00
storage.rst	Virtualize `<type>Storage` classes (#66970 )	2022-03-22 23:44:48 +00:00
tensor_attributes.rst	fix wrong indexing of class names in docs	2022-03-02 22:21:21 +00:00
tensor_view.rst	Correcting a minor typo: "Users should pay" instead of "Users should be pay" (#72500 )	2022-02-08 23:08:25 +00:00
tensorboard.rst	Cleanup all module references in doc (#73983 )	2022-03-10 22:26:29 +00:00
tensors.rst	[complex32] add chalf alias for complex32 and chalf method	2022-04-20 23:44:47 +00:00
testing.rst	promote torch.testing to stable (#73348 )	2022-02-25 06:30:31 +00:00
torch.ao.ns._numeric_suite_fx.rst	Quantization docs: add pages for Numeric Suite (Eager and FX) (#66380 )	2021-10-11 18:47:58 -07:00
torch.ao.ns._numeric_suite.rst	Quantization docs: add pages for Numeric Suite (Eager and FX) (#66380 )	2021-10-11 18:47:58 -07:00
torch.overrides.rst	Add documentation for torch.overrides submodule. (#48170 )	2020-11-30 11:25:31 -08:00
torch.rst	Cleanup all module references in doc (#73983 )	2022-03-10 22:26:29 +00:00
type_info.rst	[Docs] Mention `torch.bfloat16` in `torch.finfo` (#68496 )	2021-11-18 17:52:41 -08:00