pytorch/torch/csrc/jit/codegen/cuda/runtime
jiej 76d282d447 Nvfuser code bump 12 5 (#69964)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/69964

Things added in this PR that requires review:
1. cuLaunchCooperativeKernel driver API added
aten/src/ATen/cuda/detail/LazyNVRTC.cpp
aten/src/ATen/cuda/nvrtc_stub/ATenNVRTC.h

nvfuser code update:
1. perf turning on codegen scheduler that improves performance.
2. permutation support has been extended beyond contiguous/channels-last. (The improvements could be observed on PW benchmark)

Things reverted from local changes:
1. aten::gelu with approximation
2. local changes that is upstreamed in PR https://github.com/pytorch/pytorch/issues/68804

Pull Request resolved: https://github.com/pytorch/pytorch/pull/69428

Reviewed By: ngimel

Differential Revision: D33073817

Pulled By: wconstab

fbshipit-source-id: e77d32e81d037d7370822b040456fd4c3bd68edb
2021-12-16 08:28:54 -08:00
..
bf16_support.cu Nvfuser code bump 11 5 (#67943) 2021-11-17 01:22:17 -08:00
block_reduction.cu Nvfuser code bump 12 5 (#69964) 2021-12-16 08:28:54 -08:00
block_sync_atomic.cu Nvfuser code bump 12 5 (#69964) 2021-12-16 08:28:54 -08:00
block_sync_default.cu Nvfuser code bump 11 5 (#67943) 2021-11-17 01:22:17 -08:00
broadcast.cu Nvfuser code bump 12 5 (#69964) 2021-12-16 08:28:54 -08:00
fp16_support.cu Nvfuser code bump 11 5 (#67943) 2021-11-17 01:22:17 -08:00
grid_broadcast.cu Nvfuser code bump 12 5 (#69964) 2021-12-16 08:28:54 -08:00
grid_reduction.cu Nvfuser code bump 12 5 (#69964) 2021-12-16 08:28:54 -08:00
grid_sync.cu Nvfuser code bump 12 5 (#69964) 2021-12-16 08:28:54 -08:00
helpers.cu Nvfuser code bump 11 5 (#67943) 2021-11-17 01:22:17 -08:00
index_utils.cu Nvfuser code bump 12 5 (#69964) 2021-12-16 08:28:54 -08:00
random_numbers.cu Revert "Revert D30752939: [pytorch][PR] nvfuser update" (#65137) 2021-09-22 04:54:51 -07:00
tensor.cu Revert "Revert D30752939: [pytorch][PR] nvfuser update" (#65137) 2021-09-22 04:54:51 -07:00
warp.cu Nvfuser code bump 11 5 (#67943) 2021-11-17 01:22:17 -08:00
welford.cu Nvfuser code bump 12 5 (#69964) 2021-12-16 08:28:54 -08:00