pytorch/benchmarks/cpp/tensorexpr
Wang, Eikan 429a80dded [NNC] Lowering function generates the output buffer with the specified stride (#76529)
Summary:
Pass stride information to lowering function to generate the output bufer with proper memory layout.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/76529

Reviewed By: ZolotukhinM

Differential Revision: D36116712

Pulled By: IvanKobzarev

fbshipit-source-id: d3901f756b3710ecce172d6db3ecb0b7c12fb929
(cherry picked from commit b6cd53c91c01db36ea0e99167dc0ce0ae1d3aa23)
2022-05-04 20:04:22 +00:00
..
bench_approx.cpp [bench] Fix build issues with TensorExpr cpp benchmarks (#72196) 2022-02-07 07:01:19 +00:00
bench_batchnorm.cpp [TensorExpr] Delet DimArg class. (#72390) 2022-02-11 01:21:59 +00:00
bench_compile.cpp [TensorExpr] Delet DimArg class. (#72390) 2022-02-11 01:21:59 +00:00
bench_concat.cpp [TensorExpr] Delet DimArg class. (#72390) 2022-02-11 01:21:59 +00:00
bench_fuser_overhead.cpp use irange for loops 2 (#66746) 2021-12-10 04:26:23 -08:00
bench_gemm.cpp [TensorExpr] Delet DimArg class. (#72390) 2022-02-11 01:21:59 +00:00
bench_kernels.cpp [bench] Adding a cpp benchmark to compare performance of nnc with static and symbolic shapes (#72197) 2022-02-07 07:01:19 +00:00
bench_ops.py Updates internal assert_allclose callsites in favor of assert_close (#61841) 2021-08-19 12:50:41 -07:00
bench_parallel.cpp [TensorExpr] Delet DimArg class. (#72390) 2022-02-11 01:21:59 +00:00
bench_prefix_sum.cpp [PyTorch] Add int version of vectorized PrefixSum to Benchmark (#67865) 2021-11-04 14:00:19 -07:00
bench_reduce.cpp [NNC] Lowering function generates the output buffer with the specified stride (#76529) 2022-05-04 20:04:22 +00:00
bench_signed_log1p.cpp [TensorExpr] Delet DimArg class. (#72390) 2022-02-11 01:21:59 +00:00
CMakeLists.txt [bench] Adding a cpp benchmark to compare performance of nnc with static and symbolic shapes (#72197) 2022-02-07 07:01:19 +00:00
main.cpp