pytorch/test/cpp/tensorexpr
PaulZhang12 3ed5f1fb77 [CUDA][cuBLAS] Aten GEMM overload for FP32 output from FP16/BF16 inputs (#150812)
Enable FP32 output from FP16/BF16 GEMMs in aten with cuBLAS. Accumulation for these GEMMs are generally already done in FP32. Adds the functionality to the following aten operators:
* mm
* bmm
* addmm
* baddmm

Follow up of customer issue: https://github.com/pytorch/pytorch/issues/146241#issuecomment-2781889390

Differential Revision: [D73126191](https://our.internmc.facebook.com/intern/diff/D73126191)
Pull Request resolved: https://github.com/pytorch/pytorch/pull/150812
Approved by: https://github.com/ngimel, https://github.com/eqy
2025-04-18 01:53:26 +00:00
..
CMakeLists.txt Set RUNPATH so installed tests can find the required shared libraries (#136627) 2024-10-25 09:38:08 +00:00
gtest_assert_float_eq.h
padded_buffer.cpp
padded_buffer.h
README.md
test_approx.cpp
test_aten.cpp
test_base.h [11/N] Fix clang-tidy warnings in jit (#132131) 2024-07-31 03:45:52 +00:00
test_boundsinference.cpp
test_conv.cpp
test_cpp_codegen.cpp
test_cuda.cpp
test_dynamic_shapes.cpp
test_expr.cpp
test_external_calls.cpp [CUDA][cuBLAS] Aten GEMM overload for FP32 output from FP16/BF16 inputs (#150812) 2025-04-18 01:53:26 +00:00
test_graph_opt.cpp
test_ir_printer.cpp Fix floating point literals in IRPrinter (#142119) 2024-12-18 21:59:48 +00:00
test_ir_verifier.cpp
test_kernel.cpp
test_llvm.cpp Check unused variables in tests (#127498) 2024-06-04 05:35:25 +00:00
test_loopnest.cpp [11/N] Fix clang-tidy warnings in jit (#132131) 2024-07-31 03:45:52 +00:00
test_memdependency.cpp
test_memplanning.cpp
test_ops.cpp
test_quantization.cpp
test_reductions.cpp C10_UNUSED to [[maybe_unused]] (#6357) (#138364) 2024-10-19 13:17:43 +00:00
test_registerizer.cpp
test_simplify.cpp C10_UNUSED to [[maybe_unused]] (#6357) (#138364) 2024-10-19 13:17:43 +00:00
test_te_fuser_pass.cpp
test_type_specializations.cpp
test_type.cpp
test_utils.h
tutorial.cpp

TensorExpr C++ Tests

How to add a new test

First, create a new test file. Test files should have be placed in this directory, with a name that starts with test_, like test_foo.cpp.

Here is an example test file you can copy-paste.

#include <test/cpp/tensorexpr/test_base.h>

// Tests go in torch::jit
namespace torch {
namespace jit {

// 1. Test cases are void() functions.
// 2. They start with the prefix `test`
void testCaseOne() {
    // ...
}

void testCaseTwo() {
    // ...
}
}
}

Then, register your test in tests.h:

// Add to TH_FORALL_TESTS_CUDA instead for CUDA-requiring tests
#define TH_FORALL_TESTS(_)             \
  _(ADFormulas)                        \
  _(Attributes)                        \
  ...
  _(CaseOne)  // note that the `test` prefix is omitted.
  _(CaseTwo)

We glob all the test files together in CMakeLists.txt so that you don't have to edit it every time you add a test. Unfortunately, this means that in order to get the build to pick up your new test file, you need to re-run cmake:

python setup.py build --cmake

How do I run the tests?

The following commands assume you are in PyTorch root.

# (re)build the test binary
ninja build/bin/test_tensorexpr
# run
build/bin/test_tensorexpr --gtest_filter='glob_style_filter*'