pytorch/benchmarks/cpp
Nikita Shulga 80ea6955af Add cuda-11.3+clang9 build workflow (take 2)
To be able to detect unused captures in GPU code lambdas (as gcc does not support this diagnostic)

Remove unused opts lambda capture in `ProcessGroupMPI.cpp` and `Distributions.cu`

Fix sign-compare in nvfuser benchmark and ignore signed unsigned comparison in nvfuser tests
Fixes https://github.com/pytorch/pytorch/issues/75475 by aliasing CMAKE_CUDA_HOST_COMPILER to C_COMPILER when clang is used
Pull Request resolved: https://github.com/pytorch/pytorch/pull/75293
Approved by: https://github.com/atalman, https://github.com/seemethere
2022-04-11 17:13:01 +00:00
..
nvfuser Add cuda-11.3+clang9 build workflow (take 2) 2022-04-11 17:13:01 +00:00
tensorexpr [TensorExpr] Delet DimArg class. (#72390) 2022-02-11 01:21:59 +00:00
CMakeLists.txt CPU Convolution benchmark harness for some popular models (#56455) 2021-04-22 22:14:36 -07:00
convolution.cpp Disable avoid-non-const-global-variables lint check (#62008) 2021-07-22 18:04:40 -07:00