[CUDA][cuBLAS] Bump test_cublas_baddbmm_large_input tolerances (#117889)

Unfortunate that the current `rtol=1e-5` hits a literal 1 / 1000000 mismatch (`rtol=1.04e-5`) on L40.

CC @ptrblck

Pull Request resolved: https://github.com/pytorch/pytorch/pull/117889
Approved by: https://github.com/atalman
This commit is contained in:
eqy 2024-02-27 19:05:20 +00:00 committed by PyTorch MergeBot
parent 5b5c167adc
commit 65efece3a4

View File

@ -150,7 +150,7 @@ class TestMatmulCuda(TestCase):
@onlyCUDA
@unittest.skipIf(IS_JETSON, "Too large for Jetson")
@toleranceOverride({torch.float32: xtol(atol=1e-5, rtol=1e-5)})
@toleranceOverride({torch.float32: xtol(atol=1e-5, rtol=1.1e-5)})
@dtypes(*([torch.float32, torch.float16] +
[torch.bfloat16] if TEST_WITH_ROCM or SM53OrLater else []))
@parametrize(