pytorch/caffe2/perfkernels
Evgeny Fiksman c3b28491c8 [caffe2] Add AVX512 support for box_cox operator (#143627)
Summary:
Reuse templetized implementation of box_cox caffe2 operator.
* Duplicate .cc file of AVX2
* change intrinsics functions to use AVX512 instructions
* override templates
* extend the caller to use new methods
* guard AVX512 with a gflag to allow smooth transition

Differential Revision: D67433457

Pull Request resolved: https://github.com/pytorch/pytorch/pull/143627
Approved by: https://github.com/hl475
2025-01-07 09:54:39 +00:00
..
batch_box_cox_avx512.cc [caffe2] Add AVX512 support for box_cox operator (#143627) 2025-01-07 09:54:39 +00:00
batch_box_cox_vec.h [caffe2] Add AVX512 support for box_cox operator (#143627) 2025-01-07 09:54:39 +00:00
CMakeLists.txt Add SVE implementation of embedding_lookup_idx (#133995) 2024-10-15 18:52:44 +00:00
common_avx.cc Revert "Fix clang-tidy warnings in Caffe2 code (#134935)" 2024-09-13 16:42:37 +00:00
common_avx2.cc Revert "Fix clang-tidy warnings in Caffe2 code (#134935)" 2024-09-13 16:42:37 +00:00
common_sve.cc Add SVE implementation of embedding_lookup_idx (#133995) 2024-10-15 18:52:44 +00:00
common.h Add SVE implementation of embedding_lookup_idx (#133995) 2024-10-15 18:52:44 +00:00
embedding_lookup_idx_avx2.cc
embedding_lookup_idx_sve.cc Add SVE implementation of embedding_lookup_idx (#133995) 2024-10-15 18:52:44 +00:00
embedding_lookup_idx.cc [6/N] Fix Wextra-semi warning (#139605) 2024-11-04 13:43:16 +00:00
embedding_lookup_idx.h
hp_emblookup_codegen.py Fix ruff warnings in caffe2 and functorch (#144182) 2025-01-04 04:15:01 +00:00
sve_emblookup_codegen.py Fix ruff warnings in caffe2 and functorch (#144182) 2025-01-04 04:15:01 +00:00