pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 00:20:18 +01:00

History

Michael Melesse 785ebb9d6d [ROCM] Navi21 Enablement 3: Embedding kernels (#72809 ) Summary: This PR is a follow up to the following prs. https://github.com/pytorch/pytorch/pull/69942 https://github.com/pytorch/pytorch/pull/72682 We are adding support to Navi21 GPUs which have a warpsize of 32. We cannot rely on a constant so we have to dynamically look up the warpsize when launching the kernel on the host side. Inside device functions this is not needed and the compiler can correctly detect the correct warpsize to replace the C10_WARP_SIZE constant. Pull Request resolved: https://github.com/pytorch/pytorch/pull/72809 Reviewed By: mruberry Differential Revision: D34400737 Pulled By: ngimel fbshipit-source-id: 1a1374465d4006e485d4d11531a4c78ddb178cdf (cherry picked from commit `94211fe1f0`)		2022-02-23 04:26:58 +00:00
..
conda
src	[ROCM] Navi21 Enablement 3: Embedding kernels (#72809 )	2022-02-23 04:26:58 +00:00
tools	Do not build PyTorch with caffe2 by default (#66658 )	2021-10-21 20:32:47 -07:00
CMakeLists.txt	Define TORCH_ASSERT_ONLY_METHOD_OPERATORS in ATen/core (#72344 )	2022-02-07 15:48:56 +00:00