pytorch/c10
Banit Agrawal a575ce0dc6 [PyTorch Pinned Allocator] Add support of background thread to process events (#135524)
Summary: Currently we process events in the regular allocation path and we call cudaEventQuery to check on the events and this path can take some locks in libcuda driver. Its not entirely needed to do process events in the allocation path, we could move this to a background thread and keep processing events regularly and put the freed block to the free list.

Differential Revision: D62396585

Pull Request resolved: https://github.com/pytorch/pytorch/pull/135524
Approved by: https://github.com/zyan0
2024-09-17 21:08:10 +00:00
..
benchmark [Split Build] Add option to create libtorch wheel and use it to build pytorch as a separate wheel (#126328) 2024-05-29 04:33:56 +00:00
core [MTIA tensor] allow shallow copy between CPU and MTIA tensors (#135871) 2024-09-13 22:13:58 +00:00
cuda [PyTorch Pinned Allocator] Add support of background thread to process events (#135524) 2024-09-17 21:08:10 +00:00
hip [Split Build] Add option to create libtorch wheel and use it to build pytorch as a separate wheel (#126328) 2024-05-29 04:33:56 +00:00
macros add msg to _assert_async (#134813) 2024-09-03 06:33:18 +00:00
mobile Fix broken lint after #116876 (#122253) 2024-03-20 04:09:00 +00:00
test [Fix] Check name when registering privateuse1 backend (#134071) 2024-08-27 20:28:30 +00:00
util [pytorch][monitoring] Dynamic backend for WaitCounter (#135967) 2024-09-15 18:07:49 +00:00
xpu Fix xpu memory stats error (#135818) 2024-09-13 02:41:21 +00:00
BUCK.oss [1/4] Intel GPU Runtime Upstreaming for Device (#116019) 2024-01-12 07:36:25 +00:00
BUILD.bazel
build.bzl Revert "Move at::{Refcounted,}MapAllocator to c10 (#109881)" 2023-10-13 17:57:53 +00:00
CMakeLists.txt [pytorch][monitoring] Dynamic backend for WaitCounter (#135967) 2024-09-15 18:07:49 +00:00
ovrsource_defs.bzl [NeuralNetInference] Bring up iOS builds (#131917) 2024-07-30 23:01:09 +00:00