pytorch/docs/source/notes
Banit Agrawal f39789cdab [PyTorch Pinned Allocator] Add support of reserved pinned memory segment to avoid slow paths (#164501)
Summary:
This diff adds the feature of allocating a large pinned memory segment upfront based on the provided config. This large segment is then used to serve all the small pinned memory requests to avoid expensive device level APIs (slow paths).

Example:

PYTORCH_CUDA_ALLOC_CONF=pinned_reserve_segment_size_mb:2048

This reserves a 2GB pinned memory segment for the process and then all incoming small requests are just served from this segment and no cudaHostAlloc/cudaHostRegister apis are being called.

Differential Revision: D83779074

Pull Request resolved: https://github.com/pytorch/pytorch/pull/164501
Approved by: https://github.com/yangw-dev
2025-10-03 18:11:27 +00:00
..
amp_examples.rst
autograd.rst [doc] Add documentation for division by zero behavior in autograd (#155987) 2025-06-16 19:02:12 +00:00
broadcasting.rst Fix comment on broadcasting example to clarify dimension mismatch (#162177) 2025-09-29 16:47:48 +00:00
cpu_threading_torchscript_inference.rst [3/n] Remove references to TorchScript in PyTorch docs (#158315) 2025-07-15 21:14:18 +00:00
cuda.rst [PyTorch Pinned Allocator] Add support of reserved pinned memory segment to avoid slow paths (#164501) 2025-10-03 18:11:27 +00:00
custom_operators.rst
ddp.rst
extending.func.rst
extending.rst [autograd][docs] Add more details on why save_for_backward is important in extending autograd note (#153005) 2025-05-09 16:36:57 +00:00
faq.rst
get_start_xpu.rst update supported OS for Intel client GPU (#161699) 2025-09-01 05:45:09 +00:00
gradcheck.rst [BE] fix typos in docs/ (#156080) 2025-06-21 02:47:32 +00:00
hip.rst [ROCm] Ck backend UX refactor (#152951) 2025-08-08 18:40:17 +00:00
large_scale_deployments.rst [3/n] Remove references to TorchScript in PyTorch docs (#158315) 2025-07-15 21:14:18 +00:00
libtorch_stable_abi.md Add ScalarType -> shim conversion, add stable::Tensor.scalar_type (#160557) 2025-08-19 22:13:47 +00:00
mkldnn.rst Enable TF32 as fp32 internal precision for matmul/linear/conv (#157520) 2025-07-17 08:57:34 +00:00
modules.rst Fix to modules.rst: indent line with activation functions (#139667) 2024-11-08 01:12:52 +00:00
mps.rst
multiprocessing.rst [BE] fix typos in docs/ (#156080) 2025-06-21 02:47:32 +00:00
numerical_accuracy.rst Update warning of TF32 (#158209) 2025-07-16 01:28:50 +00:00
out.rst add Out Notes (#151306) 2025-04-24 20:25:09 +00:00
randomness.rst [cuBLAS] update cuBLAS determinism docs, remove workspace requirement checks (#161749) 2025-10-03 00:09:47 +00:00
serialization.rst Delete sections referencing torchscript in serialization docs (#156648) 2025-06-25 23:41:24 +00:00
windows.rst Removing conda references from PyTorch Docs (#152702) 2025-05-20 20:33:28 +00:00