pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

History

Banit Agrawal f39789cdab [PyTorch Pinned Allocator] Add support of reserved pinned memory segment to avoid slow paths (#164501 ) Summary: This diff adds the feature of allocating a large pinned memory segment upfront based on the provided config. This large segment is then used to serve all the small pinned memory requests to avoid expensive device level APIs (slow paths). Example: PYTORCH_CUDA_ALLOC_CONF=pinned_reserve_segment_size_mb:2048 This reserves a 2GB pinned memory segment for the process and then all incoming small requests are just served from this segment and no cudaHostAlloc/cudaHostRegister apis are being called. Differential Revision: D83779074 Pull Request resolved: https://github.com/pytorch/pytorch/pull/164501 Approved by: https://github.com/yangw-dev		2025-10-03 18:11:27 +00:00
..
amp_examples.rst
autograd.rst	[doc] Add documentation for division by zero behavior in autograd (#155987 )	2025-06-16 19:02:12 +00:00
broadcasting.rst	Fix comment on broadcasting example to clarify dimension mismatch (#162177 )	2025-09-29 16:47:48 +00:00
cpu_threading_torchscript_inference.rst	[3/n] Remove references to TorchScript in PyTorch docs (#158315 )	2025-07-15 21:14:18 +00:00
cuda.rst	[PyTorch Pinned Allocator] Add support of reserved pinned memory segment to avoid slow paths (#164501 )	2025-10-03 18:11:27 +00:00
custom_operators.rst
ddp.rst
extending.func.rst
extending.rst	[autograd][docs] Add more details on why save_for_backward is important in extending autograd note (#153005 )	2025-05-09 16:36:57 +00:00
faq.rst
get_start_xpu.rst	update supported OS for Intel client GPU (#161699 )	2025-09-01 05:45:09 +00:00
gradcheck.rst	[BE] fix typos in docs/ (#156080 )	2025-06-21 02:47:32 +00:00
hip.rst	[ROCm] Ck backend UX refactor (#152951 )	2025-08-08 18:40:17 +00:00
large_scale_deployments.rst	[3/n] Remove references to TorchScript in PyTorch docs (#158315 )	2025-07-15 21:14:18 +00:00
libtorch_stable_abi.md	Add ScalarType -> shim conversion, add stable::Tensor.scalar_type (#160557 )	2025-08-19 22:13:47 +00:00
mkldnn.rst	Enable TF32 as fp32 internal precision for matmul/linear/conv (#157520 )	2025-07-17 08:57:34 +00:00
modules.rst	Fix to modules.rst: indent line with activation functions (#139667 )	2024-11-08 01:12:52 +00:00
mps.rst
multiprocessing.rst	[BE] fix typos in docs/ (#156080 )	2025-06-21 02:47:32 +00:00
numerical_accuracy.rst	Update warning of TF32 (#158209 )	2025-07-16 01:28:50 +00:00
out.rst	add Out Notes (#151306 )	2025-04-24 20:25:09 +00:00
randomness.rst	[cuBLAS] update cuBLAS determinism docs, remove workspace requirement checks (#161749 )	2025-10-03 00:09:47 +00:00
serialization.rst	Delete sections referencing torchscript in serialization docs (#156648 )	2025-06-25 23:41:24 +00:00
windows.rst	Removing conda references from PyTorch Docs (#152702 )	2025-05-20 20:33:28 +00:00