pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

History

Emilio Castillo c9d4390d13 Add Pluggable CUDA allocator backend (#86786 ) Fixes #43144 This uses the Backend system added by [82682](https://github.com/pytorch/pytorch/pull/82682) to change allocators dynamically during the code execution. This will allow us to use RMM, use CUDA managed memory for some portions of the code that do not fit in GPU memory. Write static memory allocators to reduce fragmentation while training models and improve interoperability with external DL compilers/libraries. For example, we could have the following allocator in c++ ```c++ #include <sys/types.h> #include <cuda_runtime_api.h> #include <iostream> extern "C" { void* my_malloc(ssize_t size, int device, cudaStream_t stream) { void ptr; std::cout<<"alloc "<< size<<std::endl; cudaMalloc(&ptr, size); return ptr; } void my_free(void ptr) { std::cout<<"free "<<std::endl; cudaFree(ptr); } } ``` Compile it as a shared library ``` nvcc allocator.cc -o alloc.so -shared --compiler-options '-fPIC' ``` And use it from PyTorch as follows ```python import torch # Init caching # b = torch.zeros(10, device='cuda') new_alloc = torch.cuda.memory.CUDAPluggableAllocator('alloc.so', 'my_malloc', 'my_free') old = torch.cuda.memory.get_current_allocator() torch.cuda.memory.change_current_allocator(new_alloc) b = torch.zeros(10, device='cuda') # This will error since the current allocator was already instantiated torch.cuda.memory.change_current_allocator(old) ``` Things to discuss - How to test this, needs compiling external code ... Pull Request resolved: https://github.com/pytorch/pytorch/pull/86786 Approved by: https://github.com/albanD		2022-11-23 17:54:36 +00:00
..
_static	[maskedtensor] add docs (#84887 )	2022-10-19 20:44:34 +00:00
_templates	Fix left nav (#78552 )	2022-06-01 00:49:53 +00:00
community	Fix dev-discuss link in the maintainer docs (#89493 )	2022-11-22 19:33:21 +00:00
elastic	Add watchdog to TorchElastic agent and trainers (#84081 )	2022-09-07 00:17:20 +00:00
notes	Add Pluggable CUDA allocator backend (#86786 )	2022-11-23 17:54:36 +00:00
rpc
scripts	[ONNX] Document ONNX diagnostics (#88371 )	2022-11-16 19:21:46 +00:00
_dynamo.rst	Add `torch._dynamo` to docs (#89510 )	2022-11-23 16:33:13 +00:00
amp.rst	Remove deprecated torch.matrix_rank (#70981 )	2022-09-22 17:40:46 +00:00
autograd.rst	Change torch.autograd.graph.disable_saved_tensors_hooks to be public API (#85994 )	2022-10-03 16:25:01 +00:00
backends.rst	Create native function for determining which implementation of SDP to call (#89029 )	2022-11-16 03:07:54 +00:00
benchmark_utils.rst
bottleneck.rst	add itt unit test and docstrings (#84848 )	2022-09-28 01:39:58 +00:00
checkpoint.rst
complex_numbers.rst	Add a note on CUDA 11.6 (#80363 )	2022-06-27 21:34:24 +00:00
conf.py	[ONNX] Document ONNX diagnostics (#88371 )	2022-11-16 19:21:46 +00:00
config_mod.rst
cpp_extension.rst
cpp_index.rst
cuda._sanitizer.rst	Fix typos under docs directory (#88033 )	2022-10-31 19:31:56 +00:00
cuda.rst	Add Pluggable CUDA allocator backend (#86786 )	2022-11-23 17:54:36 +00:00
cudnn_persistent_rnn.rst
cudnn_rnn_determinism.rst
data.rst	[DataLoader] Removing DataLoader2 related code (#88848 )	2022-11-11 22:27:01 +00:00
ddp_comm_hooks.rst	Fix two small typos in ddp_comm_hooks.rst (#82047 )	2022-07-23 19:10:57 +00:00
deploy.rst	Delete torch::deploy from pytorch core (#85953 )	2022-10-06 07:20:16 +00:00
distributed.algorithms.join.rst
distributed.checkpoint.rst	[PT-D][Checkpointing] Move distributed checkpointing from torch.distributed._shard.checkpoint to torch.distributed.checkpoint (#88698 )	2022-11-16 21:06:38 +00:00
distributed.elastic.rst
distributed.optim.rst
distributed.rst	Add torch.distributed.DistBackendError exception type, thrown from C10D_NCCL_CHECK (#88134 )	2022-11-08 13:26:42 +00:00
distributions.rst
dlpack.rst
docutils.conf
fft.rst
fsdp.rst	[FSDP()][3/N] Refactor public APIs (#87917 )	2022-10-31 16:45:21 +00:00
futures.rst
fx.rst	prepare removal of deprecated functionality in torch.testing (#87969 )	2022-11-02 14:04:48 +00:00
hub.rst
index.rst	Add `torch._dynamo` to docs (#89510 )	2022-11-23 16:33:13 +00:00
jit_builtin_functions.rst
jit_language_reference_v2.rst	Fix typos in docs (#80602 )	2022-08-29 23:32:44 +00:00
jit_language_reference.rst	(Re-open) Adds cudaMallocAsync as an alternative backend for the CUDA allocator (#82682 )	2022-10-12 03:44:21 +00:00
jit_python_reference.rst
jit_unsupported.rst	(Re-open) Adds cudaMallocAsync as an alternative backend for the CUDA allocator (#82682 )	2022-10-12 03:44:21 +00:00
jit_utils.rst	Create __init__.py (#78629 )	2022-06-03 18:14:21 +00:00
jit.rst	torch.jit doc link for nvfuser readme.md (#77780 )	2022-07-07 23:25:35 +00:00
library.rst	Add docs for Python Registration	2022-06-13 23:21:23 +00:00
linalg.rst	Add a note on the stability of linalg functions. (#88313 )	2022-11-07 22:44:23 +00:00
masked.rst	Fix typos in .md and .rst files (#88962 )	2022-11-17 03:37:02 +00:00
math-quantizer-equation.png
mobile_optimizer.rst	[Vulkan] Add Vulkan Rewrite to Transfer Inputs and Outputs to Vulkan and CPU Backends Respectively (#87432 )	2022-10-31 14:18:45 +00:00
model_zoo.rst
monitor.rst
multiprocessing.rst
name_inference.rst
named_tensor.rst	Add torch.unflatten and improve its docs (#81399 )	2022-07-29 15:02:42 +00:00
nested.rst	Removed unecessary check in `select_nested` (#89150 )	2022-11-16 22:11:37 +00:00
nn.functional.rst	Add Dropout1d module	2022-06-15 14:39:07 +00:00
nn.init.rst	update nn.init doc to reflect the no_grad (#80882 )	2022-07-07 17:19:29 +00:00
nn.rst	Add Dropout1d module	2022-06-15 14:39:07 +00:00
onnx_diagnostics.rst	[ONNX] Document ONNX diagnostics (#88371 )	2022-11-16 19:21:46 +00:00
onnx_supported_aten_ops.rst	[ONNX] Update ONNX documentation to include unsupported operators (#84496 )	2022-09-16 23:48:37 +00:00
onnx.rst	[ONNX] Add onnx-script into ONNX docs (#89078 )	2022-11-17 06:27:17 +00:00
optim.rst	[doc] LR scheduler example fix (#86629 )	2022-10-11 21:41:50 +00:00
package.rst	Fix typos in `torch.package` documentation (#82994 )	2022-08-08 20:19:17 +00:00
pipeline.rst
profiler.rst	Fix ITT unit-tests if PyTorch is compiled with `USE_ITT=OFF` (#86199 )	2022-10-04 21:57:05 +00:00
quantization-accuracy-debugging.rst	Fix typo under docs directory (#87583 )	2022-10-24 23:52:44 +00:00
quantization-backend-configuration.rst
quantization-support.rst	Fix typos under docs directory (#88033 )	2022-10-31 19:31:56 +00:00
quantization.rst	Fix typos under docs directory (#88033 )	2022-10-31 19:31:56 +00:00
random.rst
rpc.rst	Fix typo under docs directory and RELEASE.md (#85896 )	2022-09-29 21:41:59 +00:00
signal.rst	Reimplement Kaiser window (#87330 )	2022-10-27 21:01:01 +00:00
sparse.rst	Fix typos in .md and .rst files (#88962 )	2022-11-17 03:37:02 +00:00
special.rst	[primTorch] special: j0, j1, spherical_j0 (#86049 )	2022-10-04 18:21:46 +00:00
storage.rst	Deprecate TypedStorage, its derived classes, and all of their public methods (#85303 )	2022-11-08 18:11:01 +00:00
tensor_attributes.rst	Chore: Add 'mps' to the docs of tensor_attributes (#86585 )	2022-10-14 19:59:33 +00:00
tensor_view.rst
tensorboard.rst
tensors.rst	Remove deprecated torch.lstsq (#70980 )	2022-09-23 00:16:55 +00:00
testing.rst	Fix links in `torch.testing` docs (#80353 )	2022-07-11 19:15:53 +00:00
torch.ao.ns._numeric_suite_fx.rst
torch.ao.ns._numeric_suite.rst
torch.overrides.rst	Revert "Revert "Implement sym_sizes to create proper IR for sym ints representing tensor sizes (#76836 )""	2022-05-18 18:40:57 +00:00
torch.rst	[primTorch] Add a ref for `narrow_copy` (#86748 )	2022-10-17 10:16:05 +00:00
type_info.rst	ENH: Convert finfo.tiny to finfo.smallest_normal (#76292 )	2022-05-20 00:59:48 +00:00