pytorch/torch
Mikayla Gawarecki 018e48c337 [Reland] Add wrappers for synchronous GPUDirect Storage APIs (#133489)
Reland #130633

USE_CUFILE turned off by default in this version
Pull Request resolved: https://github.com/pytorch/pytorch/pull/133489
Approved by: https://github.com/albanD
2024-08-15 17:11:52 +00:00
..
_awaits [BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200) 2024-08-15 15:50:19 +00:00
_C [Reland] Add wrappers for synchronous GPUDirect Storage APIs (#133489) 2024-08-15 17:11:52 +00:00
_C_flatbuffer
_custom_op Tighten torch.library.infer_schema input types (#130705) 2024-07-29 16:01:19 +00:00
_decomp Add a private _safe_softmax (#131060) 2024-08-08 23:09:38 +00:00
_dispatch [BE][Easy][15/19] enforce style for empty lines in import segments in torch/_d*/ (#129767) 2024-07-31 21:18:11 +00:00
_dynamo [export] refactor _process_dynamic_shapes (#133391) 2024-08-15 16:21:21 +00:00
_export [export] refactor _process_dynamic_shapes (#133391) 2024-08-15 16:21:21 +00:00
_functorch [BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200) 2024-08-15 15:50:19 +00:00
_higher_order_ops Add support for returning LSE from FlexAttention (and also differentiating through it) (#133159) 2024-08-11 10:29:16 +00:00
_inductor Fix triton codegen with math.trunc (#133354) 2024-08-15 16:38:26 +00:00
_lazy Add None return type to init (#132335) 2024-08-01 15:26:45 +00:00
_library [custom_ops] torch.library.{custom_op, register_kernel} disable Dynamo (#133125) 2024-08-12 18:29:18 +00:00
_logging Add ChromiumEventLogger, log FXGraphCache and AOTAutogradCache (#132864) 2024-08-10 01:15:53 +00:00
_numpy [test/torch_np] Fix usages of deprecated NumPy 2.0 APIs in numpy_tests (#131909) 2024-08-05 16:21:08 +00:00
_prims [BE] Simplify code interacting with get_proxy_mode/enable_tracing (#132675) 2024-08-08 12:03:00 +00:00
_prims_common [BE][Easy][14/19] enforce style for empty lines in import segments in torch/_[a-c]*/ and torch/_[e-h]*/ and torch/_[j-z]*/ (#129765) 2024-07-31 10:42:50 +00:00
_refs Don't attempt to compute hints for unbacked expressions (#132060) 2024-08-02 16:39:14 +00:00
_strobelight [BE][Easy][14/19] enforce style for empty lines in import segments in torch/_[a-c]*/ and torch/_[e-h]*/ and torch/_[j-z]*/ (#129765) 2024-07-31 10:42:50 +00:00
_subclasses Reland "Construct NJT without graph breaks" (#133196) 2024-08-14 01:11:13 +00:00
_vendor
amp Revert "[MPS] Add support for autocast in MPS (#99272)" 2024-08-05 19:59:04 +00:00
ao [BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200) 2024-08-15 15:50:19 +00:00
autograd [BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200) 2024-08-15 15:50:19 +00:00
backends [BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200) 2024-08-15 15:50:19 +00:00
compiler
contrib
cpu [BE][Easy][17/19] enforce style for empty lines in import segments in torch/[a-c]*/ and torch/[e-n]*/ (#129769) 2024-08-04 10:24:09 +00:00
csrc [Reland] Add wrappers for synchronous GPUDirect Storage APIs (#133489) 2024-08-15 17:11:52 +00:00
cuda [Reland] Add wrappers for synchronous GPUDirect Storage APIs (#133489) 2024-08-15 17:11:52 +00:00
distributed [CP] Rewrite ring attention backward algorithm and enablement APIs (#131351) 2024-08-15 16:41:51 +00:00
distributions [BE][Easy][18/19] enforce style for empty lines in import segments in torch/d*/ (#129770) 2024-08-01 04:22:50 +00:00
export [export] refactor _process_dynamic_shapes (#133391) 2024-08-15 16:21:21 +00:00
fft
func
futures
fx [BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200) 2024-08-15 15:50:19 +00:00
jit [BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200) 2024-08-15 15:50:19 +00:00
legacy
lib
linalg fix a typo in the householder_product docs (#124279) 2024-08-15 09:34:17 +00:00
masked [BE][Easy][17/19] enforce style for empty lines in import segments in torch/[a-c]*/ and torch/[e-n]*/ (#129769) 2024-08-04 10:24:09 +00:00
monitor [pytorch][counters] Pybind for WaitCounter (#132357) 2024-08-02 16:08:10 +00:00
mps [BE][Easy][17/19] enforce style for empty lines in import segments in torch/[a-c]*/ and torch/[e-n]*/ (#129769) 2024-08-04 10:24:09 +00:00
mtia [BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200) 2024-08-15 15:50:19 +00:00
multiprocessing [BE][Easy][17/19] enforce style for empty lines in import segments in torch/[a-c]*/ and torch/[e-n]*/ (#129769) 2024-08-04 10:24:09 +00:00
nested Reland "Construct NJT without graph breaks" (#133196) 2024-08-14 01:11:13 +00:00
nn Rewording doc string for clip_grad_norm_ (#133406) 2024-08-15 16:21:27 +00:00
onnx [BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200) 2024-08-15 15:50:19 +00:00
optim Add Adafactor foreach impl (#132336) 2024-08-15 17:00:33 +00:00
package [BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200) 2024-08-15 15:50:19 +00:00
profiler [Profiler] Add API for Dynamic Activity Toggling [2/n] (#133035) 2024-08-09 21:54:54 +00:00
quantization [BE][Easy][19/19] enforce style for empty lines in import segments in torch/[o-z]*/ (#129771) 2024-08-01 17:07:14 +00:00
signal Revert "[BE] typing for decorators - signal/windows/windows (#131582)" 2024-07-28 03:29:31 +00:00
sparse Add tests to bsr_dense_addmm_meta. Tune bsr_dense_addmm kernel for ViT shapes. (#132646) 2024-08-05 20:22:33 +00:00
special
testing Add Adafactor foreach impl (#132336) 2024-08-15 17:00:33 +00:00
utils [BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200) 2024-08-15 15:50:19 +00:00
xpu [BE][Easy][19/19] enforce style for empty lines in import segments in torch/[o-z]*/ (#129771) 2024-08-01 17:07:14 +00:00
__config__.py
__future__.py
__init__.py [Reland] Add wrappers for synchronous GPUDirect Storage APIs (#133489) 2024-08-15 17:11:52 +00:00
_appdirs.py [BE][Easy][14/19] enforce style for empty lines in import segments in torch/_[a-c]*/ and torch/_[e-h]*/ and torch/_[j-z]*/ (#129765) 2024-07-31 10:42:50 +00:00
_classes.py Add None return type to init (#132335) 2024-08-01 15:26:45 +00:00
_compile.py [BE] Format uncategorized Python files with ruff format (#132576) 2024-08-04 17:13:31 +00:00
_custom_ops.py [BE][Easy][14/19] enforce style for empty lines in import segments in torch/_[a-c]*/ and torch/_[e-h]*/ and torch/_[j-z]*/ (#129765) 2024-07-31 10:42:50 +00:00
_deploy.py
_guards.py [dynamo][inline_inbuilt_nn_modules] Mark nn module tensor static for cudagraphs (#132736) 2024-08-06 20:13:28 +00:00
_jit_internal.py [BE] Format uncategorized Python files with ruff format (#132576) 2024-08-04 17:13:31 +00:00
_linalg_utils.py [BE] Format uncategorized Python files with ruff format (#132576) 2024-08-04 17:13:31 +00:00
_lobpcg.py [BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200) 2024-08-15 15:50:19 +00:00
_lowrank.py [BE] Format uncategorized Python files with ruff format (#132576) 2024-08-04 17:13:31 +00:00
_meta_registrations.py [BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200) 2024-08-15 15:50:19 +00:00
_namedtensor_internals.py [BE][Easy][14/19] enforce style for empty lines in import segments in torch/_[a-c]*/ and torch/_[e-h]*/ and torch/_[j-z]*/ (#129765) 2024-07-31 10:42:50 +00:00
_ops.py Make the __module__ name of HOO to be always "torch.ops.higher_order" (#132775) 2024-08-08 16:55:09 +00:00
_python_dispatcher.py Add None return type to init (#132335) 2024-08-01 15:26:45 +00:00
_size_docs.py
_sources.py
_storage_docs.py [BE] Format uncategorized Python files with ruff format (#132576) 2024-08-04 17:13:31 +00:00
_streambase.py
_tensor_docs.py [BE] Format uncategorized Python files with ruff format (#132576) 2024-08-04 17:13:31 +00:00
_tensor_str.py [BE] Format uncategorized Python files with ruff format (#132576) 2024-08-04 17:13:31 +00:00
_tensor.py [BE] Format uncategorized Python files with ruff format (#132576) 2024-08-04 17:13:31 +00:00
_torch_docs.py Prevent an unnecessary device -> host copy for CuPy arrays when not explicitly setting a device in torch.as_tensor. (#132595) 2024-08-08 00:26:58 +00:00
_utils_internal.py Turn off remote caching in unit tests unless explicitly on (#133258) 2024-08-13 02:49:43 +00:00
_utils.py Revert "Deprecate torch._utils.is_compiling() and torch._dynamo.external_utils.is_compiling() (#127690)" 2024-08-07 00:05:20 +00:00
_VF.py [BE] Format uncategorized Python files with ruff format (#132576) 2024-08-04 17:13:31 +00:00
_vmap_internals.py
_weights_only_unpickler.py Add _codecs.encode and builtins.bytearray to _get_allowed_globals to support bytes and bytearray serialization (#133189) 2024-08-13 02:20:28 +00:00
abi-check.cpp
CMakeLists.txt [Reland] Add wrappers for synchronous GPUDirect Storage APIs (#133489) 2024-08-15 17:11:52 +00:00
custom_class_detail.h
custom_class.h
extension.h
functional.py [BE] Format uncategorized Python files with ruff format (#132576) 2024-08-04 17:13:31 +00:00
hub.py [BE] Format uncategorized Python files with ruff format (#132576) 2024-08-04 17:13:31 +00:00
library.h [3/N] Fix Wunused-parameter warnings (#131271) 2024-07-20 23:31:03 +00:00
library.py [custom_ops] torch.library.{custom_op, register_kernel} disable Dynamo (#133125) 2024-08-12 18:29:18 +00:00
overrides.py [BE] Format uncategorized Python files with ruff format (#132576) 2024-08-04 17:13:31 +00:00
py.typed
quasirandom.py [BE] Format uncategorized Python files with ruff format (#132576) 2024-08-04 17:13:31 +00:00
random.py
README.txt
return_types.py
script.h
serialization.py [BE] Format uncategorized Python files with ruff format (#132576) 2024-08-04 17:13:31 +00:00
storage.py Fix public API tests (#131386) 2024-07-30 18:42:54 +00:00
torch_version.py Add mypy typing to torch_version.py (#131447) 2024-07-23 17:31:07 +00:00
types.py FakeTensor cache SymInt support (#127596) 2024-07-21 19:26:38 +00:00
version.py.tpl

Note [TH abstraction violation]
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

TH/THC provide some hpp headers, which are proper C++ headers rather than
C headers.  These headers serve double duty as *internal implementation
detail* headers, whose contents should largely not be used by external
clients.

Ideally, we would not install these headers at all; instead, you should
use public functions (in headers like `THTensor.h`, NOT `THTensor.hpp`)
to manipulate these structs.  However, there are a few places
in torch/csrc where we violate this abstraction.  They are marked with
a pointer to this note.  Each of those sites will have to be refactored
when we refactor the guts of THTensor and related structures.