pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
PyTorch MergeBot	0aa34e9591	Revert "Collect packages with importlib in collect_env (#144616 )" This reverts commit `3541d2a2aa`. Reverted https://github.com/pytorch/pytorch/pull/144616 on behalf of https://github.com/malfet due to Somehow this change causes test_bottleneck_cuda to fail ([comment](https://github.com/pytorch/pytorch/pull/144616#issuecomment-2586095595))	2025-01-13 03:11:04 +00:00
Sv. Lockal	3541d2a2aa	Collect packages with importlib in collect_env (#144616 ) If pytorch is installed systemwide (via os package manager) or by alternative package manager like `uv`, pip is not available, causing error in `collect_env`. However it is still possible to collect exactly the same list using `importlib` API, which is always available. Fixes #144615 Pull Request resolved: https://github.com/pytorch/pytorch/pull/144616 Approved by: https://github.com/malfet	2025-01-12 23:21:08 +00:00
Alexander Grund	67cf126cf8	Disable PIP version check in collect_env (#142308 ) Disables version check which might require users to reach out to PyPI, reference: https://pip.pypa.io/en/latest/cli/pip/#cmdoption-disable-pip-version-check Switches pip to be used directly as a python module (`python3 -mpip`) instead of relying on `pip3` or `pip` Pull Request resolved: https://github.com/pytorch/pytorch/pull/142308 Approved by: https://github.com/seemethere	2024-12-10 19:16:36 +00:00
Jing Xu	14e6624473	Update wmic command used in collect_env.py to its counterpart in powershell due to its deprecation (#138297 ) As title. `wmic` is deprecated in Windows. Pull Request resolved: https://github.com/pytorch/pytorch/pull/138297 Approved by: https://github.com/malfet Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com>	2024-10-18 07:03:17 +00:00
ur4t	0b168ceb6d	Collect Nvidia libraries with collect_env.py (#138076 ) Collect Nvidia libraries to diagnose issues like #133548. Pull Request resolved: https://github.com/pytorch/pytorch/pull/138076 Approved by: https://github.com/malfet	2024-10-18 05:05:00 +00:00
Aaron Orenstein	8db9dfa2d7	Flip default value for mypy disallow_untyped_defs [9/11] (#127846 ) See #127836 for details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/127846 Approved by: https://github.com/ezyang ghstack dependencies: #127842, #127843, #127844, #127845	2024-06-08 18:50:06 +00:00
Kazuaki Ishizaki	117ab34891	Documenting the torch.utils.collect_env.get_pretty_env_info function (#128123 ) Fixes #127888 This PR adds docstring to the `torch.utils.collect_env.get_pretty_env_info` function Pull Request resolved: https://github.com/pytorch/pytorch/pull/128123 Approved by: https://github.com/ezyang, https://github.com/malfet	2024-06-07 00:43:18 +00:00
Arun Pa	3acbfd602e	Document torch.utils.collect_env.get_env_info function (#128021 ) Fixes #127911 Pull Request resolved: https://github.com/pytorch/pytorch/pull/128021 Approved by: https://github.com/malfet	2024-06-05 17:44:47 +00:00
Aaron Gokaslan	3fe437b24b	[BE]: Update flake8 to v6.1.0 and fix lints (#116591 ) Updates flake8 to v6.1.0 and fixes a few lints using sed and some ruff tooling. - Replace `assert(0)` with `raise AssertionError()` - Remove extraneous parenthesis i.e. - `assert(a == b)` -> `assert a == b` - `if(x > y or y < z):`->`if x > y or y < z:` - And `return('...')` -> `return '...'` Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/116591 Approved by: https://github.com/albanD, https://github.com/malfet	2024-01-03 06:04:44 +00:00
albanD	5e58be678c	Make collect env BC compatible (#116532 ) To avoid errors like the one in https://github.com/pytorch/pytorch/issues/116531 when the user tries to run collect_env Pull Request resolved: https://github.com/pytorch/pytorch/pull/116532 Approved by: https://github.com/malfet	2023-12-30 01:13:37 +00:00
joncrall	765d4599ee	Give users control over packages in torch.utils.collect_env (#112993 ) I'm looking to repurpose some logic in `torch.utils.collect_env` for the `geowatch` package. I'm mostly able to just use this script as a library, which is great because it reduces code in my package. However, the issue is that the package patterns that are relevant to torch are hard-coded inside of `get_conda_packages` and `get_pip_packages`. The changes I made are simple. I defined the default package patterns as two global sets, and I added an argument to each function that lets the user customize exactly what package patterns are relevant. If they are not specified the defaults are used. I was considering extending the power of the patterns by utilizing `fnmatch`, `re` (or [xdev.pattern](https://github.com/Erotemic/xdev/blob/main/xdev/patterns.py) which abstracts them both), but instead I opted to just use the existing `__contains__` test to keep things simple. From torch's perspective this should make maintaining this file slightly easier because to update relevant packages, the developer now updates two neighboring top-level globals instead of two separated local variables. However, it does add an argument to two functions, and that argument isn't used in torch itself, so there is an argument for removing that, and then users could still have some control by modifying globals, but I think the way I did it balances the tradeoffs well. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112993 Approved by: https://github.com/zou3519	2023-11-28 22:35:25 +00:00
ChanBong	5e10dd2c78	fix docstring issues in torch.utils (#113335 ) Fixes #112634 Fixes all the issues listed except in `torch/utils/_pytree.py` as the file no longer exists. ### Error counts \|File \| Count Before \| Count now\| \|---- \| ---- \| ---- \| \|`torch/utils/collect_env.py` \| 39 \| 25\| \|`torch/utils/cpp_extension.py` \| 51 \| 13\| \|`torch/utils/flop_counter.py` \| 25 \| 8\| \|`torch/utils/_foreach_utils.py.py` \| 2 \| 0\| \|`torch/utils/_python_dispatch.py.py` \| 26 \| 25\| \|`torch/utils/backend_registration.py` \| 15 \| 4\| \|`torch/utils/checkpoint.py` \| 29 \| 21\| Pull Request resolved: https://github.com/pytorch/pytorch/pull/113335 Approved by: https://github.com/ezyang	2023-11-13 19:37:25 +00:00
Jithun Nair	333d5821ee	[ROCm] Add gcnArchName to collect_env and torch.cuda.get_device_properties (#107477 ) Printing just the device name is not helpful when investigating PyTorch issues filed for specific AMD GPUs, as the support/issue might depend on the gfx arch, which is part of the gcnArchName property. `torch.cuda.get_device_properties(0).gcnArchName` will print the value of the `gcnArchName` property: eg. ``` >>> torch.cuda.get_device_properties(0).gcnArchName 'gfx906:sramecc+:xnack-' ``` ``` root@6f064e3c19fb:/data/pytorch/test# python ../torch/utils/collect_env.py ... GPU models and configuration: AMD Radeon Graphics(gfx906:sramecc+:xnack-) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/107477 Approved by: https://github.com/albanD	2023-10-31 23:05:36 +00:00
Justin Chu	d24e7be243	Include `onnx` and `onnxscript` information in collect_env.py (#110560 ) `onnx` and `onnxscript` are used in torch.onnx.dynamo_export since 2.0. It would be helpful to collect version information in user issue reports. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110560 Approved by: https://github.com/albanD	2023-10-05 01:29:04 +00:00
Xuehai Pan	0bf30c140a	[pytree] Use OpTree for PyTree manipulation (#93139 ) Split from #92679. Use C++-based PyTree implementation. ## Highlights 1. High performance (20x speedup than the pure-Python implementation, 10%-20% overall speedup for `torch.fx`) 2. Multi-input tree-map support 3. Custom tree node registry with namespace isolation Refs: - #65761 - #91323 - #92679 From https://github.com/pytorch/pytorch/issues/65761#issuecomment-1334746366: > ### 0. Out-of-box compatible with JAX's pytree, provides the same interfaces and functions (and more). > > ### 1. High-performance: `optree` has comparable fast tree operations (~0.9x for `dict`s and ~2.5x for `OrderedDict`s) than JAX's pytree and it is 20x faster than `torch.utils._pytree`. > > `optree` implements some common Python container types in C++ (e.g., `OrderedDict`) and achieves 2.5x performance than JAX's pytree. Check out section [Built-in PyTree Node Types](https://github.com/metaopt/optree#built-in-pytree-node-types) and [Benchmark](https://github.com/metaopt/optree#benchmark) for more details. > > \| Module \| Nodes \| OpTree (μs) \| JAX XLA (μs) \| PyTorch (μs) \| DM-Tree (μs) \| Speedup (J / O) \| Speedup (P / O) \| Speedup (D / O) \| > \| :-------- \| ----: \| ----------: \| -----------: \| -----------: \| -----------: \| --------------: \| --------------: \| --------------: \| > \| TinyMLP \| 53 \| 26.40 \| 68.19 \| 586.87 \| 34.14 \| 2.58 \| 22.23 \| 1.29 \| > \| AlexNet \| 188 \| 84.28 \| 259.51 \| 2182.07 \| 125.12 \| 3.08 \| 25.89 \| 1.48 \| > \| ResNet18 \| 698 \| 288.57 \| 807.27 \| 7881.69 \| 429.39 \| 2.80 \| 27.31 \| 1.49 \| > \| ResNet34 \| 1242 \| 580.75 \| 1564.97 \| 15082.84 \| 819.02 \| 2.69 \| 25.97 \| 1.41 \| > \| ResNet50 \| 1702 \| 791.18 \| 2081.17 \| 20982.82 \| 1104.62 \| 2.63 \| 26.52 \| 1.40 \| > \| ResNet101 \| 3317 \| 1603.93 \| 3939.37 \| 40382.14 \| 2208.63 \| 2.46 \| 25.18 \| 1.38 \| > \| ResNet152 \| 4932 \| 2446.56 \| 6267.98 \| 56892.36 \| 3139.17 \| 2.56 \| 23.25 \| 1.28 \| > \| ViT-H/14 \| 3420 \| 1681.48 \| 4488.33 \| 41703.16 \| 2504.86 \| 2.67 \| 24.80 \| 1.49 \| > \| Swin-B \| 2881 \| 1565.41 \| 4091.10 \| 34241.99 \| 1936.75 \| 2.61 \| 21.87 \| 1.24 \| > \| \| \| \| \| \| Average \| 2.68 \| 24.78 \| 1.38 \| > > <div align="center"> > <img src="https://user-images.githubusercontent.com/16078332/200494435-fd5bb385-59f7-4811-b520-98bf5763ccf3.png" width="90%" /> > </div> > > ### 2. Namespace Isolation for the PyTree Type Registry > > In addition to the JAX's pytree registry for custom node type registration, `optree` adds `namespace` isolation to the registry. Users can register the same type multiple times for different flatten/unflatten behavior. It also provides module-level isolation for safety reasons. For example, you can add a unique prefix to your namespace to isolate your registry with other modules (e.g., `torch.xxx`, `torch.functorch.xxx`): > > ```python > # Register a Python type into a namespace > import torch > > optree.register_pytree_node( > torch.Tensor, > # (tensor) -> (children, metadata) > flatten_func=lambda tensor: ( > (tensor.cpu().numpy(),), > dict(dtype=tensor.dtype, device=tensor.device, requires_grad=tensor.requires_grad), > ), > # (metadata, children) -> tensor > unflatten_func=lambda metadata, children: torch.tensor(children[0], *metadata), > namespace='torch.torch2numpy', > ) > ``` > > ```python > >>> tree = {'weight': torch.ones(size=(1, 2)).cuda(), 'bias': torch.zeros(size=(2,))} > >>> tree > {'weight': tensor([[1., 1.]], device='cuda:0'), 'bias': tensor([0., 0.])} > > # Flatten without specifying the namespace > >>> tree_flatten(tree) # `torch.Tensor`s are leaf nodes > ([tensor([0., 0.]), tensor([[1., 1.]], device='cuda:0')], PyTreeSpec({'bias': , 'weight': })) > > # Flatten with the namespace > >>> leaves, treespec = optree.tree_flatten(tree, namespace='torch.torch2numpy') > >>> leaves, treespec > ( > [array([0., 0.], dtype=float32), array([[1., 1.]], dtype=float32)], > PyTreeSpec( > { > 'bias': CustomTreeNode(Tensor[{'dtype': torch.float32, 'device': device(type='cpu'), 'requires_grad': False}], []), > 'weight': CustomTreeNode(Tensor[{'dtype': torch.float32, 'device': device(type='cuda', index=0), 'requires_grad': False}], [*]) > }, > namespace='torch.torch2numpy' > ) > ) > > # `entries` are not defined and use `range(len(children))` > >>> optree.tree_paths(tree, namespace='torch.torch2numpy') > [('bias', 0), ('weight', 0)] > > # Unflatten back to a copy of the original object > >>> optree.tree_unflatten(treespec, leaves) > {'bias': tensor([0., 0.]), 'weight': tensor([[1., 1.]], device='cuda:0')} > ``` > > Check out section [Registering a Container-like Custom Type as Non-leaf Nodes](https://github.com/metaopt/optree#notes-about-the-pytree-type-registry) for more details. > > ### 3. Support both `None` as Non-leaf Node and `None` as Leaf > > In JAX's implementation, `None` is always an internal non-leaf node with an arity 0, which is like an empty tuple. This limits the usage of the JAX's pytree utilities for PyTorch. For example, the `nn.Module` uses `_parameters` and `_buffers` (`OrderedDict[str, Optional[Tensor]]`) to hold the tensors, while the value can be a tensor or `None`. > > `optree` supports both `None` as Non-leaf Node (JAX's default) and `None` as Leaf (PyTorch's default). Check out section [None is Non-leaf Node vs. None is Leaf](https://github.com/metaopt/optree#none-is-non-leaf-node-vs-none-is-leaf) for more details. > > ### 4. Some other improvements and bug fixes > > 1. Adds in-place version of treemap (`tree_map_`), which reduces redundant unflatten operation for better performance. > 2. Adds support for tree flatten and tree map with paths. (useful for `functorch` module extraction). > 3. Improves the JAX's pytree sorting support for `dict`s. > 4. Better string representation `repr(PyTreeSpec)`. > 5. Fixes some bugs for JAX's pytree of hashing, pickle serialization, segmentation fault for infinite recursion, and tree-compose/tree-transpose. From https://github.com/pytorch/pytorch/pull/92679#issuecomment-1398778481: > ```python > # pytree_make_fx_bench.py > import torch > from torch.fx.experimental.proxy_tensor import make_fx > import time > > def f(x): > for _ in range(10000): > x = x+x > return x > > import time > begin = time.time() > out = make_fx(f, tracing_mode="real")(torch.randn(20)) > begin = time.time() > print(f'tracing_mode="real" {time.time() - begin:.2f}') > out = make_fx(f, tracing_mode="fake")(torch.randn(20)) > print(f'tracing_mode="fake" {time.time() - begin:.2f}') > > out = make_fx(f, tracing_mode="symbolic")(torch.randn(20)) > print(f'tracing_mode="symbolic" {time.time() - begin:.2f}') > ``` > > This seems to run around 10-20% faster with the optree implementation: > > ``` > # Optree > python pytree_make_fx_bench.py > tracing_mode="real" 0.00 > tracing_mode="fake" 6.32 > tracing_mode="symbolic" 27.13 > ``` > > ``` > # torch.utils._pytree > python pytree_make_fx_bench.py > tracing_mode="real" 0.00 > tracing_mode="fake" 7.66 > tracing_mode="symbolic" 31.07 > ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/93139 Approved by: https://github.com/malfet	2023-09-18 21:24:56 +00:00
Aaron Gokaslan	2f95a3d0fc	[BE]: Apply ruff PERF fixes to torch (#104917 ) Applies automated ruff fixes in the PERF modules and enables all automatic ones. I also updated ruff which applied some additional fixes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/104917 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-07-11 20:45:21 +00:00
Nikita Shulga	1ac663d9f1	`collect_env`: parse HIP version exception free (#101844 ) Should prevent broken collect_env reporting as shown in https://github.com/pytorch/vision/issues/7561#issue-1698000841 <!-- copilot:poem --> ### <samp>🤖 Generated by Copilot at 5204e0f</samp> > _`get_version_or_na`_ > _Helper function refactors_ > _Code like autumn leaves_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/101844 Approved by: https://github.com/kit1980, https://github.com/ZainRizvi	2023-05-19 17:24:35 +00:00
Nikita Shulga	bd78532020	[BE] Fix `collect_env` for python-path-with-space (#98415 ) By invoking [`Popen`](https://docs.python.org/2.7/library/subprocess.html#popen-constructor) with list of command line arguments, rather than strings that would be parsed by shell. Test plan: ```shell % conda create -n py311 python=3.11 % cd ~/miniconda3/envs % cp -a py311 py\ 311 % ./py\ 311/bin/python -mtorch.utils.collect_env ``` Fixes https://github.com/pytorch/pytorch/issues/98385 Pull Request resolved: https://github.com/pytorch/pytorch/pull/98415 Approved by: https://github.com/huydhn	2023-04-06 01:09:23 +00:00
mpearce25	5e6e984835	flake8 version reporting in collect_env (#94573 ) Fixes #94571 # Testing `[pip3] flake8==3.9.2` now appears under `Versions of relevant libraries:` when running: `python torch/utils/collect_env.py` ### Output with this change ``` Collecting environment information... PyTorch version: N/A Is debug build: N/A CUDA used to build PyTorch: N/A ROCM used to build PyTorch: N/A OS: macOS 13.1 (x86_64) GCC version: Could not collect Clang version: 14.0.0 (clang-1400.0.29.202) CMake version: Could not collect Libc version: N/A Python version: 3.9.12 (main, Apr 5 2022, 01:53:17) [Clang 12.0.0 ] (64-bit runtime) Python platform: macOS-10.16-x86_64-i386-64bit Is CUDA available: N/A CUDA runtime version: Could not collect CUDA_MODULE_LOADING set to: N/A GPU models and configuration: Could not collect Nvidia driver version: Could not collect cuDNN version: Could not collect HIP runtime version: N/A MIOpen runtime version: N/A Is XNNPACK available: N/A CPU: Intel(R) Core(TM) i9-9880H CPU @ 2.30GHz Versions of relevant libraries: [pip3] flake8==3.9.2 [pip3] mypy==0.971 [pip3] mypy-extensions==0.4.3 [pip3] numpy==1.21.5 [pip3] numpydoc==1.2 [conda] blas 1.0 mkl [conda] mkl 2021.4.0 hecd8cb5_637 [conda] mkl-service 2.4.0 py39h9ed2024_0 [conda] mkl_fft 1.3.1 py39h4ab4a9b_0 [conda] mkl_random 1.2.2 py39hb2f4e1b_0 [conda] numpy 1.21.5 py39h2e5f0a9_1 [conda] numpy-base 1.21.5 py39h3b1a694_1 [conda] numpydoc 1.2 pyhd3eb1b0_0 ``` ### Output before ``` Collecting environment information... PyTorch version: N/A Is debug build: N/A CUDA used to build PyTorch: N/A ROCM used to build PyTorch: N/A OS: macOS 13.1 (x86_64) GCC version: Could not collect Clang version: 14.0.0 (clang-1400.0.29.202) CMake version: Could not collect Libc version: N/A Python version: 3.9.12 (main, Apr 5 2022, 01:53:17) [Clang 12.0.0 ] (64-bit runtime) Python platform: macOS-10.16-x86_64-i386-64bit Is CUDA available: N/A CUDA runtime version: Could not collect CUDA_MODULE_LOADING set to: N/A GPU models and configuration: Could not collect Nvidia driver version: Could not collect cuDNN version: Could not collect HIP runtime version: N/A MIOpen runtime version: N/A Is XNNPACK available: N/A CPU: Intel(R) Core(TM) i9-9880H CPU @ 2.30GHz Versions of relevant libraries: [pip3] mypy==0.971 [pip3] mypy-extensions==0.4.3 [pip3] numpy==1.21.5 [pip3] numpydoc==1.2 [conda] blas 1.0 mkl [conda] mkl 2021.4.0 hecd8cb5_637 [conda] mkl-service 2.4.0 py39h9ed2024_0 [conda] mkl_fft 1.3.1 py39h4ab4a9b_0 [conda] mkl_random 1.2.2 py39hb2f4e1b_0 [conda] numpy 1.21.5 py39h2e5f0a9_1 [conda] numpy-base 1.21.5 py39h3b1a694_1 [conda] numpydoc 1.2 pyhd3eb1b0_0 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94573 Approved by: https://github.com/malfet, https://github.com/kit1980	2023-03-28 03:24:41 +00:00
Edward Z. Yang	4454655a4c	Add triton to relevant packages (#96663 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/96663 Approved by: https://github.com/janeyx99, https://github.com/malfet, https://github.com/atalman	2023-03-13 19:02:07 +00:00
Aaron Gokaslan	3ce1ebb6fb	Apply some safe comprehension optimizations (#94323 ) Optimize unnecessary collection cast calls, unnecessary calls to list, tuple, and dict, and simplify calls to the sorted builtin. This should strictly improve speed and improve readability. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94323 Approved by: https://github.com/albanD	2023-02-07 23:53:46 +00:00
Aaron Gokaslan	8fce9a09cd	[BE]: pyupgrade Python to 3.8 - imports and object inheritance only (#94308 ) Apply parts of pyupgrade to torch (starting with the safest changes). This PR only does two things: removes the need to inherit from object and removes unused future imports. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94308 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-02-07 21:10:56 +00:00
Jing Xu	79243516f6	collect CPU info with collect_env.py for new issues reporting (#93899 ) Add CPU information collection feature to collect_env.py for new issues reporting. This helps us to triage issues on CPU. Pull Request resolved: https://github.com/pytorch/pytorch/pull/93899 Approved by: https://github.com/malfet	2023-02-03 04:58:53 +00:00
Syed Tousif Ahmed	77d94ac5ab	Sets CUDA_MODULE_LOADING to LAZY when not set by the user (#85692 ) This PR sets CUDA_MODULE_LOADING if it's not set by the user. By default, it sets it to "LAZY". It was tested using the following commands: ``` python -c "import torch; tensor=torch.randn(20, 16, 50, 100).cuda(); free, total = torch.cuda.cudart().cudaMemGetInfo(0); print(total-free)" ``` which shows a memory usage of: 287,047,680 bytes vs ``` CUDA_MODULE_LOADING="DEFAULT" python -c "import torch; tensor=torch.randn(20, 16, 50, 100).cuda(); free, total = torch.cuda.cudart().cudaMemGetInfo(0); print(total-free)" ``` which shows 666,632,192 bytes. C++ implementation is needed for the libtorch users (otherwise it could have been a pure python functionality). cc: @ptrblck @ngimel @malfet Pull Request resolved: https://github.com/pytorch/pytorch/pull/85692 Approved by: https://github.com/malfet	2022-10-13 14:03:01 +00:00
Eli Uriegas	be25566d13	tools: Ensure compat for collect_env with python 3.5 Users were reporting errors of not being able to use collect_env with older versions of python. This adds a test to ensure that we maintain compat for this script with older versions of python Signed-off-by: Eli Uriegas <eliuriegasfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/78946 Approved by: https://github.com/janeyx99	2022-06-06 21:32:57 +00:00
Philip Meier	635aaa3d9d	replace "grep" with Python processing in `collect_env.py` (#77148 ) Fixes #77063. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77148 Approved by: https://github.com/ezyang	2022-05-10 19:35:15 +00:00
Eli Uriegas	c170d395de	utils: Only check for xnnpack if torch installed (#74342 ) Summary: Fixes a bug where collect_env.py was not able to be run without having torch installed Signed-off-by: Eli Uriegas <eliuriegas@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/74342 Reviewed By: malfet, janeyx99 Differential Revision: D34943464 Pulled By: seemethere fbshipit-source-id: dbaa0004b88cb643a9c6426c9ea7c5be3d3c9ef5 (cherry picked from commit 4f39ebb823f88df0c3902db15deaffc6ba481cb3)	2022-03-17 15:31:26 +00:00
Digant Desai	b2054d3025	Prepare for an update to the XNNPACK submodule (#72642 ) Summary: - Target Sha1: ae108ef49aa5623b896fc93d4298c49d1750d9ba - Make USE_XNNPACK a dependent option on cmake minimum version 3.12 - Print USE_XNNPACK under cmake options summary, and print the availability from collet_env.py - Skip XNNPACK based tests when XNNPACK is not available - Add SkipIfNoXNNPACK wrapper to skip tests - Update cmake version for xenial-py3.7-gcc5.4 image to 3.12.4 - This is required for the backwards compatibility test. The PyTorch op schema is XNNPACK dependent. See, aten/src/ATen/native/xnnpack/RegisterOpContextClass.cpp for example. The nightly version is assumed to have USE_XNNPACK=ON, so with this change we ensure that the test build can also have XNNPACK. - HACK: skipping test_xnnpack_integration tests on ROCM Pull Request resolved: https://github.com/pytorch/pytorch/pull/72642 Reviewed By: kimishpatel Differential Revision: D34456794 Pulled By: digantdesai fbshipit-source-id: 85dbfe0211de7846d8a84321b14fdb061cd6c037 (cherry picked from commit 6cf48e7b64d6979962d701b5d493998262cc8bfa)	2022-02-25 00:39:15 +00:00
Nikita Shulga	3f06f29577	Improve pip package determination (#63321 ) Summary: Invoking `pip` or `pip3` yields list of packages invoked for `pip` alias on the path, rather than for the one currently being executed. Changed `get_pip_packages` to use `sys.executable + '-mpip'` Also, add mypy to the list of packages of interest Discovered while looking at https://github.com/pytorch/pytorch/issues/63279 Pull Request resolved: https://github.com/pytorch/pytorch/pull/63321 Reviewed By: walterddr Differential Revision: D30342099 Pulled By: malfet fbshipit-source-id: fc8d17cf2ddcf18236cfde5c1b9edb4e72804ee0	2021-08-16 13:54:39 -07:00
Ruslan Semenov	150c828803	Add lint rule to keep collect_env.py python2 compliant (#60946 ) Summary: Fixes T94400857 - [x] Add lint rule - [x] Verify lint rule works - [x] Fix torch/utils/collect_env.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/60946 Reviewed By: malfet, mruberry Differential Revision: D29457294 Pulled By: rsemenov fbshipit-source-id: 3c0670408d7aee1479e1de335291deb13a04ace9	2021-06-29 11:57:53 -07:00
Edward Yang	4c00df12ec	Include full Python version in collect_env.py output (#59632 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59632 Before: ``` Python version: 3.7 (64-bit runtime) ``` After: ``` Python version: 3.7.7 (default, Mar 23 2020, 17:31:31) [Clang 4.0.1 (tags/RELEASE_401/final)] (64-bit runtime) ``` Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: bdhirsh Differential Revision: D28961500 Pulled By: ezyang fbshipit-source-id: 0f95a49abf6977941f09a64243916576a820679f	2021-06-24 12:11:01 -07:00
Michael Wootton	2f3be2735f	Don't split oversize cached blocks (#44742 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/35901 This change is designed to prevent fragmentation in the Caching Allocator. Permissive block splitting in the allocator allows very large blocks to be split into many pieces. Once split too finely it is unlikely all pieces will be 'free' at that same time so the original allocation can never be returned. Anecdotally, we've seen a model run out of memory failing to alloc a 50 MB block on a 32 GB card while the caching allocator is holding 13 GB of 'split free blocks' Approach: - Large blocks above a certain size are designated "oversize". This limit is currently set 1 decade above large, 200 MB - Oversize blocks can not be split - Oversize blocks must closely match the requested size (e.g. a 200 MB request will match an existing 205 MB block, but not a 300 MB block) - In lieu of splitting oversize blocks there is a mechanism to quickly free a single oversize block (to the system allocator) to allow an appropriate size block to be allocated. This will be activated under memory pressure and will prevent _release_cached_blocks()_ from triggering Initial performance tests show this is similar or quicker than the original strategy. Additional tests are ongoing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/44742 Reviewed By: zou3519 Differential Revision: D29186394 Pulled By: ezyang fbshipit-source-id: c88918836db3f51df59de6d1b3e03602ebe306a9	2021-06-21 11:46:08 -07:00
Nikita Shulga	f7c15610aa	Collect kernel version (#58485 ) Summary: Collect env should collect kernel and glibc version Fixes https://github.com/pytorch/pytorch/issues/58387 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58485 Reviewed By: walterddr Differential Revision: D28510564 Pulled By: malfet fbshipit-source-id: ad3d4b93f51db052720bfaa4322138c55816921b	2021-05-18 10:57:59 -07:00
David Riazati	1ec12fd491	Add minidump collection via breakpad (#55647 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55647 This adds [breakpad](https://github.com/google/breakpad) which comes with out-of-the-box utilities to register a signal handler that writes out a minidump on an unhandled exception. Right now this is gated behind a flag in `torch.utils`, but in the future it could be on by default. Sizewise this adds aboute 500k to `libtorch_cpu.so` (187275968 B to 187810016 B). ```bash $ cat <<EOF > test.py import torch torch.utils.enable_minidump_collection() # temporary util that just segfaults torch._C._crash() EOF $ python test.py Wrote minidump to /tmp/pytorch_crashes/6a829041-50e9-4247-ea992f99-a74cf47a.dmp fish: “python test.py” terminated by signal SIGSEGV (Address boundary error) $ minidump-2-core /tmp/pytorch_crashes/6a829041-50e9-4247-ea992f99-a74cf47a.dmp -o core.dmp $ gdb python core.dmp ... commence debugging ... ``` Right now all exceptions that get passed up to Python don't trigger the signal handler (which by default only handles [these](https://github.com/google/breakpad/blob/main/src/client/linux/handler/exception_handler.cc#L115)). It would be possible for PyTorch exceptions to explicitly write a minidump when passed up to Python (maybe only when the exception is unhandled or something). Test Plan: Imported from OSS Reviewed By: ailzhang Differential Revision: D27679767 Pulled By: driazati fbshipit-source-id: 1ab3b5160b6dc405f5097eb25acc644d533358d7	2021-04-16 13:05:01 -07:00
Natalia Gimelshein	f94c95a2dd	Revert D23752058: [pytorch][PR] Don't split oversize cached blocks Test Plan: revert-hammer Differential Revision: D23752058 (`67dcd62310`) Original commit changeset: ccb7c13e3cf8 fbshipit-source-id: 12ae9702135ea510e9714ed97fb75ca3b9f97c27	2021-04-14 09:24:08 -07:00
Michael Wootton	67dcd62310	Don't split oversize cached blocks (#44742 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/35901 This change is designed to prevent fragmentation in the Caching Allocator. Permissive block splitting in the allocator allows very large blocks to be split into many pieces. Once split too finely it is unlikely all pieces will be 'free' at that same time so the original allocation can never be returned. Anecdotally, we've seen a model run out of memory failing to alloc a 50 MB block on a 32 GB card while the caching allocator is holding 13 GB of 'split free blocks' Approach: - Large blocks above a certain size are designated "oversize". This limit is currently set 1 decade above large, 200 MB - Oversize blocks can not be split - Oversize blocks must closely match the requested size (e.g. a 200 MB request will match an existing 205 MB block, but not a 300 MB block) - In lieu of splitting oversize blocks there is a mechanism to quickly free a single oversize block (to the system allocator) to allow an appropriate size block to be allocated. This will be activated under memory pressure and will prevent _release_cached_blocks()_ from triggering Initial performance tests show this is similar or quicker than the original strategy. Additional tests are ongoing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/44742 Reviewed By: ngimel Differential Revision: D23752058 Pulled By: ezyang fbshipit-source-id: ccb7c13e3cf8ef2707706726ac9aaac3a5e3d5c8	2021-04-14 03:04:41 -07:00
Chester Liu	42635c3e59	Fix regex in collect_env.py for CUDA 11 (#51852 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/51840 Manually tested with both CUDA 10.2.89 & 11.2.67. Pull Request resolved: https://github.com/pytorch/pytorch/pull/51852 Reviewed By: izdeby Differential Revision: D26326105 Pulled By: mrshenli fbshipit-source-id: 46fbe5f20c02bca982ce2ec6e62f7cc3a14fcc97	2021-02-09 07:31:08 -08:00
peterjc123	f431e47a2e	[collect_env] Acquire windows encoding using OEMCP (#49020 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/49010. Pull Request resolved: https://github.com/pytorch/pytorch/pull/49020 Reviewed By: zhangguanheng66 Differential Revision: D25398064 Pulled By: janeyx99 fbshipit-source-id: c7fd1e7d1f3dd82613d7f2031439503188b144fd	2020-12-09 15:22:18 -08:00
peterjc123	e538bd6695	[collect_env] Add candidate paths for nvidia-smi on Windows (#49021 ) Summary: Recently, Nvidia tries to put nvidia-smi under SystemRoot. Pull Request resolved: https://github.com/pytorch/pytorch/pull/49021 Reviewed By: zhangguanheng66 Differential Revision: D25399831 Pulled By: ezyang fbshipit-source-id: b1ea12452012e0a3fb4703996b6104e7115a8a7f	2020-12-08 15:02:15 -08:00
skyline75489	9b19880c43	Fix collect_env.py with older version of PyTorch (#48076 ) Summary: Inspired by https://github.com/pytorch/pytorch/issues/47993, this fixes the import error in `collect_env.py` with older version of PyTorch when `torch.version` does not have `hip` property. Pull Request resolved: https://github.com/pytorch/pytorch/pull/48076 Reviewed By: seemethere, xuzhao9 Differential Revision: D25024352 Pulled By: samestep fbshipit-source-id: 7dff9d2ab80b0bd25f9ca035d8660f38419cdeca	2020-11-19 12:18:08 -08:00
Rong Rong	513f62b45b	[hotfix] fix collect_env not working when torch compile/install fails (#47752 ) Summary: fix collect env not working when pytorch compile from source failed mid-way. ``` Traceback (most recent call last): OSError: /home/rongr/local/pytorch/torch/lib/libtorch_global_deps.so: cannot open shared object file: No such file or directory ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/47752 Reviewed By: janeyx99 Differential Revision: D24888576 Pulled By: walterddr fbshipit-source-id: 3b20daeddbb4118491fb0cca9fb59d861f683da7	2020-11-11 11:47:49 -08:00
Nikita Shulga	848901f276	Fix collect_env when pytorch is not installed (#47398 ) Summary: Moved all torch specific checks under `if TORCH_AVAILABLE` block Embed gpu_info dict back into SystemEnv constructor creation and deduplicate some code between HIP and CUDA cases Fixes https://github.com/pytorch/pytorch/issues/47397 Pull Request resolved: https://github.com/pytorch/pytorch/pull/47398 Reviewed By: walterddr Differential Revision: D24740421 Pulled By: malfet fbshipit-source-id: d0a1fe5b428617cb1a9d027324d24d7371c68d64	2020-11-04 16:54:08 -08:00
Xiang Gao	20ac736200	Remove py2 compatible future imports (#44735 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44735 Reviewed By: mruberry Differential Revision: D23731306 Pulled By: ezyang fbshipit-source-id: 0ba009a99e475ddbe22981be8ac636f8a1c8b02f	2020-09-16 12:55:57 -07:00
Thomas Viehmann	bd257a17a1	Add HIP/ROCm version to collect_env.py (#44106 ) Summary: This adds HIP version info to the `collect_env.py` output. Pull Request resolved: https://github.com/pytorch/pytorch/pull/44106 Reviewed By: VitalyFedyunin Differential Revision: D23652341 Pulled By: zou3519 fbshipit-source-id: a1f5bce8da7ad27a1277a95885934293d0fd43c5	2020-09-14 09:19:18 -07:00
Nikita Shulga	64a7684219	Enable typechecking of `collect_env.py` during CI (#43062 ) Summary: No type annotations can be added to the script, as it still have to be Python-2 compliant. Make changes to avoid variable type redefinition. Pull Request resolved: https://github.com/pytorch/pytorch/pull/43062 Reviewed By: zou3519 Differential Revision: D23132991 Pulled By: malfet fbshipit-source-id: 360c02e564398f555273e5889a99f834a5467059	2020-08-14 12:46:42 -07:00
vfdev	c7d2774d20	Fix typo in collect_env.py (#43050 ) Summary: Minor typo fix introduced in yesterdays PR: https://github.com/pytorch/pytorch/pull/42961 Pull Request resolved: https://github.com/pytorch/pytorch/pull/43050 Reviewed By: ezyang, malfet Differential Revision: D23130936 Pulled By: zou3519 fbshipit-source-id: e8fa2bf155ab6a5988c74e8345278d8d70855894	2020-08-14 08:33:35 -07:00
Nikita Shulga	0ff51accd8	collect_env.py: Print CPU architecture after Linux OS name (#42961 ) Summary: Missed this case in https://github.com/pytorch/pytorch/pull/42887 Pull Request resolved: https://github.com/pytorch/pytorch/pull/42961 Reviewed By: zou3519 Differential Revision: D23095264 Pulled By: malfet fbshipit-source-id: ff1fb0eba9ecd29bfa3d8f5e4c3dcbcb11deefcb	2020-08-13 10:49:15 -07:00
Nikita Shulga	b0b8340065	Collect more data in collect_env (#42887 ) Summary: Collect Python runtime bitness (32 vs 64 bit) Collect Mac/Linux OS machine time (x86_64, arm, Power, etc) Collect Clang version Pull Request resolved: https://github.com/pytorch/pytorch/pull/42887 Reviewed By: seemethere Differential Revision: D23064788 Pulled By: malfet fbshipit-source-id: df361bdbb79364dc521b8e1ecbed1b4bd08f9742	2020-08-11 18:01:14 -07:00
peter	e029d678b6	Make collect_env more robust on Windows (#39136 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/39133. Pull Request resolved: https://github.com/pytorch/pytorch/pull/39136 Differential Revision: D21763686 Pulled By: zou3519 fbshipit-source-id: d45c3b529f569554e987dfd29579fc93d4894aaf	2020-05-28 13:25:36 -07:00
David Reiss	e75fb4356b	Remove (most) Python 2 support from Python code (#35615 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35615 Python 2 has reached end-of-life and is no longer supported by PyTorch. Now we can clean up a lot of cruft that we put in place to support it. These changes were all done manually, and I skipped anything that seemed like it would take more than a few seconds, so I think it makes sense to review it manually as well (though using side-by-side view and ignoring whitespace change might be helpful). Test Plan: CI Differential Revision: D20842886 Pulled By: dreiss fbshipit-source-id: 8cad4e87c45895e7ce3938a88e61157a79504aed	2020-04-22 09:23:14 -07:00

1 2

64 Commits