pytorch/tools
Natalia Gimelshein 55e62ff74a bf16 grouped gemm (#150374)
Enabled bf16 grouped gemm with an API similar to _scaled_group_gemm, except without scale and fast accum arguments. All transpose variants are enabled, unlike scaled gemm. Ideally we'd factor out a lot more code from scaled gemm, currently there's a lot of repetition between scaled and non-scaled versions. I factored out only a helper kernel that prepares arguments.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/150374
Approved by: https://github.com/drisspg
2025-04-06 04:53:24 +00:00
..
alerts
amd_build Add a stable TORCH_LIBRARY to C shim (#148124) 2025-03-11 19:12:46 +00:00
autograd bf16 grouped gemm (#150374) 2025-04-06 04:53:24 +00:00
bazel_tools
build/bazel Add networkx as bazel dep to fix CI failure (#143995) 2025-01-02 19:42:18 +00:00
build_defs
code_analyzer [3/N] Apply py39 ruff fixes (#142115) 2024-12-11 17:50:10 +00:00
code_coverage [BE] Upgrade to mypy 1.14 (#145966) 2025-03-04 20:58:26 +00:00
config
coverage_plugins_package
dynamo Revert "Use the device interface for detecting Triton availability (#139171)" 2025-03-11 18:49:21 +00:00
flight_recorder [fr] Added protection against missing stack frames in fr cont. (#150133) 2025-04-01 03:07:59 +00:00
gdb Add gdb print methods support same as pytorch-lldb (#140935) 2024-11-19 01:28:30 +00:00
github [BE][CI] bump ruff to 0.9.0: string quote styles (#144569) 2025-02-24 19:56:09 +00:00
iwyu [BE][CI] bump ruff to 0.9.0: string quote styles (#144569) 2025-02-24 19:56:09 +00:00
jit Revert "Use absolute path path.resolve() -> path.absolute() (#129409)" 2025-01-04 14:17:20 +00:00
linter [BE][CI] Update actionlint to 1.7.7 (#149919) 2025-03-25 14:37:10 +00:00
lite_interpreter [BE][CI] bump ruff to 0.9.0: string quote styles (#144569) 2025-02-24 19:56:09 +00:00
lldb
onnx [ONNX] Clean up the diagnostics module (#149864) 2025-03-26 05:58:32 +00:00
packaging [BE][CI] bump ruff to 0.9.0: string quote styles (#144569) 2025-02-24 19:56:09 +00:00
pyi add torch.float4_e2m1fn_x2 to PyTorch (#148791) 2025-03-27 17:32:20 +00:00
rules
rules_cc [BE] Fix incompatible-std-redefinition warning (#141630) 2024-11-27 05:06:36 +00:00
setup_helpers [Build] Remove pre-CXX11 ABI logic from build script (#149888) 2025-03-25 03:17:16 +00:00
shared
stats [CI] test upload: better check for if job is rerun disabled tests (#148027) 2025-02-28 00:04:33 +00:00
test [inductor] Fix issue with set_linter, improve linter framework (#144620) 2025-03-13 09:49:40 +00:00
testing [TD] test_cpp_extensions_aot_ninja corresponds to things in test/cpp_extensions (#148992) 2025-03-12 15:40:06 +00:00
__init__.py
bazel.bzl
BUCK.bzl [targets2buck] Remove tombstone messages proactively (#147897) 2025-03-26 22:15:17 +00:00
BUCK.oss
build_libtorch.py [BE][Easy] use pathlib.Path instead of dirname / ".." / pardir (#129374) 2024-12-29 17:23:13 +00:00
build_pytorch_libs.py Do not fetch NCCL when system NCCL is used (#149607) 2025-03-28 05:06:49 +00:00
build_with_debinfo.py [inductor] Remove _get_grid_fn_str (#146800) 2025-02-10 23:14:30 +00:00
download_mnist.py
extract_scripts.py [3/N] Apply py39 ruff fixes (#142115) 2024-12-11 17:50:10 +00:00
gen_flatbuffers.sh
gen_vulkan_spv.py [BE][Ez]: FURB148 - remove useless enumerate calls (#145619) 2025-01-24 23:37:15 +00:00
generate_torch_version.py Add "xpu" to __all__ for torch/version.py (#149695) 2025-04-01 08:44:51 +00:00
generated_dirs.txt
git_add_generated_dirs.sh
git_reset_generated_dirs.sh
nightly_hotpatch.py [3/N] Apply py39 ruff fixes (#142115) 2024-12-11 17:50:10 +00:00
nightly.py Set proper LD_LIBRARY_PATH on Linux in nightly venv in nightly pull tool (#143262) 2025-04-01 16:51:02 +00:00
nvcc_fix_deps.py Revert "Use absolute path path.resolve() -> path.absolute() (#129409)" 2025-01-04 14:17:20 +00:00
README.md
render_junit.py
substitute.py
update_masked_docs.py
vscode_settings.py

This folder contains a number of scripts which are used as part of the PyTorch build process. This directory also doubles as a Python module hierarchy (thus the __init__.py).

Overview

Modern infrastructure:

  • autograd - Code generation for autograd. This includes definitions of all our derivatives.
  • jit - Code generation for JIT
  • shared - Generic infrastructure that scripts in tools may find useful.
    • module_loader.py - Makes it easier to import arbitrary Python files in a script, without having to add them to the PYTHONPATH first.

Build system pieces:

  • setup_helpers - Helper code for searching for third-party dependencies on the user system.
  • build_pytorch_libs.py - cross-platform script that builds all of the constituent libraries of PyTorch, but not the PyTorch Python extension itself.
  • build_libtorch.py - Script for building libtorch, a standalone C++ library without Python support. This build script is tested in CI.

Developer tools which you might find useful:

Important if you want to run on AMD GPU:

  • amd_build - HIPify scripts, for transpiling CUDA into AMD HIP. Right now, PyTorch and Caffe2 share logic for how to do this transpilation, but have separate entry-points for transpiling either PyTorch or Caffe2 code.
    • build_amd.py - Top-level entry point for HIPifying our codebase.

Tools which are only situationally useful: