mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

History

Gabriel Ferns 254293b777 Add `flag _metrics_log_runtime` to disable runtime metric logging by default (#153506 ) https://github.com/pytorch/pytorch/pull/152708 expanded support of `get_estimated_runtime` to many more types of `SchedulerNodes`. This caused an increase in compile time because we're always calling `get_estimated_runtime` to populate the metrics table. This PR adds a flag for this logging, which reduces the instruction count by 8%. Long term, we should probably merge metrics.py with TORCH_LOGS/tlparse (suggestion from @xmfan). Update: added support for TORCH_LOGS for the metrics logging. Test Plan: mm_loop.py and many existing tests cover. Pull Request resolved: https://github.com/pytorch/pytorch/pull/153506 Approved by: https://github.com/eellison		2025-05-22 01:02:11 +00:00
..
distributed/ddp	[BE] Remove outdated RPC benchmark (#146716 )	2025-03-29 04:44:36 +00:00
dynamo	Add `flag _metrics_log_runtime` to disable runtime metric logging by default (#153506 )	2025-05-22 01:02:11 +00:00
fastrnns	[BE]: Enable ruff rule SIM113 (#147290 )	2025-02-16 22:41:16 +00:00
framework_overhead_benchmark	Fix unused Python variables outside torch/ and test/ (#136359 )	2024-12-11 17:10:23 +00:00
functional_autograd_benchmark	Fix broken URLs (#152237 )	2025-04-27 09:56:42 +00:00
fuser	Fix unused Python variables outside torch/ and test/ (#136359 )	2024-12-11 17:10:23 +00:00
gpt_fast	[BE][CI] bump `ruff` to 0.9.2: multiline `assert` statements (#144546 )	2025-02-27 20:46:16 +00:00
inductor_backends	[cutlass backend] add addmm and bmm for cutlass backend benchmark (#152163 )	2025-04-28 20:16:17 +00:00
inference	[BE][Easy][3/19] enforce style for empty lines in import segments in `benchmarks/` (#129754 )	2024-07-17 14:34:42 +00:00
instruction_counts	[BE][CI] bump `ruff` to 0.9.2: multiline `assert` statements (#144546 )	2025-02-27 20:46:16 +00:00
nested	Fix unused Python variables outside torch/ and test/ (#136359 )	2024-12-11 17:10:23 +00:00
operator_benchmark	unbreak fb:operator_benchmark_test (#152049 )	2025-05-15 03:38:48 +00:00
overrides_benchmark	[BE][Easy][3/19] enforce style for empty lines in import segments in `benchmarks/` (#129754 )	2024-07-17 14:34:42 +00:00
profiler_benchmark	Apply TorchFix TOR203 fixes (#143691 )	2024-12-23 18:21:03 +00:00
record_function_benchmark	[Caffe2]Remove Caffe2 scripts and benchmarks (#126747 )	2024-06-05 23:46:31 +00:00
serialization	Fix unused Python variables outside torch/ and test/ (#136359 )	2024-12-11 17:10:23 +00:00
sparse	Clean up conda usage in benchmark scripts (#152552 )	2025-04-30 21:27:29 +00:00
static_runtime	[3/N] Use internal linkage in C++ files (#151297 )	2025-05-05 17:48:39 +00:00
tensorexpr	Fix broken URLs (#152237 )	2025-04-27 09:56:42 +00:00
transformer	Add sparsity (#148513 )	2025-03-07 01:47:52 +00:00
compare-fastrnn-results.py	[BE][Easy][3/19] enforce style for empty lines in import segments in `benchmarks/` (#129754 )	2024-07-17 14:34:42 +00:00
compare.sh
README.md	Removing conda references from PyTorch Docs (#152702 )	2025-05-20 20:33:28 +00:00
upload_scribe.py	Fix broken URLs (#152237 )	2025-04-27 09:56:42 +00:00

README.md

PyTorch Benchmarks

This folder contains scripts that produce reproducible timings of various PyTorch features.

It also provides mechanisms to compare PyTorch with other frameworks.

Setup environment

Make sure you're on a machine with CUDA, torchvision, and pytorch installed. Install in the following order:

# Install torchvision. It comes with the pytorch stable release binary
pip3 install torch torchvision

# Install the latest pytorch master from source.
# It should supersede the installation from the release binary.
cd $PYTORCH_HOME
python setup.py build develop

# Check the pytorch installation version
python -c "import torch; print(torch.__version__)"

Benchmark List

Please refer to each subfolder to discover each benchmark suite. Links are provided where descriptions exist: