mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

History

Mao Yunfei 300e0ee13c [Reopen] [Intel GPU] Set higher tolerance for some models only on XPU Device (#144756 ) Reopen the previous stale closed PR https://github.com/pytorch/pytorch/pull/134192 We need to increase the tolerance slightly to ensure that certain models pass accuracy check on the XPU device. This pull request preserves the original tolerance threshold for the CUDA device and introduces a new key higher_fp16_bf16_xpu, which only impacts the XPU device. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144756 Approved by: https://github.com/chuanqi129, https://github.com/EikanWang, https://github.com/desertfire		2025-04-17 00:26:55 +00:00
..
distributed/ddp	[BE] Remove outdated RPC benchmark (#146716 )	2025-03-29 04:44:36 +00:00
dynamo	[Reopen] [Intel GPU] Set higher tolerance for some models only on XPU Device (#144756 )	2025-04-17 00:26:55 +00:00
fastrnns	[BE]: Enable ruff rule SIM113 (#147290 )	2025-02-16 22:41:16 +00:00
framework_overhead_benchmark	Fix unused Python variables outside torch/ and test/ (#136359 )	2024-12-11 17:10:23 +00:00
functional_autograd_benchmark	[BE][CI] bump `ruff` to 0.9.2: multiline `assert` statements (#144546 )	2025-02-27 20:46:16 +00:00
fuser	Fix unused Python variables outside torch/ and test/ (#136359 )	2024-12-11 17:10:23 +00:00
gpt_fast	[BE][CI] bump `ruff` to 0.9.2: multiline `assert` statements (#144546 )	2025-02-27 20:46:16 +00:00
inductor_backends	[cutlass backend] Add more logs for cutlass backend benchmark (#150639 )	2025-04-15 04:19:51 +00:00
inference	[BE][Easy][3/19] enforce style for empty lines in import segments in `benchmarks/` (#129754 )	2024-07-17 14:34:42 +00:00
instruction_counts	[BE][CI] bump `ruff` to 0.9.2: multiline `assert` statements (#144546 )	2025-02-27 20:46:16 +00:00
nested	Fix unused Python variables outside torch/ and test/ (#136359 )	2024-12-11 17:10:23 +00:00
operator_benchmark	[CI] enable operator benchmark on CPU (#143733 )	2025-03-21 16:46:03 +00:00
overrides_benchmark	[BE][Easy][3/19] enforce style for empty lines in import segments in `benchmarks/` (#129754 )	2024-07-17 14:34:42 +00:00
profiler_benchmark	Apply TorchFix TOR203 fixes (#143691 )	2024-12-23 18:21:03 +00:00
record_function_benchmark	[Caffe2]Remove Caffe2 scripts and benchmarks (#126747 )	2024-06-05 23:46:31 +00:00
serialization	Fix unused Python variables outside torch/ and test/ (#136359 )	2024-12-11 17:10:23 +00:00
sparse	[BE][CI] bump `ruff` to 0.9.2: multiline `assert` statements (#144546 )	2025-02-27 20:46:16 +00:00
static_runtime	[StaticRuntime] Fix a bug that memory planner ignores subblocks (#146728 ) (#146855 )	2025-02-11 13:59:54 +00:00
tensorexpr	[BE][CI] bump `ruff` to 0.8.4 (#143753 )	2024-12-24 12:24:10 +00:00
transformer	Add sparsity (#148513 )	2025-03-07 01:47:52 +00:00
compare-fastrnn-results.py	[BE][Easy][3/19] enforce style for empty lines in import segments in `benchmarks/` (#129754 )	2024-07-17 14:34:42 +00:00
compare.sh
README.md	Add more child links to benchmark readme (#104627 )	2023-07-06 12:11:00 +00:00
upload_scribe.py	Apply UFMT to all files in benchmarks/ (#105928 )	2023-07-26 01:18:48 +00:00

README.md

PyTorch Benchmarks

This folder contains scripts that produce reproducible timings of various PyTorch features.

It also provides mechanisms to compare PyTorch with other frameworks.

Setup environment

Make sure you're on a machine with CUDA, torchvision, and pytorch installed. Install in the following order:

# Install torchvision. It comes with the pytorch stable release binary
conda install pytorch torchvision -c pytorch

# Install the latest pytorch master from source.
# It should supersede the installation from the release binary.
cd $PYTORCH_HOME
python setup.py build develop

# Check the pytorch installation version
python -c "import torch; print(torch.__version__)"

Benchmark List

Please refer to each subfolder to discover each benchmark suite. Links are provided where descriptions exist: