mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

History

leslie-fang-intel 950b484356 skip three pyhpc models with dynamic shape test (#120599 ) As reported in https://github.com/pytorch/pytorch/issues/119434, `pyhpc_isoneutral_mixing`, `pyhpc_equation_of_state` and `pyhpc_turbulent_kinetic_energy` failed with dynamic shape testing, we propose to skip the dynamic batch size testing of these 3 models in this PR. * Error msg is ``` File "/localdisk/leslie/torch_inductor_community/pytorch/benchmarks/dynamo/common.py", line 3879, in run assert marked, f"nothing in example_inputs had a dim with {batch_size}" AssertionError: nothing in example_inputs had a dim with 1048576 ``` * Root Cause is * Benchmark code will only annotate the inputs' dim as dynamic when its size equals to batch size `c617e7b407/benchmarks/dynamo/common.py (L3867-L3871)`. If it fails to find any dim equals to batch size, above error throws. * However, for these 3 models, none of the inputs' dim will equal to input batch size since the [relationship of dim sizes](`26b85eadde/torchbenchmark/models/pyhpc_equation_of_state/__init__.py (L12-L16)`) ``` shape = ( math.ceil(2 * size ** (1/3)), math.ceil(2 * size ** (1/3)), math.ceil(0.25 * size ** (1/3)), ) ``` * Another thing is `pyhpc_isoneutral_mixing`, `pyhpc_equation_of_state` can pass the dynamic batch size accuracy testing, because the batch size has been set to 4 in accuracy testing (`c617e7b407/benchmarks/dynamo/common.py (L3456)`) and `math.ceil(2 * size ** (1/3))` happens equaling to 4. * Since the dim sizes of input has above relationship, running the these models in dynamic shape, we may need to annotate `dim[0](s0) = dim[2](s1) * 8`, per the discussion in https://github.com/pytorch/pytorch/issues/117477#issuecomment-1897108756 @avikchaudhuri, looks like we are not expressible for this case. So, I think we may need to skip the dynamic batch size testing for these 3 models. Pull Request resolved: https://github.com/pytorch/pytorch/pull/120599 Approved by: https://github.com/jgong5, https://github.com/desertfire		2024-02-29 00:38:06 +00:00
..
distributed	[BE]: Apply FURB118 (prev): replaces unnecessary lambdas with operator. (#116027 )	2023-12-20 19:35:08 +00:00
dynamo	skip three pyhpc models with dynamic shape test (#120599 )	2024-02-29 00:38:06 +00:00
fastrnns	Apply UFMT to all files in benchmarks/ (#105928 )	2023-07-26 01:18:48 +00:00
framework_overhead_benchmark	[BE]: Enable F821 and fix bugs (#116579 )	2024-01-01 08:40:46 +00:00
functional_autograd_benchmark	[BE]: Enable F821 and fix bugs (#116579 )	2024-01-01 08:40:46 +00:00
fuser	Apply UFMT to all files in benchmarks/ (#105928 )	2023-07-26 01:18:48 +00:00
inference	Allow more backend worker threads with each using a separate cuda stream (#116190 )	2023-12-20 22:08:29 +00:00
instruction_counts	Use strict to toggle strict options in MYPYSTRICT (#118479 )	2024-01-28 19:22:22 +00:00
nested	Apply UFMT to all files in benchmarks/ (#105928 )	2023-07-26 01:18:48 +00:00
operator_benchmark	highlight readme code block (#120228 )	2024-02-22 21:23:08 +00:00
overrides_benchmark	[BE]: Update ruff to 0.285 (#107519 )	2023-08-22 23:16:38 +00:00
profiler_benchmark	Apply UFMT to all files in benchmarks/ (#105928 )	2023-07-26 01:18:48 +00:00
record_function_benchmark	Apply UFMT to all files in benchmarks/ (#105928 )	2023-07-26 01:18:48 +00:00
serialization	Apply UFMT to all files in benchmarks/ (#105928 )	2023-07-26 01:18:48 +00:00
sparse	[BE]: Enable F821 and fix bugs (#116579 )	2024-01-01 08:40:46 +00:00
static_runtime	[PyTorch] fix mixed int32/int64 indices/offsets for embedding_bag_out (#120752 )	2024-02-28 20:13:30 +00:00
tensorexpr	[BE]: Enable F821 and fix bugs (#116579 )	2024-01-01 08:40:46 +00:00
transformer	[CUDNN][SDPA] Experimental cuDNN Flash Attention v2 Inference (#115663 )	2024-02-14 22:02:06 +00:00
compare-fastrnn-results.py	Apply UFMT to all files in benchmarks/ (#105928 )	2023-07-26 01:18:48 +00:00
compare.sh
README.md	Add more child links to benchmark readme (#104627 )	2023-07-06 12:11:00 +00:00
upload_scribe.py	Apply UFMT to all files in benchmarks/ (#105928 )	2023-07-26 01:18:48 +00:00

README.md

PyTorch Benchmarks

This folder contains scripts that produce reproducible timings of various PyTorch features.

It also provides mechanisms to compare PyTorch with other frameworks.

Setup environment

Make sure you're on a machine with CUDA, torchvision, and pytorch installed. Install in the following order:

# Install torchvision. It comes with the pytorch stable release binary
conda install pytorch torchvision -c pytorch

# Install the latest pytorch master from source.
# It should supersede the installation from the release binary.
cd $PYTORCH_HOME
python setup.py build develop

# Check the pytorch installation version
python -c "import torch; print(torch.__version__)"

Benchmark List

Please refer to each subfolder to discover each benchmark suite. Links are provided where descriptions exist: