pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

History

Xuehai Pan b6bdb67f82 [BE][Easy] use `pathlib.Path` instead of `dirname` / `".."` / `pardir` (#129374 ) Changes by apply order: 1. Replace all `".."` and `os.pardir` usage with `os.path.dirname(...)`. 2. Replace nested `os.path.dirname(os.path.dirname(...))` call with `str(Path(...).parent.parent)`. 3. Reorder `.absolute()` ~/ `.resolve()`~ and `.parent`: always resolve the path first. `.parent{...}.absolute()` -> `.absolute().parent{...}` 4. Replace chained `.parent x N` with `.parents[${N - 1}]`: the code is easier to read (see 5.) `.parent.parent.parent.parent` -> `.parents[3]` 5. ~Replace `.parents[${N - 1}]` with `.parents[${N} - 1]`: the code is easier to read and does not introduce any runtime overhead.~ ~`.parents[3]` -> `.parents[4 - 1]`~ 6. ~Replace `.parents[2 - 1]` with `.parent.parent`: because the code is shorter and easier to read.~ Pull Request resolved: https://github.com/pytorch/pytorch/pull/129374 Approved by: https://github.com/justinchuby, https://github.com/malfet		2024-12-29 17:23:13 +00:00
..
gen_data_mm.py	[BE][Easy] use `pathlib.Path` instead of `dirname` / `".."` / `pardir` (#129374 )	2024-12-29 17:23:13 +00:00
gen_heuristic_a100.sh
gen_heuristic_h100.sh	AutoHeuristic: mm ranking heuristic h100 (#133608 )	2024-08-16 16:20:38 +00:00
get_mm_dataset.sh	AutoHeuristic: mm ranking heuristic h100 (#133608 )	2024-08-16 16:20:38 +00:00
README.md	AutoHeuristic: documentation for mm (#133611 )	2024-08-16 16:20:38 +00:00
train_decision_mm.py	[BE][Easy] use `pathlib.Path` instead of `dirname` / `".."` / `pardir` (#129374 )	2024-12-29 17:23:13 +00:00

README.md

If you just want to re-generate existing heuristics with already collected data for mm for A100/H100, run the following scripts:

bash get_mm_dataset.sh # Downloads A100 and H100 datasets bash gen_heuristic_a100.sh # Generates A100 heuristic bash gen_heuristic_h100.sh # Generates H100 heuristic

If you want to collect new data, or generate a heuristic for another GPU, use the generate_heuristic_mm.sh script: First, go into the generate_heuristic_mm.sh and modify the variables according to the comments. Then, run the script to perform benchmarks and collect training data:

bash generate_heuristic.sh collect

This will collect training data on random inputs. Depending on how many GPUs you are using, this might take a day. If you use multiple GPU, you will have one file per GPU, e.g. "data_6.txt", "data_7.txt" if you used GPUs with id 6 and 7. To merge this into a single file run: python torchgen/_autuoheuristic/merge_data.py mm_train.txt data_6.txt data_7.txt

For mm, we also want to incorporate data from huggingface and TIMM models into the training data.

To collect data for huggingface, run the following command:

TORCHINDUCTOR_AUTOHEURISTIC_USE="" TORCHINDUCTOR_AUTOHEURISTIC_COLLECT="mm" TORCHINDUCTOR_AUTOHEURISTIC_LOG_PATH="hf_train_mm.txt" TORCHINDUCTOR_MAX_AUTOTUNE=1 time python ../../../benchmarks/dynamo/huggingface.py --ci --performance --timing --explain --inductor --device cuda --train --amp

To collect data for TIMM models, run the following command

TORCHINDUCTOR_AUTOHEURISTIC_USE="" TORCHINDUCTOR_AUTOHEURISTIC_COLLECT="mm" TORCHINDUCTOR_AUTOHEURISTIC_LOG_PATH="timm_train_mm.txt" TORCHINDUCTOR_MAX_AUTOTUNE=1 time python ../../../benchmarks/dynamo/timm_models.py --ci --performance --timing --explain --inductor --device cuda --train --amp

Afterwards, run the script in order to learn the heuristic:

bash generate_heuristic_mm.sh generate