pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Yanbo Liang	a489792bb2	[GPT-benchmark] Fix memory bandwidth for MoE (#128783 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/128783 Approved by: https://github.com/Chillee ghstack dependencies: #128768	2024-06-17 21:04:57 +00:00
Yanbo Liang	8c06eae17e	[GPT-benchmark] Add metric: compilation time for GPT models (#128768 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/128768 Approved by: https://github.com/Chillee	2024-06-17 21:04:57 +00:00
Huy Do	f37121bb74	Add model name, quantization and device to gpt_fast micro benchmark output (#128091 ) A small enhancement to https://hud.pytorch.org/benchmark/llms with these columns in the output. Pull Request resolved: https://github.com/pytorch/pytorch/pull/128091 Approved by: https://github.com/yanboliang	2024-06-15 01:39:48 +00:00
Yanbo Liang	0be06b08fc	[GPT-fast benchmark] Merge GPT-fast and micro benchmark output as one CSV file (#127586 ) Consolidate GPT-fast models benchmark with micro-benchmark, and save output as one CSV file with the same format as https://github.com/pytorch/pytorch/pull/126754#issue-2307296847. Pull Request resolved: https://github.com/pytorch/pytorch/pull/127586 Approved by: https://github.com/Chillee	2024-05-31 18:50:49 +00:00