pytorch/benchmarks/dynamo/pr_time_benchmarks/expected_results.csv
Laith Sakka 39df901b2a introduce definitely_contiguous and use it for reshape and tensor meta data computation. (#153432)
when a tensor has unbacked symbols it can be general enough to represent both contiguous and non contiguous tensors.
in that case we cant really evaluate is_contiguous. In many places in the code base, we check for is_contiguous to take a fast path. but the general path usually works for both contiguous and not contiguous in that case we probably want
to use definitely _contiguous API.

This is appleid for reshape in this PR and also to  tensor meta data computation, the meta data now will have an attribute that says that its contiguous when its always contiguous. We would store that only if definitely _contiguous is true  now.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/153432
Approved by: https://github.com/bobrenjc93
2025-05-28 03:41:26 +00:00

1.6 KiB

1add_loop_eagercompile_time_instruction_count29530000000.015
2add_loop_eager_dynamiccompile_time_instruction_count57380000000.025
3add_loop_inductorcompile_time_instruction_count293700000000.015
4add_loop_inductor_dynamic_gpucompile_time_instruction_count444900000000.025
5add_loop_inductor_gpucompile_time_instruction_count259000000000.015
6basic_modules_ListOfLinears_eagercompile_time_instruction_count9399000000.015
7basic_modules_ListOfLinears_inductorcompile_time_instruction_count182700000000.015
8basic_modules_ListOfLinears_inductor_gpu_force_shape_padcompile_time_instruction_count163100000000.015
9basic_modules_ListOfLinears_inductor_gpucompile_time_instruction_count103700000000.2
10update_hint_regressioncompile_time_instruction_count17000000000.02
11float_argscompile_time_instruction_count4525000000.015
12sum_floordiv_regressioncompile_time_instruction_count9986000000.015
13symint_sumcompile_time_instruction_count32520000000.015
14symint_sum_loopcompile_time_instruction_count42620000000.015
15aotdispatcher_inference_nosubclass_cpucompile_time_instruction_count21120000000.015
16aotdispatcher_inference_subclass_cpucompile_time_instruction_count60220000000.015
17aotdispatcher_partitioner_cpucompile_time_instruction_count86720000000.015
18aotdispatcher_partitioner_cpu2compile_time_instruction_count19170000000.015
19aotdispatcher_training_nosubclass_cpucompile_time_instruction_count38590000000.015
20aotdispatcher_training_subclass_cpucompile_time_instruction_count104200000000.015