mirror of
https://github.com/zebrajr/pytorch.git
synced 2025-12-07 00:21:07 +01:00
Summary: During the inference time the intermediate graphs for optimization are not used so the Executor's graph is the only graph we need to keep around these two flags Test Plan: the FLAGS are all off by default baseline ``` buck run mode/opt-clang sigrid/predictor/client/localnet:run_model -- --model_id_to_load=951679039 --model_snapshot_to_load=244 --torch_jit_do_not_store_optimized_graph=true I1212 10:24:20.407408 401092 SigridPredictorLocalModelFactory.cpp:32] Memory usage for 951679039_244 is 182863 Kb ``` ``` buck run mode/opt-clang sigrid/predictor/client/localnet:run_model -- --model_id_to_load=951679039 --model_snapshot_to_load=244 --torch_jit_do_not_store_optimized_graph=true --torch_jit_release_profiling_graph_after_optimization=true I1212 10:31:37.663487 464000 SigridPredictorLocalModelFactory.cpp:32] Memory usage for 951679039_244 is 186127 Kb ``` ``` buck run mode/opt-clang sigrid/predictor/client/localnet:run_model -- --model_id_to_load=951679039 --model_snapshot_to_load=244 --torch_jit_do_not_store_optimized_graph=true --torch_jit_release_profiling_graph_after_optimization=true --torch_jit_execution_plan_avoid_extra_graph_copy=true I1212 10:29:42.848093 447218 SigridPredictorLocalModelFactory.cpp:32] Memory usage for 951679039_244 is 129451 Kb``` Differential Revision: D52081631 Pull Request resolved: https://github.com/pytorch/pytorch/pull/115657 Approved by: https://github.com/houseroad |
||
|---|---|---|
| .. | ||
| interpreter | ||
| static | ||
| argument_spec.cpp | ||
| argument_spec.h | ||
| autodiff.cpp | ||
| autodiff.h | ||
| calculate_necessary_args.h | ||
| custom_operator.h | ||
| decomposition_registry_util.cpp | ||
| decomposition_registry_util.h | ||
| decomposition_registry.cpp | ||
| decomposition_registry.h | ||
| exception_message.h | ||
| graph_executor_impl.h | ||
| graph_executor.cpp | ||
| graph_executor.h | ||
| graph_iterator.h | ||
| instruction.cpp | ||
| instruction.h | ||
| interpreter.cpp | ||
| interpreter.h | ||
| jit_exception.cpp | ||
| jit_exception.h | ||
| jit_trace.cpp | ||
| jit_trace.h | ||
| logging.cpp | ||
| logging.h | ||
| operator_options.h | ||
| operator.cpp | ||
| operator.h | ||
| print_handler.cpp | ||
| print_handler.h | ||
| profiling_graph_executor_impl.cpp | ||
| profiling_graph_executor_impl.h | ||
| profiling_record.cpp | ||
| profiling_record.h | ||
| register_c10_ops.cpp | ||
| register_cuda_ops.cpp | ||
| register_distributed_ops.cpp | ||
| register_ops_utils.cpp | ||
| register_ops_utils.h | ||
| register_prim_ops_fulljit.cpp | ||
| register_prim_ops.cpp | ||
| register_special_ops.cpp | ||
| script_profile.cpp | ||
| script_profile.h | ||
| serialized_shape_function_registry.cpp | ||
| serialized_shape_function_registry.h | ||
| shape_function_registry.h | ||
| simple_graph_executor_impl.cpp | ||
| simple_graph_executor_impl.h | ||
| slice_indices_adjust.cpp | ||
| slice_indices_adjust.h | ||
| symbolic_script.cpp | ||
| symbolic_script.h | ||
| symbolic_shape_registry_util.cpp | ||
| symbolic_shape_registry_util.h | ||
| symbolic_shape_registry.cpp | ||
| symbolic_shape_registry.h | ||
| vararg_functions.cpp | ||
| vararg_functions.h | ||
| variable_tensor_list.h | ||