pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

History

Yiming Zhou 089ad1d88b [1/n][export] Refactor PT2 Archive weight saving and loading (#160394 ) Summary: We split the refactoring in two parts for forward compatibility concerns First, we land the deserialization (loading part) Then, we land the serialization (saving part) Save weights and constants as individual files in PT2 archive. Each weight/constant will be saved as raw bytes, unless it is a custom object (TorchBind object) or a non-fake tensor subclass, for these two special cases we still save them using pickle. The metadata of saved tensors along with the file name will be saved as `PayloadMeta`. The mapping from FQN to `PayloadMeta` will be saved as `PayloadConfig` under `WEIGHTS_CONFIG_FORMAT` and `CONTANTS_CONFIG_FORMAT` This changes the serialization in python side when calling `torch.export.save()`. For deserialization in python `torch.export.load()`, we make it BC-safe by allowing loading legacy format weights/constants. For deserialization in C++ `torch/nativert/ModelRunner.cpp`, we make this a BC breaking change as currently the OSS ModelRunner API is not being used. The file structure ``` ├── archive_format ├── archive_version ├── byteorder ├── .data │ ├── serialization_id │ └── version ├── data │ ├── sample_inputs │ │ └── model.pt │ ├── constants │ │ ├── tensor_0 │ │ ├── tensor_1 │ │ └── model_constants_config.json │ └── weights │ ├── weight_0 │ ├── weight_1 │ ├── weight_2 │ ├── weight_3 │ └── model_weights_config.json └── models └── model.json ``` Test Plan: CI Rollback Plan: Differential Revision: D80035490 Pull Request resolved: https://github.com/pytorch/pytorch/pull/160394 Approved by: https://github.com/SherlockNoMad		2025-08-26 01:15:42 +00:00
..
example_upgraders.cpp	[schema_upgrader] add C++ upgrader for json based upgrading (#156761 )	2025-06-28 18:15:06 +00:00
example_upgraders.h	[schema_upgrader] add C++ upgrader for json based upgrading (#156761 )	2025-06-28 18:15:06 +00:00
pt2_archive_constants.h	[1/n][export] Refactor PT2 Archive weight saving and loading (#160394 )	2025-08-26 01:15:42 +00:00
pybind.cpp	[schema_upgrader] add C++ upgrader for json based upgrading (#156761 )	2025-06-28 18:15:06 +00:00
pybind.h
upgrader.cpp	[schema_upgrader] add C++ upgrader for json based upgrading (#156761 )	2025-06-28 18:15:06 +00:00
upgrader.h	[schema_upgrader] add C++ upgrader for json based upgrading (#156761 )	2025-06-28 18:15:06 +00:00