pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

History

yiliu30 4669c6d3ae [quant][pt2e][quantizer] Support `set_module_name_qconfig` in X86InductorQuantizer (#126044 ) Summary: Added `set_module_name_qconfig` support to allow users to set configurations based on module name in `X86InductorQuantizer`. For example, only quantize the `sub`: ```python class M(torch.nn.Module): def __init__(self): super().__init__() self.linear = torch.nn.Linear(5, 5) self.sub = Sub() def forward(self, x): x = self.linear(x) x = self.sub(x) return x m = M().eval() example_inputs = (torch.randn(3, 5),) # Set config for a specific submodule. quantizer = X86InductorQuantizer() quantizer.set_module_name_qconfig("sub", xiq.get_default_x86_inductor_quantization_config()) ``` - Added `set_module_name_qconfig` to allow user set the configuration at the `module_name` level. - Unified the annotation process to follow this order: `module_name_qconfig`, `operator_type_qconfig`, and `global_config`. - Added `config_checker` to validate all user configurations and prevent mixing of static/dynamic or QAT/non-QAT configs. - Moved `_get_module_name_filter` from `xnnpack_quantizer.py` into `utils.py` as it common for all quantizer. Test Plan ```bash python -m pytest quantization/pt2e/test_x86inductor_quantizer.py -k test_set_module_name ``` @Xia-Weiwen @leslie-fang-intel @jgong5 Pull Request resolved: https://github.com/pytorch/pytorch/pull/126044 Approved by: https://github.com/jgong5, https://github.com/leslie-fang-intel, https://github.com/jerryzh168		2024-06-14 07:13:10 +00:00
..
ao_migration	Enable UFMT on all of test/quantization/ao_migration &bc (#123994 )	2024-04-13 06:36:10 +00:00
bc	Enable UFMT on all of test/quantization/ao_migration &bc (#123994 )	2024-04-13 06:36:10 +00:00
core	Revert "[cuDNN][Quantization] Don't print when plan finalization fails in cuDNN quantization backend (#128177 )"	2024-06-12 02:20:15 +00:00
eager	Support min/max carry over for eager mode from_float method (#127309 )	2024-05-29 19:33:26 +00:00
fx	[BE]: Update ruff to v0.4.4 (#125031 )	2024-05-12 20:02:37 +00:00
jit	Enable UFMT on all of test/quantization/jit &pt2e (#124010 )	2024-04-14 06:07:23 +00:00
pt2e	[quant][pt2e][quantizer] Support `set_module_name_qconfig` in X86InductorQuantizer (#126044 )	2024-06-14 07:13:10 +00:00
serialized
__init__.py