pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

History

leslie-fang-intel d83ab88f81 [Inductor] [Quant] Enable lowering of quant per tensor and refactor quant pattern (#124041 ) Summary Per the discussion in https://github.com/pytorch/pytorch/pull/123444, the `decomposed quant/dequant` patterns changed after https://github.com/pytorch/pytorch/pull/123445, we can move the optimization of `decomposed quant/dequant` from inductor decomposition into lowering phase to avoid the changes. In this way, we can: - Avoid the pattern matcher failure introduced in https://github.com/pytorch/pytorch/pull/123445 - Make the quantization pattern clearer in the pattern matcher phase, since the `quant/dequant` nodes have not been decomposed. Changes in this PR - Move optimization of `decomposed quant/dequant` from inductor decomposition into lowering phase. - Corresponding changes in the quantization pattern matcher to ensure no bc-breaking. TestPlan ``` python -u -m pytest -s -v test/inductor/test_mkldnn_pattern_matcher.py -k test_q ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/124041 Approved by: https://github.com/peterbell10, https://github.com/jgong5		2024-05-09 08:40:44 +00:00
..
ao_migration	Enable UFMT on all of test/quantization/ao_migration &bc (#123994 )	2024-04-13 06:36:10 +00:00
bc	Enable UFMT on all of test/quantization/ao_migration &bc (#123994 )	2024-04-13 06:36:10 +00:00
core	[Inductor] [Quant] Enable lowering of quant per tensor and refactor quant pattern (#124041 )	2024-05-09 08:40:44 +00:00
eager	[BE]: Update flake8 to v6.1.0 and fix lints (#116591 )	2024-01-03 06:04:44 +00:00
fx	Add testing and fix `weights_only` load for quantized types and nn.Parameters with python attrs (#124330 )	2024-04-23 04:13:26 +00:00
jit	Enable UFMT on all of test/quantization/jit &pt2e (#124010 )	2024-04-14 06:07:23 +00:00
pt2e	[quant][pt2e] Fix conv-bn weight + bias per channel QAT (#125208 )	2024-04-30 18:12:25 +00:00
serialized
__init__.py