pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

History

Jerry Zhang 1b51d29b66 [quant][pt2e] Enable constant folding for quantize ops (#109343 ) Summary: This PR added constant folding for quantize ops so that instead of storing fp32 weight in the quantized model, we'll get int8/int16 etc. weight Test Plan: python test/test_quantization.py TestQuantizePT2E.test_fold_quantize also will verify in executorch later Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D49399210](https://our.internmc.facebook.com/intern/diff/D49399210) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109343 Approved by: https://github.com/kimishpatel, https://github.com/jgong5		2023-09-27 06:04:45 +00:00
..
ao_migration	ao migration: remove package test as this behavior is tested by other things (#94422 )	2023-02-13 16:33:40 +00:00
bc	[BE] Enable ruff's UP rules and autoformat test/ (#105434 )	2023-07-19 20:36:06 +00:00
core	[Quant] Add int8 linear op impl for quantization PT2E with Inductor. input is an int8 CPU tensor; weight is an int8 MdkldnnCPU tensor. (#105818 )	2023-08-27 08:13:12 +00:00
eager	[pytorch][ao] Add `torch.matmul` in FloatFunctional/QFunctional (#106831 )	2023-08-10 22:43:36 +00:00
fx	Revert "[quant][pt2e] store scale/zero_point as tensor attributes to support serialization (#105894 )"	2023-07-28 01:16:02 +00:00
jit	Reland: Remove remaining global `set_default_dtype` calls from tests (#108088 )	2023-09-07 03:04:34 +00:00
pt2e	[quant][pt2e] Enable constant folding for quantize ops (#109343 )	2023-09-27 06:04:45 +00:00
serialized	[ao] fix incorrect integer cast on histogram observer bounds (#90355 )	2022-12-12 20:30:44 +00:00
__init__.py