pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Kimish Patel	446afb5f9f	[On Device Quantization][pytorch]Make insert_quant_dequant support ondevice ptq (#83570 ) Summary: This diff adds a way to: - clone previously observed method - Add calls to observer's calculate_qparams methods - Extract the scale and zero point - Use them to insert quant dequant nodes Now for forward method we have - observe_forward - quantize_forward observe_forward is used post training to observer statistics. In the case of dynamic PTQ this requires just running that method once to update weight observer statistics. quantize_forward method will be used to use the observer statistics to calculate quantization parameters and apply that to quant dequant op. Subsequent diffs will replace dequant + op with their quantized op counter parts and replace quantize ops with relevant packed params class where possible Test Plan: To be written Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D38771419](https://our.internmc.facebook.com/intern/diff/D38771419) Pull Request resolved: https://github.com/pytorch/pytorch/pull/83570 Approved by: https://github.com/jerryzh168	2022-08-29 17:51:00 +00:00
Kimish Patel	9189edb3b3	[Quantization][Pytorch] On device quantization support part 1 (#83568 ) Summary: TO support on device quantization this diff introduces observer insertion. Specifically observers are inserted by adding new method with prefix observ_. Intent is that post training, this method will be run to record statistics Test Plan: test_ondevice_quantization.py Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D38771417](https://our.internmc.facebook.com/intern/diff/D38771417) Pull Request resolved: https://github.com/pytorch/pytorch/pull/83568 Approved by: https://github.com/jerryzh168	2022-08-29 17:22:30 +00:00
Zafar	0d020effab	[quant] Fix the parts that were missing after initial migration (#66058 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66058 After the initial migration from `torch.quantization` to `torch.ao.quantization`, some of the files did not change. This happened because the migration was done in parallel, and some of the files were landed while the others were still in the original location. This is the last fix in the AO migration phase 1, which completely enables the ao.quantization namespace. Test Plan: `python test/test_quantization.py` Reviewed By: vkuzo Differential Revision: D31366066 Pulled By: z-a-f fbshipit-source-id: bf4a74885be89d098df2d87e685795a2a64026c5	2021-10-05 11:45:37 -07:00
Supriya Rao	3d976d9ceb	torch.ao migration: quantize_jit.py phase1 (#64860 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64860 ghstack-source-id: 137885395 Test Plan: buck test mode/dev //caffe2/test:quantization Reviewed By: jerryzh168 Differential Revision: D30880574 fbshipit-source-id: 9629027dd3b00bb8d45633e1564fc03a866f8c31	2021-09-13 08:41:48 -07:00

4 Commits