pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Edward Z. Yang	0b8fbfe9de	automatic_dynamic_shapes is on by default (#106188 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106188 Approved by: https://github.com/albanD	2023-07-28 13:26:54 +00:00
Mark Saroufim	c759a57003	Skip deterministic mode for SAM (#105615 ) SAM uses cumsum which doesnt have a deterministic mode enabled so this the onl way I can work around this https://github.com/pytorch/pytorch/issues/89492 Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/105615 Approved by: https://github.com/eellison, https://github.com/cpuhrsch	2023-07-21 01:52:08 +00:00
Elias Ellison	024d26208c	Add Freezing Option to Benchmarking (#105616 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105616 Approved by: https://github.com/desertfire	2023-07-20 22:50:51 +00:00
Michael Lazos	690ea933ca	Enable more e2e foreach optimizer compilation tests (#105438 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/105438 Approved by: https://github.com/jansel	2023-07-20 02:41:19 +00:00
Justin Chu	5ef023b05a	[BE] Enable ruff's UP rules and autoformat benchmarks/ (#105429 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105429 Approved by: https://github.com/malfet	2023-07-19 04:46:37 +00:00
Bin Bao	b10de43c0a	Add aot_inductor as a test backend for benchmarking (#105221 ) Summary: Original PR at https://github.com/pytorch/pytorch/pull/104977. Landing from fbcode instead. Add an aot_inductor backend (Export+AOTInductor) in the benchmarking harness. Note it is not a dynamo backend. Moved files from torch/_inductor/aot_inductor_include to torch/csrc/inductor as a more standard way for exposing headers Created a caching function in benchmarks/dynamo/common.py for compiling, loading and caching the .so file, as a proxy for a pure C++ deployment, but easier for benchmarking. Differential Revision: D47452591 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105221 Approved by: https://github.com/jansel	2023-07-18 13:16:36 +00:00
Edward Z. Yang	10cbc9a063	Enable cuda graphs for dynamic shapes (#105064 ) The general idea is to do a separate CUDA graph for each size. Because of cuda graph trees, these graphs will all share the same memory pool, so your memory usage will only be the worst case memory usage of the biggest dynamic size you want. This requires an extra dispatch in the cudagraphified callable. You must pay for a CUDA graph recording for every dynamic size you encounter, but this is MUCH cheaper than running the entire PT2 compile stack, so I expect you to still see benefits. This was surprisingly easy to do. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/105064 Approved by: https://github.com/voznesenskym	2023-07-14 16:13:50 +00:00
Yukio Siraichi	6abe0b2ee8	Disable translation validation on performance runs. (#104887 ) This PR disables translation validation (TV) when running the benchmark suits on performance workflows: inductor with A100s. In summary, the changes are: - Add flag for turning TV on and off on _benchmarks/dynamo/common.py_ - Turn TV on only on CI accuracy builds - Add `--no-translation-validation` target flag to _.ci/pytorch/test.sh_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/104887 Approved by: https://github.com/ezyang	2023-07-11 17:30:40 +00:00
Yukio Siraichi	85cbe7e6fd	Add timeout for translation validation instances. (#104654 ) As of now, translation validation runs to its completion. However, Z3 is time consuming. PR #104464, for example, disables translation validation for a few benchmarks. Instead, this PR introduces a timeout for translation validation. In that case, Z3 will return `unknown`, since it wasn't able to prove or disprove the assertions. Then, we log it as a warning, but don't stop execution. Here's a summary of the changes: - Added an environment variable for turning translation validation on and off - Added an environment variable for setting the translation validation timeout - Possibly reverts the changes in #104464 - ~~Move from "QF_NRA" to "QF_NIRA" logic~~ - ~~It makes more sense, given the nature of the problems~~ - "QF_NRA" seems to solve more instances of _dynamo/test_dynamic_shapes.py_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/104654 Approved by: https://github.com/ezyang	2023-07-08 19:19:00 +00:00
willfengg	202fb95c68	[benchmark][export] Add torch.export passrate for TB/TIMM benchmarks (#104382 ) issues resolved: https://github.com/pytorch/pytorch/issues/104294 local test on TB and TIMM * python benchmarks/dynamo/torchbench.py -d cuda --inference --accuracy --progress --export --print-dataframe-summary * python benchmarks/dynamo/timm_models.py -d cuda --inference --accuracy --progress --export --print-dataframe-summary why not HF * huggingface use kwargs (dict) to torch.nn.module * we will need to support kwargs in torch._export.export, which is in progress local test result timm 95% pass rate (58 ouf of 61 passed) P781702926 * 1 x [export specific]1 x ERROR:common:Mutating module attribute rel_indices during export * 1 x[not relevant to export] Unknown model (SelecSls42b) * 1 x [not relevant to export] Failed to load model: HTTP Error 409: Public access is not permitted on this storage account torchbench 54% pass rate (41 out of 75 passed) P781690552 * 7 x ERROR:common:Dynamo input and output is a strict subset of traced input/output * 3 x ERROR:common:call_method NNModuleVariable() / UserDefinedObjectVariable * 3 x ERROR:common:Mutating module attribute {xx} during export. * 2 x ERROR:common:inline in skipfiles * 2 x ERROR:common:Consider annotating your code using constrain_as_(). It appears that you're trying 1 x ERROR:common:guard on data-dependent symbolic int/float * 1 x ERROR:common:Tensor.tolist * 1 x ERROR:common:Tensor.numpy. Turn on config.numpy_ndarray_as_tensor and install torch_np to support tensor.numpy(). [may be dev * env?] * 1 x ERROR:common:missing: BUILD_SET * 1 x ERROR:common:whole graph export entails exactly one guard export * 1 x ERROR:common:call_function BuiltinVariable(str) [GetAttrVariable(UserMethodVariable(<function * 1 x ERROR:common:Dynamic slicing on data-dependent value is not supported * 1 x ERROR:common:Failed running call_function <function interpolate at 0x7f60a8361ea0>((FakeTensor(..., device='cuda:0', size=(1, 3, 427, * 1 x ERROR:common:Dynamo attempts to add additional input during export: value=0.6177528500556946, source=RandomValueSource(random_call_index=0) * 1 x Found following user inputs located at [16, 17, 18, 19, 20, 21, 22] are mutated. This is currently banned in the aot_export workflow. * 1 x RuntimeError: cumsum_cuda_kernel does not have a deterministic implementation * 4 x pass_due_to_skip * 1 x eager_2nd_run_OOM * 1 x fail_accuracy Pull Request resolved: https://github.com/pytorch/pytorch/pull/104382 Approved by: https://github.com/zhxchen17	2023-07-06 17:16:07 +00:00
Yukio Siraichi	0cee4e3c32	Turn translation validation off on timeouts. (#104464 ) Follow-up to PR: #97964 After the introduction of translation validation, (TV) a few TIMM and TorchBench benchmarks started failing due to TIMEOUT. This PR turns TV off for them. Pull Request resolved: https://github.com/pytorch/pytorch/pull/104464 Approved by: https://github.com/malfet	2023-07-05 19:01:50 +00:00
Yukio Siraichi	40b8d10d5e	Re-land: Turn translation validation on for tests and accuracy runs by default. (#104467 ) Re-landing: #103611 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104467 Approved by: https://github.com/malfet	2023-07-05 19:01:50 +00:00
Edward Z. Yang	2385dad4b3	Enable automatic_dynamic_shapes by default (#103623 ) Some notes: * I now manually turn off `_generate` jobs from running with cudagraphs, as it is unrealistic to expect to cudagraph autoregressive generation up to max sequence length, this would imply compiling the entire unrolled sequence generation. Concretely, cm3leon_generate was timing out post this change, likely due to the compile time slowdown of dynamic shapes ON TOP OF accidentally unrolling all the loops * A few torch._dynamo.reset tactically inserted to force recompiles on tests that expected it * expectedFailureAutomaticDynamic flip into patching automatic_dynamic_shapes=False Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103623 Approved by: https://github.com/voznesenskym	2023-07-05 00:25:02 +00:00
PyTorch MergeBot	a2a8b4d415	Revert "Turn translation validation on for tests and accuracy runs by default. (#103611 )" This reverts commit `e311bed2a8`. Reverted https://github.com/pytorch/pytorch/pull/103611 on behalf of https://github.com/malfet due to Broke inductor tests ([comment](https://github.com/pytorch/pytorch/pull/103611#issuecomment-1614850276))	2023-06-30 15:54:18 +00:00
Yukio Siraichi	e311bed2a8	Turn translation validation on for tests and accuracy runs by default. (#103611 ) This PR turns translation validation on by default for tests and accuracy benchmark runs. It also installs Z3 on CI. The main changes are: - Add `--no-translation-validation` as an option in _test/run_tests.py_ - Set `PYTORCH_TEST_WITH_TV` environment variable - Add `TEST_WITH_TV` variable in _torch/testing/_internal/common_utils.py_ - Turn translation validation on for accuracy benchmarks in _benchmarks/dynamo/common.py_ - Add Z3 installation on CI scripts Pull Request resolved: https://github.com/pytorch/pytorch/pull/103611 Approved by: https://github.com/ezyang	2023-06-30 01:32:21 +00:00
BowenBao	c1a49823cd	[ONNX] Bench torch.onnx.dynamo_export and torch.onnx.export under dynamo bench (#103135 ) - Extend dynamo bench interface with '--compilers onnx' and '--compilers dynamo-onnx' - ONNX bench exports model to onnx and runs in ONNX Runtime. - Introduce error aggregation and report. - Scripts to build ONNX deps and running ONNX bench. - Huggingface accuracy check workaround for ONNX. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103135 Approved by: https://github.com/thiagocrepaldi, https://github.com/jansel	2023-06-22 01:21:09 +00:00
Bin Bao	a2988c9e6a	[CI] Switch inference accuracy and performance tests to bfloat16 (#103535 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/103535 Approved by: https://github.com/eellison	2023-06-17 00:24:37 +00:00
Edward Z. Yang	bc6ec97e02	Switch dynamic_shapes to True by default (#103597 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103597 Approved by: https://github.com/voznesenskym	2023-06-15 15:16:20 +00:00
Edward Z. Yang	5211fad738	cm3leon_generate is at edge of timeout, so bump it up (#103607 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103607 Approved by: https://github.com/malfet	2023-06-15 03:40:42 +00:00
PyTorch MergeBot	a60f6dbe69	Revert "Add groups to dynamo benchmarking output data (#103268 )" This reverts commit `455f542ed9`. Reverted https://github.com/pytorch/pytorch/pull/103268 on behalf of https://github.com/drisspg due to no longer needed ([comment](https://github.com/pytorch/pytorch/pull/103268#issuecomment-1591732331))	2023-06-14 17:50:34 +00:00
chuanqiw	3c5ac4baa4	[CI] Enable inductor dynamic accuracy test on cpu device (#103387 ) Enable inductor dynamic accuracy test on cpu in ci workflow to capture issue early. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103387 Approved by: https://github.com/EikanWang, https://github.com/jgong5, https://github.com/desertfire	2023-06-14 06:12:41 +00:00
BowenBao	45104cb67f	Different csv headers by bench mode on infra error (#103134 ) As title. The headers are different for distinct bench mode. This PR is a supplement to https://github.com/pytorch/pytorch/pull/100372 to respect `performance` mode where numerical speedup is expected instead of status text. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103134 Approved by: https://github.com/thiagocrepaldi, https://github.com/ezyang	2023-06-13 03:40:22 +00:00
Driss Guessous	455f542ed9	Add groups to dynamo benchmarking output data (#103268 ) # Summary Ads the required information to enable this issue: https://github.com/pytorch/test-infra/issues/4268 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103268 Approved by: https://github.com/huydhn	2023-06-12 21:09:42 +00:00
Edward Z. Yang	54daf870bc	CUDA graphs overrides dynamic shapes and forces specialization (#103290 ) Previously, cudagraphs and dynamic_shapes were incompatible and enabling dynamic shapes would forcibly disable cudagraphs. This new strategy I think is better. The idea is essentially that cudagraphs is an "optimization" that happens to guard on every input. When cudagraphs is on, we force everything static, and this automatically does the right thing because we will force a recompile if sizes change. This obsoletes https://github.com/pytorch/pytorch/pull/101813 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103290 Approved by: https://github.com/voznesenskym, https://github.com/eellison	2023-06-12 20:26:55 +00:00
Bin Bao	141828498c	[CI] Update inference accuracy test (#103361 ) Summary: 1) Switch inference accuracy test from fp32 to amp (consistent with dashboard run, https://github.com/pytorch/pytorch/pull/103220) 2) GoogleFnet fails in eager with amp or fp16, so fallback to always using fp32. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103361 Approved by: https://github.com/eellison	2023-06-12 19:34:18 +00:00
Edward Z. Yang	c3fdfca5da	Always create ShapeEnv, always apply unspec logic (#103302 ) Originally, my goal for this PR was to remove the `dynamic_shapes` tests in torch/_dynamo/variables/builder.py. However, one thing lead to another, and it turns out that it was easiest to do all of the following in one go: * Unconditionally allocate a ShapeEnv, no matter if dynamic_shapes is enabled or not (torch/_dynamo/output_graph.py). There is a small adjustment to export torch/_dynamo/eval_frame.py to account for the fact that a ShapeEnv always exists, even if you're not doing symbolic export. * Remove dynamic_shapes test from unspec logic (torch/_dynamo/variables/builder.py), the original goal * Specialize strides and storage offset if all sizes are dynamic (torch/fx/experimental/symbolic_shapes.py). This is required to deal with unconditional ShapeEnv: if a ShapeEnv exist, fake tensor-ification may choose to allocate symbols. The idea is that with `automatic_dynamic_shapes == False`, Dynamo should never request dynamic sizes, but this invariant was not upheld for nontrivial strides/offset. The rest are just auxiliary fixups from the above: * Workaround bug in FakeTensorProp where sometimes it doesn't return a FakeTensor (torch/fx/passes/fake_tensor_prop.py), see https://github.com/pytorch/pytorch/pull/103395 for follow up * Make ShapeProp correctly handle int inputs (torch/fx/passes/shape_prop.py) * Disable indexing strength reduction if `assume_static_by_default` is False (torch/_inductor/codegen/triton.py) * Fix hf_T5_generate to NOT toggle `assume_static_by_default` if dynamic shapes is not enabled (benchmarks/dynamo/common.py); technically this is not necessary anymore but it's in for safety. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103302 Approved by: https://github.com/voznesenskym	2023-06-12 12:48:28 +00:00
Edward Z. Yang	414ec6ce97	Turn off automatic_dynamic_shapes in prep for dynamic-by-default (#103320 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103320 Approved by: https://github.com/Skylion007	2023-06-10 02:49:59 +00:00
PyTorch MergeBot	d89dd05e4d	Revert "CUDA graphs overrides dynamic shapes and forces specialization (#103290 )" This reverts commit `c760f0e4dd`. Reverted https://github.com/pytorch/pytorch/pull/103290 on behalf of https://github.com/ezyang due to to handle the other cuda graphs case ([comment](https://github.com/pytorch/pytorch/pull/103290#issuecomment-1584977767))	2023-06-09 18:25:28 +00:00
Edward Z. Yang	c760f0e4dd	CUDA graphs overrides dynamic shapes and forces specialization (#103290 ) Previously, cudagraphs and dynamic_shapes were incompatible and enabling dynamic shapes would forcibly disable cudagraphs. This new strategy I think is better. The idea is essentially that cudagraphs is an "optimization" that happens to guard on every input. When cudagraphs is on, we force everything static, and this automatically does the right thing because we will force a recompile if sizes change. This obsoletes https://github.com/pytorch/pytorch/pull/101813 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103290 Approved by: https://github.com/voznesenskym	2023-06-09 17:43:47 +00:00
Will Constable	39201ce025	Make dynamo bench conditionally import DDP/FSDP (#103163 ) Avoids hitting importerror for singlenode benchmarks when running on a non-distributed build of pytorch. Fixes #102086 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103163 Approved by: https://github.com/lezcano, https://github.com/wanchaol	2023-06-08 19:10:49 +00:00
Elias Ellison	18e4a466db	fix amp in inference in benchmarking suite (#103220 ) Even if you passed in --amp we would run inference in float32. `AlbertForMaskedLM` goes from 1.305 float32 to 1.724x amp, and then again to 1.910x with freezing. Benchmark numbers for amp are about to go way up lol. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103220 Approved by: https://github.com/desertfire	2023-06-08 05:16:22 +00:00
Edward Z. Yang	eeb3c62117	Add Wav2Vec2 HuggingFace support (#103009 ) This is not actually enabled in the benchmark suite as you need https://github.com/pytorch/pytorch/pull/103001 and also training is broken per https://github.com/pytorch/pytorch/issues/101160 but might as well review this part first. Contains https://github.com/pytorch/pytorch/pull/102979 but I will probably rebase past that once it lands. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103009 Approved by: https://github.com/Skylion007	2023-06-06 13:25:06 +00:00
Edward Z. Yang	cca7b38564	Don't allow skipping deepcopy (#102973 ) We might mutate it afterwards! This could lead to hard to understand bugs. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/102973 Approved by: https://github.com/albanD	2023-06-05 20:01:16 +00:00
Vinay Kumar Burugu	8215468870	Feature:To add --tolerance option to benchmark scripts (#102218 ) The "tolerance" option evaluates the model on the baseline device in eager mode (default: CPU) compared to the test device (e.g., CUDA, XLA, etc.) and compares the output tensors to determine the absolute tolerance value based on the [formula](https://pytorch.org/docs/stable/generated/torch.allclose.html). It then saves the results in a CSV file. This comparison highlights the tolerance/accuracy difference between XLA and GPU/CPU devices and can also be used to evaluate newer accelerators. This feature aims to identify accuracy failures on the test device (e.g., XLA) and facilitate quick bug triaging. This feature enables the following capabilities: 1. Ability to monitor accuracy issues of backends 2. Provide more informative picture on accuracy beyond pass/ fail status 3. Having a dump of accuracy information will help triage models accordingly The data generated using this feature is in the [spreadsheet](https://docs.google.com/spreadsheets/d/1A8BAzSqfAw0Q5rgzK5Gk__Uy7qhuynh8tedxKnH-t94/edit#gid=0). The spreadsheet data can be used to compile the below summary table: \| Suite \| Max Tolerance \| \| No. of models with high inaccuracy(>=0.005) \| \| Mean Tolerance \| \| \|------------------ \|:-------------:\|:--------:\|:-------------------------------------------:\|:--------:\|:--------------:\|:--------:\| \| \| xla \| inductor \| xla \| inductor \| xla \| inductor \| \| huggingface \| 0.1169 \| 0.0032 \| 1 \| 0 \| 0.0022 \| 0.0005 \| \| timm_models \| 0.0373 \| 2.8892 \| 10 \| 8 \| 0.0028 \| 0.7044 \| \| torchbench \| 3.013 \| 3.0381 \| 6 \| 2 \| 0.0016 \| 0.0016 \| \| All models \| 3.013 \| 3.0381 \| 17 \| 10 \| 0.0028 \| 0.7044 \| I used PyTorch release/2.0 branch and corresponding [commit_pin](https://github.com/pytorch/pytorch/blob/release/2.0/.github/ci_commit_pins/xla.txt) for XLA to generate the above data. Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/102218 Approved by: https://github.com/jansel	2023-06-03 06:40:26 +00:00
Edward Z. Yang	624257890e	Reenable hf_T5_generate (#102818 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/102818 Approved by: https://github.com/albanD	2023-06-02 17:59:53 +00:00
Edward Z. Yang	7c00d45312	Reenable cm3leon_generate (#102793 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/102793 Approved by: https://github.com/albanD, https://github.com/awgu	2023-06-02 15:15:26 +00:00
Animesh Jain	65631d4515	[benchmarks] Use train mode for accuracy checks for HF models (#102578 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102578 Approved by: https://github.com/desertfire	2023-05-31 19:47:18 +00:00
Bin Bao	47b884a74c	[inductor] Revert a CI remedy for Triton compilation error (#102541 ) Summary: revert https://github.com/pytorch/pytorch/pull/91634 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102541 Approved by: https://github.com/ngimel	2023-05-31 13:13:51 +00:00
Animesh Jain	33a49eeae7	[benchmark] Flag to switch on activation checkpointing for HF models (#102557 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102557 Approved by: https://github.com/ngimel, https://github.com/Chillee	2023-05-30 23:46:14 +00:00
Horace He	e71ab21422	update triton pin (#101919 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/101919 Approved by: https://github.com/ngimel	2023-05-30 17:16:05 +00:00
Animesh Jain	040d2cc969	[dynamo] Some torchrec_dlrm related fixes (#101953 ) Issue 1 of https://github.com/pytorch/pytorch/issues/101918 Pull Request resolved: https://github.com/pytorch/pytorch/pull/101953 Approved by: https://github.com/jansel	2023-05-28 17:56:08 +00:00
Bin Bao	ee33bae5c7	Fix an issue where checking sameness throw an exception (#102279 ) Summary: currently the exception is caught by outside and marked as infra_error Pull Request resolved: https://github.com/pytorch/pytorch/pull/102279 Approved by: https://github.com/anijain2305	2023-05-25 19:49:23 +00:00
Jason Ansel	5ba16011d7	Suppress profiler spam in dynamo benchmarks (#101942 ) Makes this stuff go away: ``` STAGE:2023-05-20 20:49:34 63580:63580 ActivityProfilerController.cpp:311] Completed Stage: Warm Up STAGE:2023-05-20 20:49:34 63580:63580 ActivityProfilerController.cpp:317] Completed Stage: Collection STAGE:2023-05-20 20:49:34 63580:63580 ActivityProfilerController.cpp:321] Completed Stage: Post Processing ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/101942 Approved by: https://github.com/shunting314, https://github.com/desertfire	2023-05-22 18:32:31 +00:00
Edward Z. Yang	22ca1a1124	Partially fix shape mismatch in vision_maskrcnn (#101477 ) The bulk of the heavy lifting is happening in https://github.com/pytorch/vision/pull/7592 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/101477 Approved by: https://github.com/voznesenskym	2023-05-21 05:20:08 +00:00
drisspg	6f13d6892a	Add meta support for multinomial (#101324 ) # Summary Found this when trying to compile the text gen loop of nanogpt here: `b33289942b/torchbenchmark/models/nanogpt_generate/model.py (L322)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/101324 Approved by: https://github.com/ngimel	2023-05-19 00:04:26 +00:00
Animesh Jain	794cc3952e	adding moco to CI (#101098 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/101098 Approved by: https://github.com/desertfire	2023-05-18 10:01:49 +00:00
chuanqiw	b315c9b5ab	[CI] Enlarge memory for OOM models in inductor cpu HF accuracy test (#101395 ) Change the Inductor CPU HF accuracy test node from `linux.4xlarge` (32GB) to `linux.24xlarge` (192GB) to enlarge the node memory. Also add 3 HF models back to CI test. Fixes #101390 Pull Request resolved: https://github.com/pytorch/pytorch/pull/101395 Approved by: https://github.com/EikanWang, https://github.com/desertfire, https://github.com/huydhn	2023-05-18 09:23:30 +00:00
Jason Ansel	403ce1a1c9	Fix benchmark model names printouts with tqdm (#101627 ) With the TQDM changes in #100969 -- the models names ended up getting hidden from the benchmark printouts. We would print the model name with no newline, then tqdm would print a `\r` and overwrite the name of the running model. Pull Request resolved: https://github.com/pytorch/pytorch/pull/101627 Approved by: https://github.com/ezyang	2023-05-17 15:31:11 +00:00
PaliC	e0fc24cdc5	add retries to inductor benchmark suite (#101019 ) This pr accomplishes 1) Enables retries for downloading torchbenchmark and huggingface models in a similar method to how we do it for timm models right now. 2) creates a `_download_model` function for the hugging face and TIMM runners whose output I plan to use to preload the models somewhere if possible (please double check I'll be saving the right thing). Instead of retries, we plan to just add torchbench to a docker image as it is relatively small. <!-- copilot:poem --> ### <samp>🤖 Generated by Copilot at 3361a4c</samp> > _We're the brave and bold coders of the `common.py` module_ > _We've made a handy function for downloading models_ > _We've shared it with our mates in the other runners_ > _So pull and push and try again, we'll get them all in time_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/101019 Approved by: https://github.com/huydhn, https://github.com/desertfire	2023-05-16 21:41:50 +00:00
Edward Z. Yang	41468833fb	vision_maskrcnn is now deterministic (#101116 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/101116 Approved by: https://github.com/ngimel	2023-05-16 21:32:17 +00:00

1 2 3 4 5 ...

268 Commits