pytorch/benchmarks/gpt_fast
2024-10-11 18:30:26 +00:00
..
benchmark.py Run inductor micro benchmark on x86 metal runner (#135042) 2024-09-05 21:31:36 +00:00
generate.py [GPT-fast] Update compilation time target for Llama & Mixtral (#135817) 2024-09-12 07:13:44 +00:00
mixtral_moe_model.py Reduce the number of layers for mixtral moe model to adapt CI memory limitation (#125608) 2024-05-06 21:52:25 +00:00
mixtral_moe_quantize.py [BE] Format .ci/ / .github/ / benchmarks/ / functorch/ / tools/ / torchgen/ with ruff format (#132577) 2024-10-11 18:30:26 +00:00
model.py
quantize.py