ollama/llm
Jesse Gross ad6f6a1d29 llm: Change memory allocation backoff from exponential to incremental
If we create a memory layout that should fit based on report free VRAM
but allocation still fails, we start applying a backoff. This reduces
free VRAM by an exponential percentage (1%, 2%, 4%...). However, the
points chosen tend to be too dense at the beginning and too sparse at
the end. Therefore, this switches to an incremental backoff (10%, 20%,
30%...).
2025-10-23 12:58:31 -07:00
..
llm_darwin.go Optimize container images for startup (#6547) 2024-09-12 12:10:30 -07:00
llm_linux.go Optimize container images for startup (#6547) 2024-09-12 12:10:30 -07:00
llm_windows.go win: lint fix (#10571) 2025-05-05 11:08:12 -07:00
memory_test.go DRY out the runner lifecycle code (#12540) 2025-10-23 11:20:02 -07:00
memory.go DRY out the runner lifecycle code (#12540) 2025-10-23 11:20:02 -07:00
server_test.go DRY out the runner lifecycle code (#12540) 2025-10-23 11:20:02 -07:00
server.go llm: Change memory allocation backoff from exponential to incremental 2025-10-23 12:58:31 -07:00
status.go Improve crash reporting (#7728) 2024-11-19 16:26:57 -08:00