ollama

mirror of https://github.com/zebrajr/ollama.git synced 2025-12-06 00:19:51 +01:00

History

Jesse Gross ad6f6a1d29 llm: Change memory allocation backoff from exponential to incremental If we create a memory layout that should fit based on report free VRAM but allocation still fails, we start applying a backoff. This reduces free VRAM by an exponential percentage (1%, 2%, 4%...). However, the points chosen tend to be too dense at the beginning and too sparse at the end. Therefore, this switches to an incremental backoff (10%, 20%, 30%...).		2025-10-23 12:58:31 -07:00
..
llm_darwin.go	Optimize container images for startup (#6547 )	2024-09-12 12:10:30 -07:00
llm_linux.go	Optimize container images for startup (#6547 )	2024-09-12 12:10:30 -07:00
llm_windows.go	win: lint fix (#10571 )	2025-05-05 11:08:12 -07:00
memory_test.go	DRY out the runner lifecycle code (#12540 )	2025-10-23 11:20:02 -07:00
memory.go	DRY out the runner lifecycle code (#12540 )	2025-10-23 11:20:02 -07:00
server_test.go	DRY out the runner lifecycle code (#12540 )	2025-10-23 11:20:02 -07:00
server.go	llm: Change memory allocation backoff from exponential to incremental	2025-10-23 12:58:31 -07:00
status.go	Improve crash reporting (#7728 )	2024-11-19 16:26:57 -08:00