ollama/ml
Jesse Gross 79f6376f5b ggml: No-alloc mode
Callers can set a backend buffer type to be no-alloc, meaning that
it does not allocate memory for tensors or operations. This can
be used for calculating memory requirements. Tensors and graphs
must be recreated with no-alloc set to false before loading data.

Defaults to false for newly created backend buffer types.
2025-08-08 14:57:13 -07:00
..
backend ggml: No-alloc mode 2025-08-08 14:57:13 -07:00
nn gpt-oss (#11672) 2025-08-05 12:21:16 -07:00
backend.go ggml: Support closing backends 2025-08-08 14:57:13 -07:00