ollama/kvcache
Jesse Gross d3e9ca3eda kvcache: Account for source tensors in defrag operation count
Defragging the KV cache can generate a lot of operations, so we
need to be careful that we don't overflow the number that the graph
can support. We currently account for all of the nodes that we add
to the graph for each move but we also need to include the original
cache tensors as well.

Fixes #9904
2025-03-21 10:42:19 -07:00
..
cache.go input: Rename Options to Batch 2025-03-20 13:28:13 -07:00
causal_test.go input: Rename Options to Batch 2025-03-20 13:28:13 -07:00
causal.go kvcache: Account for source tensors in defrag operation count 2025-03-21 10:42:19 -07:00
encoder.go input: Rename Options to Batch 2025-03-20 13:28:13 -07:00
wrapper.go input: Rename Options to Batch 2025-03-20 13:28:13 -07:00