mirror of
https://github.com/zebrajr/ollama.git
synced 2025-12-06 12:19:56 +01:00
Our default behavior today is to try to fit into a single GPU if possible. Some users would prefer the old behavior of always spreading across multiple GPUs even if the model can fit into one. This exposes that tunable behavior. |
||
|---|---|---|
| .. | ||
| auth.go | ||
| download.go | ||
| fixblobs_test.go | ||
| fixblobs.go | ||
| images.go | ||
| layer.go | ||
| manifest_test.go | ||
| manifest.go | ||
| model.go | ||
| modelpath_test.go | ||
| modelpath.go | ||
| prompt_test.go | ||
| prompt.go | ||
| routes_create_test.go | ||
| routes_delete_test.go | ||
| routes_list_test.go | ||
| routes_test.go | ||
| routes.go | ||
| sched_test.go | ||
| sched.go | ||
| upload.go | ||