ollama

mirror of https://github.com/zebrajr/ollama.git synced 2025-12-06 12:19:56 +01:00

History

Daniel Hiltgen 5e8ff556cb Support forced spreading for multi GPU Our default behavior today is to try to fit into a single GPU if possible. Some users would prefer the old behavior of always spreading across multiple GPUs even if the model can fit into one. This exposes that tunable behavior.		2024-06-14 14:51:40 -07:00
..
auth.go	Revert "use post token"	2024-05-11 22:19:14 -07:00
download.go	server: skip blob verification for already verified blobs	2024-06-05 16:39:11 -07:00
fixblobs_test.go	server: replace blob prefix separator from ':' to '-' (#3146 )	2024-03-14 20:18:06 -07:00
fixblobs.go	server: replace blob prefix separator from ':' to '-' (#3146 )	2024-03-14 20:18:06 -07:00
images.go	server: remove jwt decoding error (#5027 )	2024-06-13 11:21:15 -07:00
layer.go	Merge pull request #3718 from ollama/mxyng/modelname-3	2024-05-29 12:02:07 -07:00
manifest_test.go	add OLLAMA_MODELS to envconfig (#5029 )	2024-06-13 12:52:03 -07:00
manifest.go	fix: skip removing layers that no longer exist	2024-06-10 11:32:19 -07:00
model.go	fix: multiple templates when creating from model	2024-06-12 13:35:49 -07:00
modelpath_test.go	add OLLAMA_MODELS to envconfig (#5029 )	2024-06-13 12:52:03 -07:00
modelpath.go	add OLLAMA_MODELS to envconfig (#5029 )	2024-06-13 12:52:03 -07:00
prompt_test.go	change `github.com/jmorganca/ollama` to `github.com/ollama/ollama` (#3347 )	2024-03-26 13:04:17 -07:00
prompt.go	change `github.com/jmorganca/ollama` to `github.com/ollama/ollama` (#3347 )	2024-03-26 13:04:17 -07:00
routes_create_test.go	add OLLAMA_MODELS to envconfig (#5029 )	2024-06-13 12:52:03 -07:00
routes_delete_test.go	add OLLAMA_MODELS to envconfig (#5029 )	2024-06-13 12:52:03 -07:00
routes_list_test.go	add OLLAMA_MODELS to envconfig (#5029 )	2024-06-13 12:52:03 -07:00
routes_test.go	add OLLAMA_MODELS to envconfig (#5029 )	2024-06-13 12:52:03 -07:00
routes.go	API app/browser access (#4879 )	2024-06-06 15:19:03 -07:00
sched_test.go	Improve multi-gpu handling at the limit	2024-06-14 14:51:40 -07:00
sched.go	Support forced spreading for multi GPU	2024-06-14 14:51:40 -07:00
upload.go	lint	2024-06-04 11:13:30 -07:00