ollama

mirror of https://github.com/zebrajr/ollama.git synced 2025-12-06 12:19:56 +01:00

History

Jesse Gross 6cd566872b sched: Lift parallel restriction for multimodal models except mllama The Go runner does not have a problem with supporting parallel requests for most multimodal models. Now that we won't be potentially falling back to server.cpp, this restriction can be lifted. However, the new mllama model can't support parallel requests, so we will need to keep a restriction for that.		2024-11-06 13:32:18 -08:00
..
imageproc	add more tests for getting the optimal tiled canvas (#7411 )	2024-10-29 16:28:02 -07:00
testdata/tools	server: add tool parsing support for nemotron-mini (#6849 )	2024-09-17 18:06:16 -07:00
auth.go	fix nil deref in auth.go	2024-07-26 14:14:48 -07:00
download.go	server: fix blob download when receiving a 200 response (#6656 )	2024-09-05 10:48:26 -07:00
fixblobs_test.go	server: replace blob prefix separator from ':' to '-' (#3146 )	2024-03-14 20:18:06 -07:00
fixblobs.go	server: replace blob prefix separator from ':' to '-' (#3146 )	2024-03-14 20:18:06 -07:00
images.go	One corrupt manifest should not wedge model operations (#7515 )	2024-11-05 14:21:45 -08:00
layer.go	One corrupt manifest should not wedge model operations (#7515 )	2024-11-05 14:21:45 -08:00
manifest_test.go	One corrupt manifest should not wedge model operations (#7515 )	2024-11-05 14:21:45 -08:00
manifest.go	One corrupt manifest should not wedge model operations (#7515 )	2024-11-05 14:21:45 -08:00
model_test.go	server: add tool parsing support for nemotron-mini (#6849 )	2024-09-17 18:06:16 -07:00
model.go	image processing for llama3.2 (#6963 )	2024-10-18 16:12:35 -07:00
modelpath_test.go	validate model path	2024-08-28 09:32:57 -07:00
modelpath.go	validate model path	2024-08-28 09:32:57 -07:00
prompt_test.go	runner.go: Better abstract vision model integration	2024-10-30 14:53:43 -07:00
prompt.go	prompt: Use a single token when estimating mllama context size	2024-11-05 10:11:50 -08:00
routes_create_test.go	Merge pull request #6534 from ollama/mxyng/messages	2024-08-30 09:39:59 -07:00
routes_delete_test.go	server: clean up route names for consistency (#6524 )	2024-08-26 19:36:11 -07:00
routes_generate_test.go	image processing for llama3.2 (#6963 )	2024-10-18 16:12:35 -07:00
routes_list_test.go	server: clean up route names for consistency (#6524 )	2024-08-26 19:36:11 -07:00
routes_test.go	image processing for llama3.2 (#6963 )	2024-10-18 16:12:35 -07:00
routes.go	One corrupt manifest should not wedge model operations (#7515 )	2024-11-05 14:21:45 -08:00
sched_test.go	Rename gpu package discover (#7143 )	2024-10-16 17:45:00 -07:00
sched.go	sched: Lift parallel restriction for multimodal models except mllama	2024-11-06 13:32:18 -08:00
sparse_common.go	Don't hard fail on sparse setup error	2024-08-09 12:16:19 -07:00
sparse_windows.go	Don't hard fail on sparse setup error	2024-08-09 12:16:19 -07:00
upload.go	server: limit upload parts to 16 (#6411 )	2024-08-19 09:20:52 -07:00