ollama/model
Michael Yang 9f3a37fd36
fix: model load for unsupported embedding models (#12311)
with #12181, there's now support for embeddings in ollama engine.
this is done by mutating the architecture and adding _embed when it
detects an embedding model. however this introduced a bug where if
an embedding model was run based on an existing ollama engine model
without an embedding implementation, e.g. llama4, it will pass the
initial arch support check but fail when actually loaded.

there's currently two entrypoints to creating a model. previously this
second entrypoint was necessary because calling model.New would also
load the model. since #11818, this is no longer th case so merge them
to reduce complexity
2025-09-18 16:11:08 -07:00
..
imageproc imageproc mllama refactor (#7537) 2024-12-14 19:50:15 -08:00
input batch: use tensors for outputs (#12185) 2025-09-15 14:33:06 -07:00
models feat: qwen3 embed (#12301) 2025-09-18 15:50:32 -07:00
parsers address comments 2025-09-15 11:46:25 -07:00
renderers address comments 2025-09-15 11:46:25 -07:00
testdata gemma2 impl 2025-03-11 14:35:08 -07:00
bytepairencoding_test.go model: add bpe roundtripping tests 2025-08-19 22:05:48 -07:00
bytepairencoding.go embedding gemma model (#12181) 2025-09-04 09:09:07 -07:00
model_test.go fix: model load for unsupported embedding models (#12311) 2025-09-18 16:11:08 -07:00
model.go fix: model load for unsupported embedding models (#12311) 2025-09-18 16:11:08 -07:00
sentencepiece_test.go model: implement bert in ollama engine (#9080) 2025-09-15 15:35:59 -07:00
sentencepiece.go model: implement bert in ollama engine (#9080) 2025-09-15 15:35:59 -07:00
textprocessor.go model: handle multiple eos tokens (#10577) 2025-05-16 13:40:23 -07:00
vocabulary_test.go model: treat 'user defined' tokens as special tokens (#11077) 2025-06-16 16:03:16 -07:00
vocabulary.go embedding gemma model (#12181) 2025-09-04 09:09:07 -07:00
wordpiece_test.go model: implement bert in ollama engine (#9080) 2025-09-15 15:35:59 -07:00
wordpiece.go model: implement bert in ollama engine (#9080) 2025-09-15 15:35:59 -07:00