Model List
The available models change over time as new models are added and older ones are deprecated. For tasks that only require a general LLM or embedding model, use the provided tags instead of hard-coding specific model names. Tags are fully supported as model identifiers in the API and offer a more stable interface, even as the underlying models are updated. The "best-*" tag is a special case that always points to the best model for a given task.
Models are provided on a best-effort basis, which means they will usually be available, but occasional interruptions may occur.
Large Language Models
| Name | Model | Documentation | License | Tags |
|---|---|---|---|---|
| best-vision | gemma-3-27b-it | - | Apache 2.0 | best-vision vision chat |
| best-code | Qwen/Qwen3-Coder-Next | lmstudio-community/Qwen3-Coder-Next-MLX-4bit | Apache 2.0 | best-code code chat |
| best-chat | openai/gpt-oss-120b | https://huggingface.co/openai/gpt-oss-120b | Apache 2.0 | best-chat chat |
| granite-docling-258m-mlx | granite-docling-258m-mlx | https://huggingface.co/ibm-granite/granite-docling-258M-mlx | Apache 2.0 | chat vision |
| gemma-3-27b-it | gemma-3-27b-it | - | Apache 2.0 | best-vision vision chat |
| Qwen/Qwen3-Coder-Next | Qwen/Qwen3-Coder-Next | lmstudio-community/Qwen3-Coder-Next-MLX-4bit | Apache 2.0 | best-code code chat |
| lnc | openai/gpt-oss-20b | - | Apache-2.0 | chat |
| Qwen3.5-122B-A10B | qwen3.5-122b-a10b | https://huggingface.co/mlx-community/Qwen3.5-122B-A10B-4bit | Apache-2.0 | chat reasoning vision |
| gpt-oss-120b | openai/gpt-oss-120b | https://huggingface.co/openai/gpt-oss-120b | Apache 2.0 | best-chat chat |
Embedding Models
| Name | Model | Documentation | License | Tags |
|---|---|---|---|---|
| best-embedding | intfloat/multilingual-e5-large-instruct | https://huggingface.co/intfloat/multilingual-e5-large-instruct | mit | best-embedding embedding |
| multilingual-e5-large-instruct | intfloat/multilingual-e5-large-instruct | https://huggingface.co/intfloat/multilingual-e5-large-instruct | mit | best-embedding embedding |
| bge-reranker-v2-m3 | text-embedding-bge-reranker-v2-m3 | https://huggingface.co/BAAI/bge-reranker-v2-m3#model-list | Apache 2.0 | embedding |
| nomic-embed-text-v1.5 | nomic-ai/nomic-embed-text-v1.5 | https://huggingface.co/nomic-ai/nomic-embed-text-v1.5 | Apache 2.0 | embedding |
Audio Models
| Name | Model | Documentation | License | Tags |
|---|---|---|---|---|
| whisper-large-v3-turbo-swiss-german | Flurin17/whisper-large-v3-turbo-swiss-german | https://huggingface.co/openai/whisper-large-v3 | Apache 2.0 | stt |
| whisper-large-v3 | openai/whisper-large-v3 | - | Apache 2.0 | stt |
