Configuring a provider
Enter your API key
Paste your API key into the key field. For OAuth-based providers (GitHub Copilot, ChatGPT), click Sign in instead.
Selecting a model
Use the model selector dropdown in the chat input area to choose any configured model. The change takes effect immediately — even mid-session. The model you select becomes your default for all new sessions going forward; you can change it at any time.Provider categories
Local Models
Run Ollama models entirely on your own hardware. No API key, no data sent to the cloud, complete privacy.
Cloud Providers
Access frontier models from Anthropic, OpenAI, Google Gemini, GitHub Copilot, OpenRouter, and 10+ more cloud providers.
Custom Endpoints
Point Luminy at any OpenAI-compatible or Anthropic-compatible API — LM Studio, vLLM, Azure OpenAI, private deployments, and more.
Model ID format
Every model in Luminy is identified by aprovider-prefix:model-name string. You will see these IDs in the model selector and in any exported session data.
Ollama is the one exception to the prefix rule. Luminy displays local Ollama models by their bare model name — for example,
qwen2.5-coder:7b — without any extra provider prefix in the chat UI.All supported providers
| Provider | Model ID prefix | API key required | Notes |
|---|---|---|---|
| Ollama | (bare model name) | No | Local inference on your machine |
| Anthropic | anthropic: | Yes | Prompt caching for lower costs on long sessions |
| OpenAI | openai: | Yes | Automatic prefix caching |
| Google Gemini | gemini: | Yes | Automatic prefix caching |
| GitHub Copilot | copilot: | OAuth | Sign in with your GitHub account |
| ChatGPT | chatgpt: | OAuth | Uses chatgpt.com |
| OpenRouter | openrouter: | Yes | Access 100+ models with one key |
| xAI (Grok) | xai: | Yes | Grok model family |
| Mistral | mistral: | Yes | Mistral & Mixtral models |
| Groq | groq: | Yes | High-speed LPU inference |
| DeepInfra | deepinfra: | Yes | Hosted open-source models |
| Cerebras | cerebras: | Yes | Ultra-fast Cerebras inference |
| Cohere | cohere: | Yes | Command R+ and other Cohere models |
| TogetherAI | togetherai: | Yes | Broad open-source model catalog |
| Perplexity | perplexity: | Yes | Online-search-augmented models |
| Alibaba DashScope | alibaba: | Yes | Qwen model family |
| Venice AI | venice: | Yes | Privacy-focused inference |
| Fireworks | fireworks: | Yes | Fast serverless open-source models |
| OpenCode Zen | opencode-zen: | No | Free models, no key needed |
| Custom (OpenAI-compat) | openai-compat: | Optional | Any OpenAI-compatible API |
| Custom (Anthropic-compat) | anthropic-compat: | Optional | Any Anthropic Messages API-compatible server |
