Adding a cloud provider API key
Get your API key
Visit the provider’s developer console and create an API key (see the per-provider instructions below).
Select the provider section
Scroll to the provider you want to configure — for example, Anthropic or OpenAI.
API keys are stored locally on your device and are never transmitted to Luminy’s servers.
Provider details
Anthropic
Anthropic
Anthropic’s Claude models are best-in-class for code understanding, long-context reasoning, and instruction following.Get your key: console.anthropic.comRecommended models:
Prompt caching: Luminy takes full advantage of Anthropic’s prompt caching, which significantly reduces costs on long conversations and large codebase sessions. Repeated context — system prompts, file contents, project rules — is served from cache rather than counted as new input tokens each turn.
| Model ID | Notes |
|---|---|
anthropic:claude-opus-4-5 | Most capable, best for complex tasks |
anthropic:claude-sonnet-4-5 | Balanced speed and capability |
anthropic:claude-haiku-3-5 | Fastest and most cost-efficient |
OpenAI
OpenAI
OpenAI’s GPT and o-series models offer strong all-around coding performance and the broadest ecosystem support.Get your key: platform.openai.comRecommended models:
Automatic prefix caching: OpenAI automatically caches common prefixes server-side, so repeated context in long sessions is handled without any extra configuration.
| Model ID | Notes |
|---|---|
openai:gpt-4o | Fastest GPT-4-class model |
openai:gpt-4o-mini | Lightweight and cost-efficient |
openai:o3-mini | Strong reasoning for complex problems |
Google Gemini
Google Gemini
Gemini models from Google DeepMind excel at long-context tasks and multimodal inputs.Get your key: aistudio.google.comRecommended models:
Automatic prefix caching: like OpenAI, Gemini handles prefix caching automatically on the server side.
| Model ID | Notes |
|---|---|
gemini:gemini-2.0-flash | Fast and capable for everyday tasks |
gemini:gemini-2.5-pro | Top-tier reasoning and code quality |
GitHub Copilot
GitHub Copilot
GitHub Copilot integration uses OAuth — no API key is needed. Luminy authenticates using your GitHub account, so your Copilot subscription covers usage.How to connect:Model ID format:
Sign in with GitHub
Click Sign in with GitHub. A browser window opens for the OAuth authorization flow.
copilot:gpt-4o, copilot:claude-3.5-sonnet, etc. — Luminy fetches the list of models available to your Copilot subscription automatically after sign-in.ChatGPT
ChatGPT
The ChatGPT provider connects to
chatgpt.com via OAuth, giving access to models through your ChatGPT account rather than the OpenAI API.How to connect:Navigate to Settings → ChatGPT and click Sign in. Follow the OAuth flow in the browser. No API key is required — your ChatGPT account subscription is used.Model ID format: chatgpt:<model-id>OpenRouter
OpenRouter
OpenRouter provides a single API key that routes to 100+ models from dozens of providers — a great option if you want to experiment with many models without managing multiple keys.Get your key: openrouter.ai/keysModel ID format:
openrouter:anthropic/claude-3.5-sonnet, openrouter:google/gemini-flash-1.5, etc. — model IDs follow OpenRouter’s vendor/model naming convention.xAI (Grok)
xAI (Grok)
xAI’s Grok models are configured with a standard API key.Get your key: console.x.aiModel ID format:
xai:grok-3, xai:grok-3-mini, etc.Mistral
Mistral
Mistral AI offers the Mistral and Mixtral model families, known for strong performance at efficient parameter counts.Get your key: console.mistral.aiModel ID format:
mistral:mistral-large-latest, mistral:codestral-latest, etc.Groq
Groq
Groq’s LPU inference hardware delivers some of the fastest token generation speeds available in the cloud.Get your key: console.groq.comModel ID format:
groq:llama-3.1-70b-versatile, groq:mixtral-8x7b-32768, etc.Other providers
Other providers
Luminy also supports the following providers — all configured the same way (Settings → provider section → paste API key → Save):
| Provider | Get your key | Model ID prefix |
|---|---|---|
| DeepInfra | deepinfra.com | deepinfra: |
| Cerebras | cloud.cerebras.ai | cerebras: |
| Cohere | dashboard.cohere.com | cohere: |
| TogetherAI | api.together.xyz | togetherai: |
| Perplexity | perplexity.ai/settings/api | perplexity: |
| Alibaba DashScope | dashscope.aliyun.com | alibaba: |
| Venice AI | venice.ai | venice: |
| Fireworks | fireworks.ai | fireworks: |
OpenCode Zen (no key required)
OpenCode Zen (no key required)
OpenCode Zen provides a set of free models that require no API key at all. They are always available in the model selector as soon as you launch Luminy.Available models:
| Model ID | Display name |
|---|---|
opencode-zen:big-pickle | Big Pickle |
opencode-zen:mimo-v2.5-free | MiMo-V2.5 Free |
opencode-zen:nemotron-3-ultra-free | Nemotron 3 Ultra Free |
opencode-zen:deepseek-v4-flash-free | DeepSeek V4 Flash Free |
Security note
Your API keys are stored locally on your device and used only for direct requests to the respective provider APIs. Luminy never proxies your keys through its own servers.
