Model setup quickstart
Setup
In most cases, you can simply run mini-extra config setup
to set up your default model and API keys.
This should be run the first time you run mini
.
Setting API keys
There are several ways to set your API keys:
- Recommended: Run our setup script:
mini-extra config setup
. This should also run automatically the first time you runmini
. - Use
mini-extra config set ANTHROPIC_API_KEY <your-api-key>
to put the key in themini
config file. - Export your key as an environment variable:
export ANTHROPIC_API_KEY=<your-api-key>
(this is not persistent if you restart your shell, unless you add it to your shell config, like~/.bashrc
or~/.zshrc
). - If you only use a single model, you can also set
MSWEA_MODEL_API_KEY
(as environment variable or in the config file). This takes precedence over all other keys. - If you run several agents in parallel, see our note about rotating anthropic keys here.
All the API key names
We use litellm
to support most models.
Here's a list of all the API key names available in litellm
:
ALEPH_ALPHA_API_KEY
ALEPHALPHA_API_KEY
ANTHROPIC_API_KEY
ANYSCALE_API_KEY
AZURE_AI_API_KEY
AZURE_API_KEY
AZURE_OPENAI_API_KEY
BASETEN_API_KEY
CEREBRAS_API_KEY
CLARIFAI_API_KEY
CLOUDFLARE_API_KEY
CO_API_KEY
CODESTRAL_API_KEY
COHERE_API_KEY
DATABRICKS_API_KEY
DEEPINFRA_API_KEY
DEEPSEEK_API_KEY
FEATHERLESS_AI_API_KEY
FIREWORKS_AI_API_KEY
FIREWORKS_API_KEY
FIREWORKSAI_API_KEY
GEMINI_API_KEY
GROQ_API_KEY
HUGGINGFACE_API_KEY
INFINITY_API_KEY
MARITALK_API_KEY
MISTRAL_API_KEY
NEBIUS_API_KEY
NLP_CLOUD_API_KEY
NOVITA_API_KEY
NVIDIA_NIM_API_KEY
OLLAMA_API_KEY
OPENAI_API_KEY
OPENAI_LIKE_API_KEY
OPENROUTER_API_KEY
OR_API_KEY
PALM_API_KEY
PERPLEXITYAI_API_KEY
PREDIBASE_API_KEY
PROVIDER_API_KEY
REPLICATE_API_KEY
TOGETHERAI_API_KEY
VOLCENGINE_API_KEY
VOYAGE_API_KEY
WATSONX_API_KEY
WX_API_KEY
XAI_API_KEY
XINFERENCE_API_KEY
Selecting a model
Model names and providers.
We support most models using litellm
.
You can find a list of their supported models here.
Please always include the provider in the model name, e.g., anthropic/claude-...
.
- Recommended:
mini-extra config setup
(should be run the first time you runmini
) can set the default model for you - All command line interfaces allow you to set the model name with
-m
or--model
. - In addition, you can set the default model with
mini-extra config set MSWEA_MODEL_NAME <model-name>
, by editing the global config file (shortcut:mini-extra config edit
), or by setting theMSWEA_MODEL_NAME
environment variable. - You can also set your model in a config file (key
model_name
undermodel
). - If you want to use local models, please check this guide.
Popular models
Here's a few examples of popular models:
anthropic/claude-sonnet-4-20250514
openai/gpt-5
openai/gpt-5-mini
gemini/gemini-2.5-pro
deepseek/deepseek-chat
List of all supported models
Here's a list of all model names supported by litellm
as of Aug 29th 2025.
For even more recent models, check the model_prices_and_context_window.json
file from litellm.
1024-x-1024/50-steps/bedrock/amazon.nova-canvas-v1:0
1024-x-1024/50-steps/stability.stable-diffusion-xl-v1
1024-x-1024/dall-e-2
1024-x-1024/max-steps/stability.stable-diffusion-xl-v1
256-x-256/dall-e-2
512-x-512/50-steps/stability.stable-diffusion-xl-v0
512-x-512/dall-e-2
512-x-512/max-steps/stability.stable-diffusion-xl-v0
ai21.j2-mid-v1
ai21.j2-ultra-v1
ai21.jamba-1-5-large-v1:0
ai21.jamba-1-5-mini-v1:0
ai21.jamba-instruct-v1:0
aiml/dall-e-2
aiml/dall-e-3
aiml/flux-pro
aiml/flux-pro/v1.1
aiml/flux-pro/v1.1-ultra
aiml/flux-realism
aiml/flux/dev
aiml/flux/kontext-max/text-to-image
aiml/flux/kontext-pro/text-to-image
aiml/flux/schnell
amazon.nova-lite-v1:0
amazon.nova-micro-v1:0
amazon.nova-pro-v1:0
amazon.rerank-v1:0
amazon.titan-embed-image-v1
amazon.titan-embed-text-v1
amazon.titan-embed-text-v2:0
amazon.titan-text-express-v1
amazon.titan-text-lite-v1
amazon.titan-text-premier-v1:0
anthropic.claude-3-5-haiku-20241022-v1:0
anthropic.claude-3-5-sonnet-20240620-v1:0
anthropic.claude-3-5-sonnet-20241022-v2:0
anthropic.claude-3-7-sonnet-20250219-v1:0
anthropic.claude-3-haiku-20240307-v1:0
anthropic.claude-3-opus-20240229-v1:0
anthropic.claude-3-sonnet-20240229-v1:0
anthropic.claude-instant-v1
anthropic.claude-opus-4-1-20250805-v1:0
anthropic.claude-opus-4-20250514-v1:0
anthropic.claude-sonnet-4-20250514-v1:0
anthropic.claude-v1
anthropic.claude-v2
anthropic.claude-v2:1
anyscale/HuggingFaceH4/zephyr-7b-beta
anyscale/codellama/CodeLlama-34b-Instruct-hf
anyscale/codellama/CodeLlama-70b-Instruct-hf
anyscale/google/gemma-7b-it
anyscale/meta-llama/Llama-2-13b-chat-hf
anyscale/meta-llama/Llama-2-70b-chat-hf
anyscale/meta-llama/Llama-2-7b-chat-hf
anyscale/meta-llama/Meta-Llama-3-70B-Instruct
anyscale/meta-llama/Meta-Llama-3-8B-Instruct
anyscale/mistralai/Mistral-7B-Instruct-v0.1
anyscale/mistralai/Mixtral-8x22B-Instruct-v0.1
anyscale/mistralai/Mixtral-8x7B-Instruct-v0.1
apac.amazon.nova-lite-v1:0
apac.amazon.nova-micro-v1:0
apac.amazon.nova-pro-v1:0
apac.anthropic.claude-3-5-sonnet-20240620-v1:0
apac.anthropic.claude-3-5-sonnet-20241022-v2:0
apac.anthropic.claude-3-haiku-20240307-v1:0
apac.anthropic.claude-3-sonnet-20240229-v1:0
apac.anthropic.claude-sonnet-4-20250514-v1:0
assemblyai/best
assemblyai/nano
azure/ada
azure/codex-mini
azure/command-r-plus
azure/computer-use-preview
azure/eu/gpt-4o-2024-08-06
azure/eu/gpt-4o-2024-11-20
azure/eu/gpt-4o-mini-2024-07-18
azure/eu/gpt-4o-mini-realtime-preview-2024-12-17
azure/eu/gpt-4o-realtime-preview-2024-10-01
azure/eu/gpt-4o-realtime-preview-2024-12-17
azure/eu/o1-2024-12-17
azure/eu/o1-mini-2024-09-12
azure/eu/o1-preview-2024-09-12
azure/eu/o3-mini-2025-01-31
azure/global-standard/gpt-4o-2024-08-06
azure/global-standard/gpt-4o-2024-11-20
azure/global-standard/gpt-4o-mini
azure/global/gpt-4o-2024-08-06
azure/global/gpt-4o-2024-11-20
azure/gpt-3.5-turbo
azure/gpt-3.5-turbo-0125
azure/gpt-3.5-turbo-instruct-0914
azure/gpt-35-turbo
azure/gpt-35-turbo-0125
azure/gpt-35-turbo-0301
azure/gpt-35-turbo-0613
azure/gpt-35-turbo-1106
azure/gpt-35-turbo-16k
azure/gpt-35-turbo-16k-0613
azure/gpt-35-turbo-instruct
azure/gpt-35-turbo-instruct-0914
azure/gpt-4
azure/gpt-4-0125-preview
azure/gpt-4-0613
azure/gpt-4-1106-preview
azure/gpt-4-32k
azure/gpt-4-32k-0613
azure/gpt-4-turbo
azure/gpt-4-turbo-2024-04-09
azure/gpt-4-turbo-vision-preview
azure/gpt-4.1
azure/gpt-4.1-2025-04-14
azure/gpt-4.1-mini
azure/gpt-4.1-mini-2025-04-14
azure/gpt-4.1-nano
azure/gpt-4.1-nano-2025-04-14
azure/gpt-4.5-preview
azure/gpt-4o
azure/gpt-4o-2024-05-13
azure/gpt-4o-2024-08-06
azure/gpt-4o-2024-11-20
azure/gpt-4o-audio-preview-2024-12-17
azure/gpt-4o-mini
azure/gpt-4o-mini-2024-07-18
azure/gpt-4o-mini-audio-preview-2024-12-17
azure/gpt-4o-mini-realtime-preview-2024-12-17
azure/gpt-4o-mini-transcribe
azure/gpt-4o-mini-tts
azure/gpt-4o-realtime-preview-2024-10-01
azure/gpt-4o-realtime-preview-2024-12-17
azure/gpt-4o-transcribe
azure/gpt-5
azure/gpt-5-2025-08-07
azure/gpt-5-chat
azure/gpt-5-chat-latest
azure/gpt-5-mini
azure/gpt-5-mini-2025-08-07
azure/gpt-5-nano
azure/gpt-5-nano-2025-08-07
azure/gpt-image-1
azure/hd/1024-x-1024/dall-e-3
azure/hd/1024-x-1792/dall-e-3
azure/hd/1792-x-1024/dall-e-3
azure/high/1024-x-1024/gpt-image-1
azure/high/1024-x-1536/gpt-image-1
azure/high/1536-x-1024/gpt-image-1
azure/low/1024-x-1024/gpt-image-1
azure/low/1024-x-1536/gpt-image-1
azure/low/1536-x-1024/gpt-image-1
azure/medium/1024-x-1024/gpt-image-1
azure/medium/1024-x-1536/gpt-image-1
azure/medium/1536-x-1024/gpt-image-1
azure/mistral-large-2402
azure/mistral-large-latest
azure/o1
azure/o1-2024-12-17
azure/o1-mini
azure/o1-mini-2024-09-12
azure/o1-preview
azure/o1-preview-2024-09-12
azure/o3
azure/o3-2025-04-16
azure/o3-deep-research
azure/o3-mini
azure/o3-mini-2025-01-31
azure/o3-pro
azure/o3-pro-2025-06-10
azure/o4-mini
azure/o4-mini-2025-04-16
azure/standard/1024-x-1024/dall-e-2
azure/standard/1024-x-1024/dall-e-3
azure/standard/1024-x-1792/dall-e-3
azure/standard/1792-x-1024/dall-e-3
azure/text-embedding-3-large
azure/text-embedding-3-small
azure/text-embedding-ada-002
azure/tts-1
azure/tts-1-hd
azure/us/gpt-4o-2024-08-06
azure/us/gpt-4o-2024-11-20
azure/us/gpt-4o-mini-2024-07-18
azure/us/gpt-4o-mini-realtime-preview-2024-12-17
azure/us/gpt-4o-realtime-preview-2024-10-01
azure/us/gpt-4o-realtime-preview-2024-12-17
azure/us/o1-2024-12-17
azure/us/o1-mini-2024-09-12
azure/us/o1-preview-2024-09-12
azure/us/o3-mini-2025-01-31
azure/whisper-1
azure_ai/Cohere-embed-v3-english
azure_ai/Cohere-embed-v3-multilingual
azure_ai/FLUX-1.1-pro
azure_ai/FLUX.1-Kontext-pro
azure_ai/Llama-3.2-11B-Vision-Instruct
azure_ai/Llama-3.2-90B-Vision-Instruct
azure_ai/Llama-3.3-70B-Instruct
azure_ai/Llama-4-Maverick-17B-128E-Instruct-FP8
azure_ai/Llama-4-Scout-17B-16E-Instruct
azure_ai/Meta-Llama-3-70B-Instruct
azure_ai/Meta-Llama-3.1-405B-Instruct
azure_ai/Meta-Llama-3.1-70B-Instruct
azure_ai/Meta-Llama-3.1-8B-Instruct
azure_ai/Phi-3-medium-128k-instruct
azure_ai/Phi-3-medium-4k-instruct
azure_ai/Phi-3-mini-128k-instruct
azure_ai/Phi-3-mini-4k-instruct
azure_ai/Phi-3-small-128k-instruct
azure_ai/Phi-3-small-8k-instruct
azure_ai/Phi-3.5-MoE-instruct
azure_ai/Phi-3.5-mini-instruct
azure_ai/Phi-3.5-vision-instruct
azure_ai/Phi-4
azure_ai/Phi-4-mini-instruct
azure_ai/Phi-4-multimodal-instruct
azure_ai/cohere-rerank-v3-english
azure_ai/cohere-rerank-v3-multilingual
azure_ai/cohere-rerank-v3.5
azure_ai/deepseek-r1
azure_ai/deepseek-v3
azure_ai/deepseek-v3-0324
azure_ai/embed-v-4-0
azure_ai/global/grok-3
azure_ai/global/grok-3-mini
azure_ai/grok-3
azure_ai/grok-3-mini
azure_ai/jais-30b-chat
azure_ai/jamba-instruct
azure_ai/ministral-3b
azure_ai/mistral-large
azure_ai/mistral-large-2407
azure_ai/mistral-large-latest
azure_ai/mistral-medium-2505
azure_ai/mistral-nemo
azure_ai/mistral-small
azure_ai/mistral-small-2503
babbage-002
bedrock/*/1-month-commitment/cohere.command-light-text-v14
bedrock/*/1-month-commitment/cohere.command-text-v14
bedrock/*/6-month-commitment/cohere.command-light-text-v14
bedrock/*/6-month-commitment/cohere.command-text-v14
bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-instant-v1
bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v1
bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v2
bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v2:1
bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-instant-v1
bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v1
bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v2
bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v2:1
bedrock/ap-northeast-1/anthropic.claude-instant-v1
bedrock/ap-northeast-1/anthropic.claude-v1
bedrock/ap-northeast-1/anthropic.claude-v2
bedrock/ap-northeast-1/anthropic.claude-v2:1
bedrock/ap-south-1/meta.llama3-70b-instruct-v1:0
bedrock/ap-south-1/meta.llama3-8b-instruct-v1:0
bedrock/ca-central-1/meta.llama3-70b-instruct-v1:0
bedrock/ca-central-1/meta.llama3-8b-instruct-v1:0
bedrock/eu-central-1/1-month-commitment/anthropic.claude-instant-v1
bedrock/eu-central-1/1-month-commitment/anthropic.claude-v1
bedrock/eu-central-1/1-month-commitment/anthropic.claude-v2
bedrock/eu-central-1/1-month-commitment/anthropic.claude-v2:1
bedrock/eu-central-1/6-month-commitment/anthropic.claude-instant-v1
bedrock/eu-central-1/6-month-commitment/anthropic.claude-v1
bedrock/eu-central-1/6-month-commitment/anthropic.claude-v2
bedrock/eu-central-1/6-month-commitment/anthropic.claude-v2:1
bedrock/eu-central-1/anthropic.claude-instant-v1
bedrock/eu-central-1/anthropic.claude-v1
bedrock/eu-central-1/anthropic.claude-v2
bedrock/eu-central-1/anthropic.claude-v2:1
bedrock/eu-west-1/meta.llama3-70b-instruct-v1:0
bedrock/eu-west-1/meta.llama3-8b-instruct-v1:0
bedrock/eu-west-2/meta.llama3-70b-instruct-v1:0
bedrock/eu-west-2/meta.llama3-8b-instruct-v1:0
bedrock/eu-west-3/mistral.mistral-7b-instruct-v0:2
bedrock/eu-west-3/mistral.mistral-large-2402-v1:0
bedrock/eu-west-3/mistral.mixtral-8x7b-instruct-v0:1
bedrock/invoke/anthropic.claude-3-5-sonnet-20240620-v1:0
bedrock/sa-east-1/meta.llama3-70b-instruct-v1:0
bedrock/sa-east-1/meta.llama3-8b-instruct-v1:0
bedrock/us-east-1/1-month-commitment/anthropic.claude-instant-v1
bedrock/us-east-1/1-month-commitment/anthropic.claude-v1
bedrock/us-east-1/1-month-commitment/anthropic.claude-v2
bedrock/us-east-1/1-month-commitment/anthropic.claude-v2:1
bedrock/us-east-1/6-month-commitment/anthropic.claude-instant-v1
bedrock/us-east-1/6-month-commitment/anthropic.claude-v1
bedrock/us-east-1/6-month-commitment/anthropic.claude-v2
bedrock/us-east-1/6-month-commitment/anthropic.claude-v2:1
bedrock/us-east-1/anthropic.claude-instant-v1
bedrock/us-east-1/anthropic.claude-v1
bedrock/us-east-1/anthropic.claude-v2
bedrock/us-east-1/anthropic.claude-v2:1
bedrock/us-east-1/meta.llama3-70b-instruct-v1:0
bedrock/us-east-1/meta.llama3-8b-instruct-v1:0
bedrock/us-east-1/mistral.mistral-7b-instruct-v0:2
bedrock/us-east-1/mistral.mistral-large-2402-v1:0
bedrock/us-east-1/mistral.mixtral-8x7b-instruct-v0:1
bedrock/us-gov-east-1/amazon.nova-pro-v1:0
bedrock/us-gov-east-1/amazon.titan-embed-text-v1
bedrock/us-gov-east-1/amazon.titan-embed-text-v2:0
bedrock/us-gov-east-1/amazon.titan-text-express-v1
bedrock/us-gov-east-1/amazon.titan-text-lite-v1
bedrock/us-gov-east-1/amazon.titan-text-premier-v1:0
bedrock/us-gov-east-1/anthropic.claude-3-5-sonnet-20240620-v1:0
bedrock/us-gov-east-1/anthropic.claude-3-haiku-20240307-v1:0
bedrock/us-gov-east-1/meta.llama3-70b-instruct-v1:0
bedrock/us-gov-east-1/meta.llama3-8b-instruct-v1:0
bedrock/us-gov-west-1/amazon.nova-pro-v1:0
bedrock/us-gov-west-1/amazon.titan-embed-text-v1
bedrock/us-gov-west-1/amazon.titan-embed-text-v2:0
bedrock/us-gov-west-1/amazon.titan-text-express-v1
bedrock/us-gov-west-1/amazon.titan-text-lite-v1
bedrock/us-gov-west-1/amazon.titan-text-premier-v1:0
bedrock/us-gov-west-1/anthropic.claude-3-5-sonnet-20240620-v1:0
bedrock/us-gov-west-1/anthropic.claude-3-haiku-20240307-v1:0
bedrock/us-gov-west-1/meta.llama3-70b-instruct-v1:0
bedrock/us-gov-west-1/meta.llama3-8b-instruct-v1:0
bedrock/us-west-1/meta.llama3-70b-instruct-v1:0
bedrock/us-west-1/meta.llama3-8b-instruct-v1:0
bedrock/us-west-2/1-month-commitment/anthropic.claude-instant-v1
bedrock/us-west-2/1-month-commitment/anthropic.claude-v1
bedrock/us-west-2/1-month-commitment/anthropic.claude-v2
bedrock/us-west-2/1-month-commitment/anthropic.claude-v2:1
bedrock/us-west-2/6-month-commitment/anthropic.claude-instant-v1
bedrock/us-west-2/6-month-commitment/anthropic.claude-v1
bedrock/us-west-2/6-month-commitment/anthropic.claude-v2
bedrock/us-west-2/6-month-commitment/anthropic.claude-v2:1
bedrock/us-west-2/anthropic.claude-instant-v1
bedrock/us-west-2/anthropic.claude-v1
bedrock/us-west-2/anthropic.claude-v2
bedrock/us-west-2/anthropic.claude-v2:1
bedrock/us-west-2/mistral.mistral-7b-instruct-v0:2
bedrock/us-west-2/mistral.mistral-large-2402-v1:0
bedrock/us-west-2/mistral.mixtral-8x7b-instruct-v0:1
cerebras/llama-3.3-70b
cerebras/llama3.1-70b
cerebras/llama3.1-8b
cerebras/openai/gpt-oss-120b
cerebras/openai/gpt-oss-20b
cerebras/qwen-3-32b
chat-bison
chat-bison-32k
chat-bison-32k@002
chat-bison@001
chat-bison@002
chatdolphin
chatgpt-4o-latest
claude-3-5-haiku-20241022
claude-3-5-haiku-latest
claude-3-5-sonnet-20240620
claude-3-5-sonnet-20241022
claude-3-5-sonnet-latest
claude-3-7-sonnet-20250219
claude-3-7-sonnet-latest
claude-3-haiku-20240307
claude-3-opus-20240229
claude-3-opus-latest
claude-4-opus-20250514
claude-4-sonnet-20250514
claude-opus-4-1
claude-opus-4-1-20250805
claude-opus-4-20250514
claude-sonnet-4-20250514
cloudflare/@cf/meta/llama-2-7b-chat-fp16
cloudflare/@cf/meta/llama-2-7b-chat-int8
cloudflare/@cf/mistral/mistral-7b-instruct-v0.1
cloudflare/@hf/thebloke/codellama-7b-instruct-awq
code-bison
code-bison-32k@002
code-bison32k
code-bison@001
code-bison@002
code-gecko
code-gecko-latest
code-gecko@001
code-gecko@002
codechat-bison
codechat-bison-32k
codechat-bison-32k@002
codechat-bison@001
codechat-bison@002
codechat-bison@latest
codestral/codestral-2405
codestral/codestral-latest
codex-mini-latest
cohere.command-light-text-v14
cohere.command-r-plus-v1:0
cohere.command-r-v1:0
cohere.command-text-v14
cohere.embed-english-v3
cohere.embed-multilingual-v3
cohere.rerank-v3-5:0
command
command-a-03-2025
command-light
command-nightly
command-r
command-r-08-2024
command-r-plus
command-r-plus-08-2024
command-r7b-12-2024
computer-use-preview
dashscope/qwen-max
dashscope/qwen-plus-latest
dashscope/qwen-turbo-latest
dashscope/qwen3-30b-a3b
databricks/databricks-bge-large-en
databricks/databricks-claude-3-7-sonnet
databricks/databricks-gte-large-en
databricks/databricks-llama-2-70b-chat
databricks/databricks-llama-4-maverick
databricks/databricks-meta-llama-3-1-405b-instruct
databricks/databricks-meta-llama-3-3-70b-instruct
databricks/databricks-meta-llama-3-70b-instruct
databricks/databricks-mixtral-8x7b-instruct
databricks/databricks-mpt-30b-instruct
databricks/databricks-mpt-7b-instruct
davinci-002
deepgram/base
deepgram/base-conversationalai
deepgram/base-finance
deepgram/base-general
deepgram/base-meeting
deepgram/base-phonecall
deepgram/base-video
deepgram/base-voicemail
deepgram/enhanced
deepgram/enhanced-finance
deepgram/enhanced-general
deepgram/enhanced-meeting
deepgram/enhanced-phonecall
deepgram/nova
deepgram/nova-2
deepgram/nova-2-atc
deepgram/nova-2-automotive
deepgram/nova-2-conversationalai
deepgram/nova-2-drivethru
deepgram/nova-2-finance
deepgram/nova-2-general
deepgram/nova-2-meeting
deepgram/nova-2-phonecall
deepgram/nova-2-video
deepgram/nova-2-voicemail
deepgram/nova-3
deepgram/nova-3-general
deepgram/nova-3-medical
deepgram/nova-general
deepgram/nova-phonecall
deepgram/whisper
deepgram/whisper-base
deepgram/whisper-large
deepgram/whisper-medium
deepgram/whisper-small
deepgram/whisper-tiny
deepinfra/Austism/chronos-hermes-13b-v2
deepinfra/Gryphe/MythoMax-L2-13b
deepinfra/Gryphe/MythoMax-L2-13b-turbo
deepinfra/KoboldAI/LLaMA2-13B-Tiefighter
deepinfra/NousResearch/Hermes-3-Llama-3.1-405B
deepinfra/NousResearch/Hermes-3-Llama-3.1-70B
deepinfra/NovaSky-AI/Sky-T1-32B-Preview
deepinfra/Phind/Phind-CodeLlama-34B-v2
deepinfra/Qwen/QVQ-72B-Preview
deepinfra/Qwen/QwQ-32B
deepinfra/Qwen/QwQ-32B-Preview
deepinfra/Qwen/Qwen2-72B-Instruct
deepinfra/Qwen/Qwen2-7B-Instruct
deepinfra/Qwen/Qwen2.5-72B-Instruct
deepinfra/Qwen/Qwen2.5-7B-Instruct
deepinfra/Qwen/Qwen2.5-Coder-32B-Instruct
deepinfra/Qwen/Qwen2.5-Coder-7B
deepinfra/Qwen/Qwen2.5-VL-32B-Instruct
deepinfra/Qwen/Qwen3-14B
deepinfra/Qwen/Qwen3-235B-A22B
deepinfra/Qwen/Qwen3-235B-A22B-Instruct-2507
deepinfra/Qwen/Qwen3-235B-A22B-Thinking-2507
deepinfra/Qwen/Qwen3-30B-A3B
deepinfra/Qwen/Qwen3-32B
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo
deepinfra/Sao10K/L3-70B-Euryale-v2.1
deepinfra/Sao10K/L3-8B-Lunaris-v1
deepinfra/Sao10K/L3-8B-Lunaris-v1-Turbo
deepinfra/Sao10K/L3.1-70B-Euryale-v2.2
deepinfra/Sao10K/L3.3-70B-Euryale-v2.3
deepinfra/allenai/olmOCR-7B-0725-FP8
deepinfra/anthropic/claude-3-7-sonnet-latest
deepinfra/anthropic/claude-4-opus
deepinfra/anthropic/claude-4-sonnet
deepinfra/bigcode/starcoder2-15b-instruct-v0.1
deepinfra/cognitivecomputations/dolphin-2.6-mixtral-8x7b
deepinfra/cognitivecomputations/dolphin-2.9.1-llama-3-70b
deepinfra/deepinfra/airoboros-70b
deepinfra/deepseek-ai/DeepSeek-Prover-V2-671B
deepinfra/deepseek-ai/DeepSeek-R1
deepinfra/deepseek-ai/DeepSeek-R1-0528
deepinfra/deepseek-ai/DeepSeek-R1-0528-Turbo
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
deepinfra/deepseek-ai/DeepSeek-R1-Turbo
deepinfra/deepseek-ai/DeepSeek-V3
deepinfra/deepseek-ai/DeepSeek-V3-0324
deepinfra/deepseek-ai/DeepSeek-V3-0324-Turbo
deepinfra/deepseek-ai/DeepSeek-V3.1
deepinfra/google/codegemma-7b-it
deepinfra/google/gemini-1.5-flash
deepinfra/google/gemini-1.5-flash-8b
deepinfra/google/gemini-2.0-flash-001
deepinfra/google/gemini-2.5-flash
deepinfra/google/gemini-2.5-pro
deepinfra/google/gemma-1.1-7b-it
deepinfra/google/gemma-2-27b-it
deepinfra/google/gemma-2-9b-it
deepinfra/google/gemma-3-12b-it
deepinfra/google/gemma-3-27b-it
deepinfra/google/gemma-3-4b-it
deepinfra/lizpreciatior/lzlv_70b_fp16_hf
deepinfra/mattshumer/Reflection-Llama-3.1-70B
deepinfra/meta-llama/Llama-2-13b-chat-hf
deepinfra/meta-llama/Llama-2-70b-chat-hf
deepinfra/meta-llama/Llama-3.2-11B-Vision-Instruct
deepinfra/meta-llama/Llama-3.2-1B-Instruct
deepinfra/meta-llama/Llama-3.2-3B-Instruct
deepinfra/meta-llama/Llama-3.2-90B-Vision-Instruct
deepinfra/meta-llama/Llama-3.3-70B-Instruct
deepinfra/meta-llama/Llama-3.3-70B-Instruct-Turbo
deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-Turbo
deepinfra/meta-llama/Llama-4-Scout-17B-16E-Instruct
deepinfra/meta-llama/Llama-Guard-3-8B
deepinfra/meta-llama/Llama-Guard-4-12B
deepinfra/meta-llama/Meta-Llama-3-70B-Instruct
deepinfra/meta-llama/Meta-Llama-3-8B-Instruct
deepinfra/meta-llama/Meta-Llama-3.1-405B-Instruct
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
deepinfra/microsoft/Phi-3-medium-4k-instruct
deepinfra/microsoft/Phi-4-multimodal-instruct
deepinfra/microsoft/WizardLM-2-7B
deepinfra/microsoft/WizardLM-2-8x22B
deepinfra/microsoft/phi-4
deepinfra/microsoft/phi-4-reasoning-plus
deepinfra/mistralai/Devstral-Small-2505
deepinfra/mistralai/Devstral-Small-2507
deepinfra/mistralai/Mistral-7B-Instruct-v0.1
deepinfra/mistralai/Mistral-7B-Instruct-v0.2
deepinfra/mistralai/Mistral-7B-Instruct-v0.3
deepinfra/mistralai/Mistral-Nemo-Instruct-2407
deepinfra/mistralai/Mistral-Small-24B-Instruct-2501
deepinfra/mistralai/Mistral-Small-3.1-24B-Instruct-2503
deepinfra/mistralai/Mistral-Small-3.2-24B-Instruct-2506
deepinfra/mistralai/Mixtral-8x22B-Instruct-v0.1
deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1
deepinfra/moonshotai/Kimi-K2-Instruct
deepinfra/nvidia/Llama-3.1-Nemotron-70B-Instruct
deepinfra/nvidia/Nemotron-4-340B-Instruct
deepinfra/openai/gpt-oss-120b
deepinfra/openai/gpt-oss-20b
deepinfra/openbmb/MiniCPM-Llama3-V-2_5
deepinfra/openchat/openchat-3.6-8b
deepinfra/openchat/openchat_3.5
deepinfra/zai-org/GLM-4.5
deepinfra/zai-org/GLM-4.5-Air
deepseek/deepseek-chat
deepseek/deepseek-coder
deepseek/deepseek-r1
deepseek/deepseek-reasoner
deepseek/deepseek-v3
dolphin
elevenlabs/scribe_v1
elevenlabs/scribe_v1_experimental
embed-english-light-v2.0
embed-english-light-v3.0
embed-english-v2.0
embed-english-v3.0
embed-multilingual-v2.0
embed-multilingual-v3.0
eu.amazon.nova-lite-v1:0
eu.amazon.nova-micro-v1:0
eu.amazon.nova-pro-v1:0
eu.anthropic.claude-3-5-haiku-20241022-v1:0
eu.anthropic.claude-3-5-sonnet-20240620-v1:0
eu.anthropic.claude-3-5-sonnet-20241022-v2:0
eu.anthropic.claude-3-7-sonnet-20250219-v1:0
eu.anthropic.claude-3-haiku-20240307-v1:0
eu.anthropic.claude-3-opus-20240229-v1:0
eu.anthropic.claude-3-sonnet-20240229-v1:0
eu.anthropic.claude-opus-4-1-20250805-v1:0
eu.anthropic.claude-opus-4-20250514-v1:0
eu.anthropic.claude-sonnet-4-20250514-v1:0
eu.meta.llama3-2-1b-instruct-v1:0
eu.meta.llama3-2-3b-instruct-v1:0
eu.mistral.pixtral-large-2502-v1:0
featherless_ai/featherless-ai/Qwerky-72B
featherless_ai/featherless-ai/Qwerky-QwQ-32B
fireworks-ai-4.1b-to-16b
fireworks-ai-56b-to-176b
fireworks-ai-above-16b
fireworks-ai-default
fireworks-ai-embedding-150m-to-350m
fireworks-ai-embedding-up-to-150m
fireworks-ai-moe-up-to-56b
fireworks-ai-up-to-4b
fireworks_ai/WhereIsAI/UAE-Large-V1
fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-instruct
fireworks_ai/accounts/fireworks/models/deepseek-r1
fireworks_ai/accounts/fireworks/models/deepseek-r1-0528
fireworks_ai/accounts/fireworks/models/deepseek-r1-basic
fireworks_ai/accounts/fireworks/models/deepseek-v3
fireworks_ai/accounts/fireworks/models/deepseek-v3-0324
fireworks_ai/accounts/fireworks/models/deepseek-v3p1
fireworks_ai/accounts/fireworks/models/firefunction-v2
fireworks_ai/accounts/fireworks/models/glm-4p5
fireworks_ai/accounts/fireworks/models/glm-4p5-air
fireworks_ai/accounts/fireworks/models/gpt-oss-120b
fireworks_ai/accounts/fireworks/models/gpt-oss-20b
fireworks_ai/accounts/fireworks/models/kimi-k2-instruct
fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct
fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct
fireworks_ai/accounts/fireworks/models/llama-v3p2-11b-vision-instruct
fireworks_ai/accounts/fireworks/models/llama-v3p2-1b-instruct
fireworks_ai/accounts/fireworks/models/llama-v3p2-3b-instruct
fireworks_ai/accounts/fireworks/models/llama-v3p2-90b-vision-instruct
fireworks_ai/accounts/fireworks/models/llama4-maverick-instruct-basic
fireworks_ai/accounts/fireworks/models/llama4-scout-instruct-basic
fireworks_ai/accounts/fireworks/models/mixtral-8x22b-instruct-hf
fireworks_ai/accounts/fireworks/models/qwen2-72b-instruct
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct
fireworks_ai/accounts/fireworks/models/yi-large
fireworks_ai/nomic-ai/nomic-embed-text-v1
fireworks_ai/nomic-ai/nomic-embed-text-v1.5
fireworks_ai/thenlper/gte-base
fireworks_ai/thenlper/gte-large
friendliai/meta-llama-3.1-70b-instruct
friendliai/meta-llama-3.1-8b-instruct
ft:babbage-002
ft:davinci-002
ft:gpt-3.5-turbo
ft:gpt-3.5-turbo-0125
ft:gpt-3.5-turbo-0613
ft:gpt-3.5-turbo-1106
ft:gpt-4-0613
ft:gpt-4o-2024-08-06
ft:gpt-4o-2024-11-20
ft:gpt-4o-mini-2024-07-18
gemini-1.0-pro
gemini-1.0-pro-001
gemini-1.0-pro-002
gemini-1.0-pro-vision
gemini-1.0-pro-vision-001
gemini-1.0-ultra
gemini-1.0-ultra-001
gemini-1.5-flash
gemini-1.5-flash-001
gemini-1.5-flash-002
gemini-1.5-flash-exp-0827
gemini-1.5-flash-preview-0514
gemini-1.5-pro
gemini-1.5-pro-001
gemini-1.5-pro-002
gemini-1.5-pro-preview-0215
gemini-1.5-pro-preview-0409
gemini-1.5-pro-preview-0514
gemini-2.0-flash
gemini-2.0-flash-001
gemini-2.0-flash-exp
gemini-2.0-flash-lite
gemini-2.0-flash-lite-001
gemini-2.0-flash-live-preview-04-09
gemini-2.0-flash-preview-image-generation
gemini-2.0-flash-thinking-exp
gemini-2.0-flash-thinking-exp-01-21
gemini-2.0-pro-exp-02-05
gemini-2.5-flash
gemini-2.5-flash-image-preview
gemini-2.5-flash-lite
gemini-2.5-flash-lite-preview-06-17
gemini-2.5-flash-preview-04-17
gemini-2.5-flash-preview-05-20
gemini-2.5-pro
gemini-2.5-pro-exp-03-25
gemini-2.5-pro-preview-03-25
gemini-2.5-pro-preview-05-06
gemini-2.5-pro-preview-06-05
gemini-2.5-pro-preview-tts
gemini-embedding-001
gemini-flash-experimental
gemini-pro
gemini-pro-experimental
gemini-pro-vision
gemini/gemini-1.5-flash
gemini/gemini-1.5-flash-001
gemini/gemini-1.5-flash-002
gemini/gemini-1.5-flash-8b
gemini/gemini-1.5-flash-8b-exp-0827
gemini/gemini-1.5-flash-8b-exp-0924
gemini/gemini-1.5-flash-exp-0827
gemini/gemini-1.5-flash-latest
gemini/gemini-1.5-pro
gemini/gemini-1.5-pro-001
gemini/gemini-1.5-pro-002
gemini/gemini-1.5-pro-exp-0801
gemini/gemini-1.5-pro-exp-0827
gemini/gemini-1.5-pro-latest
gemini/gemini-2.0-flash
gemini/gemini-2.0-flash-001
gemini/gemini-2.0-flash-exp
gemini/gemini-2.0-flash-lite
gemini/gemini-2.0-flash-lite-preview-02-05
gemini/gemini-2.0-flash-live-001
gemini/gemini-2.0-flash-preview-image-generation
gemini/gemini-2.0-flash-thinking-exp
gemini/gemini-2.0-flash-thinking-exp-01-21
gemini/gemini-2.0-pro-exp-02-05
gemini/gemini-2.5-flash
gemini/gemini-2.5-flash-image-preview
gemini/gemini-2.5-flash-lite
gemini/gemini-2.5-flash-lite-preview-06-17
gemini/gemini-2.5-flash-preview-04-17
gemini/gemini-2.5-flash-preview-05-20
gemini/gemini-2.5-flash-preview-tts
gemini/gemini-2.5-pro
gemini/gemini-2.5-pro-exp-03-25
gemini/gemini-2.5-pro-preview-03-25
gemini/gemini-2.5-pro-preview-05-06
gemini/gemini-2.5-pro-preview-06-05
gemini/gemini-2.5-pro-preview-tts
gemini/gemini-exp-1114
gemini/gemini-exp-1206
gemini/gemini-gemma-2-27b-it
gemini/gemini-gemma-2-9b-it
gemini/gemini-pro
gemini/gemini-pro-vision
gemini/gemma-3-27b-it
gemini/imagen-3.0-fast-generate-001
gemini/imagen-3.0-generate-001
gemini/imagen-3.0-generate-002
gemini/imagen-4.0-fast-generate-001
gemini/imagen-4.0-generate-001
gemini/imagen-4.0-ultra-generate-001
gemini/learnlm-1.5-pro-experimental
gpt-3.5-turbo
gpt-3.5-turbo-0125
gpt-3.5-turbo-0301
gpt-3.5-turbo-0613
gpt-3.5-turbo-1106
gpt-3.5-turbo-16k
gpt-3.5-turbo-16k-0613
gpt-3.5-turbo-instruct
gpt-3.5-turbo-instruct-0914
gpt-4
gpt-4-0125-preview
gpt-4-0314
gpt-4-0613
gpt-4-1106-preview
gpt-4-1106-vision-preview
gpt-4-32k
gpt-4-32k-0314
gpt-4-32k-0613
gpt-4-turbo
gpt-4-turbo-2024-04-09
gpt-4-turbo-preview
gpt-4-vision-preview
gpt-4.1
gpt-4.1-2025-04-14
gpt-4.1-mini
gpt-4.1-mini-2025-04-14
gpt-4.1-nano
gpt-4.1-nano-2025-04-14
gpt-4.5-preview
gpt-4.5-preview-2025-02-27
gpt-4o
gpt-4o-2024-05-13
gpt-4o-2024-08-06
gpt-4o-2024-11-20
gpt-4o-audio-preview
gpt-4o-audio-preview-2024-10-01
gpt-4o-audio-preview-2024-12-17
gpt-4o-audio-preview-2025-06-03
gpt-4o-mini
gpt-4o-mini-2024-07-18
gpt-4o-mini-audio-preview
gpt-4o-mini-audio-preview-2024-12-17
gpt-4o-mini-realtime-preview
gpt-4o-mini-realtime-preview-2024-12-17
gpt-4o-mini-search-preview
gpt-4o-mini-search-preview-2025-03-11
gpt-4o-mini-transcribe
gpt-4o-mini-tts
gpt-4o-realtime-preview
gpt-4o-realtime-preview-2024-10-01
gpt-4o-realtime-preview-2024-12-17
gpt-4o-realtime-preview-2025-06-03
gpt-4o-search-preview
gpt-4o-search-preview-2025-03-11
gpt-4o-transcribe
gpt-5
gpt-5-2025-08-07
gpt-5-chat
gpt-5-chat-latest
gpt-5-mini
gpt-5-mini-2025-08-07
gpt-5-nano
gpt-5-nano-2025-08-07
gpt-image-1
gradient_ai/alibaba-qwen3-32b
gradient_ai/anthropic-claude-3-opus
gradient_ai/anthropic-claude-3.5-haiku
gradient_ai/anthropic-claude-3.5-sonnet
gradient_ai/anthropic-claude-3.7-sonnet
gradient_ai/deepseek-r1-distill-llama-70b
gradient_ai/llama3-8b-instruct
gradient_ai/llama3.3-70b-instruct
gradient_ai/mistral-nemo-instruct-2407
gradient_ai/openai-gpt-4o
gradient_ai/openai-gpt-4o-mini
gradient_ai/openai-o3
gradient_ai/openai-o3-mini
groq/deepseek-r1-distill-llama-70b
groq/distil-whisper-large-v3-en
groq/gemma-7b-it
groq/gemma2-9b-it
groq/llama-3.1-405b-reasoning
groq/llama-3.1-70b-versatile
groq/llama-3.1-8b-instant
groq/llama-3.2-11b-text-preview
groq/llama-3.2-11b-vision-preview
groq/llama-3.2-1b-preview
groq/llama-3.2-3b-preview
groq/llama-3.2-90b-text-preview
groq/llama-3.2-90b-vision-preview
groq/llama-3.3-70b-specdec
groq/llama-3.3-70b-versatile
groq/llama-guard-3-8b
groq/llama2-70b-4096
groq/llama3-70b-8192
groq/llama3-8b-8192
groq/llama3-groq-70b-8192-tool-use-preview
groq/llama3-groq-8b-8192-tool-use-preview
groq/meta-llama/llama-4-maverick-17b-128e-instruct
groq/meta-llama/llama-4-scout-17b-16e-instruct
groq/mistral-saba-24b
groq/mixtral-8x7b-32768
groq/moonshotai/kimi-k2-instruct
groq/openai/gpt-oss-120b
groq/openai/gpt-oss-20b
groq/playai-tts
groq/qwen/qwen3-32b
groq/whisper-large-v3
groq/whisper-large-v3-turbo
hd/1024-x-1024/dall-e-3
hd/1024-x-1792/dall-e-3
hd/1792-x-1024/dall-e-3
high/1024-x-1024/gpt-image-1
high/1024-x-1536/gpt-image-1
high/1536-x-1024/gpt-image-1
hyperbolic/NousResearch/Hermes-3-Llama-3.1-70B
hyperbolic/Qwen/QwQ-32B
hyperbolic/Qwen/Qwen2.5-72B-Instruct
hyperbolic/Qwen/Qwen2.5-Coder-32B-Instruct
hyperbolic/Qwen/Qwen3-235B-A22B
hyperbolic/deepseek-ai/DeepSeek-R1
hyperbolic/deepseek-ai/DeepSeek-R1-0528
hyperbolic/deepseek-ai/DeepSeek-V3
hyperbolic/deepseek-ai/DeepSeek-V3-0324
hyperbolic/meta-llama/Llama-3.2-3B-Instruct
hyperbolic/meta-llama/Llama-3.3-70B-Instruct
hyperbolic/meta-llama/Meta-Llama-3-70B-Instruct
hyperbolic/meta-llama/Meta-Llama-3.1-405B-Instruct
hyperbolic/meta-llama/Meta-Llama-3.1-70B-Instruct
hyperbolic/meta-llama/Meta-Llama-3.1-8B-Instruct
hyperbolic/moonshotai/Kimi-K2-Instruct
j2-light
j2-mid
j2-ultra
jamba-1.5
jamba-1.5-large
jamba-1.5-large@001
jamba-1.5-mini
jamba-1.5-mini@001
jamba-large-1.6
jamba-large-1.7
jamba-mini-1.6
jamba-mini-1.7
jina-reranker-v2-base-multilingual
lambda_ai/deepseek-llama3.3-70b
lambda_ai/deepseek-r1-0528
lambda_ai/deepseek-r1-671b
lambda_ai/deepseek-v3-0324
lambda_ai/hermes3-405b
lambda_ai/hermes3-70b
lambda_ai/hermes3-8b
lambda_ai/lfm-40b
lambda_ai/lfm-7b
lambda_ai/llama-4-maverick-17b-128e-instruct-fp8
lambda_ai/llama-4-scout-17b-16e-instruct
lambda_ai/llama3.1-405b-instruct-fp8
lambda_ai/llama3.1-70b-instruct-fp8
lambda_ai/llama3.1-8b-instruct
lambda_ai/llama3.1-nemotron-70b-instruct-fp8
lambda_ai/llama3.2-11b-vision-instruct
lambda_ai/llama3.2-3b-instruct
lambda_ai/llama3.3-70b-instruct-fp8
lambda_ai/qwen25-coder-32b-instruct
lambda_ai/qwen3-32b-fp8
low/1024-x-1024/gpt-image-1
low/1024-x-1536/gpt-image-1
low/1536-x-1024/gpt-image-1
luminous-base
luminous-base-control
luminous-extended
luminous-extended-control
luminous-supreme
luminous-supreme-control
max-x-max/50-steps/stability.stable-diffusion-xl-v0
max-x-max/max-steps/stability.stable-diffusion-xl-v0
medium/1024-x-1024/gpt-image-1
medium/1024-x-1536/gpt-image-1
medium/1536-x-1024/gpt-image-1
medlm-large
medlm-medium
meta.llama2-13b-chat-v1
meta.llama2-70b-chat-v1
meta.llama3-1-405b-instruct-v1:0
meta.llama3-1-70b-instruct-v1:0
meta.llama3-1-8b-instruct-v1:0
meta.llama3-2-11b-instruct-v1:0
meta.llama3-2-1b-instruct-v1:0
meta.llama3-2-3b-instruct-v1:0
meta.llama3-2-90b-instruct-v1:0
meta.llama3-3-70b-instruct-v1:0
meta.llama3-70b-instruct-v1:0
meta.llama3-8b-instruct-v1:0
meta.llama4-maverick-17b-instruct-v1:0
meta.llama4-scout-17b-instruct-v1:0
meta_llama/Llama-3.3-70B-Instruct
meta_llama/Llama-3.3-8B-Instruct
meta_llama/Llama-4-Maverick-17B-128E-Instruct-FP8
meta_llama/Llama-4-Scout-17B-16E-Instruct-FP8
mistral.mistral-7b-instruct-v0:2
mistral.mistral-large-2402-v1:0
mistral.mistral-large-2407-v1:0
mistral.mistral-small-2402-v1:0
mistral.mixtral-8x7b-instruct-v0:1
mistral/codestral-2405
mistral/codestral-latest
mistral/codestral-mamba-latest
mistral/devstral-medium-2507
mistral/devstral-small-2505
mistral/devstral-small-2507
mistral/magistral-medium-2506
mistral/magistral-medium-latest
mistral/magistral-small-2506
mistral/magistral-small-latest
mistral/mistral-embed
mistral/mistral-large-2402
mistral/mistral-large-2407
mistral/mistral-large-2411
mistral/mistral-large-latest
mistral/mistral-medium
mistral/mistral-medium-2312
mistral/mistral-medium-2505
mistral/mistral-medium-latest
mistral/mistral-small
mistral/mistral-small-latest
mistral/mistral-tiny
mistral/open-codestral-mamba
mistral/open-mistral-7b
mistral/open-mistral-nemo
mistral/open-mistral-nemo-2407
mistral/open-mixtral-8x22b
mistral/open-mixtral-8x7b
mistral/pixtral-12b-2409
mistral/pixtral-large-2411
mistral/pixtral-large-latest
moonshot/kimi-k2-0711-preview
moonshot/kimi-latest
moonshot/kimi-latest-128k
moonshot/kimi-latest-32k
moonshot/kimi-latest-8k
moonshot/kimi-thinking-preview
moonshot/moonshot-v1-128k
moonshot/moonshot-v1-128k-0430
moonshot/moonshot-v1-128k-vision-preview
moonshot/moonshot-v1-32k
moonshot/moonshot-v1-32k-0430
moonshot/moonshot-v1-32k-vision-preview
moonshot/moonshot-v1-8k
moonshot/moonshot-v1-8k-0430
moonshot/moonshot-v1-8k-vision-preview
moonshot/moonshot-v1-auto
morph/morph-v3-fast
morph/morph-v3-large
multimodalembedding
multimodalembedding@001
nscale/Qwen/QwQ-32B
nscale/Qwen/Qwen2.5-Coder-32B-Instruct
nscale/Qwen/Qwen2.5-Coder-3B-Instruct
nscale/Qwen/Qwen2.5-Coder-7B-Instruct
nscale/black-forest-labs/FLUX.1-schnell
nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-8B
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
nscale/meta-llama/Llama-3.1-8B-Instruct
nscale/meta-llama/Llama-3.3-70B-Instruct
nscale/meta-llama/Llama-4-Scout-17B-16E-Instruct
nscale/mistralai/mixtral-8x22b-instruct-v0.1
nscale/stabilityai/stable-diffusion-xl-base-1.0
o1
o1-2024-12-17
o1-mini
o1-mini-2024-09-12
o1-preview
o1-preview-2024-09-12
o1-pro
o1-pro-2025-03-19
o3
o3-2025-04-16
o3-deep-research
o3-deep-research-2025-06-26
o3-mini
o3-mini-2025-01-31
o3-pro
o3-pro-2025-06-10
o4-mini
o4-mini-2025-04-16
o4-mini-deep-research
o4-mini-deep-research-2025-06-26
oci/meta.llama-3.1-405b-instruct
oci/meta.llama-3.2-90b-vision-instruct
oci/meta.llama-3.3-70b-instruct
oci/meta.llama-4-maverick-17b-128e-instruct-fp8
oci/meta.llama-4-scout-17b-16e-instruct
oci/xai.grok-3
oci/xai.grok-3-fast
oci/xai.grok-3-mini
oci/xai.grok-3-mini-fast
oci/xai.grok-4
ollama/codegeex4
ollama/codegemma
ollama/codellama
ollama/deepseek-coder-v2-base
ollama/deepseek-coder-v2-instruct
ollama/deepseek-coder-v2-lite-base
ollama/deepseek-coder-v2-lite-instruct
ollama/internlm2_5-20b-chat
ollama/llama2
ollama/llama2-uncensored
ollama/llama2:13b
ollama/llama2:70b
ollama/llama2:7b
ollama/llama3
ollama/llama3.1
ollama/llama3:70b
ollama/llama3:8b
ollama/mistral
ollama/mistral-7B-Instruct-v0.1
ollama/mistral-7B-Instruct-v0.2
ollama/mistral-large-instruct-2407
ollama/mixtral-8x22B-Instruct-v0.1
ollama/mixtral-8x7B-Instruct-v0.1
ollama/orca-mini
ollama/vicuna
omni-moderation-2024-09-26
omni-moderation-latest
omni-moderation-latest-intents
openai.gpt-oss-120b-1:0
openai.gpt-oss-20b-1:0
openrouter/anthropic/claude-2
openrouter/anthropic/claude-3-5-haiku
openrouter/anthropic/claude-3-5-haiku-20241022
openrouter/anthropic/claude-3-haiku
openrouter/anthropic/claude-3-haiku-20240307
openrouter/anthropic/claude-3-opus
openrouter/anthropic/claude-3-sonnet
openrouter/anthropic/claude-3.5-sonnet
openrouter/anthropic/claude-3.5-sonnet:beta
openrouter/anthropic/claude-3.7-sonnet
openrouter/anthropic/claude-3.7-sonnet:beta
openrouter/anthropic/claude-instant-v1
openrouter/anthropic/claude-opus-4
openrouter/anthropic/claude-opus-4.1
openrouter/anthropic/claude-sonnet-4
openrouter/bytedance/ui-tars-1.5-7b
openrouter/cognitivecomputations/dolphin-mixtral-8x7b
openrouter/cohere/command-r-plus
openrouter/databricks/dbrx-instruct
openrouter/deepseek/deepseek-chat
openrouter/deepseek/deepseek-chat-v3-0324
openrouter/deepseek/deepseek-chat-v3.1
openrouter/deepseek/deepseek-coder
openrouter/deepseek/deepseek-r1
openrouter/deepseek/deepseek-r1-0528
openrouter/fireworks/firellava-13b
openrouter/google/gemini-2.0-flash-001
openrouter/google/gemini-2.5-flash
openrouter/google/gemini-2.5-pro
openrouter/google/gemini-pro-1.5
openrouter/google/gemini-pro-vision
openrouter/google/palm-2-chat-bison
openrouter/google/palm-2-codechat-bison
openrouter/gryphe/mythomax-l2-13b
openrouter/jondurbin/airoboros-l2-70b-2.1
openrouter/mancer/weaver
openrouter/meta-llama/codellama-34b-instruct
openrouter/meta-llama/llama-2-13b-chat
openrouter/meta-llama/llama-2-70b-chat
openrouter/meta-llama/llama-3-70b-instruct
openrouter/meta-llama/llama-3-70b-instruct:nitro
openrouter/meta-llama/llama-3-8b-instruct:extended
openrouter/meta-llama/llama-3-8b-instruct:free
openrouter/microsoft/wizardlm-2-8x22b:nitro
openrouter/mistralai/mistral-7b-instruct
openrouter/mistralai/mistral-7b-instruct:free
openrouter/mistralai/mistral-large
openrouter/mistralai/mistral-small-3.1-24b-instruct
openrouter/mistralai/mistral-small-3.2-24b-instruct
openrouter/mistralai/mixtral-8x22b-instruct
openrouter/nousresearch/nous-hermes-llama2-13b
openrouter/openai/gpt-3.5-turbo
openrouter/openai/gpt-3.5-turbo-16k
openrouter/openai/gpt-4
openrouter/openai/gpt-4-vision-preview
openrouter/openai/gpt-4o
openrouter/openai/gpt-4o-2024-05-13
openrouter/openai/gpt-5-chat
openrouter/openai/gpt-5-mini
openrouter/openai/gpt-5-nano
openrouter/openai/gpt-oss-120b
openrouter/openai/gpt-oss-20b
openrouter/openai/o1
openrouter/openai/o1-mini
openrouter/openai/o1-mini-2024-09-12
openrouter/openai/o1-preview
openrouter/openai/o1-preview-2024-09-12
openrouter/openai/o3-mini
openrouter/openai/o3-mini-high
openrouter/pygmalionai/mythalion-13b
openrouter/qwen/qwen-2.5-coder-32b-instruct
openrouter/qwen/qwen-vl-plus
openrouter/qwen/qwen3-coder
openrouter/switchpoint/router
openrouter/undi95/remm-slerp-l2-13b
openrouter/x-ai/grok-4
palm/chat-bison
palm/chat-bison-001
palm/text-bison
palm/text-bison-001
palm/text-bison-safety-off
palm/text-bison-safety-recitation-off
perplexity/codellama-34b-instruct
perplexity/codellama-70b-instruct
perplexity/llama-2-70b-chat
perplexity/llama-3.1-70b-instruct
perplexity/llama-3.1-8b-instruct
perplexity/llama-3.1-sonar-huge-128k-online
perplexity/llama-3.1-sonar-large-128k-chat
perplexity/llama-3.1-sonar-large-128k-online
perplexity/llama-3.1-sonar-small-128k-chat
perplexity/llama-3.1-sonar-small-128k-online
perplexity/mistral-7b-instruct
perplexity/mixtral-8x7b-instruct
perplexity/pplx-70b-chat
perplexity/pplx-70b-online
perplexity/pplx-7b-chat
perplexity/pplx-7b-online
perplexity/sonar
perplexity/sonar-deep-research
perplexity/sonar-medium-chat
perplexity/sonar-medium-online
perplexity/sonar-pro
perplexity/sonar-reasoning
perplexity/sonar-reasoning-pro
perplexity/sonar-small-chat
perplexity/sonar-small-online
recraft/recraftv2
recraft/recraftv3
replicate/meta/llama-2-13b
replicate/meta/llama-2-13b-chat
replicate/meta/llama-2-70b
replicate/meta/llama-2-70b-chat
replicate/meta/llama-2-7b
replicate/meta/llama-2-7b-chat
replicate/meta/llama-3-70b
replicate/meta/llama-3-70b-instruct
replicate/meta/llama-3-8b
replicate/meta/llama-3-8b-instruct
replicate/mistralai/mistral-7b-instruct-v0.2
replicate/mistralai/mistral-7b-v0.1
replicate/mistralai/mixtral-8x7b-instruct-v0.1
rerank-english-v2.0
rerank-english-v3.0
rerank-multilingual-v2.0
rerank-multilingual-v3.0
rerank-v3.5
sagemaker/meta-textgeneration-llama-2-13b
sagemaker/meta-textgeneration-llama-2-13b-f
sagemaker/meta-textgeneration-llama-2-70b
sagemaker/meta-textgeneration-llama-2-70b-b-f
sagemaker/meta-textgeneration-llama-2-7b
sagemaker/meta-textgeneration-llama-2-7b-f
sambanova/DeepSeek-R1
sambanova/DeepSeek-R1-Distill-Llama-70B
sambanova/DeepSeek-V3-0324
sambanova/Llama-4-Maverick-17B-128E-Instruct
sambanova/Llama-4-Scout-17B-16E-Instruct
sambanova/Meta-Llama-3.1-405B-Instruct
sambanova/Meta-Llama-3.1-8B-Instruct
sambanova/Meta-Llama-3.2-1B-Instruct
sambanova/Meta-Llama-3.2-3B-Instruct
sambanova/Meta-Llama-3.3-70B-Instruct
sambanova/Meta-Llama-Guard-3-8B
sambanova/QwQ-32B
sambanova/Qwen2-Audio-7B-Instruct
sambanova/Qwen3-32B
sample_spec
snowflake/claude-3-5-sonnet
snowflake/deepseek-r1
snowflake/gemma-7b
snowflake/jamba-1.5-large
snowflake/jamba-1.5-mini
snowflake/jamba-instruct
snowflake/llama2-70b-chat
snowflake/llama3-70b
snowflake/llama3-8b
snowflake/llama3.1-405b
snowflake/llama3.1-70b
snowflake/llama3.1-8b
snowflake/llama3.2-1b
snowflake/llama3.2-3b
snowflake/llama3.3-70b
snowflake/mistral-7b
snowflake/mistral-large
snowflake/mistral-large2
snowflake/mixtral-8x7b
snowflake/reka-core
snowflake/reka-flash
snowflake/snowflake-arctic
snowflake/snowflake-llama-3.1-405b
snowflake/snowflake-llama-3.3-70b
stability.sd3-5-large-v1:0
stability.sd3-large-v1:0
stability.stable-image-core-v1:0
stability.stable-image-core-v1:1
stability.stable-image-ultra-v1:0
stability.stable-image-ultra-v1:1
standard/1024-x-1024/dall-e-3
standard/1024-x-1792/dall-e-3
standard/1792-x-1024/dall-e-3
text-bison
text-bison32k
text-bison32k@002
text-bison@001
text-bison@002
text-completion-codestral/codestral-2405
text-completion-codestral/codestral-latest
text-embedding-004
text-embedding-005
text-embedding-3-large
text-embedding-3-small
text-embedding-ada-002
text-embedding-ada-002-v2
text-embedding-large-exp-03-07
text-embedding-preview-0409
text-moderation-007
text-moderation-latest
text-moderation-stable
text-multilingual-embedding-002
text-multilingual-embedding-preview-0409
text-unicorn
text-unicorn@001
textembedding-gecko
textembedding-gecko-multilingual
textembedding-gecko-multilingual@001
textembedding-gecko@001
textembedding-gecko@003
together-ai-21.1b-41b
together-ai-4.1b-8b
together-ai-41.1b-80b
together-ai-8.1b-21b
together-ai-81.1b-110b
together-ai-embedding-151m-to-350m
together-ai-embedding-up-to-150m
together-ai-up-to-4b
together_ai/OpenAI/gpt-oss-20B
together_ai/Qwen/Qwen2.5-72B-Instruct-Turbo
together_ai/Qwen/Qwen2.5-7B-Instruct-Turbo
together_ai/Qwen/Qwen3-235B-A22B-Instruct-2507-tput
together_ai/Qwen/Qwen3-235B-A22B-Thinking-2507
together_ai/Qwen/Qwen3-235B-A22B-fp8-tput
together_ai/Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
together_ai/deepseek-ai/DeepSeek-R1
together_ai/deepseek-ai/DeepSeek-R1-0528-tput
together_ai/deepseek-ai/DeepSeek-V3
together_ai/meta-llama/Llama-3.2-3B-Instruct-Turbo
together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo
together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo-Free
together_ai/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
together_ai/meta-llama/Llama-4-Scout-17B-16E-Instruct
together_ai/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
together_ai/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo
together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
together_ai/mistralai/Mistral-7B-Instruct-v0.1
together_ai/mistralai/Mistral-Small-24B-Instruct-2501
together_ai/mistralai/Mixtral-8x7B-Instruct-v0.1
together_ai/moonshotai/Kimi-K2-Instruct
together_ai/openai/gpt-oss-120b
together_ai/togethercomputer/CodeLlama-34b-Instruct
together_ai/zai-org/GLM-4.5-Air-FP8
tts-1
tts-1-hd
us.amazon.nova-lite-v1:0
us.amazon.nova-micro-v1:0
us.amazon.nova-premier-v1:0
us.amazon.nova-pro-v1:0
us.anthropic.claude-3-5-haiku-20241022-v1:0
us.anthropic.claude-3-5-sonnet-20240620-v1:0
us.anthropic.claude-3-5-sonnet-20241022-v2:0
us.anthropic.claude-3-7-sonnet-20250219-v1:0
us.anthropic.claude-3-haiku-20240307-v1:0
us.anthropic.claude-3-opus-20240229-v1:0
us.anthropic.claude-3-sonnet-20240229-v1:0
us.anthropic.claude-opus-4-1-20250805-v1:0
us.anthropic.claude-opus-4-20250514-v1:0
us.anthropic.claude-sonnet-4-20250514-v1:0
us.deepseek.r1-v1:0
us.meta.llama3-1-405b-instruct-v1:0
us.meta.llama3-1-70b-instruct-v1:0
us.meta.llama3-1-8b-instruct-v1:0
us.meta.llama3-2-11b-instruct-v1:0
us.meta.llama3-2-1b-instruct-v1:0
us.meta.llama3-2-3b-instruct-v1:0
us.meta.llama3-2-90b-instruct-v1:0
us.meta.llama3-3-70b-instruct-v1:0
us.meta.llama4-maverick-17b-instruct-v1:0
us.meta.llama4-scout-17b-instruct-v1:0
us.mistral.pixtral-large-2502-v1:0
v0/v0-1.0-md
v0/v0-1.5-lg
v0/v0-1.5-md
vertex_ai/claude-3-5-haiku
vertex_ai/claude-3-5-haiku@20241022
vertex_ai/claude-3-5-sonnet
vertex_ai/claude-3-5-sonnet-v2
vertex_ai/claude-3-5-sonnet-v2@20241022
vertex_ai/claude-3-5-sonnet@20240620
vertex_ai/claude-3-7-sonnet@20250219
vertex_ai/claude-3-haiku
vertex_ai/claude-3-haiku@20240307
vertex_ai/claude-3-opus
vertex_ai/claude-3-opus@20240229
vertex_ai/claude-3-sonnet
vertex_ai/claude-3-sonnet@20240229
vertex_ai/claude-opus-4
vertex_ai/claude-opus-4-1
vertex_ai/claude-opus-4-1@20250805
vertex_ai/claude-opus-4@20250514
vertex_ai/claude-sonnet-4
vertex_ai/claude-sonnet-4@20250514
vertex_ai/codestral-2501
vertex_ai/codestral@2405
vertex_ai/codestral@latest
vertex_ai/deepseek-ai/deepseek-r1-0528-maas
vertex_ai/imagegeneration@006
vertex_ai/imagen-3.0-fast-generate-001
vertex_ai/imagen-3.0-generate-001
vertex_ai/imagen-3.0-generate-002
vertex_ai/imagen-4.0-fast-generate-001
vertex_ai/imagen-4.0-generate-001
vertex_ai/imagen-4.0-ultra-generate-001
vertex_ai/jamba-1.5
vertex_ai/jamba-1.5-large
vertex_ai/jamba-1.5-large@001
vertex_ai/jamba-1.5-mini
vertex_ai/jamba-1.5-mini@001
vertex_ai/meta/llama-3.1-405b-instruct-maas
vertex_ai/meta/llama-3.1-70b-instruct-maas
vertex_ai/meta/llama-3.1-8b-instruct-maas
vertex_ai/meta/llama-3.2-90b-vision-instruct-maas
vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas
vertex_ai/meta/llama-4-maverick-17b-16e-instruct-maas
vertex_ai/meta/llama-4-scout-17b-128e-instruct-maas
vertex_ai/meta/llama-4-scout-17b-16e-instruct-maas
vertex_ai/meta/llama3-405b-instruct-maas
vertex_ai/meta/llama3-70b-instruct-maas
vertex_ai/meta/llama3-8b-instruct-maas
vertex_ai/mistral-large-2411
vertex_ai/mistral-large@2407
vertex_ai/mistral-large@2411-001
vertex_ai/mistral-large@latest
vertex_ai/mistral-nemo@2407
vertex_ai/mistral-nemo@latest
vertex_ai/mistral-small-2503
vertex_ai/mistral-small-2503@001
vertex_ai/qwen/qwen3-235b-a22b-instruct-2507-maas
vertex_ai/qwen/qwen3-coder-480b-a35b-instruct-maas
voyage/rerank-2
voyage/rerank-2-lite
voyage/voyage-2
voyage/voyage-3
voyage/voyage-3-large
voyage/voyage-3-lite
voyage/voyage-code-2
voyage/voyage-code-3
voyage/voyage-context-3
voyage/voyage-finance-2
voyage/voyage-large-2
voyage/voyage-law-2
voyage/voyage-lite-01
voyage/voyage-lite-02-instruct
voyage/voyage-multimodal-3
watsonx/ibm/granite-3-8b-instruct
watsonx/mistralai/mistral-large
whisper-1
xai/grok-2
xai/grok-2-1212
xai/grok-2-latest
xai/grok-2-vision
xai/grok-2-vision-1212
xai/grok-2-vision-latest
xai/grok-3
xai/grok-3-beta
xai/grok-3-fast-beta
xai/grok-3-fast-latest
xai/grok-3-latest
xai/grok-3-mini
xai/grok-3-mini-beta
xai/grok-3-mini-fast
xai/grok-3-mini-fast-beta
xai/grok-3-mini-fast-latest
xai/grok-3-mini-latest
xai/grok-4
xai/grok-4-0709
xai/grok-4-latest
xai/grok-beta
xai/grok-code-fast
xai/grok-code-fast-1
xai/grok-code-fast-1-0825
xai/grok-vision-beta
To find the corresponding API key, check the previous section.