Skip to content

Model setup quickstart

Setup

In most cases, you can simply run mini-extra config setup to set up your default model and API keys. This should be run the first time you run mini.

Setting API keys

There are several ways to set your API keys:

  • Recommended: Run our setup script: mini-extra config setup. This should also run automatically the first time you run mini.
  • Use mini-extra config set ANTHROPIC_API_KEY <your-api-key> to put the key in the mini config file.
  • Export your key as an environment variable: export ANTHROPIC_API_KEY=<your-api-key> (this is not persistent if you restart your shell, unless you add it to your shell config, like ~/.bashrc or ~/.zshrc).
  • If you only use a single model, you can also set MSWEA_MODEL_API_KEY (as environment variable or in the config file). This takes precedence over all other keys.
  • If you run several agents in parallel, see our note about rotating anthropic keys here.
All the API key names

We use litellm to support most models. Here's a list of all the API key names available in litellm:

ALEPH_ALPHA_API_KEY
ALEPHALPHA_API_KEY
ANTHROPIC_API_KEY
ANYSCALE_API_KEY
AZURE_AI_API_KEY
AZURE_API_KEY
AZURE_OPENAI_API_KEY
BASETEN_API_KEY
CEREBRAS_API_KEY
CLARIFAI_API_KEY
CLOUDFLARE_API_KEY
CO_API_KEY
CODESTRAL_API_KEY
COHERE_API_KEY
DATABRICKS_API_KEY
DEEPINFRA_API_KEY
DEEPSEEK_API_KEY
FEATHERLESS_AI_API_KEY
FIREWORKS_AI_API_KEY
FIREWORKS_API_KEY
FIREWORKSAI_API_KEY
GEMINI_API_KEY
GROQ_API_KEY
HUGGINGFACE_API_KEY
INFINITY_API_KEY
MARITALK_API_KEY
MISTRAL_API_KEY
NEBIUS_API_KEY
NLP_CLOUD_API_KEY
NOVITA_API_KEY
NVIDIA_NIM_API_KEY
OLLAMA_API_KEY
OPENAI_API_KEY
OPENAI_LIKE_API_KEY
OPENROUTER_API_KEY
OR_API_KEY
PALM_API_KEY
PERPLEXITYAI_API_KEY
PREDIBASE_API_KEY
PROVIDER_API_KEY
REPLICATE_API_KEY
TOGETHERAI_API_KEY
VOLCENGINE_API_KEY
VOYAGE_API_KEY
WATSONX_API_KEY
WX_API_KEY
XAI_API_KEY
XINFERENCE_API_KEY

Selecting a model

Model names and providers.

We support most models using litellm. You can find a list of their supported models here. Please always include the provider in the model name, e.g., anthropic/claude-....

  • Recommended: mini-extra config setup (should be run the first time you run mini) can set the default model for you
  • All command line interfaces allow you to set the model name with -m or --model.
  • In addition, you can set the default model with mini-extra config set MSWEA_MODEL_NAME <model-name>, by editing the global config file (shortcut: mini-extra config edit), or by setting the MSWEA_MODEL_NAME environment variable.
  • You can also set your model in a config file (key model_name under model).
  • If you want to use local models, please check this guide.

Popular models

Here's a few examples of popular models:

anthropic/claude-sonnet-4-20250514
openai/gpt-5
openai/gpt-5-mini
gemini/gemini-2.5-pro
deepseek/deepseek-chat
List of all supported models

Here's a list of all model names supported by litellm as of Aug 29th 2025. For even more recent models, check the model_prices_and_context_window.json file from litellm.

1024-x-1024/50-steps/bedrock/amazon.nova-canvas-v1:0
1024-x-1024/50-steps/stability.stable-diffusion-xl-v1
1024-x-1024/dall-e-2
1024-x-1024/max-steps/stability.stable-diffusion-xl-v1
256-x-256/dall-e-2
512-x-512/50-steps/stability.stable-diffusion-xl-v0
512-x-512/dall-e-2
512-x-512/max-steps/stability.stable-diffusion-xl-v0
ai21.j2-mid-v1
ai21.j2-ultra-v1
ai21.jamba-1-5-large-v1:0
ai21.jamba-1-5-mini-v1:0
ai21.jamba-instruct-v1:0
aiml/dall-e-2
aiml/dall-e-3
aiml/flux-pro
aiml/flux-pro/v1.1
aiml/flux-pro/v1.1-ultra
aiml/flux-realism
aiml/flux/dev
aiml/flux/kontext-max/text-to-image
aiml/flux/kontext-pro/text-to-image
aiml/flux/schnell
amazon.nova-lite-v1:0
amazon.nova-micro-v1:0
amazon.nova-pro-v1:0
amazon.rerank-v1:0
amazon.titan-embed-image-v1
amazon.titan-embed-text-v1
amazon.titan-embed-text-v2:0
amazon.titan-text-express-v1
amazon.titan-text-lite-v1
amazon.titan-text-premier-v1:0
anthropic.claude-3-5-haiku-20241022-v1:0
anthropic.claude-3-5-sonnet-20240620-v1:0
anthropic.claude-3-5-sonnet-20241022-v2:0
anthropic.claude-3-7-sonnet-20250219-v1:0
anthropic.claude-3-haiku-20240307-v1:0
anthropic.claude-3-opus-20240229-v1:0
anthropic.claude-3-sonnet-20240229-v1:0
anthropic.claude-instant-v1
anthropic.claude-opus-4-1-20250805-v1:0
anthropic.claude-opus-4-20250514-v1:0
anthropic.claude-sonnet-4-20250514-v1:0
anthropic.claude-v1
anthropic.claude-v2
anthropic.claude-v2:1
anyscale/HuggingFaceH4/zephyr-7b-beta
anyscale/codellama/CodeLlama-34b-Instruct-hf
anyscale/codellama/CodeLlama-70b-Instruct-hf
anyscale/google/gemma-7b-it
anyscale/meta-llama/Llama-2-13b-chat-hf
anyscale/meta-llama/Llama-2-70b-chat-hf
anyscale/meta-llama/Llama-2-7b-chat-hf
anyscale/meta-llama/Meta-Llama-3-70B-Instruct
anyscale/meta-llama/Meta-Llama-3-8B-Instruct
anyscale/mistralai/Mistral-7B-Instruct-v0.1
anyscale/mistralai/Mixtral-8x22B-Instruct-v0.1
anyscale/mistralai/Mixtral-8x7B-Instruct-v0.1
apac.amazon.nova-lite-v1:0
apac.amazon.nova-micro-v1:0
apac.amazon.nova-pro-v1:0
apac.anthropic.claude-3-5-sonnet-20240620-v1:0
apac.anthropic.claude-3-5-sonnet-20241022-v2:0
apac.anthropic.claude-3-haiku-20240307-v1:0
apac.anthropic.claude-3-sonnet-20240229-v1:0
apac.anthropic.claude-sonnet-4-20250514-v1:0
assemblyai/best
assemblyai/nano
azure/ada
azure/codex-mini
azure/command-r-plus
azure/computer-use-preview
azure/eu/gpt-4o-2024-08-06
azure/eu/gpt-4o-2024-11-20
azure/eu/gpt-4o-mini-2024-07-18
azure/eu/gpt-4o-mini-realtime-preview-2024-12-17
azure/eu/gpt-4o-realtime-preview-2024-10-01
azure/eu/gpt-4o-realtime-preview-2024-12-17
azure/eu/o1-2024-12-17
azure/eu/o1-mini-2024-09-12
azure/eu/o1-preview-2024-09-12
azure/eu/o3-mini-2025-01-31
azure/global-standard/gpt-4o-2024-08-06
azure/global-standard/gpt-4o-2024-11-20
azure/global-standard/gpt-4o-mini
azure/global/gpt-4o-2024-08-06
azure/global/gpt-4o-2024-11-20
azure/gpt-3.5-turbo
azure/gpt-3.5-turbo-0125
azure/gpt-3.5-turbo-instruct-0914
azure/gpt-35-turbo
azure/gpt-35-turbo-0125
azure/gpt-35-turbo-0301
azure/gpt-35-turbo-0613
azure/gpt-35-turbo-1106
azure/gpt-35-turbo-16k
azure/gpt-35-turbo-16k-0613
azure/gpt-35-turbo-instruct
azure/gpt-35-turbo-instruct-0914
azure/gpt-4
azure/gpt-4-0125-preview
azure/gpt-4-0613
azure/gpt-4-1106-preview
azure/gpt-4-32k
azure/gpt-4-32k-0613
azure/gpt-4-turbo
azure/gpt-4-turbo-2024-04-09
azure/gpt-4-turbo-vision-preview
azure/gpt-4.1
azure/gpt-4.1-2025-04-14
azure/gpt-4.1-mini
azure/gpt-4.1-mini-2025-04-14
azure/gpt-4.1-nano
azure/gpt-4.1-nano-2025-04-14
azure/gpt-4.5-preview
azure/gpt-4o
azure/gpt-4o-2024-05-13
azure/gpt-4o-2024-08-06
azure/gpt-4o-2024-11-20
azure/gpt-4o-audio-preview-2024-12-17
azure/gpt-4o-mini
azure/gpt-4o-mini-2024-07-18
azure/gpt-4o-mini-audio-preview-2024-12-17
azure/gpt-4o-mini-realtime-preview-2024-12-17
azure/gpt-4o-mini-transcribe
azure/gpt-4o-mini-tts
azure/gpt-4o-realtime-preview-2024-10-01
azure/gpt-4o-realtime-preview-2024-12-17
azure/gpt-4o-transcribe
azure/gpt-5
azure/gpt-5-2025-08-07
azure/gpt-5-chat
azure/gpt-5-chat-latest
azure/gpt-5-mini
azure/gpt-5-mini-2025-08-07
azure/gpt-5-nano
azure/gpt-5-nano-2025-08-07
azure/gpt-image-1
azure/hd/1024-x-1024/dall-e-3
azure/hd/1024-x-1792/dall-e-3
azure/hd/1792-x-1024/dall-e-3
azure/high/1024-x-1024/gpt-image-1
azure/high/1024-x-1536/gpt-image-1
azure/high/1536-x-1024/gpt-image-1
azure/low/1024-x-1024/gpt-image-1
azure/low/1024-x-1536/gpt-image-1
azure/low/1536-x-1024/gpt-image-1
azure/medium/1024-x-1024/gpt-image-1
azure/medium/1024-x-1536/gpt-image-1
azure/medium/1536-x-1024/gpt-image-1
azure/mistral-large-2402
azure/mistral-large-latest
azure/o1
azure/o1-2024-12-17
azure/o1-mini
azure/o1-mini-2024-09-12
azure/o1-preview
azure/o1-preview-2024-09-12
azure/o3
azure/o3-2025-04-16
azure/o3-deep-research
azure/o3-mini
azure/o3-mini-2025-01-31
azure/o3-pro
azure/o3-pro-2025-06-10
azure/o4-mini
azure/o4-mini-2025-04-16
azure/standard/1024-x-1024/dall-e-2
azure/standard/1024-x-1024/dall-e-3
azure/standard/1024-x-1792/dall-e-3
azure/standard/1792-x-1024/dall-e-3
azure/text-embedding-3-large
azure/text-embedding-3-small
azure/text-embedding-ada-002
azure/tts-1
azure/tts-1-hd
azure/us/gpt-4o-2024-08-06
azure/us/gpt-4o-2024-11-20
azure/us/gpt-4o-mini-2024-07-18
azure/us/gpt-4o-mini-realtime-preview-2024-12-17
azure/us/gpt-4o-realtime-preview-2024-10-01
azure/us/gpt-4o-realtime-preview-2024-12-17
azure/us/o1-2024-12-17
azure/us/o1-mini-2024-09-12
azure/us/o1-preview-2024-09-12
azure/us/o3-mini-2025-01-31
azure/whisper-1
azure_ai/Cohere-embed-v3-english
azure_ai/Cohere-embed-v3-multilingual
azure_ai/FLUX-1.1-pro
azure_ai/FLUX.1-Kontext-pro
azure_ai/Llama-3.2-11B-Vision-Instruct
azure_ai/Llama-3.2-90B-Vision-Instruct
azure_ai/Llama-3.3-70B-Instruct
azure_ai/Llama-4-Maverick-17B-128E-Instruct-FP8
azure_ai/Llama-4-Scout-17B-16E-Instruct
azure_ai/Meta-Llama-3-70B-Instruct
azure_ai/Meta-Llama-3.1-405B-Instruct
azure_ai/Meta-Llama-3.1-70B-Instruct
azure_ai/Meta-Llama-3.1-8B-Instruct
azure_ai/Phi-3-medium-128k-instruct
azure_ai/Phi-3-medium-4k-instruct
azure_ai/Phi-3-mini-128k-instruct
azure_ai/Phi-3-mini-4k-instruct
azure_ai/Phi-3-small-128k-instruct
azure_ai/Phi-3-small-8k-instruct
azure_ai/Phi-3.5-MoE-instruct
azure_ai/Phi-3.5-mini-instruct
azure_ai/Phi-3.5-vision-instruct
azure_ai/Phi-4
azure_ai/Phi-4-mini-instruct
azure_ai/Phi-4-multimodal-instruct
azure_ai/cohere-rerank-v3-english
azure_ai/cohere-rerank-v3-multilingual
azure_ai/cohere-rerank-v3.5
azure_ai/deepseek-r1
azure_ai/deepseek-v3
azure_ai/deepseek-v3-0324
azure_ai/embed-v-4-0
azure_ai/global/grok-3
azure_ai/global/grok-3-mini
azure_ai/grok-3
azure_ai/grok-3-mini
azure_ai/jais-30b-chat
azure_ai/jamba-instruct
azure_ai/ministral-3b
azure_ai/mistral-large
azure_ai/mistral-large-2407
azure_ai/mistral-large-latest
azure_ai/mistral-medium-2505
azure_ai/mistral-nemo
azure_ai/mistral-small
azure_ai/mistral-small-2503
babbage-002
bedrock/*/1-month-commitment/cohere.command-light-text-v14
bedrock/*/1-month-commitment/cohere.command-text-v14
bedrock/*/6-month-commitment/cohere.command-light-text-v14
bedrock/*/6-month-commitment/cohere.command-text-v14
bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-instant-v1
bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v1
bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v2
bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v2:1
bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-instant-v1
bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v1
bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v2
bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v2:1
bedrock/ap-northeast-1/anthropic.claude-instant-v1
bedrock/ap-northeast-1/anthropic.claude-v1
bedrock/ap-northeast-1/anthropic.claude-v2
bedrock/ap-northeast-1/anthropic.claude-v2:1
bedrock/ap-south-1/meta.llama3-70b-instruct-v1:0
bedrock/ap-south-1/meta.llama3-8b-instruct-v1:0
bedrock/ca-central-1/meta.llama3-70b-instruct-v1:0
bedrock/ca-central-1/meta.llama3-8b-instruct-v1:0
bedrock/eu-central-1/1-month-commitment/anthropic.claude-instant-v1
bedrock/eu-central-1/1-month-commitment/anthropic.claude-v1
bedrock/eu-central-1/1-month-commitment/anthropic.claude-v2
bedrock/eu-central-1/1-month-commitment/anthropic.claude-v2:1
bedrock/eu-central-1/6-month-commitment/anthropic.claude-instant-v1
bedrock/eu-central-1/6-month-commitment/anthropic.claude-v1
bedrock/eu-central-1/6-month-commitment/anthropic.claude-v2
bedrock/eu-central-1/6-month-commitment/anthropic.claude-v2:1
bedrock/eu-central-1/anthropic.claude-instant-v1
bedrock/eu-central-1/anthropic.claude-v1
bedrock/eu-central-1/anthropic.claude-v2
bedrock/eu-central-1/anthropic.claude-v2:1
bedrock/eu-west-1/meta.llama3-70b-instruct-v1:0
bedrock/eu-west-1/meta.llama3-8b-instruct-v1:0
bedrock/eu-west-2/meta.llama3-70b-instruct-v1:0
bedrock/eu-west-2/meta.llama3-8b-instruct-v1:0
bedrock/eu-west-3/mistral.mistral-7b-instruct-v0:2
bedrock/eu-west-3/mistral.mistral-large-2402-v1:0
bedrock/eu-west-3/mistral.mixtral-8x7b-instruct-v0:1
bedrock/invoke/anthropic.claude-3-5-sonnet-20240620-v1:0
bedrock/sa-east-1/meta.llama3-70b-instruct-v1:0
bedrock/sa-east-1/meta.llama3-8b-instruct-v1:0
bedrock/us-east-1/1-month-commitment/anthropic.claude-instant-v1
bedrock/us-east-1/1-month-commitment/anthropic.claude-v1
bedrock/us-east-1/1-month-commitment/anthropic.claude-v2
bedrock/us-east-1/1-month-commitment/anthropic.claude-v2:1
bedrock/us-east-1/6-month-commitment/anthropic.claude-instant-v1
bedrock/us-east-1/6-month-commitment/anthropic.claude-v1
bedrock/us-east-1/6-month-commitment/anthropic.claude-v2
bedrock/us-east-1/6-month-commitment/anthropic.claude-v2:1
bedrock/us-east-1/anthropic.claude-instant-v1
bedrock/us-east-1/anthropic.claude-v1
bedrock/us-east-1/anthropic.claude-v2
bedrock/us-east-1/anthropic.claude-v2:1
bedrock/us-east-1/meta.llama3-70b-instruct-v1:0
bedrock/us-east-1/meta.llama3-8b-instruct-v1:0
bedrock/us-east-1/mistral.mistral-7b-instruct-v0:2
bedrock/us-east-1/mistral.mistral-large-2402-v1:0
bedrock/us-east-1/mistral.mixtral-8x7b-instruct-v0:1
bedrock/us-gov-east-1/amazon.nova-pro-v1:0
bedrock/us-gov-east-1/amazon.titan-embed-text-v1
bedrock/us-gov-east-1/amazon.titan-embed-text-v2:0
bedrock/us-gov-east-1/amazon.titan-text-express-v1
bedrock/us-gov-east-1/amazon.titan-text-lite-v1
bedrock/us-gov-east-1/amazon.titan-text-premier-v1:0
bedrock/us-gov-east-1/anthropic.claude-3-5-sonnet-20240620-v1:0
bedrock/us-gov-east-1/anthropic.claude-3-haiku-20240307-v1:0
bedrock/us-gov-east-1/meta.llama3-70b-instruct-v1:0
bedrock/us-gov-east-1/meta.llama3-8b-instruct-v1:0
bedrock/us-gov-west-1/amazon.nova-pro-v1:0
bedrock/us-gov-west-1/amazon.titan-embed-text-v1
bedrock/us-gov-west-1/amazon.titan-embed-text-v2:0
bedrock/us-gov-west-1/amazon.titan-text-express-v1
bedrock/us-gov-west-1/amazon.titan-text-lite-v1
bedrock/us-gov-west-1/amazon.titan-text-premier-v1:0
bedrock/us-gov-west-1/anthropic.claude-3-5-sonnet-20240620-v1:0
bedrock/us-gov-west-1/anthropic.claude-3-haiku-20240307-v1:0
bedrock/us-gov-west-1/meta.llama3-70b-instruct-v1:0
bedrock/us-gov-west-1/meta.llama3-8b-instruct-v1:0
bedrock/us-west-1/meta.llama3-70b-instruct-v1:0
bedrock/us-west-1/meta.llama3-8b-instruct-v1:0
bedrock/us-west-2/1-month-commitment/anthropic.claude-instant-v1
bedrock/us-west-2/1-month-commitment/anthropic.claude-v1
bedrock/us-west-2/1-month-commitment/anthropic.claude-v2
bedrock/us-west-2/1-month-commitment/anthropic.claude-v2:1
bedrock/us-west-2/6-month-commitment/anthropic.claude-instant-v1
bedrock/us-west-2/6-month-commitment/anthropic.claude-v1
bedrock/us-west-2/6-month-commitment/anthropic.claude-v2
bedrock/us-west-2/6-month-commitment/anthropic.claude-v2:1
bedrock/us-west-2/anthropic.claude-instant-v1
bedrock/us-west-2/anthropic.claude-v1
bedrock/us-west-2/anthropic.claude-v2
bedrock/us-west-2/anthropic.claude-v2:1
bedrock/us-west-2/mistral.mistral-7b-instruct-v0:2
bedrock/us-west-2/mistral.mistral-large-2402-v1:0
bedrock/us-west-2/mistral.mixtral-8x7b-instruct-v0:1
cerebras/llama-3.3-70b
cerebras/llama3.1-70b
cerebras/llama3.1-8b
cerebras/openai/gpt-oss-120b
cerebras/openai/gpt-oss-20b
cerebras/qwen-3-32b
chat-bison
chat-bison-32k
chat-bison-32k@002
chat-bison@001
chat-bison@002
chatdolphin
chatgpt-4o-latest
claude-3-5-haiku-20241022
claude-3-5-haiku-latest
claude-3-5-sonnet-20240620
claude-3-5-sonnet-20241022
claude-3-5-sonnet-latest
claude-3-7-sonnet-20250219
claude-3-7-sonnet-latest
claude-3-haiku-20240307
claude-3-opus-20240229
claude-3-opus-latest
claude-4-opus-20250514
claude-4-sonnet-20250514
claude-opus-4-1
claude-opus-4-1-20250805
claude-opus-4-20250514
claude-sonnet-4-20250514
cloudflare/@cf/meta/llama-2-7b-chat-fp16
cloudflare/@cf/meta/llama-2-7b-chat-int8
cloudflare/@cf/mistral/mistral-7b-instruct-v0.1
cloudflare/@hf/thebloke/codellama-7b-instruct-awq
code-bison
code-bison-32k@002
code-bison32k
code-bison@001
code-bison@002
code-gecko
code-gecko-latest
code-gecko@001
code-gecko@002
codechat-bison
codechat-bison-32k
codechat-bison-32k@002
codechat-bison@001
codechat-bison@002
codechat-bison@latest
codestral/codestral-2405
codestral/codestral-latest
codex-mini-latest
cohere.command-light-text-v14
cohere.command-r-plus-v1:0
cohere.command-r-v1:0
cohere.command-text-v14
cohere.embed-english-v3
cohere.embed-multilingual-v3
cohere.rerank-v3-5:0
command
command-a-03-2025
command-light
command-nightly
command-r
command-r-08-2024
command-r-plus
command-r-plus-08-2024
command-r7b-12-2024
computer-use-preview
dashscope/qwen-max
dashscope/qwen-plus-latest
dashscope/qwen-turbo-latest
dashscope/qwen3-30b-a3b
databricks/databricks-bge-large-en
databricks/databricks-claude-3-7-sonnet
databricks/databricks-gte-large-en
databricks/databricks-llama-2-70b-chat
databricks/databricks-llama-4-maverick
databricks/databricks-meta-llama-3-1-405b-instruct
databricks/databricks-meta-llama-3-3-70b-instruct
databricks/databricks-meta-llama-3-70b-instruct
databricks/databricks-mixtral-8x7b-instruct
databricks/databricks-mpt-30b-instruct
databricks/databricks-mpt-7b-instruct
davinci-002
deepgram/base
deepgram/base-conversationalai
deepgram/base-finance
deepgram/base-general
deepgram/base-meeting
deepgram/base-phonecall
deepgram/base-video
deepgram/base-voicemail
deepgram/enhanced
deepgram/enhanced-finance
deepgram/enhanced-general
deepgram/enhanced-meeting
deepgram/enhanced-phonecall
deepgram/nova
deepgram/nova-2
deepgram/nova-2-atc
deepgram/nova-2-automotive
deepgram/nova-2-conversationalai
deepgram/nova-2-drivethru
deepgram/nova-2-finance
deepgram/nova-2-general
deepgram/nova-2-meeting
deepgram/nova-2-phonecall
deepgram/nova-2-video
deepgram/nova-2-voicemail
deepgram/nova-3
deepgram/nova-3-general
deepgram/nova-3-medical
deepgram/nova-general
deepgram/nova-phonecall
deepgram/whisper
deepgram/whisper-base
deepgram/whisper-large
deepgram/whisper-medium
deepgram/whisper-small
deepgram/whisper-tiny
deepinfra/Austism/chronos-hermes-13b-v2
deepinfra/Gryphe/MythoMax-L2-13b
deepinfra/Gryphe/MythoMax-L2-13b-turbo
deepinfra/KoboldAI/LLaMA2-13B-Tiefighter
deepinfra/NousResearch/Hermes-3-Llama-3.1-405B
deepinfra/NousResearch/Hermes-3-Llama-3.1-70B
deepinfra/NovaSky-AI/Sky-T1-32B-Preview
deepinfra/Phind/Phind-CodeLlama-34B-v2
deepinfra/Qwen/QVQ-72B-Preview
deepinfra/Qwen/QwQ-32B
deepinfra/Qwen/QwQ-32B-Preview
deepinfra/Qwen/Qwen2-72B-Instruct
deepinfra/Qwen/Qwen2-7B-Instruct
deepinfra/Qwen/Qwen2.5-72B-Instruct
deepinfra/Qwen/Qwen2.5-7B-Instruct
deepinfra/Qwen/Qwen2.5-Coder-32B-Instruct
deepinfra/Qwen/Qwen2.5-Coder-7B
deepinfra/Qwen/Qwen2.5-VL-32B-Instruct
deepinfra/Qwen/Qwen3-14B
deepinfra/Qwen/Qwen3-235B-A22B
deepinfra/Qwen/Qwen3-235B-A22B-Instruct-2507
deepinfra/Qwen/Qwen3-235B-A22B-Thinking-2507
deepinfra/Qwen/Qwen3-30B-A3B
deepinfra/Qwen/Qwen3-32B
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo
deepinfra/Sao10K/L3-70B-Euryale-v2.1
deepinfra/Sao10K/L3-8B-Lunaris-v1
deepinfra/Sao10K/L3-8B-Lunaris-v1-Turbo
deepinfra/Sao10K/L3.1-70B-Euryale-v2.2
deepinfra/Sao10K/L3.3-70B-Euryale-v2.3
deepinfra/allenai/olmOCR-7B-0725-FP8
deepinfra/anthropic/claude-3-7-sonnet-latest
deepinfra/anthropic/claude-4-opus
deepinfra/anthropic/claude-4-sonnet
deepinfra/bigcode/starcoder2-15b-instruct-v0.1
deepinfra/cognitivecomputations/dolphin-2.6-mixtral-8x7b
deepinfra/cognitivecomputations/dolphin-2.9.1-llama-3-70b
deepinfra/deepinfra/airoboros-70b
deepinfra/deepseek-ai/DeepSeek-Prover-V2-671B
deepinfra/deepseek-ai/DeepSeek-R1
deepinfra/deepseek-ai/DeepSeek-R1-0528
deepinfra/deepseek-ai/DeepSeek-R1-0528-Turbo
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
deepinfra/deepseek-ai/DeepSeek-R1-Turbo
deepinfra/deepseek-ai/DeepSeek-V3
deepinfra/deepseek-ai/DeepSeek-V3-0324
deepinfra/deepseek-ai/DeepSeek-V3-0324-Turbo
deepinfra/deepseek-ai/DeepSeek-V3.1
deepinfra/google/codegemma-7b-it
deepinfra/google/gemini-1.5-flash
deepinfra/google/gemini-1.5-flash-8b
deepinfra/google/gemini-2.0-flash-001
deepinfra/google/gemini-2.5-flash
deepinfra/google/gemini-2.5-pro
deepinfra/google/gemma-1.1-7b-it
deepinfra/google/gemma-2-27b-it
deepinfra/google/gemma-2-9b-it
deepinfra/google/gemma-3-12b-it
deepinfra/google/gemma-3-27b-it
deepinfra/google/gemma-3-4b-it
deepinfra/lizpreciatior/lzlv_70b_fp16_hf
deepinfra/mattshumer/Reflection-Llama-3.1-70B
deepinfra/meta-llama/Llama-2-13b-chat-hf
deepinfra/meta-llama/Llama-2-70b-chat-hf
deepinfra/meta-llama/Llama-3.2-11B-Vision-Instruct
deepinfra/meta-llama/Llama-3.2-1B-Instruct
deepinfra/meta-llama/Llama-3.2-3B-Instruct
deepinfra/meta-llama/Llama-3.2-90B-Vision-Instruct
deepinfra/meta-llama/Llama-3.3-70B-Instruct
deepinfra/meta-llama/Llama-3.3-70B-Instruct-Turbo
deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-Turbo
deepinfra/meta-llama/Llama-4-Scout-17B-16E-Instruct
deepinfra/meta-llama/Llama-Guard-3-8B
deepinfra/meta-llama/Llama-Guard-4-12B
deepinfra/meta-llama/Meta-Llama-3-70B-Instruct
deepinfra/meta-llama/Meta-Llama-3-8B-Instruct
deepinfra/meta-llama/Meta-Llama-3.1-405B-Instruct
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
deepinfra/microsoft/Phi-3-medium-4k-instruct
deepinfra/microsoft/Phi-4-multimodal-instruct
deepinfra/microsoft/WizardLM-2-7B
deepinfra/microsoft/WizardLM-2-8x22B
deepinfra/microsoft/phi-4
deepinfra/microsoft/phi-4-reasoning-plus
deepinfra/mistralai/Devstral-Small-2505
deepinfra/mistralai/Devstral-Small-2507
deepinfra/mistralai/Mistral-7B-Instruct-v0.1
deepinfra/mistralai/Mistral-7B-Instruct-v0.2
deepinfra/mistralai/Mistral-7B-Instruct-v0.3
deepinfra/mistralai/Mistral-Nemo-Instruct-2407
deepinfra/mistralai/Mistral-Small-24B-Instruct-2501
deepinfra/mistralai/Mistral-Small-3.1-24B-Instruct-2503
deepinfra/mistralai/Mistral-Small-3.2-24B-Instruct-2506
deepinfra/mistralai/Mixtral-8x22B-Instruct-v0.1
deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1
deepinfra/moonshotai/Kimi-K2-Instruct
deepinfra/nvidia/Llama-3.1-Nemotron-70B-Instruct
deepinfra/nvidia/Nemotron-4-340B-Instruct
deepinfra/openai/gpt-oss-120b
deepinfra/openai/gpt-oss-20b
deepinfra/openbmb/MiniCPM-Llama3-V-2_5
deepinfra/openchat/openchat-3.6-8b
deepinfra/openchat/openchat_3.5
deepinfra/zai-org/GLM-4.5
deepinfra/zai-org/GLM-4.5-Air
deepseek/deepseek-chat
deepseek/deepseek-coder
deepseek/deepseek-r1
deepseek/deepseek-reasoner
deepseek/deepseek-v3
dolphin
elevenlabs/scribe_v1
elevenlabs/scribe_v1_experimental
embed-english-light-v2.0
embed-english-light-v3.0
embed-english-v2.0
embed-english-v3.0
embed-multilingual-v2.0
embed-multilingual-v3.0
eu.amazon.nova-lite-v1:0
eu.amazon.nova-micro-v1:0
eu.amazon.nova-pro-v1:0
eu.anthropic.claude-3-5-haiku-20241022-v1:0
eu.anthropic.claude-3-5-sonnet-20240620-v1:0
eu.anthropic.claude-3-5-sonnet-20241022-v2:0
eu.anthropic.claude-3-7-sonnet-20250219-v1:0
eu.anthropic.claude-3-haiku-20240307-v1:0
eu.anthropic.claude-3-opus-20240229-v1:0
eu.anthropic.claude-3-sonnet-20240229-v1:0
eu.anthropic.claude-opus-4-1-20250805-v1:0
eu.anthropic.claude-opus-4-20250514-v1:0
eu.anthropic.claude-sonnet-4-20250514-v1:0
eu.meta.llama3-2-1b-instruct-v1:0
eu.meta.llama3-2-3b-instruct-v1:0
eu.mistral.pixtral-large-2502-v1:0
featherless_ai/featherless-ai/Qwerky-72B
featherless_ai/featherless-ai/Qwerky-QwQ-32B
fireworks-ai-4.1b-to-16b
fireworks-ai-56b-to-176b
fireworks-ai-above-16b
fireworks-ai-default
fireworks-ai-embedding-150m-to-350m
fireworks-ai-embedding-up-to-150m
fireworks-ai-moe-up-to-56b
fireworks-ai-up-to-4b
fireworks_ai/WhereIsAI/UAE-Large-V1
fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-instruct
fireworks_ai/accounts/fireworks/models/deepseek-r1
fireworks_ai/accounts/fireworks/models/deepseek-r1-0528
fireworks_ai/accounts/fireworks/models/deepseek-r1-basic
fireworks_ai/accounts/fireworks/models/deepseek-v3
fireworks_ai/accounts/fireworks/models/deepseek-v3-0324
fireworks_ai/accounts/fireworks/models/deepseek-v3p1
fireworks_ai/accounts/fireworks/models/firefunction-v2
fireworks_ai/accounts/fireworks/models/glm-4p5
fireworks_ai/accounts/fireworks/models/glm-4p5-air
fireworks_ai/accounts/fireworks/models/gpt-oss-120b
fireworks_ai/accounts/fireworks/models/gpt-oss-20b
fireworks_ai/accounts/fireworks/models/kimi-k2-instruct
fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct
fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct
fireworks_ai/accounts/fireworks/models/llama-v3p2-11b-vision-instruct
fireworks_ai/accounts/fireworks/models/llama-v3p2-1b-instruct
fireworks_ai/accounts/fireworks/models/llama-v3p2-3b-instruct
fireworks_ai/accounts/fireworks/models/llama-v3p2-90b-vision-instruct
fireworks_ai/accounts/fireworks/models/llama4-maverick-instruct-basic
fireworks_ai/accounts/fireworks/models/llama4-scout-instruct-basic
fireworks_ai/accounts/fireworks/models/mixtral-8x22b-instruct-hf
fireworks_ai/accounts/fireworks/models/qwen2-72b-instruct
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct
fireworks_ai/accounts/fireworks/models/yi-large
fireworks_ai/nomic-ai/nomic-embed-text-v1
fireworks_ai/nomic-ai/nomic-embed-text-v1.5
fireworks_ai/thenlper/gte-base
fireworks_ai/thenlper/gte-large
friendliai/meta-llama-3.1-70b-instruct
friendliai/meta-llama-3.1-8b-instruct
ft:babbage-002
ft:davinci-002
ft:gpt-3.5-turbo
ft:gpt-3.5-turbo-0125
ft:gpt-3.5-turbo-0613
ft:gpt-3.5-turbo-1106
ft:gpt-4-0613
ft:gpt-4o-2024-08-06
ft:gpt-4o-2024-11-20
ft:gpt-4o-mini-2024-07-18
gemini-1.0-pro
gemini-1.0-pro-001
gemini-1.0-pro-002
gemini-1.0-pro-vision
gemini-1.0-pro-vision-001
gemini-1.0-ultra
gemini-1.0-ultra-001
gemini-1.5-flash
gemini-1.5-flash-001
gemini-1.5-flash-002
gemini-1.5-flash-exp-0827
gemini-1.5-flash-preview-0514
gemini-1.5-pro
gemini-1.5-pro-001
gemini-1.5-pro-002
gemini-1.5-pro-preview-0215
gemini-1.5-pro-preview-0409
gemini-1.5-pro-preview-0514
gemini-2.0-flash
gemini-2.0-flash-001
gemini-2.0-flash-exp
gemini-2.0-flash-lite
gemini-2.0-flash-lite-001
gemini-2.0-flash-live-preview-04-09
gemini-2.0-flash-preview-image-generation
gemini-2.0-flash-thinking-exp
gemini-2.0-flash-thinking-exp-01-21
gemini-2.0-pro-exp-02-05
gemini-2.5-flash
gemini-2.5-flash-image-preview
gemini-2.5-flash-lite
gemini-2.5-flash-lite-preview-06-17
gemini-2.5-flash-preview-04-17
gemini-2.5-flash-preview-05-20
gemini-2.5-pro
gemini-2.5-pro-exp-03-25
gemini-2.5-pro-preview-03-25
gemini-2.5-pro-preview-05-06
gemini-2.5-pro-preview-06-05
gemini-2.5-pro-preview-tts
gemini-embedding-001
gemini-flash-experimental
gemini-pro
gemini-pro-experimental
gemini-pro-vision
gemini/gemini-1.5-flash
gemini/gemini-1.5-flash-001
gemini/gemini-1.5-flash-002
gemini/gemini-1.5-flash-8b
gemini/gemini-1.5-flash-8b-exp-0827
gemini/gemini-1.5-flash-8b-exp-0924
gemini/gemini-1.5-flash-exp-0827
gemini/gemini-1.5-flash-latest
gemini/gemini-1.5-pro
gemini/gemini-1.5-pro-001
gemini/gemini-1.5-pro-002
gemini/gemini-1.5-pro-exp-0801
gemini/gemini-1.5-pro-exp-0827
gemini/gemini-1.5-pro-latest
gemini/gemini-2.0-flash
gemini/gemini-2.0-flash-001
gemini/gemini-2.0-flash-exp
gemini/gemini-2.0-flash-lite
gemini/gemini-2.0-flash-lite-preview-02-05
gemini/gemini-2.0-flash-live-001
gemini/gemini-2.0-flash-preview-image-generation
gemini/gemini-2.0-flash-thinking-exp
gemini/gemini-2.0-flash-thinking-exp-01-21
gemini/gemini-2.0-pro-exp-02-05
gemini/gemini-2.5-flash
gemini/gemini-2.5-flash-image-preview
gemini/gemini-2.5-flash-lite
gemini/gemini-2.5-flash-lite-preview-06-17
gemini/gemini-2.5-flash-preview-04-17
gemini/gemini-2.5-flash-preview-05-20
gemini/gemini-2.5-flash-preview-tts
gemini/gemini-2.5-pro
gemini/gemini-2.5-pro-exp-03-25
gemini/gemini-2.5-pro-preview-03-25
gemini/gemini-2.5-pro-preview-05-06
gemini/gemini-2.5-pro-preview-06-05
gemini/gemini-2.5-pro-preview-tts
gemini/gemini-exp-1114
gemini/gemini-exp-1206
gemini/gemini-gemma-2-27b-it
gemini/gemini-gemma-2-9b-it
gemini/gemini-pro
gemini/gemini-pro-vision
gemini/gemma-3-27b-it
gemini/imagen-3.0-fast-generate-001
gemini/imagen-3.0-generate-001
gemini/imagen-3.0-generate-002
gemini/imagen-4.0-fast-generate-001
gemini/imagen-4.0-generate-001
gemini/imagen-4.0-ultra-generate-001
gemini/learnlm-1.5-pro-experimental
gpt-3.5-turbo
gpt-3.5-turbo-0125
gpt-3.5-turbo-0301
gpt-3.5-turbo-0613
gpt-3.5-turbo-1106
gpt-3.5-turbo-16k
gpt-3.5-turbo-16k-0613
gpt-3.5-turbo-instruct
gpt-3.5-turbo-instruct-0914
gpt-4
gpt-4-0125-preview
gpt-4-0314
gpt-4-0613
gpt-4-1106-preview
gpt-4-1106-vision-preview
gpt-4-32k
gpt-4-32k-0314
gpt-4-32k-0613
gpt-4-turbo
gpt-4-turbo-2024-04-09
gpt-4-turbo-preview
gpt-4-vision-preview
gpt-4.1
gpt-4.1-2025-04-14
gpt-4.1-mini
gpt-4.1-mini-2025-04-14
gpt-4.1-nano
gpt-4.1-nano-2025-04-14
gpt-4.5-preview
gpt-4.5-preview-2025-02-27
gpt-4o
gpt-4o-2024-05-13
gpt-4o-2024-08-06
gpt-4o-2024-11-20
gpt-4o-audio-preview
gpt-4o-audio-preview-2024-10-01
gpt-4o-audio-preview-2024-12-17
gpt-4o-audio-preview-2025-06-03
gpt-4o-mini
gpt-4o-mini-2024-07-18
gpt-4o-mini-audio-preview
gpt-4o-mini-audio-preview-2024-12-17
gpt-4o-mini-realtime-preview
gpt-4o-mini-realtime-preview-2024-12-17
gpt-4o-mini-search-preview
gpt-4o-mini-search-preview-2025-03-11
gpt-4o-mini-transcribe
gpt-4o-mini-tts
gpt-4o-realtime-preview
gpt-4o-realtime-preview-2024-10-01
gpt-4o-realtime-preview-2024-12-17
gpt-4o-realtime-preview-2025-06-03
gpt-4o-search-preview
gpt-4o-search-preview-2025-03-11
gpt-4o-transcribe
gpt-5
gpt-5-2025-08-07
gpt-5-chat
gpt-5-chat-latest
gpt-5-mini
gpt-5-mini-2025-08-07
gpt-5-nano
gpt-5-nano-2025-08-07
gpt-image-1
gradient_ai/alibaba-qwen3-32b
gradient_ai/anthropic-claude-3-opus
gradient_ai/anthropic-claude-3.5-haiku
gradient_ai/anthropic-claude-3.5-sonnet
gradient_ai/anthropic-claude-3.7-sonnet
gradient_ai/deepseek-r1-distill-llama-70b
gradient_ai/llama3-8b-instruct
gradient_ai/llama3.3-70b-instruct
gradient_ai/mistral-nemo-instruct-2407
gradient_ai/openai-gpt-4o
gradient_ai/openai-gpt-4o-mini
gradient_ai/openai-o3
gradient_ai/openai-o3-mini
groq/deepseek-r1-distill-llama-70b
groq/distil-whisper-large-v3-en
groq/gemma-7b-it
groq/gemma2-9b-it
groq/llama-3.1-405b-reasoning
groq/llama-3.1-70b-versatile
groq/llama-3.1-8b-instant
groq/llama-3.2-11b-text-preview
groq/llama-3.2-11b-vision-preview
groq/llama-3.2-1b-preview
groq/llama-3.2-3b-preview
groq/llama-3.2-90b-text-preview
groq/llama-3.2-90b-vision-preview
groq/llama-3.3-70b-specdec
groq/llama-3.3-70b-versatile
groq/llama-guard-3-8b
groq/llama2-70b-4096
groq/llama3-70b-8192
groq/llama3-8b-8192
groq/llama3-groq-70b-8192-tool-use-preview
groq/llama3-groq-8b-8192-tool-use-preview
groq/meta-llama/llama-4-maverick-17b-128e-instruct
groq/meta-llama/llama-4-scout-17b-16e-instruct
groq/mistral-saba-24b
groq/mixtral-8x7b-32768
groq/moonshotai/kimi-k2-instruct
groq/openai/gpt-oss-120b
groq/openai/gpt-oss-20b
groq/playai-tts
groq/qwen/qwen3-32b
groq/whisper-large-v3
groq/whisper-large-v3-turbo
hd/1024-x-1024/dall-e-3
hd/1024-x-1792/dall-e-3
hd/1792-x-1024/dall-e-3
high/1024-x-1024/gpt-image-1
high/1024-x-1536/gpt-image-1
high/1536-x-1024/gpt-image-1
hyperbolic/NousResearch/Hermes-3-Llama-3.1-70B
hyperbolic/Qwen/QwQ-32B
hyperbolic/Qwen/Qwen2.5-72B-Instruct
hyperbolic/Qwen/Qwen2.5-Coder-32B-Instruct
hyperbolic/Qwen/Qwen3-235B-A22B
hyperbolic/deepseek-ai/DeepSeek-R1
hyperbolic/deepseek-ai/DeepSeek-R1-0528
hyperbolic/deepseek-ai/DeepSeek-V3
hyperbolic/deepseek-ai/DeepSeek-V3-0324
hyperbolic/meta-llama/Llama-3.2-3B-Instruct
hyperbolic/meta-llama/Llama-3.3-70B-Instruct
hyperbolic/meta-llama/Meta-Llama-3-70B-Instruct
hyperbolic/meta-llama/Meta-Llama-3.1-405B-Instruct
hyperbolic/meta-llama/Meta-Llama-3.1-70B-Instruct
hyperbolic/meta-llama/Meta-Llama-3.1-8B-Instruct
hyperbolic/moonshotai/Kimi-K2-Instruct
j2-light
j2-mid
j2-ultra
jamba-1.5
jamba-1.5-large
jamba-1.5-large@001
jamba-1.5-mini
jamba-1.5-mini@001
jamba-large-1.6
jamba-large-1.7
jamba-mini-1.6
jamba-mini-1.7
jina-reranker-v2-base-multilingual
lambda_ai/deepseek-llama3.3-70b
lambda_ai/deepseek-r1-0528
lambda_ai/deepseek-r1-671b
lambda_ai/deepseek-v3-0324
lambda_ai/hermes3-405b
lambda_ai/hermes3-70b
lambda_ai/hermes3-8b
lambda_ai/lfm-40b
lambda_ai/lfm-7b
lambda_ai/llama-4-maverick-17b-128e-instruct-fp8
lambda_ai/llama-4-scout-17b-16e-instruct
lambda_ai/llama3.1-405b-instruct-fp8
lambda_ai/llama3.1-70b-instruct-fp8
lambda_ai/llama3.1-8b-instruct
lambda_ai/llama3.1-nemotron-70b-instruct-fp8
lambda_ai/llama3.2-11b-vision-instruct
lambda_ai/llama3.2-3b-instruct
lambda_ai/llama3.3-70b-instruct-fp8
lambda_ai/qwen25-coder-32b-instruct
lambda_ai/qwen3-32b-fp8
low/1024-x-1024/gpt-image-1
low/1024-x-1536/gpt-image-1
low/1536-x-1024/gpt-image-1
luminous-base
luminous-base-control
luminous-extended
luminous-extended-control
luminous-supreme
luminous-supreme-control
max-x-max/50-steps/stability.stable-diffusion-xl-v0
max-x-max/max-steps/stability.stable-diffusion-xl-v0
medium/1024-x-1024/gpt-image-1
medium/1024-x-1536/gpt-image-1
medium/1536-x-1024/gpt-image-1
medlm-large
medlm-medium
meta.llama2-13b-chat-v1
meta.llama2-70b-chat-v1
meta.llama3-1-405b-instruct-v1:0
meta.llama3-1-70b-instruct-v1:0
meta.llama3-1-8b-instruct-v1:0
meta.llama3-2-11b-instruct-v1:0
meta.llama3-2-1b-instruct-v1:0
meta.llama3-2-3b-instruct-v1:0
meta.llama3-2-90b-instruct-v1:0
meta.llama3-3-70b-instruct-v1:0
meta.llama3-70b-instruct-v1:0
meta.llama3-8b-instruct-v1:0
meta.llama4-maverick-17b-instruct-v1:0
meta.llama4-scout-17b-instruct-v1:0
meta_llama/Llama-3.3-70B-Instruct
meta_llama/Llama-3.3-8B-Instruct
meta_llama/Llama-4-Maverick-17B-128E-Instruct-FP8
meta_llama/Llama-4-Scout-17B-16E-Instruct-FP8
mistral.mistral-7b-instruct-v0:2
mistral.mistral-large-2402-v1:0
mistral.mistral-large-2407-v1:0
mistral.mistral-small-2402-v1:0
mistral.mixtral-8x7b-instruct-v0:1
mistral/codestral-2405
mistral/codestral-latest
mistral/codestral-mamba-latest
mistral/devstral-medium-2507
mistral/devstral-small-2505
mistral/devstral-small-2507
mistral/magistral-medium-2506
mistral/magistral-medium-latest
mistral/magistral-small-2506
mistral/magistral-small-latest
mistral/mistral-embed
mistral/mistral-large-2402
mistral/mistral-large-2407
mistral/mistral-large-2411
mistral/mistral-large-latest
mistral/mistral-medium
mistral/mistral-medium-2312
mistral/mistral-medium-2505
mistral/mistral-medium-latest
mistral/mistral-small
mistral/mistral-small-latest
mistral/mistral-tiny
mistral/open-codestral-mamba
mistral/open-mistral-7b
mistral/open-mistral-nemo
mistral/open-mistral-nemo-2407
mistral/open-mixtral-8x22b
mistral/open-mixtral-8x7b
mistral/pixtral-12b-2409
mistral/pixtral-large-2411
mistral/pixtral-large-latest
moonshot/kimi-k2-0711-preview
moonshot/kimi-latest
moonshot/kimi-latest-128k
moonshot/kimi-latest-32k
moonshot/kimi-latest-8k
moonshot/kimi-thinking-preview
moonshot/moonshot-v1-128k
moonshot/moonshot-v1-128k-0430
moonshot/moonshot-v1-128k-vision-preview
moonshot/moonshot-v1-32k
moonshot/moonshot-v1-32k-0430
moonshot/moonshot-v1-32k-vision-preview
moonshot/moonshot-v1-8k
moonshot/moonshot-v1-8k-0430
moonshot/moonshot-v1-8k-vision-preview
moonshot/moonshot-v1-auto
morph/morph-v3-fast
morph/morph-v3-large
multimodalembedding
multimodalembedding@001
nscale/Qwen/QwQ-32B
nscale/Qwen/Qwen2.5-Coder-32B-Instruct
nscale/Qwen/Qwen2.5-Coder-3B-Instruct
nscale/Qwen/Qwen2.5-Coder-7B-Instruct
nscale/black-forest-labs/FLUX.1-schnell
nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-8B
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
nscale/meta-llama/Llama-3.1-8B-Instruct
nscale/meta-llama/Llama-3.3-70B-Instruct
nscale/meta-llama/Llama-4-Scout-17B-16E-Instruct
nscale/mistralai/mixtral-8x22b-instruct-v0.1
nscale/stabilityai/stable-diffusion-xl-base-1.0
o1
o1-2024-12-17
o1-mini
o1-mini-2024-09-12
o1-preview
o1-preview-2024-09-12
o1-pro
o1-pro-2025-03-19
o3
o3-2025-04-16
o3-deep-research
o3-deep-research-2025-06-26
o3-mini
o3-mini-2025-01-31
o3-pro
o3-pro-2025-06-10
o4-mini
o4-mini-2025-04-16
o4-mini-deep-research
o4-mini-deep-research-2025-06-26
oci/meta.llama-3.1-405b-instruct
oci/meta.llama-3.2-90b-vision-instruct
oci/meta.llama-3.3-70b-instruct
oci/meta.llama-4-maverick-17b-128e-instruct-fp8
oci/meta.llama-4-scout-17b-16e-instruct
oci/xai.grok-3
oci/xai.grok-3-fast
oci/xai.grok-3-mini
oci/xai.grok-3-mini-fast
oci/xai.grok-4
ollama/codegeex4
ollama/codegemma
ollama/codellama
ollama/deepseek-coder-v2-base
ollama/deepseek-coder-v2-instruct
ollama/deepseek-coder-v2-lite-base
ollama/deepseek-coder-v2-lite-instruct
ollama/internlm2_5-20b-chat
ollama/llama2
ollama/llama2-uncensored
ollama/llama2:13b
ollama/llama2:70b
ollama/llama2:7b
ollama/llama3
ollama/llama3.1
ollama/llama3:70b
ollama/llama3:8b
ollama/mistral
ollama/mistral-7B-Instruct-v0.1
ollama/mistral-7B-Instruct-v0.2
ollama/mistral-large-instruct-2407
ollama/mixtral-8x22B-Instruct-v0.1
ollama/mixtral-8x7B-Instruct-v0.1
ollama/orca-mini
ollama/vicuna
omni-moderation-2024-09-26
omni-moderation-latest
omni-moderation-latest-intents
openai.gpt-oss-120b-1:0
openai.gpt-oss-20b-1:0
openrouter/anthropic/claude-2
openrouter/anthropic/claude-3-5-haiku
openrouter/anthropic/claude-3-5-haiku-20241022
openrouter/anthropic/claude-3-haiku
openrouter/anthropic/claude-3-haiku-20240307
openrouter/anthropic/claude-3-opus
openrouter/anthropic/claude-3-sonnet
openrouter/anthropic/claude-3.5-sonnet
openrouter/anthropic/claude-3.5-sonnet:beta
openrouter/anthropic/claude-3.7-sonnet
openrouter/anthropic/claude-3.7-sonnet:beta
openrouter/anthropic/claude-instant-v1
openrouter/anthropic/claude-opus-4
openrouter/anthropic/claude-opus-4.1
openrouter/anthropic/claude-sonnet-4
openrouter/bytedance/ui-tars-1.5-7b
openrouter/cognitivecomputations/dolphin-mixtral-8x7b
openrouter/cohere/command-r-plus
openrouter/databricks/dbrx-instruct
openrouter/deepseek/deepseek-chat
openrouter/deepseek/deepseek-chat-v3-0324
openrouter/deepseek/deepseek-chat-v3.1
openrouter/deepseek/deepseek-coder
openrouter/deepseek/deepseek-r1
openrouter/deepseek/deepseek-r1-0528
openrouter/fireworks/firellava-13b
openrouter/google/gemini-2.0-flash-001
openrouter/google/gemini-2.5-flash
openrouter/google/gemini-2.5-pro
openrouter/google/gemini-pro-1.5
openrouter/google/gemini-pro-vision
openrouter/google/palm-2-chat-bison
openrouter/google/palm-2-codechat-bison
openrouter/gryphe/mythomax-l2-13b
openrouter/jondurbin/airoboros-l2-70b-2.1
openrouter/mancer/weaver
openrouter/meta-llama/codellama-34b-instruct
openrouter/meta-llama/llama-2-13b-chat
openrouter/meta-llama/llama-2-70b-chat
openrouter/meta-llama/llama-3-70b-instruct
openrouter/meta-llama/llama-3-70b-instruct:nitro
openrouter/meta-llama/llama-3-8b-instruct:extended
openrouter/meta-llama/llama-3-8b-instruct:free
openrouter/microsoft/wizardlm-2-8x22b:nitro
openrouter/mistralai/mistral-7b-instruct
openrouter/mistralai/mistral-7b-instruct:free
openrouter/mistralai/mistral-large
openrouter/mistralai/mistral-small-3.1-24b-instruct
openrouter/mistralai/mistral-small-3.2-24b-instruct
openrouter/mistralai/mixtral-8x22b-instruct
openrouter/nousresearch/nous-hermes-llama2-13b
openrouter/openai/gpt-3.5-turbo
openrouter/openai/gpt-3.5-turbo-16k
openrouter/openai/gpt-4
openrouter/openai/gpt-4-vision-preview
openrouter/openai/gpt-4o
openrouter/openai/gpt-4o-2024-05-13
openrouter/openai/gpt-5-chat
openrouter/openai/gpt-5-mini
openrouter/openai/gpt-5-nano
openrouter/openai/gpt-oss-120b
openrouter/openai/gpt-oss-20b
openrouter/openai/o1
openrouter/openai/o1-mini
openrouter/openai/o1-mini-2024-09-12
openrouter/openai/o1-preview
openrouter/openai/o1-preview-2024-09-12
openrouter/openai/o3-mini
openrouter/openai/o3-mini-high
openrouter/pygmalionai/mythalion-13b
openrouter/qwen/qwen-2.5-coder-32b-instruct
openrouter/qwen/qwen-vl-plus
openrouter/qwen/qwen3-coder
openrouter/switchpoint/router
openrouter/undi95/remm-slerp-l2-13b
openrouter/x-ai/grok-4
palm/chat-bison
palm/chat-bison-001
palm/text-bison
palm/text-bison-001
palm/text-bison-safety-off
palm/text-bison-safety-recitation-off
perplexity/codellama-34b-instruct
perplexity/codellama-70b-instruct
perplexity/llama-2-70b-chat
perplexity/llama-3.1-70b-instruct
perplexity/llama-3.1-8b-instruct
perplexity/llama-3.1-sonar-huge-128k-online
perplexity/llama-3.1-sonar-large-128k-chat
perplexity/llama-3.1-sonar-large-128k-online
perplexity/llama-3.1-sonar-small-128k-chat
perplexity/llama-3.1-sonar-small-128k-online
perplexity/mistral-7b-instruct
perplexity/mixtral-8x7b-instruct
perplexity/pplx-70b-chat
perplexity/pplx-70b-online
perplexity/pplx-7b-chat
perplexity/pplx-7b-online
perplexity/sonar
perplexity/sonar-deep-research
perplexity/sonar-medium-chat
perplexity/sonar-medium-online
perplexity/sonar-pro
perplexity/sonar-reasoning
perplexity/sonar-reasoning-pro
perplexity/sonar-small-chat
perplexity/sonar-small-online
recraft/recraftv2
recraft/recraftv3
replicate/meta/llama-2-13b
replicate/meta/llama-2-13b-chat
replicate/meta/llama-2-70b
replicate/meta/llama-2-70b-chat
replicate/meta/llama-2-7b
replicate/meta/llama-2-7b-chat
replicate/meta/llama-3-70b
replicate/meta/llama-3-70b-instruct
replicate/meta/llama-3-8b
replicate/meta/llama-3-8b-instruct
replicate/mistralai/mistral-7b-instruct-v0.2
replicate/mistralai/mistral-7b-v0.1
replicate/mistralai/mixtral-8x7b-instruct-v0.1
rerank-english-v2.0
rerank-english-v3.0
rerank-multilingual-v2.0
rerank-multilingual-v3.0
rerank-v3.5
sagemaker/meta-textgeneration-llama-2-13b
sagemaker/meta-textgeneration-llama-2-13b-f
sagemaker/meta-textgeneration-llama-2-70b
sagemaker/meta-textgeneration-llama-2-70b-b-f
sagemaker/meta-textgeneration-llama-2-7b
sagemaker/meta-textgeneration-llama-2-7b-f
sambanova/DeepSeek-R1
sambanova/DeepSeek-R1-Distill-Llama-70B
sambanova/DeepSeek-V3-0324
sambanova/Llama-4-Maverick-17B-128E-Instruct
sambanova/Llama-4-Scout-17B-16E-Instruct
sambanova/Meta-Llama-3.1-405B-Instruct
sambanova/Meta-Llama-3.1-8B-Instruct
sambanova/Meta-Llama-3.2-1B-Instruct
sambanova/Meta-Llama-3.2-3B-Instruct
sambanova/Meta-Llama-3.3-70B-Instruct
sambanova/Meta-Llama-Guard-3-8B
sambanova/QwQ-32B
sambanova/Qwen2-Audio-7B-Instruct
sambanova/Qwen3-32B
sample_spec
snowflake/claude-3-5-sonnet
snowflake/deepseek-r1
snowflake/gemma-7b
snowflake/jamba-1.5-large
snowflake/jamba-1.5-mini
snowflake/jamba-instruct
snowflake/llama2-70b-chat
snowflake/llama3-70b
snowflake/llama3-8b
snowflake/llama3.1-405b
snowflake/llama3.1-70b
snowflake/llama3.1-8b
snowflake/llama3.2-1b
snowflake/llama3.2-3b
snowflake/llama3.3-70b
snowflake/mistral-7b
snowflake/mistral-large
snowflake/mistral-large2
snowflake/mixtral-8x7b
snowflake/reka-core
snowflake/reka-flash
snowflake/snowflake-arctic
snowflake/snowflake-llama-3.1-405b
snowflake/snowflake-llama-3.3-70b
stability.sd3-5-large-v1:0
stability.sd3-large-v1:0
stability.stable-image-core-v1:0
stability.stable-image-core-v1:1
stability.stable-image-ultra-v1:0
stability.stable-image-ultra-v1:1
standard/1024-x-1024/dall-e-3
standard/1024-x-1792/dall-e-3
standard/1792-x-1024/dall-e-3
text-bison
text-bison32k
text-bison32k@002
text-bison@001
text-bison@002
text-completion-codestral/codestral-2405
text-completion-codestral/codestral-latest
text-embedding-004
text-embedding-005
text-embedding-3-large
text-embedding-3-small
text-embedding-ada-002
text-embedding-ada-002-v2
text-embedding-large-exp-03-07
text-embedding-preview-0409
text-moderation-007
text-moderation-latest
text-moderation-stable
text-multilingual-embedding-002
text-multilingual-embedding-preview-0409
text-unicorn
text-unicorn@001
textembedding-gecko
textembedding-gecko-multilingual
textembedding-gecko-multilingual@001
textembedding-gecko@001
textembedding-gecko@003
together-ai-21.1b-41b
together-ai-4.1b-8b
together-ai-41.1b-80b
together-ai-8.1b-21b
together-ai-81.1b-110b
together-ai-embedding-151m-to-350m
together-ai-embedding-up-to-150m
together-ai-up-to-4b
together_ai/OpenAI/gpt-oss-20B
together_ai/Qwen/Qwen2.5-72B-Instruct-Turbo
together_ai/Qwen/Qwen2.5-7B-Instruct-Turbo
together_ai/Qwen/Qwen3-235B-A22B-Instruct-2507-tput
together_ai/Qwen/Qwen3-235B-A22B-Thinking-2507
together_ai/Qwen/Qwen3-235B-A22B-fp8-tput
together_ai/Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
together_ai/deepseek-ai/DeepSeek-R1
together_ai/deepseek-ai/DeepSeek-R1-0528-tput
together_ai/deepseek-ai/DeepSeek-V3
together_ai/meta-llama/Llama-3.2-3B-Instruct-Turbo
together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo
together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo-Free
together_ai/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
together_ai/meta-llama/Llama-4-Scout-17B-16E-Instruct
together_ai/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
together_ai/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo
together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
together_ai/mistralai/Mistral-7B-Instruct-v0.1
together_ai/mistralai/Mistral-Small-24B-Instruct-2501
together_ai/mistralai/Mixtral-8x7B-Instruct-v0.1
together_ai/moonshotai/Kimi-K2-Instruct
together_ai/openai/gpt-oss-120b
together_ai/togethercomputer/CodeLlama-34b-Instruct
together_ai/zai-org/GLM-4.5-Air-FP8
tts-1
tts-1-hd
us.amazon.nova-lite-v1:0
us.amazon.nova-micro-v1:0
us.amazon.nova-premier-v1:0
us.amazon.nova-pro-v1:0
us.anthropic.claude-3-5-haiku-20241022-v1:0
us.anthropic.claude-3-5-sonnet-20240620-v1:0
us.anthropic.claude-3-5-sonnet-20241022-v2:0
us.anthropic.claude-3-7-sonnet-20250219-v1:0
us.anthropic.claude-3-haiku-20240307-v1:0
us.anthropic.claude-3-opus-20240229-v1:0
us.anthropic.claude-3-sonnet-20240229-v1:0
us.anthropic.claude-opus-4-1-20250805-v1:0
us.anthropic.claude-opus-4-20250514-v1:0
us.anthropic.claude-sonnet-4-20250514-v1:0
us.deepseek.r1-v1:0
us.meta.llama3-1-405b-instruct-v1:0
us.meta.llama3-1-70b-instruct-v1:0
us.meta.llama3-1-8b-instruct-v1:0
us.meta.llama3-2-11b-instruct-v1:0
us.meta.llama3-2-1b-instruct-v1:0
us.meta.llama3-2-3b-instruct-v1:0
us.meta.llama3-2-90b-instruct-v1:0
us.meta.llama3-3-70b-instruct-v1:0
us.meta.llama4-maverick-17b-instruct-v1:0
us.meta.llama4-scout-17b-instruct-v1:0
us.mistral.pixtral-large-2502-v1:0
v0/v0-1.0-md
v0/v0-1.5-lg
v0/v0-1.5-md
vertex_ai/claude-3-5-haiku
vertex_ai/claude-3-5-haiku@20241022
vertex_ai/claude-3-5-sonnet
vertex_ai/claude-3-5-sonnet-v2
vertex_ai/claude-3-5-sonnet-v2@20241022
vertex_ai/claude-3-5-sonnet@20240620
vertex_ai/claude-3-7-sonnet@20250219
vertex_ai/claude-3-haiku
vertex_ai/claude-3-haiku@20240307
vertex_ai/claude-3-opus
vertex_ai/claude-3-opus@20240229
vertex_ai/claude-3-sonnet
vertex_ai/claude-3-sonnet@20240229
vertex_ai/claude-opus-4
vertex_ai/claude-opus-4-1
vertex_ai/claude-opus-4-1@20250805
vertex_ai/claude-opus-4@20250514
vertex_ai/claude-sonnet-4
vertex_ai/claude-sonnet-4@20250514
vertex_ai/codestral-2501
vertex_ai/codestral@2405
vertex_ai/codestral@latest
vertex_ai/deepseek-ai/deepseek-r1-0528-maas
vertex_ai/imagegeneration@006
vertex_ai/imagen-3.0-fast-generate-001
vertex_ai/imagen-3.0-generate-001
vertex_ai/imagen-3.0-generate-002
vertex_ai/imagen-4.0-fast-generate-001
vertex_ai/imagen-4.0-generate-001
vertex_ai/imagen-4.0-ultra-generate-001
vertex_ai/jamba-1.5
vertex_ai/jamba-1.5-large
vertex_ai/jamba-1.5-large@001
vertex_ai/jamba-1.5-mini
vertex_ai/jamba-1.5-mini@001
vertex_ai/meta/llama-3.1-405b-instruct-maas
vertex_ai/meta/llama-3.1-70b-instruct-maas
vertex_ai/meta/llama-3.1-8b-instruct-maas
vertex_ai/meta/llama-3.2-90b-vision-instruct-maas
vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas
vertex_ai/meta/llama-4-maverick-17b-16e-instruct-maas
vertex_ai/meta/llama-4-scout-17b-128e-instruct-maas
vertex_ai/meta/llama-4-scout-17b-16e-instruct-maas
vertex_ai/meta/llama3-405b-instruct-maas
vertex_ai/meta/llama3-70b-instruct-maas
vertex_ai/meta/llama3-8b-instruct-maas
vertex_ai/mistral-large-2411
vertex_ai/mistral-large@2407
vertex_ai/mistral-large@2411-001
vertex_ai/mistral-large@latest
vertex_ai/mistral-nemo@2407
vertex_ai/mistral-nemo@latest
vertex_ai/mistral-small-2503
vertex_ai/mistral-small-2503@001
vertex_ai/qwen/qwen3-235b-a22b-instruct-2507-maas
vertex_ai/qwen/qwen3-coder-480b-a35b-instruct-maas
voyage/rerank-2
voyage/rerank-2-lite
voyage/voyage-2
voyage/voyage-3
voyage/voyage-3-large
voyage/voyage-3-lite
voyage/voyage-code-2
voyage/voyage-code-3
voyage/voyage-context-3
voyage/voyage-finance-2
voyage/voyage-large-2
voyage/voyage-law-2
voyage/voyage-lite-01
voyage/voyage-lite-02-instruct
voyage/voyage-multimodal-3
watsonx/ibm/granite-3-8b-instruct
watsonx/mistralai/mistral-large
whisper-1
xai/grok-2
xai/grok-2-1212
xai/grok-2-latest
xai/grok-2-vision
xai/grok-2-vision-1212
xai/grok-2-vision-latest
xai/grok-3
xai/grok-3-beta
xai/grok-3-fast-beta
xai/grok-3-fast-latest
xai/grok-3-latest
xai/grok-3-mini
xai/grok-3-mini-beta
xai/grok-3-mini-fast
xai/grok-3-mini-fast-beta
xai/grok-3-mini-fast-latest
xai/grok-3-mini-latest
xai/grok-4
xai/grok-4-0709
xai/grok-4-latest
xai/grok-beta
xai/grok-code-fast
xai/grok-code-fast-1
xai/grok-code-fast-1-0825
xai/grok-vision-beta

To find the corresponding API key, check the previous section.