SOC 2 Type II LLM models
Models served from providers with a current SOC 2 Type II report — the de-facto baseline for B2B procurement.
226 models match this tag · sorted by lowest total cost.
| # | Model | Provider | Context | In $/1M | Out $/1M |
|---|---|---|---|---|---|
| 1 | Gemma 4 26B A4B (free) | 262k | 0.0000 | 0.0000 | |
| 2 | Gemma 4 31B (free) | 262k | 0.0000 | 0.0000 | |
| 3 | Lyria 3 Clip Preview | 1049k | 0.0000 | 0.0000 | |
| 4 | Lyria 3 Pro Preview | 1049k | 0.0000 | 0.0000 | |
| 5 | Nemotron 3 Nano 30B A3B (free) | NVIDIA | 256k | 0.0000 | 0.0000 |
| 6 | Nemotron 3 Nano Omni (free) | NVIDIA | 256k | 0.0000 | 0.0000 |
| 7 | Nemotron 3 Super (free) | NVIDIA | 262k | 0.0000 | 0.0000 |
| 8 | Nemotron Nano 12B 2 VL (free) | NVIDIA | 128k | 0.0000 | 0.0000 |
| 9 | Nemotron Nano 9B V2 (free) | NVIDIA | 128k | 0.0000 | 0.0000 |
| 10 | Qwen3 Coder 480B A35B (free) | Qwen | 262k | 0.0000 | 0.0000 |
| 11 | Qwen3 Next 80B A3B Instruct (free) | Qwen | 262k | 0.0000 | 0.0000 |
| 12 | gpt-oss-120b (free) | OpenAI | 131k | 0.0000 | 0.0000 |
| 13 | gpt-oss-20b (free) | OpenAI | 131k | 0.0000 | 0.0000 |
| 14 | Mistral Nemo | Mistral | 131k | 0.0200 | 0.0300 |
| 15 | Gemma 3 4B | 131k | 0.0400 | 0.0800 | |
| 16 | Mistral Small 3 | Mistral | 33k | 0.0500 | 0.0800 |
| 17 | Qwen2.5 7B Instruct | Qwen | 33k | 0.0400 | 0.1000 |
| 18 | Qwen-Turbo | Qwen | 131k | 0.0325 | 0.1300 |
| 19 | Gemma 3 12B | 131k | 0.0400 | 0.1300 | |
| 20 | gpt-oss-20b | OpenAI | 131k | 0.0300 | 0.1400 |
| 21 | Qwen3 235B A22B Instruct 2507 | Qwen | 262k | 0.0710 | 0.1000 |
| 22 | Nova Micro 1.0 | Amazon | 128k | 0.0350 | 0.1400 |
| 23 | Gemma 3n 4B | 33k | 0.0600 | 0.1200 | |
| 24 | Command R7B (12-2024) | Cohere | 128k | 0.0375 | 0.1500 |
| 25 | Qwen3.5-9B | Qwen | 262k | 0.0400 | 0.1500 |
| 26 | Ministral 3 3B 2512 | Mistral | 131k | 0.1000 | 0.1000 |
| 27 | Nemotron Nano 9B V2 | NVIDIA | 131k | 0.0400 | 0.1600 |
| 28 | Phi 4 | Microsoft | 16k | 0.0650 | 0.1400 |
| 29 | gpt-oss-120b | OpenAI | 131k | 0.0390 | 0.1800 |
| 30 | Gemma 3 27B | 131k | 0.0800 | 0.1600 | |
| 31 | Nemotron 3 Nano 30B A3B | NVIDIA | 262k | 0.0500 | 0.2000 |
| 32 | Mistral Small 3.2 24B | Mistral | 128k | 0.0750 | 0.2000 |
| 33 | Ministral 3 8B 2512 | Mistral | 262k | 0.1500 | 0.1500 |
| 34 | Mistral 7B Instruct v0.1 | Mistral | 3k | 0.1100 | 0.1900 |
| 35 | Nova Lite 1.0 | Amazon | 300k | 0.0600 | 0.2400 |
| 36 | Qwen3.5-Flash | Qwen | 1000k | 0.0650 | 0.2600 |
| 37 | Qwen3 14B | Qwen | 41k | 0.1000 | 0.2400 |
| 38 | Qwen3 Coder 30B A3B Instruct | Qwen | 160k | 0.0700 | 0.2700 |
| 39 | Qwen3 32B | Qwen | 41k | 0.0800 | 0.2800 |
| 40 | Gemini 2.0 Flash Lite | 1049k | 0.0750 | 0.3000 | |
| 41 | gpt-oss-safeguard-20b | OpenAI | 131k | 0.0750 | 0.3000 |
| 42 | Gemma 4 26B A4B | 262k | 0.0600 | 0.3300 | |
| 43 | Qwen3 30B A3B Instruct 2507 | Qwen | 262k | 0.0900 | 0.3000 |
| 44 | Devstral Small 1.1 | Mistral | 131k | 0.1000 | 0.3000 |
| 45 | Ministral 3 14B 2512 | Mistral | 262k | 0.2000 | 0.2000 |
| 46 | Voxtral Small 24B 2507 | Mistral | 32k | 0.1000 | 0.3000 |
| 47 | Phi 4 Mini Instruct | Microsoft | 128k | 0.0800 | 0.3500 |
| 48 | GPT-5 Nano | OpenAI | 400k | 0.0500 | 0.4000 |
| 49 | Qwen3 8B | Qwen | 41k | 0.0500 | 0.4000 |
| 50 | Qwen3 30B A3B Thinking 2507 | Qwen | 131k | 0.0800 | 0.4000 |
| 51 | Gemma 4 31B | 262k | 0.1200 | 0.3700 | |
| 52 | GPT-4.1 Nano | OpenAI | 1048k | 0.1000 | 0.4000 |
| 53 | Gemini 2.0 Flash | 1049k | 0.1000 | 0.4000 | |
| 54 | Gemini 2.5 Flash Lite | 1049k | 0.1000 | 0.4000 | |
| 55 | Gemini 2.5 Flash Lite Preview 09-2025 | 1049k | 0.1000 | 0.4000 | |
| 56 | Llama 3.3 Nemotron Super 49B V1.5 | NVIDIA | 131k | 0.1000 | 0.4000 |
| 57 | Qwen3 VL 32B Instruct | Qwen | 131k | 0.1040 | 0.4160 |
| 58 | Nemotron 3 Super | NVIDIA | 262k | 0.0900 | 0.4500 |
| 59 | Qwen3 30B A3B | Qwen | 41k | 0.0900 | 0.4500 |
| 60 | Qwen VL Plus | Qwen | 131k | 0.1365 | 0.4095 |
| 61 | Qwen3 VL 8B Instruct | Qwen | 131k | 0.0800 | 0.5000 |
| 62 | Qwen3 VL 30B A3B Instruct | Qwen | 131k | 0.1300 | 0.5200 |
| 63 | Grok 4 Fast | xAI | 2000k | 0.2000 | 0.5000 |
| 64 | Grok 4.1 Fast | xAI | 2000k | 0.2000 | 0.5000 |
| 65 | Command R (08-2024) | Cohere | 128k | 0.1500 | 0.6000 |
| 66 | GPT-4o-mini | OpenAI | 128k | 0.1500 | 0.6000 |
| 67 | GPT-4o-mini (2024-07-18) | OpenAI | 128k | 0.1500 | 0.6000 |
| 68 | GPT-4o-mini Search Preview | OpenAI | 128k | 0.1500 | 0.6000 |
| 69 | Mistral Small 4 | Mistral | 262k | 0.1500 | 0.6000 |
| 70 | Qwen2.5 72B Instruct | Qwen | 33k | 0.3600 | 0.4000 |
| 71 | Grok 3 Mini | xAI | 131k | 0.3000 | 0.5000 |
| 72 | Grok 3 Mini Beta | xAI | 131k | 0.3000 | 0.5000 |
| 73 | Saba | Mistral | 33k | 0.2000 | 0.6000 |
| 74 | Qwen3 Next 80B A3B Thinking | Qwen | 131k | 0.0975 | 0.7800 |
| 75 | Mistral Small 3.1 24B | Mistral | 128k | 0.3500 | 0.5600 |
| 76 | Qwen3 Coder Next | Qwen | 262k | 0.1100 | 0.8000 |
| 77 | Qwen2.5 VL 72B Instruct | Qwen | 32k | 0.2500 | 0.7500 |
| 78 | Qwen Plus 0728 | Qwen | 1000k | 0.2600 | 0.7800 |
| 79 | Qwen Plus 0728 (thinking) | Qwen | 1000k | 0.2600 | 0.7800 |
| 80 | Qwen-Plus | Qwen | 1000k | 0.2600 | 0.7800 |
| 81 | Qwen3 VL 235B A22B Instruct | Qwen | 262k | 0.2000 | 0.8800 |
| 82 | Qwen3.5-35B-A3B | Qwen | 262k | 0.1400 | 1.0000 |
| 83 | Qwen3.6 35B A3B | Qwen | 262k | 0.1500 | 1.0000 |
| 84 | Qwen3 Coder Flash | Qwen | 1000k | 0.1950 | 0.9750 |
| 85 | Qwen3 Next 80B A3B Instruct | Qwen | 262k | 0.0900 | 1.1000 |
| 86 | Codestral 2508 | Mistral | 256k | 0.3000 | 0.9000 |
| 87 | WizardLM-2 8x22B | Microsoft | 66k | 0.6200 | 0.6200 |
| 88 | Gemma 2 27B | 8k | 0.6500 | 0.6500 | |
| 89 | Qwen3.6 Flash | Qwen | 1000k | 0.1875 | 1.1250 |
| 90 | GPT-5.4 Nano | OpenAI | 400k | 0.2000 | 1.2500 |
| 91 | Qwen3 VL 8B Thinking | Qwen | 131k | 0.1170 | 1.3650 |
| 92 | Claude 3 Haiku | Anthropic | 200k | 0.2500 | 1.2500 |
| 93 | Qwen3 235B A22B Thinking 2507 | Qwen | 131k | 0.1495 | 1.4950 |
| 94 | Qwen2.5 Coder 32B Instruct | Qwen | 33k | 0.6600 | 1.0000 |
| 95 | Qwen3 VL 30B A3B Thinking | Qwen | 131k | 0.1300 | 1.5600 |
| 96 | Grok Code Fast 1 | xAI | 256k | 0.2000 | 1.5000 |
| 97 | Gemini 3.1 Flash Lite | 1049k | 0.2500 | 1.5000 | |
| 98 | Gemini 3.1 Flash Lite Preview | 1049k | 0.2500 | 1.5000 | |
| 99 | Qwen3.5-27B | Qwen | 262k | 0.1950 | 1.5600 |
| 100 | Qwen3.5 Plus 2026-02-15 | Qwen | 1000k | 0.2600 | 1.5600 |
| 101 | GPT-3.5 Turbo | OpenAI | 16k | 0.5000 | 1.5000 |
| 102 | GPT-4.1 Mini | OpenAI | 1048k | 0.4000 | 1.6000 |
| 103 | Mistral Large 3 2512 | Mistral | 262k | 0.5000 | 1.5000 |
| 104 | Sonar | Perplexity | 127k | 1.0000 | 1.0000 |
| 105 | Qwen3 Coder 480B A35B | Qwen | 262k | 0.2200 | 1.8000 |
| 106 | Qwen3.5 Plus 2026-04-20 | Qwen | 1000k | 0.3000 | 1.8000 |
| 107 | GPT-5 Mini | OpenAI | 400k | 0.2500 | 2.0000 |
| 108 | GPT-5.1-Codex-Mini | OpenAI | 400k | 0.2500 | 2.0000 |
| 109 | Qwen3 235B A22B | Qwen | 131k | 0.4550 | 1.8200 |
| 110 | Qwen3.6 Plus | Qwen | 1000k | 0.3250 | 1.9500 |
| 111 | Qwen3.5-122B-A10B | Qwen | 262k | 0.2600 | 2.0800 |
| 112 | Devstral 2 2512 | Mistral | 262k | 0.4000 | 2.0000 |
| 113 | Devstral Medium | Mistral | 131k | 0.4000 | 2.0000 |
| 114 | Mistral Medium 3 | Mistral | 131k | 0.4000 | 2.0000 |
| 115 | Mistral Medium 3.1 | Mistral | 131k | 0.4000 | 2.0000 |
| 116 | Qwen VL Max | Qwen | 131k | 0.5200 | 2.0800 |
| 117 | Qwen3.5 397B A17B | Qwen | 262k | 0.3900 | 2.3400 |
| 118 | Gemini 2.5 Flash | 1049k | 0.3000 | 2.5000 | |
| 119 | Nano Banana (Gemini 2.5 Flash Image) | 33k | 0.3000 | 2.5000 | |
| 120 | Nova 2 Lite | Amazon | 1000k | 0.3000 | 2.5000 |
| 121 | Qwen3 VL 235B A22B Thinking | Qwen | 131k | 0.2600 | 2.6000 |
| 122 | GPT Audio Mini | OpenAI | 128k | 0.6000 | 2.4000 |
| 123 | GPT-3.5 Turbo (older v0613) | OpenAI | 4k | 1.0000 | 2.0000 |
| 124 | GPT-3.5 Turbo Instruct | OpenAI | 4k | 1.5000 | 2.0000 |
| 125 | Gemini 3 Flash Preview | 1049k | 0.5000 | 3.0000 | |
| 126 | Google Gemini Flash Latest | 1049k | 0.5000 | 3.0000 | |
| 127 | Nano Banana 2 (Gemini 3.1 Flash Image Preview) | 66k | 0.5000 | 3.0000 | |
| 128 | Qwen3.6 27B | Qwen | 262k | 0.3200 | 3.2000 |
| 129 | Grok 4.20 | xAI | 2000k | 1.2500 | 2.5000 |
| 130 | Grok 4.3 | xAI | 1000k | 1.2500 | 2.5000 |
| 131 | Qwen3 Coder Plus | Qwen | 1000k | 0.6500 | 3.2500 |
| 132 | Nova Pro 1.0 | Amazon | 300k | 0.8000 | 3.2000 |
| 133 | GPT-5 Image Mini | OpenAI | 400k | 2.5000 | 2.0000 |
| 134 | Qwen3 Max | Qwen | 262k | 0.7800 | 3.9000 |
| 135 | Qwen3 Max Thinking | Qwen | 262k | 0.7800 | 3.9000 |
| 136 | Claude 3.5 Haiku | Anthropic | 200k | 0.8000 | 4.0000 |
| 137 | Qwen-Max | Qwen | 33k | 1.0400 | 4.1600 |
| 138 | GPT-5.4 Mini | OpenAI | 400k | 0.7500 | 4.5000 |
| 139 | OpenAI GPT Mini Latest | OpenAI | 400k | 0.7500 | 4.5000 |
| 140 | o3 Mini | OpenAI | 200k | 1.1000 | 4.4000 |
| 141 | o3 Mini High | OpenAI | 200k | 1.1000 | 4.4000 |
| 142 | o4 Mini | OpenAI | 200k | 1.1000 | 4.4000 |
| 143 | o4 Mini High | OpenAI | 200k | 1.1000 | 4.4000 |
| 144 | Anthropic Claude Haiku Latest | Anthropic | 200k | 1.0000 | 5.0000 |
| 145 | Claude Haiku 4.5 | Anthropic | 200k | 1.0000 | 5.0000 |
| 146 | GPT-3.5 Turbo 16k | OpenAI | 16k | 3.0000 | 4.0000 |
| 147 | Qwen3.6 Max Preview | Qwen | 262k | 1.0400 | 6.2400 |
| 148 | Grok 4.20 Multi-Agent | xAI | 2000k | 2.0000 | 6.0000 |
| 149 | Mistral Large | Mistral | 128k | 2.0000 | 6.0000 |
| 150 | Mistral Large 2407 | Mistral | 131k | 2.0000 | 6.0000 |
| 151 | Mistral Large 2411 | Mistral | 131k | 2.0000 | 6.0000 |
| 152 | Mixtral 8x22B Instruct | Mistral | 66k | 2.0000 | 6.0000 |
| 153 | Pixtral Large 2411 | Mistral | 131k | 2.0000 | 6.0000 |
| 154 | Mistral Medium 3.5 | Mistral | 262k | 1.5000 | 7.5000 |
| 155 | GPT-4.1 | OpenAI | 1048k | 2.0000 | 8.0000 |
| 156 | Sonar Deep Research | Perplexity | 128k | 2.0000 | 8.0000 |
| 157 | Sonar Reasoning Pro | Perplexity | 128k | 2.0000 | 8.0000 |
| 158 | o3 | OpenAI | 200k | 2.0000 | 8.0000 |
| 159 | o4 Mini Deep Research | OpenAI | 200k | 2.0000 | 8.0000 |
| 160 | GPT-5 | OpenAI | 400k | 1.2500 | 10.0000 |
| 161 | GPT-5 Chat | OpenAI | 128k | 1.2500 | 10.0000 |
| 162 | GPT-5 Codex | OpenAI | 400k | 1.2500 | 10.0000 |
| 163 | GPT-5.1 | OpenAI | 400k | 1.2500 | 10.0000 |
| 164 | GPT-5.1 Chat | OpenAI | 128k | 1.2500 | 10.0000 |
| 165 | GPT-5.1-Codex | OpenAI | 400k | 1.2500 | 10.0000 |
| 166 | GPT-5.1-Codex-Max | OpenAI | 400k | 1.2500 | 10.0000 |
| 167 | Gemini 2.5 Pro | 1049k | 1.2500 | 10.0000 | |
| 168 | Gemini 2.5 Pro Preview 05-06 | 1049k | 1.2500 | 10.0000 | |
| 169 | Gemini 2.5 Pro Preview 06-05 | 1049k | 1.2500 | 10.0000 | |
| 170 | Command A | Cohere | 256k | 2.5000 | 10.0000 |
| 171 | Command R+ (08-2024) | Cohere | 128k | 2.5000 | 10.0000 |
| 172 | GPT Audio | OpenAI | 128k | 2.5000 | 10.0000 |
| 173 | GPT-4o | OpenAI | 128k | 2.5000 | 10.0000 |
| 174 | GPT-4o (2024-08-06) | OpenAI | 128k | 2.5000 | 10.0000 |
| 175 | GPT-4o (2024-11-20) | OpenAI | 128k | 2.5000 | 10.0000 |
| 176 | GPT-4o Audio | OpenAI | 128k | 2.5000 | 10.0000 |
| 177 | GPT-4o Search Preview | OpenAI | 128k | 2.5000 | 10.0000 |
| 178 | Gemini 3.1 Pro Preview | 1049k | 2.0000 | 12.0000 | |
| 179 | Gemini 3.1 Pro Preview Custom Tools | 1049k | 2.0000 | 12.0000 | |
| 180 | Google Gemini Pro Latest | 1049k | 2.0000 | 12.0000 | |
| 181 | Nano Banana Pro (Gemini 3 Pro Image Preview) | 66k | 2.0000 | 12.0000 | |
| 182 | Nova Premier 1.0 | Amazon | 1000k | 2.5000 | 12.5000 |
| 183 | GPT-5.2 | OpenAI | 400k | 1.7500 | 14.0000 |
| 184 | GPT-5.2 Chat | OpenAI | 128k | 1.7500 | 14.0000 |
| 185 | GPT-5.2-Codex | OpenAI | 400k | 1.7500 | 14.0000 |
| 186 | GPT-5.3 Chat | OpenAI | 128k | 1.7500 | 14.0000 |
| 187 | GPT-5.3-Codex | OpenAI | 400k | 1.7500 | 14.0000 |
| 188 | GPT-5.4 | OpenAI | 1050k | 2.5000 | 15.0000 |
| 189 | Anthropic Claude Sonnet Latest | Anthropic | 1000k | 3.0000 | 15.0000 |
| 190 | Claude 3.7 Sonnet | Anthropic | 200k | 3.0000 | 15.0000 |
| 191 | Claude 3.7 Sonnet (thinking) | Anthropic | 200k | 3.0000 | 15.0000 |
| 192 | Claude Sonnet 4 | Anthropic | 1000k | 3.0000 | 15.0000 |
| 193 | Claude Sonnet 4.5 | Anthropic | 1000k | 3.0000 | 15.0000 |
| 194 | Claude Sonnet 4.6 | Anthropic | 1000k | 3.0000 | 15.0000 |
| 195 | Grok 3 | xAI | 131k | 3.0000 | 15.0000 |
| 196 | Grok 3 Beta | xAI | 131k | 3.0000 | 15.0000 |
| 197 | Grok 4 | xAI | 256k | 3.0000 | 15.0000 |
| 198 | Sonar Pro | Perplexity | 200k | 3.0000 | 15.0000 |
| 199 | Sonar Pro Search | Perplexity | 200k | 3.0000 | 15.0000 |
| 200 | GPT-4o (2024-05-13) | OpenAI | 128k | 5.0000 | 15.0000 |
| 201 | GPT-5 Image | OpenAI | 400k | 10.0000 | 10.0000 |
| 202 | GPT-5.4 Image 2 | OpenAI | 272k | 8.0000 | 15.0000 |
| 203 | Claude Opus 4.5 | Anthropic | 200k | 5.0000 | 25.0000 |
| 204 | Claude Opus 4.6 | Anthropic | 1000k | 5.0000 | 25.0000 |
| 205 | Claude Opus 4.7 | Anthropic | 1000k | 5.0000 | 25.0000 |
| 206 | Claude Opus Latest | Anthropic | 1000k | 5.0000 | 25.0000 |
| 207 | GPT Chat Latest | OpenAI | 400k | 5.0000 | 30.0000 |
| 208 | GPT-5.5 | OpenAI | 1050k | 5.0000 | 30.0000 |
| 209 | OpenAI GPT Latest | OpenAI | 1050k | 5.0000 | 30.0000 |
| 210 | GPT-4 Turbo | OpenAI | 128k | 10.0000 | 30.0000 |
| 211 | GPT-4 Turbo (older v1106) | OpenAI | 128k | 10.0000 | 30.0000 |
| 212 | GPT-4 Turbo Preview | OpenAI | 128k | 10.0000 | 30.0000 |
| 213 | o3 Deep Research | OpenAI | 200k | 10.0000 | 40.0000 |
| 214 | o1 | OpenAI | 200k | 15.0000 | 60.0000 |
| 215 | Claude Opus 4 | Anthropic | 200k | 15.0000 | 75.0000 |
| 216 | Claude Opus 4.1 | Anthropic | 200k | 15.0000 | 75.0000 |
| 217 | GPT-4 | OpenAI | 8k | 30.0000 | 60.0000 |
| 218 | GPT-4 (older v0314) | OpenAI | 8k | 30.0000 | 60.0000 |
| 219 | o3 Pro | OpenAI | 200k | 20.0000 | 80.0000 |
| 220 | GPT-5 Pro | OpenAI | 400k | 15.0000 | 120.0000 |
| 221 | Claude Opus 4.6 (Fast) | Anthropic | 1000k | 30.0000 | 150.0000 |
| 222 | Claude Opus 4.7 (Fast) | Anthropic | 1000k | 30.0000 | 150.0000 |
| 223 | GPT-5.2 Pro | OpenAI | 400k | 21.0000 | 168.0000 |
| 224 | GPT-5.4 Pro | OpenAI | 1050k | 30.0000 | 180.0000 |
| 225 | GPT-5.5 Pro | OpenAI | 1050k | 30.0000 | 180.0000 |
| 226 | o1-pro | OpenAI | 200k | 150.0000 | 600.0000 |