L LLM Cloud Hub

Vendor comparison

Meta vs NVIDIA

Every Meta and NVIDIA LLM model side by side: pricing per million tokens, context windows, and capabilities. Refreshed nightly from upstream.

Meta

14 models

Model	Context	In $/1M	Out $/1M
Llama 3 70B Instruct	8k	0.5100	0.7400
Llama 3 8B Instruct	8k	0.0400	0.0400
Llama 3.1 70B Instruct tools, json_mode	131k	0.4000	0.4000
Llama 3.1 8B Instruct tools, json_mode	16k	0.0200	0.0500
Llama 3.2 11B Vision Instruct vision, json_mode	131k	0.2450	0.2450
Llama 3.2 1B Instruct	60k	0.0270	0.2000
Llama 3.2 3B Instruct	80k	0.0510	0.3400
Llama 3.2 3B Instruct (free)	131k	0.0000	0.0000
Llama 3.3 70B Instruct tools, json_mode	131k	0.1000	0.3200
Llama 3.3 70B Instruct (free) tools	66k	0.0000	0.0000
Llama 4 Maverick vision, json_mode	1049k	0.1500	0.6000
Llama 4 Scout vision, tools, json_mode	328k	0.0800	0.3000
Llama Guard 3 8B	131k	0.4800	0.0300
Llama Guard 4 12B vision, json_mode	164k	0.1800	0.1800

NVIDIA

9 models

Model	Context	In $/1M	Out $/1M
Llama 3.3 Nemotron Super 49B V1.5 tools, json_mode	131k	0.1000	0.4000
Nemotron 3 Nano 30B A3B tools, json_mode	262k	0.0500	0.2000
Nemotron 3 Nano 30B A3B (free) tools	256k	0.0000	0.0000
Nemotron 3 Nano Omni (free) vision, tools	256k	0.0000	0.0000
Nemotron 3 Super tools, json_mode	262k	0.0900	0.4500
Nemotron 3 Super (free) tools, json_mode	262k	0.0000	0.0000
Nemotron Nano 12B 2 VL (free) vision, tools	128k	0.0000	0.0000
Nemotron Nano 9B V2 tools, json_mode	131k	0.0400	0.1600
Nemotron Nano 9B V2 (free) tools, json_mode	128k	0.0000	0.0000

Going deeper

Keyboard shortcuts

?: Show this overlay
/: Focus the first form field
g h: Go to / (home)
g b: Go to /best-llm-for
g c: Go to /cost
g s: Go to /self-hosted
g x: Go to /compliance
Esc: Close any overlay

Inspired by Linear and GitHub conventions. The two-key sequences (g then h) work within ~1 second.