Self-host vs API
For each open-weight model below: rental GPU options, single-stream throughput from public benchmarks, estimated monthly cost at your workload — and the cheapest API alternative for the same weights.
For each open-weight model below: rental GPU options, single-stream throughput from public benchmarks, estimated monthly cost at your workload — and the cheapest API alternative for the same weights.