Financial Analysis of AI Deployment

An interactive tool to compare costs and performance between buying hardware, renting server power, or using an API.

Cost Table for API Models

Compare costs between different API models based on your scenario. Click column headers to sort.

Input tokens per call

Output tokens per call

API calls per day

Number of days

Model	Company	Input Cost	Output Cost	Cost / call	Total Daily Cost	Total (30 days)
Llama 3.1 8B Instant 128k	Groq	$0.000050	$0.000080	$0.000130	$0.0130	$0.3900
Llama 3 8B 8k	Groq	$0.000050	$0.000080	$0.000130	$0.0130	$0.3900
Mistral Small 3.2 24B	openrouter	$0.000050	$0.000100	$0.000150	$0.0150	$0.4500
Gemma 3 12B	openrouter	$0.000050	$0.000100	$0.000150	$0.0150	$0.4500
Qwen3 8B	openrouter	$0.000035	$0.000138	$0.000173	$0.0173	$0.5190
Phi 4 14B	openrouter	$0.000070	$0.000140	$0.000210	$0.0210	$0.6300
Llama 3.3 70B	openrouter	$0.000050	$0.000190	$0.000240	$0.0240	$0.7200
Gemma 3 27B	openrouter	$0.000100	$0.000180	$0.000280	$0.0280	$0.8400
Qwen3 14B	openrouter	$0.000060	$0.000240	$0.000300	$0.0300	$0.9000
DeepSeek: R1 Distill Qwen 32B	openrouter	$0.000120	$0.000180	$0.000300	$0.0300	$0.9000
Mistral 7B	AWS Bedrock	$0.000150	$0.000200	$0.000350	$0.0350	$1.05
gemini 1.5 flash-8B > 128k	Google	$0.000100	$0.000300	$0.000400	$0.0400	$1.20
gemini 2.0 flash-lite	Google	$0.000100	$0.000300	$0.000400	$0.0400	$1.20
Qwen3 32B	openrouter	$0.000100	$0.000300	$0.000400	$0.0400	$1.20
Gemma 2 9B 8k	Groq	$0.000200	$0.000200	$0.000400	$0.0400	$1.20
Llama 4 Scout (17Bx16E)	Groq	$0.000110	$0.000340	$0.000450	$0.0450	$1.35
gpt-4.1-nano	OpenAI	$0.000100	$0.000400	$0.000500	$0.0500	$1.50
gemini 2.0 flash text	Google	$0.000100	$0.000400	$0.000500	$0.0500	$1.50
GPT-4.1-nano (2025-04-14)	Azure	$0.000100	$0.000400	$0.000500	$0.0500	$1.50
DeepSeek-V3-0324 （UTC 16:30-00:30）	DeepSeek	$0.000035	$0.000550	$0.000585	$0.0585	$1.76
DeepSeek-R1-0528 （UTC 16:30-00:30）	DeepSeek	$0.000035	$0.000550	$0.000585	$0.0585	$1.76
Qwen QwQ 32B (Preview) 128k	Groq	$0.000290	$0.000390	$0.000680	$0.0680	$2.04
gemini 1.5 flash > 128k	Google	$0.000100	$0.000600	$0.000700	$0.0700	$2.10
Qwen3 235B	openrouter	$0.000130	$0.000600	$0.000730	$0.0730	$2.19
gpt-4o-mini	OpenAI	$0.000200	$0.000600	$0.000800	$0.0800	$2.40
grok-3-mini	Grok (xAI)	$0.000300	$0.000500	$0.000800	$0.0800	$2.40
Llama 4 Maverick (17Bx128E)	Groq	$0.000200	$0.000600	$0.000800	$0.0800	$2.40
Mistral 7B Instruct v0.3	openrouter	$0.000280	$0.000540	$0.000820	$0.0820	$2.46
Scout 17B	AWS Bedrock	$0.000170	$0.000660	$0.000830	$0.0830	$2.49
Qwen3 32B 131k	Groq	$0.000290	$0.000590	$0.000880	$0.0880	$2.64
Mixtral 8×7B	AWS Bedrock	$0.000450	$0.000700	$0.001150	$0.1150	$3.45
DeepSeek-V3-0324 （UTC 00:30-16:30）	DeepSeek	$0.000070	$0.001100	$0.001170	$0.1170	$3.51
Maverick 17B	AWS Bedrock	$0.000240	$0.000970	$0.001210	$0.1210	$3.63
Llama 3.3 70B Versatile 128k	Groq	$0.000590	$0.000790	$0.001380	$0.1380	$4.14
Llama 3 70B 8k	Groq	$0.000590	$0.000790	$0.001380	$0.1380	$4.14
Claude Haiku 3	Anthropic	$0.000300	$0.001300	$0.001600	$0.1600	$4.80
Llama 3.1 405B	openrouter	$0.000800	$0.000800	$0.001600	$0.1600	$4.80
DeepSeek R1 Distill Llama 70B	Groq	$0.000750	$0.000990	$0.001740	$0.1740	$5.22
gemini 2.0 flash live text	Google	$0.000300	$0.001500	$0.001800	$0.1800	$5.40
Dolphin 2.9.2 Mixtral 8x22B	openrouter	$0.000900	$0.000900	$0.001800	$0.1800	$5.40
gpt-4.1-mini	OpenAI	$0.000400	$0.001600	$0.002000	$0.2000	$6.00
GPT-3.5-Turbo-0125 (16k)	Azure	$0.000500	$0.001500	$0.002000	$0.2000	$6.00
GPT-4.1-mini (2025-04-14)	Azure	$0.000400	$0.001600	$0.002000	$0.2000	$6.00
DeepSeek-R1-0528 （UTC 00:30-16:30）	DeepSeek	$0.000140	$0.002190	$0.002330	$0.2330	$6.99
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1	openrouter	$0.000600	$0.001800	$0.002400	$0.2400	$7.20
gemini 2.5 flash live text	Google	$0.000500	$0.002000	$0.002500	$0.2500	$7.50
DeepSeek: R1 0528 671B	openrouter	$0.000500	$0.002150	$0.002650	$0.2650	$7.95
gemini 2.5 flash text	Google	$0.000300	$0.002500	$0.002800	$0.2800	$8.40
GPT-3.5-Turbo-1106 (16k)	Azure	$0.001000	$0.002000	$0.003000	$0.3000	$9.00
Mistral Small (24.02)	AWS Bedrock	$0.001000	$0.003000	$0.004000	$0.4000	$12.00
grok-3-mini-fast	Grok (xAI)	$0.000600	$0.004000	$0.004600	$0.4600	$13.80
Claude Haiku 3.5	Anthropic	$0.000800	$0.004000	$0.004800	$0.4800	$14.40
o4-mini	OpenAI	$0.001100	$0.004400	$0.005500	$0.5500	$16.50
o3-mini	OpenAI	$0.001100	$0.004400	$0.005500	$0.5500	$16.50
o1-mini	OpenAI	$0.001100	$0.004400	$0.005500	$0.5500	$16.50
o4-mini (2025-04-16)	Azure	$0.001100	$0.004400	$0.005500	$0.5500	$16.50
gemini 1.5 pro ≤ 128k	Google	$0.001300	$0.005000	$0.006300	$0.6300	$18.90
DeepSeek-R1	AWS Bedrock	$0.001350	$0.005400	$0.006750	$0.6750	$20.25
gpt-4.1	OpenAI	$0.002000	$0.008000	$0.0100	$1.00	$30.00
o3	OpenAI	$0.002000	$0.008000	$0.0100	$1.00	$30.00
GPT-4.1 (2025-04-14)	Azure	$0.002000	$0.008000	$0.0100	$1.00	$30.00
gemini 2.5 pro ≤ 200k	Google	$0.001300	$0.0100	$0.0113	$1.13	$33.90
grok-2-1212	Grok (xAI)	$0.002000	$0.0100	$0.0120	$1.20	$36.00
gpt-4o	OpenAI	$0.002500	$0.0100	$0.0125	$1.25	$37.50
gemini 1.5 pro > 128k	Google	$0.002500	$0.0100	$0.0125	$1.25	$37.50
Mistral Large (24.02)	AWS Bedrock	$0.004000	$0.0120	$0.0160	$1.60	$48.00
gemini 2.5 pro > 200k	Google	$0.002500	$0.0150	$0.0175	$1.75	$52.50
Claude Sonnet 4	Anthropic	$0.003000	$0.0150	$0.0180	$1.80	$54.00
Claude Sonnet 3.7	Anthropic	$0.003000	$0.0150	$0.0180	$1.80	$54.00
Claude Sonnet 3.5	Anthropic	$0.003000	$0.0150	$0.0180	$1.80	$54.00
grok-3	Grok (xAI)	$0.003000	$0.0150	$0.0180	$1.80	$54.00
Claude Sonnet 4	AWS Bedrock	$0.003000	$0.0150	$0.0180	$1.80	$54.00
Claude 3.7 Sonnet	AWS Bedrock	$0.003000	$0.0150	$0.0180	$1.80	$54.00
Claude 3.5 Sonnet v2	AWS Bedrock	$0.003000	$0.0150	$0.0180	$1.80	$54.00
grok-3-fast	Grok (xAI)	$0.005000	$0.0250	$0.0300	$3.00	$90.00
GPT-4-Turbo (128k)	Azure	$0.0100	$0.0300	$0.0400	$4.00	$120.00
o3 (2025-04-16)	Azure	$0.0100	$0.0400	$0.0500	$5.00	$150.00
o1	OpenAI	$0.0150	$0.0600	$0.0750	$7.50	$225.00
Claude Opus 4	Anthropic	$0.0150	$0.0750	$0.0900	$9.00	$270.00
Claude Opus 3	Anthropic	$0.0150	$0.0750	$0.0900	$9.00	$270.00
GPT-4 (8k)	Azure	$0.0300	$0.0600	$0.0900	$9.00	$270.00
o3-pro	OpenAI	$0.0200	$0.0800	$0.1000	$10.00	$300.00
GPT-4 (32k)	Azure	$0.0600	$0.1200	$0.1800	$18.00	$540.00
gpt-4.5-preview	OpenAI	$0.0750	$0.1500	$0.2250	$22.50	$675.00
o1-pro	OpenAI	$0.1500	$0.6000	$0.7500	$75.00	$2250.00

GPU Server Rental Analysis

Compare and analyze costs and performance for renting dedicated GPU hardware.

GPU Server Prices

Hourly prices for renting dedicated GPU servers from various providers. Use the controls to search and sort.

Company	GPU Model	VRAM (GB)	Price ($/h)
Vast.ai	RTX A2000	12	$0.0400
Vast.ai	GTX 1080	8	$0.0400
Vast.ai	GTX 1070 Ti	8	$0.0400
Vast.ai	RTX 3070	8	$0.0500
Vast.ai	RTX 2060	6	$0.0500
Vast.ai	GTX 1660 S	6	$0.0500
Vast.ai	RTX 3060	12	$0.0600
Vast.ai	GTX 1070	8	$0.0600
Vast.ai	GTX 1080 Ti	11	$0.0600
Vast.ai	RTX 3060 Ti	8	$0.0700
Vast.ai	RTX 4060	8	$0.0700
Vast.ai	RTX 2080 Ti	11	$0.0800
Vast.ai	RTX 3070 Ti	8	$0.0800
Vast.ai	RTX 5070	12	$0.0900
Vast.ai	RTX A4000	16	$0.1000
Vast.ai	RTX 3080	10	$0.1000
Vast.ai	RTX 2060S	8	$0.1000
Vast.ai	RTX 4060 Ti	16	$0.1200
Vast.ai	RTX 5060 Ti	16	$0.1200
Vast.ai	RTX 4070	12	$0.1200
Vast.ai	RTX 3080 Ti	12	$0.1200
Vast.ai	RTX 5070 Ti	16	$0.1300
Vast.ai	RTX 4070 Ti	12	$0.1300
Vast.ai	RTX 4070S Ti	16	$0.1600
Vast.ai	RTX 5080	16	$0.1600
Vast.ai	RTX 4070S	12	$0.1600
Vast.ai	RTX 3090	24	$0.1700
Vast.ai	RTX A5000	24	$0.2000
Vast.ai	RTX 4080S	16	$0.2000
Vast.ai	RTX 3090 Ti	24	$0.2000
Vast.ai	Tesla V100	32	$0.2300
RunPod	RTX A5000	24	$0.2600
Vast.ai	Q RTX 8000	48	$0.3200
Vast.ai	RTX 4090	24	$0.3500
RunPod	A40	48	$0.4000
RunPod	L4	24	$0.4300
Vast.ai	RTX 5090	32	$0.4300
Vast.ai	RTX A6000	48	$0.4500
RunPod	RTX 3090	24	$0.4600
Vast.ai	A40	48	$0.4600
RunPod	RTX A6000	48	$0.4900
Vast.ai	RTX 6000ADA	48	$0.6500
Vast.ai	L40S	48	$0.6800
RunPod	RTX 4090	24	$0.6900
RunPod	RTX 6000 Ada	48	$0.7700
Vast.ai	A100 SXM4	80	$0.8200
RunPod	L40S	48	$0.8600
Vast.ai	A100 PCIE	40	$0.8700
RunPod	RTX 5090	32	$0.8900
RunPod	L40	48	$0.9900
RunPod	A100 PCIe	80	$1.6400
RunPod	A100 SXM	80	$1.7400
Vast.ai	H100 SXM	80	$1.8700
RunPod	H100 PCIe	80	$2.1900
Vast.ai	H100 NVL	188	$2.2700
RunPod	H100 NVL	94	$2.7900
RunPod	H100 SXM	80	$2.9900
Vast.ai	H200	141	$3.0600
RunPod	H200 SXM	141	$3.9900
RunPod	B200	180	$6.3900

GPU Purchase Analysis

Compare and analyze costs and performance for purchasing dedicated GPU hardware.

GPU Purchase Prices

Recommended MSRP prices for various graphics cards and enterprise servers.

Subcategory	GPU Model	VRAM (GB)	MSRP ($)
Lokala GPU	GeForce GTX 1660 Super	6	$229
Lokala GPU	GeForce RTX 3050 8GB	8	$249
Lokala GPU	GeForce RTX 5050	8	$249
Lokala GPU	GeForce RTX 4060 8GB	8	$299
Lokala GPU	GeForce RTX 5060	8	$299
Lokala GPU	GeForce RTX 3060 8GB	8	$300
Lokala GPU	GeForce RTX 3060 12GB	12	$329
Lokala GPU	GeForce RTX 5060 Ti 8GB	8	$379
Lokala GPU	GeForce RTX 2060 Super	8	$399
Lokala GPU	GeForce RTX 3060 Ti 8GB	8	$399
Lokala GPU	GeForce RTX 4060 Ti 8GB	8	$399
Lokala GPU	GeForce RTX 5060 Ti 16GB	16	$429
Lokala GPU	GeForce RTX 4060 Ti 16GB	16	$499
Lokala GPU	GeForce RTX 4070 12GB	12	$549
Lokala GPU	GeForce RTX 5070	12	$549
Lokala GPU	GeForce RTX 4070 Super	12	$599
Lokala GPU	GeForce RTX 5070 Super	18	$650
Lokala GPU	GeForce RTX 5070 Ti	16	$749
Lokala GPU	GeForce RTX 4070 Ti	12	$799
Lokala GPU	GeForce RTX 4070 Ti Super	16	$799
Lokala GPU	GeForce RTX 4080 Super	16	$999
Lokala GPU	GeForce RTX 5080	16	$999
Lokala GPU	GeForce RTX 3080 Ti	12	$1,199
Lokala GPU	GeForce RTX 4080	16	$1,199
Lokala GPU	RTX A4000	16	$1,200
Lokala GPU	GeForce RTX 3090	24	$1,499
Lokala GPU	GeForce RTX 5080 Super*	24	$1,500
Lokala GPU	GeForce RTX 4090	24	$1,599
Lokala GPU	GeForce RTX 3090 Ti	24	$1,999
Lokala GPU	GeForce RTX 5090	32	$1,999
Lokala GPU	RTX A5000	24	$2,300
Lokala GPU	AMD Radeon Pro W7900	32	$2,500
Lokala GPU	Quadro RTX 8000	48	$3,500
Lokala GPU	AMD Radeon Pro W7900	48	$3,500
Lokala GPU	RTX A6000	48	$5,100
Enterprise GPU	L40S	48	$7,500
Lokala GPU	RTX Pro 6000 Blackwell	48	$8,500
Enterprise GPU	A100 40GB PCIe	40	$10,000
Enterprise GPU	A100 80GB SXM4	80	$12,000
Enterprise GPU	Intel Gaudi 3	128	$15,500
Enterprise GPU	Tesla V100 32GB	32	$19,000
Enterprise GPU	A40	48	$19,000
Enterprise GPU	H100 80GB PCIe	80	$25,000
Enterprise GPU	H100 80GB SXM5	80	$30,000
Enterprise GPU	H200 141GB SXM	141	$35,000
Enterprise GPU-server	H100 NVL (2×H100 PCIe)	188	$65,000
Enterprise GPU-server	HGX H100 (8×H100 SXM5)	640	$350,000
Enterprise GPU-server	DGX B200 (8×B200)	1440	$400,000
Enterprise GPU-server	GB200 NVL72 (72×GB200)	13500	$3,000,000

Overall Investment Analysis

Analyze the cost of running a specific AI model by buying or renting hardware.

1. Define your AI model

Parameters (Billions)

Precision

2. Results & Cost Estimates

Estimated VRAM requirement

3.3 GB

Estimated Purchase Cost

$110

Estimated Rental Cost (per hour)

$0.017 / timme

Break-Even Point

It becomes more cost-effective to BUY the hardware instead of renting it 24/7 after about 276 days (0.76 years).

3. Analysis: Budget → Performance

Total Budget ($)

Time period (Days)

Option: Buy

Estimated VRAM for your budget

48.2 GB

This corresponds to a model size of:

Precision	Parameters (Billions)
INT4 (0.5 byte)	103.6B
INT8 (1 byte)	51.8B
FP16/BF16 (2 byte)	25.9B
FP32 (4 byte)	12.9B

Option: Rent 24/7

Estimated VRAM for your budget

40.9 GB

This corresponds to a model size of:

Precision	Parameters (Billions)
INT4 (0.5 byte)	87.8B
INT8 (1 byte)	43.9B
FP16/BF16 (2 byte)	22.0B
FP32 (4 byte)	11.0B

The Golden Analysis: Profitability Maps

These heatmaps provide an overview of how ''investment budget'' and ''daily token volume'' interact to affect the break-even time. Use the controls to define your analysis area and identify the most profitable zones for your specific situation.

Min Budget ($)

Max Budget ($)

Min Tokens/day

Max Tokens/day