DeepSeek V3
Active84.0FrontierOpen-weight frontier model competitive with GPT-4o and Claude Sonnet at fraction of training cost.
Intelligence Index
84.0/ 100
Frontierweighted across 5 benchmarks
- Math
- 97.1
- General knowledge
- 82.2
- Coding
- 72.7
Computed as the mean of per-category averages across MMLU, GPQA, SWE-bench, HumanEval, MATH, GSM8K, AIME, Aider Polyglot and more. See each benchmark for methodology.
Overview
DeepSeek V3 is a 671B mixture-of-experts model (37B active parameters) trained for a reported $6M in compute — dramatically less than comparable frontier models. It matches GPT-4o and Claude Sonnet on most standard benchmarks.
History
DeepSeek V3 was released on 2024-12-26.
Training & availability
Weights are publicly available under the DeepSeek License (permissive) license, making this an open-weight model suitable for on-prem deployment and fine-tuning.
Capabilities
-
Context window: 128K tokens.
-
Max output: 8K tokens.
-
Input modalities: text.
-
Intelligence Index: 84.0/100.
Strongest categories: Math (97), General knowledge (82), Coding (73).
Recommended for: math, agentic, frontier, open-source, code.
Limitations
- Text-only — cannot process images, audio, or video inputs.
Pricing
- Input: $0.2700 per 1M tokens
- Output: $1.1000 per 1M tokens
Use the cost calculator above to estimate monthly spend for your workload.
Example interactions
Curated prompts showing the model's response style — not cherry-picked to impress, picked to show what typical output looks like.
Quick start
Minimal example using the OpenRouter API. Copy, paste, replace the key.
from openai import OpenAI
client = OpenAI(
base_url="https://openrouter.ai/api/v1",
api_key="sk-or-...",
)
resp = client.chat.completions.create(
model="deepseek/deepseek-v3",
messages=[{"role": "user", "content": "Explain quantum computing in one sentence."}],
)
print(resp.choices[0].message.content)Cost calculator
Estimate your monthly bill. Presets are typical workload sizes.
Benchmarks
| Benchmark | Score | Source |
|---|---|---|
| Aider PolyglotCoding | 55.1% pass@2 | Third-party Papers With Code |
| GSM8KMath | 97.1% accuracy | Self-reported DeepSeek tech report |
| HumanEvalCoding | 90.2pass@1 % | Self-reported DeepSeek tech report |
Integrations & tooling support
- Tool calling
- Supported
- Structured outputs
- Supported
Price vs quality
Competent capability at a low price.
- Quality percentile
- 65.7%
- Effective price
- $0.892/1M
- Pricing breakdown
- $0.27/1M in
$1.1/1M out
Community ratings
Rate DeepSeek V3
Sign in to rate and review.
Comments
Sign in to leave a comment.