Grok 4 Fast
ActiveHigh-speed variant of Grok 4.
History
Grok 4 Fast became available via the xAI API on 2025-09-19.
Training & availability
Training data has a knowledge cutoff of 2025-09-30 — information about events after that date is unlikely to appear in the model's responses. xAI has not released the underlying model weights — access is via their hosted API only.
Capabilities
- Input modalities: text.
Recommended for: agentic.
Limitations
- Text-only — cannot process images, audio, or video inputs.
Quick start
Minimal example using the OpenRouter API. Copy, paste, replace the key.
from openai import OpenAI
client = OpenAI(
base_url="https://openrouter.ai/api/v1",
api_key="sk-or-...",
)
resp = client.chat.completions.create(
model="xai/grok-4-fast",
messages=[{"role": "user", "content": "Explain quantum computing in one sentence."}],
)
print(resp.choices[0].message.content)Cost calculator
Estimate your monthly bill. Presets are typical workload sizes.
Providers & performance
1 providerMulti-provider inference routes for this model — sorted by throughput. Latency is time-to-first-token; throughput is output tokens per second. Data from OpenRouter, measured over the last 30 minutes.
| Provider | Throughput | Latency (TTFT) | Input $ / 1M | Output $ / 1M | Context | Quant | Supports |
|---|---|---|---|---|---|---|---|
| xAI | 33tok/s | 504ms | $0.2 | $0.5 | 2.0M | — | tools · json |
Integrations & tooling support
- Tool calling
- Supported
- Structured outputs
- Not supported
Price vs quality
Priced low — good for high-volume tasks. Quality tier pending more benchmark coverage.
- Quality percentile
- —
- Effective price
- $0.425/1M
- Pricing breakdown
- $0.2/1M in
$0.5/1M out
Community ratings
Rate Grok 4 Fast
Sign in to rate and review.
Comments
Sign in to leave a comment.