Grok 4.20 Non-Reasoning

Grok 4.20 Non-Reasoning is xAI's non-reasoning model in the Grok 4.20 beta generation, optimized for speed and direct responses with low hallucination rates and strict prompt adherence.

Tool UseImplicit CachingVision (Image)File InputWeb Search

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'xai/grok-4.20-non-reasoning',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Latency Uptime Status Similar FAQ

Playground

Try out Grok 4.20 Non-Reasoning by xAI. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

Grok 4.20 Non-Reasoning

Ask Grok 4.20 Non-Reasoning anything to try it out.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

xAI

0.4s

$1.25/M

$2.50/M

Read:

$0.2/M

Write:

—

$5/K

+ input costs

—

03/10/2026

Google Vertex AI

0.4s

$2.00/M

$6.00/M

Read:$0.2/M

Write:—

—

03/10/2026

More models by xAI

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

xai/grok-build-0.1

256K

0.4s

169tps

$1.00/M

$2.00/M

Read:

$0.2/M

Write:

—

$5/K

+ input costs

—

05/20/2026

xai/grok-4.3

1.0s

152tps

$1.25/M

$2.50/M

Read:

$0.2/M

Write:

—

$5/K

+ input costs

—

04/30/2026

xai/grok-4.20-reasoning

0.5s

233tps

$1.25/M

$2.50/M

Read:

$0.2/M

Write:

—

$5/K

+ input costs

—

03/10/2026

xai/grok-4.20-multi-agent

1.3s

688tps

$1.25/M

$2.50/M

Read:

$0.2/M

Write:

—

$5/K

+ input costs

—

03/10/2026

xai/grok-4.1-fast-reasoning

0.7s

176tps

$0.20/M

$0.50/M

Read:$0.05/M

Write:—

—

11/19/2025

xai/grok-4.1-fast-non-reasoning

0.3s

120tps

$0.20/M

$0.50/M

Read:$0.05/M

Write:—

—

11/19/2025

About Grok 4.20 Non-Reasoning

Grok 4.20 Non-Reasoning was released March 10, 2026 as part of xAI's Grok 4.20 beta generation. It's optimized for speed and direct responses, producing answers without chain-of-thought reasoning overhead. The model features low hallucination rates and strict prompt adherence, making it suitable for production workloads that need precise, reliable output.

As a non-reasoning variant, Grok 4.20 Non-Reasoning skips intermediate reasoning traces and delivers answers directly. This reduces latency and output token cost per request, which is particularly valuable in high-throughput applications and agentic tool-calling loops where per-step speed compounds into overall workflow efficiency.

This model is currently in beta.

What To Consider When Choosing a Provider

Configuration: Grok 4.20 Non-Reasoning is in beta. Expect potential changes to behavior, pricing, or availability before general availability.
Configuration: This variant produces direct answers. If you need the model to reason through complex problems step by step, use the Grok 4.20 Reasoning variant instead.
Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Grok 4.20 Non-Reasoning

Best For

High-throughput production APIs: Direct, precise answers at low latency serve end users best
Agentic tool-calling workflows: That benefit from fast decision-making with low hallucination rates
Classification and routing pipelines: That need reliable, prompt-adherent output for downstream processing
Chat and conversational interfaces: Low-hallucination, prompt-adherent responses arrive quickly without chain-of-thought overhead
Content generation tasks: Where strict prompt adherence matters more than deep reasoning

Consider Alternatives When

Complex analytical tasks: Requiring multi-step reasoning. Use the Grok 4.20 Reasoning variant
Multi-agent orchestration: The Grok 4.20 Multi-Agent variant is purpose-built for agent collaboration
Stable production deployments: Beta models introduce unwanted risk. Use Grok 4.1 Fast Non-Reasoning instead
Maximum cost efficiency on simple tasks: Grok 3 Mini Fast offers lower per-token costs

Conclusion

Grok 4.20 Non-Reasoning brings Grok 4.20 generation capabilities to speed-focused workloads. It pairs direct responses with xAI's reported low hallucination rates and strict prompt adherence. Remember it's beta when you plan production deployments.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Grok 4.20 Non-Reasoning

Playground

Providers

More models by xAI

About Grok 4.20 Non-Reasoning

What To Consider When Choosing a Provider

When to Use Grok 4.20 Non-Reasoning

Best For

Consider Alternatives When

Conclusion