Gemini 3.1 Pro Preview

Gemini 3.1 Pro Preview advances software engineering and agentic workflows with targeted quality improvements for finance and spreadsheet applications, plus more efficient thinking that reduces token consumption while maintaining performance.

File InputTool UseReasoningVision (Image)Web Searchtiered-costImplicit Caching

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'google/gemini-3.1-pro-preview',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

Playground

Try out Gemini 3.1 Pro Preview by Google. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

Gemini 3.1 Pro Preview

Ask Gemini 3.1 Pro Preview anything to try it out.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

Google

87.0s

112tps

$2.00/M

$12.00/M

Read:

$0.2/M

Write:

—

$14.00/K

+ input costs

—

02/19/2026

Google Vertex AI

4.3s

105tps

$2.00/M

$12.00/M

Read:

$0.2/M

Write:

—

$14.00/K

+ input costs

—

02/19/2026

More models by Google

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

google/gemini-3.5-flash

2.1s

191tps

$1.50/M

$9.00/M

Read:$0.15/M

Write:—

$14.00/K

+ input costs

—

05/19/2026

google/gemma-4-26b-a4b-it

262K

0.6s

85tps

$0.13/M

$0.40/M

Read:$0.01/M

Write:—

—

04/02/2026

google/gemini-3.1-flash-lite-preview

0.5s

232tps

$0.25/M

$1.50/M

Read:$0.03/M

Write:—

$14.00/K

+ input costs

—

03/03/2026

google/gemini-3-flash

0.7s

160tps

$0.50/M

$3.00/M

Read:

$0.05/M

Write:

—

$14.00/K

+ input costs

—

12/17/2025

google/gemini-2.5-flash-lite

0.3s

271tps

$0.10/M

$0.40/M

Read:$0.01/M

Write:—

$35.00/K

+ input costs

—

06/17/2025

google/gemini-2.5-flash

0.4s

198tps

$0.30/M

$2.50/M

Read:$0.03/M

Write:—

$35.00/K

+ input costs

—

03/20/2025

About Gemini 3.1 Pro Preview

Gemini 3.1 Pro Preview is the updated flagship in the Gemini 3.1 family, building directly on Gemini 3 Pro with quality improvements targeted at real-world production domains. Software engineering and agentic workflows are the primary areas of improvement, alongside enhanced usability for finance and spreadsheet applications.

The model introduces more efficient thinking. At the pro tier, input tokens are a significant cost factor in long agentic sessions. Better reasoning efficiency means the model works through complex tasks without generating proportionally more tokens, directly reducing per-task operating cost compared to Gemini 3 Pro.

For teams building developer tooling, code review automation, or financial analysis pipelines, Gemini 3.1 Pro Preview is designed for the level of structured reasoning those applications require.

What To Consider When Choosing a Provider

Configuration: This model supports the medium thinking level via thinking_level in providerOptions.google, giving you control over the cost, performance, and speed tradeoff.
Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Gemini 3.1 Pro Preview

Best For

Automated code review: Security vulnerability analysis and test suite generation
Agentic software engineering: Tasks requiring planning across multiple code changes
Financial and spreadsheet analysis: Modeling, formula generation, and structured numerical work
Pro-tier reasoning tasks: The efficient thinking improvement reduces per-task token cost
Long agentic sessions: Accumulated token consumption from reasoning steps is a cost concern

Consider Alternatives When

Initial Gemini 3 Pro baseline: You need capabilities without the 3.1 updates (consider google/gemini-3-pro-preview)
Speed and cost at volume: Your primary need is efficiency at high throughput (consider google/gemini-3-flash or google/gemini-3.1-flash-lite-preview)
Image generation alongside reasoning: Your task requires native image output (consider google/gemini-3-pro-image)
Flash-tier reasoning is enough: Pro-tier capability is not required (consider google/gemini-3-flash)

Conclusion

Gemini 3.1 Pro Preview addresses the most common reasons teams hit limits with Gemini 3 Pro: token-heavy reasoning sessions in agentic workflows, and the need for more reliable performance in finance and structured data applications. The efficiency improvements in thinking make extended pro-tier reasoning more economically viable for sustained production deployments.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Gemini 3.1 Pro Preview

Playground

Providers

More models by Google

About Gemini 3.1 Pro Preview

What To Consider When Choosing a Provider

When to Use Gemini 3.1 Pro Preview

Best For

Consider Alternatives When

Conclusion