Documentation Index
Fetch the complete documentation index at: https://docs.apiyi.com/llms.txt
Use this file to discover all available pages before exploring further.
Highlights
- Version-less alias: Renamed from
gpt-5.3-chat-latestto the cleanerchat-latest— no more version-number coupling - Always current: Auto-tracks ChatGPT’s active Instant model snapshot (currently GPT-5.5 Instant)
- Big context: 400K tokens input, 128K tokens max output, knowledge cutoff 2025-08-31
- Multimodal input: Text + image input, text-only output (no audio/video)
- Official pricing: Input $5, output $30, cached input $0.50 per 1M tokens — identical to OpenAI’s rates
- Recharge promo: APIYI’s stackable recharge bonus brings effective cost down to 79-86% of OpenAI’s list
Background
In early May 2026, OpenAI rolled out a generational swap for ChatGPT’s default Instant model — GPT-5.5 Instant replaced the previous-gen Instant model as the new default across every ChatGPT tier (Free through Enterprise). At the same time, OpenAI renamed the “always latest” API alias on the API side: from the versionedgpt-5.x-chat-latest family to the version-less chat-latest.
The design intent behind the rename is clear: keep the alias stable, let the pointer roll. The old naming (gpt-5.2-chat-latest, gpt-5.3-chat-latest) misled developers into thinking the model snapshot was bound to the version number — but in reality OpenAI had been quietly rolling the underlying snapshot all along; the version was just a label. The new chat-latest makes that semantic explicit: it is the API mirror of ChatGPT’s default model, and it ships when ChatGPT ships.
OpenAI notified developers on 2026/5/8 (UTC+8): gpt-5.2-chat-latest and gpt-5.3-chat-latest are deprecated and will be removed from the API. New integrations should use chat-latest.
Sources: OpenAI official API docs
developers.openai.com/api/docs/models/chat-latest, OpenAI Changelog deprecation notice 2026/5/8, TechCrunch report on GPT-5.5 Instant launch 2026/5/5. Data retrieved 2026/5/21 (UTC+8).Detailed Breakdown
Key Features
Version-less alias
chat-latest always points to ChatGPT’s current default Instant model — no need to manually track version numbers.Rolling auto-update
OpenAI rolls the underlying snapshot without an announcement — once integrated, you don’t need to watch upgrade timing.
400K context
400K tokens input + 128K tokens max output — comfortable for long documents, long conversations, long RAG contexts.
Multimodal input
Native image input (documents, screenshots, charts); text-only output. Audio and video modalities are not supported.
Currently Points to: GPT-5.5 Instant
As of 2026/5/21 (UTC+8),chat-latest points to GPT-5.5 Instant. Improvements over the previous-gen Instant model include:
- ~50%+ fewer hallucinations on factual QA tasks
- More concise replies: default behavior leans “short and precise,” cuts redundant scaffolding
- Better instruction following on complex multi-step instructions
- AIME 2025: 81.2 (vs 65.4 previous generation)
- Multimodal reasoning: measurable gains on visual data interpretation and clinical accuracy
- Cross-conversation memory: can reference past conversations, files, and Gmail (full experience in ChatGPT; API uses call-time context)
Technical Specs
| Spec | Value |
|---|---|
| Model ID | chat-latest |
| Currently points to | GPT-5.5 Instant |
| Input context | 400,000 tokens |
| Max output | 128,000 tokens |
| Knowledge cutoff | 2025-08-31 |
| Input modalities | Text, image |
| Output modality | Text |
| Streaming | Supported |
| Function calling | Supported |
| Structured Outputs | Supported |
| Fine-tuning | Not supported |
| Predicted Outputs | Not supported |
chat-latest can also enable built-in tools: Web Search, File Search, Image Generation, Code Interpreter, MCP Tools.
Practical Use
Recommended Scenarios
ChatGPT experience parity
Want API output style that matches the ChatGPT web/desktop experience your end users are already familiar with.
Long-context conversational products
400K input window suits multi-turn long dialogues, long RAG, document QA; multimodal input handles screenshot/chart understanding.
Low-maintenance integration
Integrate once, benefit long-term — no code changes when OpenAI rolls the underlying Instant model.
Production API alternative
OpenAI officially recommends
gpt-5.5 for production, but chat-latest is the answer if you want to “roll with ChatGPT.”Code Example
Best Practices
- Production stability first → use pinned snapshots: If your business is highly sensitive to model behavior and you don’t want auto-upgrades, use
gpt-5.5or a specific snapshot;chat-latestis for the “roll with ChatGPT” scenario. - Cached input saves money:
chat-latest’s cached input is $0.50/M tokens — 1/10 of regular input. Long system prompts and long RAG contexts should actively leverage caching. - Multimodal cost: Image input is tiled and converted to tokens for billing — complex images can consume more than you’d expect. Estimate with OpenAI’s Tokenizer before going to production.
- Don’t assume fine-tuning:
chat-latestdoes not support fine-tuning — pick another snapshot if you need it.
Pricing & Availability
Side-by-side Pricing
| Item | OpenAI list | APIYI list |
|---|---|---|
| Input | $5.00 / 1M tokens | $5.00 / 1M tokens |
| Cached input | $0.50 / 1M tokens | $0.50 / 1M tokens |
| Output | $30.00 / 1M tokens | $30.00 / 1M tokens |
chat-latest switches in sync.
Effective Price (with Recharge Promo)
APIYI runs an always-on recharge bonus promo: the more you top up, the higher the bonus percentage, credited directly to your spendable balance. Stacking the bonus:- Top up $100 → ~86% of list (effective input ~$4.30, output ~$25.80)
- Top up $300+ → as low as 79% of list (depending on the tier; see Recharge Promotions FAQ)
Available Groups
| Group | Open | Notes |
|---|---|---|
Default | ✅ | Direct calls work on the default group |
SVIP | ✅ | High-priority queue, no extra multiplier |
Enterprise | ✅ | Enterprise group, suited for high-concurrency production |
Summary & Recommendation
chat-latest is not a brand-new model — it’s OpenAI’s naming correction for the “always latest” semantic. For API integrators, it solves three things:
- Clear naming: version numbers no longer mislead — the alias literally means “same as ChatGPT”
- Stable integration: integrate once, benefit long-term — no code changes when OpenAI rolls the Instant model
- Official semantics: currently points to GPT-5.5 Instant, will auto-follow future ChatGPT default upgrades
chat-latest is the most direct official answer today — APIYI’s official-proxy channel runs at list price, and stackable recharge bonuses bring it down further.
Sources: OpenAI official API docs
developers.openai.com/api/docs/models/chat-latest, OpenAI Changelog deprecation notice (2026/5/8), TechCrunch (2026/5/5), OpenRouter chat-latest model page openrouter.ai/openai/gpt-chat-latest. Data retrieved 2026/5/21 (UTC+8).