API Yi now supports GPT-5.3-chat-latest via official direct connection, compatible with OpenAI format, plug and play.
Key Advantages
Massive Context
400K token context window, 3x the previous generation, easily handles ultra-long documents
Lower Hallucination
26.8% reduction in hallucination rate (with web search), 19.7% (knowledge-only)
Natural Conversation
Fewer unnecessary refusals and preachy responses, smoother and more natural dialogue
Official Direct Connection
OpenAI official API transparent forwarding, same quality and stability as official
Model Information
| Parameter | Value |
|---|---|
| Model Name | gpt-5.3-chat-latest |
| Context Window | 128,000 tokens |
| Max Output | 16,384 tokens |
| Knowledge Cutoff | August 31, 2025 |
| Input Format | Text + Image |
| Output Format | Text |
| Streaming | ✅ Supported |
| Function Calling | ✅ Supported |
| Structured Output | ✅ Supported |
Pricing
| Item | Price (per million tokens) |
|---|---|
| Input | $1.75 |
| Cached Input | $0.175 |
| Output | $14.00 |
Pricing matches OpenAI official rates. Top-up bonuses available for better value. See Pricing for details.
How to Use
Endpoint
Basic Call
Streaming Output
Multimodal Input (Vision)
GPT-5.3-chat-latest supports image input for understanding and analyzing visual content:Function Calling
Comparison with Other GPT Models
| Feature | GPT-5.3 Chat | GPT-5.2 | GPT-5 |
|---|---|---|---|
| Context Window | 128K | 128K | 400K |
| Max Output | 16K | 16K | 128K |
| Input Price | $1.75/M | $1.75/M | $1.25/M |
| Output Price | $14.00/M | $14.00/M | $10.00/M |
| Hallucination Improvement | ✅ 26.8% reduction | - | - |
| Conversation Naturalness | ✅ Significantly improved | Good | Good |
| Image Input | ✅ | ✅ | ✅ |
| Function Calling | ✅ | ✅ | ✅ |
FAQ
What's the difference between GPT-5.3-chat-latest and GPT-5.2?
What's the difference between GPT-5.3-chat-latest and GPT-5.2?
GPT-5.3 Chat is the latest Instant model used in ChatGPT. Compared to GPT-5.2, it reduces hallucination rates by 26.8% (with web search), features more natural conversation style, and eliminates unnecessary refusals and preachy responses.
What scenarios is this model best for?
What scenarios is this model best for?
GPT-5.3 Chat is ideal for everyday conversations, customer service, content creation, and Q&A scenarios requiring natural and fluent dialogue. For coding tasks, consider using the GPT-5.3-Codex model.
Does it support image input?
Does it support image input?
Yes, GPT-5.3-chat-latest supports text and image input for understanding and analyzing visual content. However, audio and video inputs are not supported.
What is the maximum output length?
What is the maximum output length?
The maximum output is 16,384 tokens. If output is truncated, check the
finish_reason field and adjust the max_tokens parameter accordingly.How to reduce costs with cached input?
How to reduce costs with cached input?
When your requests contain large repeated prefix content (such as system prompts), OpenAI automatically caches these. Cached input is billed at $0.175/M tokens, just 1/10 of the regular input price.
Is API Yi's GPT-5.3 Chat officially connected?
Is API Yi's GPT-5.3 Chat officially connected?
Yes, API Yi uses OpenAI’s official API transparent forwarding, ensuring the same service quality and stability as the official API.