Website Announcements - API易文档中心

Welcome to the APIYI changelog page. Here you can find our latest model launches, pricing adjustments, feature updates and other important information. We are committed to providing you with the highest quality and most cost-effective AI services.

Subscribe to Updates: Follow our announcements to get model launches and promotional information first, don’t miss any important updates!

🔥 Latest Updates

Recent important updates:

7/25 Claude Opus 5 Launch · Fable 5-Class Intelligence at Half the Price Anthropic’s next-gen Opus released 7/24! Scores 43.3% on the internal Frontier Bench v0.1, ahead of Fable 5’s 33.7%, with a 1M context window (default and maximum), 128K output, thinking on by default and five effort levels. APIYI has launched claude-opus-5 / claude-opus-5-thinking at $5/$25 per 1M tokens — identical to Opus 4.8, effectively flagship-class intelligence at half price, even cheaper with recharge bonuses! 📖 Read details
7/22 Gemini 3.6 Flash / 3.5 Flash-Lite Dual Launch · Official Price Parity Google’s two July stable text models are live! 30+ test cases per model across both endpoints — Search grounding, code execution, URL context and other native tools all verified working, with 1M context and full multimodal input. 3.6 Flash at $1.50/$7.50 thinks by default (four controllable tiers); Lite at $0.30/$2.50 ships zero-thinking with ~2s responses. Down to ~17% off with recharge bonuses. 📖 Read details | 🔗 3.6 Flash docs | 🔗 Lite docs
7/21 Seed 2.1 Turbo Launch · Both Endpoints Verified 15/15 ByteDance’s high-frequency production text model dola-seed-2-1-turbo-260628 is live! $0.50 input / $2.50 output per 1M tokens (officially positioned at half the Pro price), with Chat Completions and Responses both fully verified — thinking toggle/tiers plus implicit + explicit two-layer caching all measured. Note that deep thinking is ON by default; disable it explicitly for high-frequency short Q&A to save cost and latency. 📖 Read details | 🔗 Onboarding docs
7/21 HappyHorse Video Series Now on Sale · 98% of Official Price Alibaba’s video series flagship upgraded to 1.1, four models in one release: happyhorse-1.1-t2v / happyhorse-1.1-i2v / happyhorse-1.1-r2v (up to 9 reference images) / happyhorse-1.0-video-edit. Shares the Wan&HappyHorse group and endpoint with Wan — just switch the model name; pricing is built into the system, the 0.14x rate equals 98% of official, down to ~81% with recharge bonuses. 📖 Read details | 🔗 Onboarding docs
7/19 Seedream 5.0 Pro Launch · $0.12 per Request ByteDance’s Seedream pro tier seedream-5-0-pro-260628 is live! Strongest image quality and instruction following in the family, with interactive editing (coordinates / selection boxes / arrows), up to 10 reference images for fusion, and png output. Fixed $0.12 per request (1 image each), ~2 minutes per image, 1K/2K plus exact pixel sizes; batch sequence and streaming are not supported (passing those params returns 400). For everyday work, 5.0-lite ($0.035/image) remains the recommendation. 🔗 Model comparison & onboarding
7/18 Kimi K3 Launch · Official Price Parity Moonshot AI’s 2.8-trillion-parameter open-source flagship released 7/16 — the largest open-source model ever! Kimi Delta Attention + 1M context with flat pricing across the whole range, #1 on LMArena Frontend Code Arena, GPQA Diamond 93.5%, Terminal-Bench 2.1 88.3%, native image/video input. APIYI has launched kimi-k3 at $3/$15 per 1M tokens — identical to official rates (cache-hit input as low as $0.30), an easy ~9% off via recharge bonus (recharge $100, get 10%), up to ~17% off at higher tiers! 📖 Read details
7/12 Seedance 2.0 mini Launch · About Half the Standard Price ByteDance’s SD2 lite model doubao-seedance-2-0-mini-260615 is live — SD2 now runs a three-model lineup (standard / fast / mini)! 720p/5s nominal ≈ ¥3.16 (official reference ¥2.50; about on par with official after recharge bonuses), faster generation (measured ~1.5-2.5 min for 5 s @720p), capped at 720p, all other capabilities matching the family — same SeeDance2 group, just switch the model name. 📖 Read details | 🔗 Three-model price comparison
7/10 GPT-5.6 Series Launch · Official Relay + CodexReverse Dual Groups OpenAI’s next-gen three-tier family released 7/9! Flagship gpt-5.6-sol (Terminal-Bench 2.1 88.8%), balanced gpt-5.6-terra (GPT-5.5-class at half the price), and lightweight gpt-5.6-luna are all live. Priced identical to official rates — $5/$30, $2.5/$15, $1/$6 per 1M tokens; the default official-relay group gets up to a 20% recharge bonus (~17% off effective), and the CodexReverse group carries a default 0.7x discount! 📖 Read details
7/10 Grok 4.5 Launch · Official Price Parity xAI’s (SpaceXAI) new flagship — Musk calls it “an Opus-class model, but faster, more token-efficient and lower cost”! Co-trained with Cursor, it scores 54 on the Artificial Analysis Intelligence Index (rank #4) and tops agentic tool use, with Terminal Bench 2.1 at 83.3% and ~4.2x the output-token efficiency of Opus 4.8. APIYI has launched grok-4.5 at $2/$6 per 1M tokens — identical to official rates, open on default / svip groups, even cheaper with recharge bonuses stacked! 📖 Read details
7/1 Nano Banana 2 Lite Launch · ~40% of Official on Metered Google’s Lite of Nano Banana 2, gemini-3.1-flash-lite-image (released 6/30)! ~4s per image, ~2.7x faster than Nano Banana 2, focused on 1K canvas + 14 aspect ratios for high-concurrency low-cost work. Live on APIYI: metered ~40% of official ($0.10 input / $12 output per 1M tokens), per-call $0.025. Just launched — price is provisional and may change; we’ll announce! 📖 Read details
7/1 Claude Sonnet 5 Launch · Priced Same as Official Anthropic’s newest Sonnet! Billed as “the most agentic Sonnet model yet,” with performance close to flagship Opus 4.8 — SWE-bench Verified 85.2% (up 5.6pp from Sonnet 4.6’s 79.6%), Terminal-Bench 2.1 80.4%. APIYI has launched claude-sonnet-5 at exactly the official price (intro $2/$10, then $3/$15 per 1M tokens after Aug 31), on high-cache-hit pure AWS Claude official-relay resources! 📖 Read details
6/18 GLM-5.2 Launch · Alibaba Cloud Official Channel Zhipu Z.AI’s new flagship! 1M context + 744B MoE, 51 on the Artificial Analysis Intelligence Index (top open-weight), 81.0 on Terminal-Bench 2.1 and 62.1 on SWE-bench Pro — leading open-weight long-horizon coding. Live via Alibaba Cloud’s official authorized channel at official-aligned pricing: $1.142/$3.997 per 1M tokens (converted from 8/28 RMB at our fixed 1:7 rate), ~85% of official pricing with recharge bonuses stacked! 📖 Read details
6/10 Claude Fable 5 Launch · 30-Day Data Retention Compliance Anthropic’s new Mythos-class flagship! Near-universal SOTA across software engineering and vision, with long async execution and self-updating skills. APIYI has launched claude-fable-5 at official pricing ($10/$50 per 1M tokens). Important: because of these advanced models’ capabilities, Fable 5 and Mythos 5 retain inputs/outputs for 30 days for serious-abuse detection — accessed only by automated safety systems by default, with Anthropic human review only when systems flag potential harm. 📖 Read details
6/10 Seedance 2.0 Launch · Pricing Pegged to Official ByteDance’s new-generation video model! Served on official Volcengine Mainland China resources, with standard doubao-seedance-2-0 + fast doubao-seedance-2-0-fast (tiered pricing — fast is cheaper) — supports text-to-video / first+last frame / first frame / multi-modal reference-to-video (0-9 images + 0-3 videos + 0-3 audios), synchronized audio by default, every ratio in a tier at the same price. Token-based, tier-by-tier pegged to the official site: general discount price ~1.1× official, landing close to official after recharge bonuses. Companion visual testing tool icover.ai/zh. 📖 Overview | 🔗 API Reference
6/5 MiniMax-M3 Launch · Limited-Time 50% Off The first open-weight flagship combining frontier agentic coding, a 1M-token context, and native multimodality! SWE-Bench Pro 59.0 beats GPT-5.5 and Gemini 3.1 Pro; BrowseComp 83.5 tops Opus 4.7. APIYI matches the official limited-time 50% off: $0.30/$1.20 per 1M tokens (0-512K tier), down to ~41% of list with recharge bonuses, ending June 8, 00:00 (UTC+8)! 📖 Read details
5/29 Nano Banana stable names (no -preview) Google updated its official docs and released stable model names gemini-3.1-flash-image (Nano Banana 2) and gemini-3-pro-image (Nano Banana Pro). APIYI already supports them. The original -preview names still work, pricing unchanged — both old and new names run fine, no code changes needed. 📖 Nano Banana 2 | 🔗 Nano Banana Pro
5/28 Claude Opus 4.8 Launch Anthropic’s flagship is here! Agentic coding rises from 64.3% to 69.2%, it’s 4x less likely to miss code flaws, more honest with lower deception, and adds dynamic workflows (hundreds of parallel subagents) plus effort control. APIYI has launched claude-opus-4-8 at the same price as Opus 4.7 ($5/$25 per 1M tokens) — an upgrade at no extra cost! 📖 Read details
5/26 claude-jupiter-v1-p Launch Claude Opus 4.8 preview! Faithfully relayed straight from the official channel, priced identically to Opus 4.7 ($5/$25 per 1M tokens), and usable directly as an Opus model. Free to test in the preview phase, but it may be unstable — not recommended for production. 📖 Read details
5/21 Qwen3.7-Max Launch Alibaba’s Qwen team releases its next-gen flagship! Artificial Analysis Intelligence Index 56.6 (5th globally, #1 Chinese, ahead of Gemini 3.5 Flash 55.3), Terminal-Bench 2.0 69.7, 1M context + 35-hour autonomous agent runs. Listed at $1.7140/$5.1420 per 1M tokens (fixed 1:7 conversion from Alibaba’s ¥12/¥36) — identical to official rates, stackable recharge promo brings effective cost to 79-86% of list!
5/21 chat-latest Launch OpenAI introduces the version-less alias chat-latest (replacing gpt-5.3-chat-latest et al.), auto-tracking ChatGPT’s current default Instant model (currently GPT-5.5 Instant). 400K context + multimodal input. Listed at $5/$30 per 1M tokens — matches OpenAI exactly, stackable recharge bonus brings effective cost to 79-86% of list!
5/21 VEO 3.1 Official Launch AIStudio’s official VEO 3.1 models now available via official proxy! Two models: veo-3.1-fast-generate-preview $0.3/req and veo-3.1-generate-preview $1.2/req, per-request billing, flexible 4/6/8 sec durations, 720p / 1080p / 4k resolution tiers, native synchronized audio. Default group + pay-per-request just works — existing Tokens drop in without config changes; coexists with the Reverse channel. 📖 Read details | 🔗 Official Overview
5/20 Gemini 3.5 Flash Launch Announced at Google I/O 2026! Beats Gemini 3.1 Pro on Terminal-Bench 2.1 (76.2%), MCP Atlas (83.6%), GDPval-AA (1656 Elo), runs ~4× faster and costs ~half. Listed at $1.50/$9.00 per 1M tokens — matches Google’s official rate, up to 20% off via recharge bonus!
5/19 Stripe Online Recharge Launched New Stripe online recharge channel with foreign credit card support and real-time crediting, backed by 3D Secure + strict risk controls. Use a real-name card from your company / yourself / your school — unsafe cards will be rejected, malicious-payment accounts will be banned!
5/18 Referral Rebate 5% Policy + Workflow FAQ Published Default referral rebate is now 5% continuous payout (adjusted from 10% starting 5/15). Grab your invite code / link from the 推广信息 (Referral Info) section in Account Center; rewards are manually transferable to your main balance, and you can filter the logs by 奖励金划转 (Reward Transfer) to see every payout!
5/18 Email Registration Support List FAQ Published Accepts Gmail / Outlook / Hotmail / Foxmail / global university addresses (.edu / .edu.cn); QQ / 126 / 163 not supported; corporate custom-domain emails are off by default and require contacting support for whitelist approval!
5/8 Gemini 3.1 Flash Lite Goes GA Google’s lightweight model is officially GA! Model name moves from gemini-3.1-flash-lite-preview to gemini-3.1-flash-lite, output speed 64% faster than 2.5 Flash with 2.5× faster TTFT, GPQA Diamond 86.9% / MMMU Pro 76.8%. Listed at $0.25/$1.50 per 1M tokens — identical to Google’s rates, stackable top-up promo down to ~79–85% of list!
5/3 Grok 4.3 Launch xAI’s new flagship model! Intelligence Index 53, GDPval-AA ELO 1500 (+321), τ²-Bench 98%, IFBench 81%, 1M context + multimodal + 159 t/s. Listed at $1.25/$2.50 per 1M tokens (input down 37.5%, output down 58.3% vs Grok 4.20), tiered pricing: 0-200K standard, 200K-∞ 2x. Default, SVIP and other common groups all open, ~85% of upstream after 10% recharge bonus!
5/3 GPT-5.5 Pro Launch OpenAI’s strongest reasoning model today! Terminal-Bench 2.0 82.7%, Expert-SWE 73.1%, GDPval 84.9%, 1.05M context + 128K output. Tiered pricing: 0-272K $30/$180, 272K-∞ $60/$270 per 1M tokens. SVIP group only, NOT exposed on the Default group — single calls can cost several dollars, restricted to prevent misuse!
4/28 Qwen3.6 Open-Weight Duo Launch APIYI-hosted, no GPU rental! qwen3.6-27b (27B dense, coding rivals 397B-class) + qwen3.6-35b-a3b (same lineage as closed-source Flash MoE) open-weights now on APIYI: pay-as-you-go Chat flat pricing, 27b $0.42/$2.52, 35b-a3b $0.26/$1.56 per 1M tokens. Skip the GPU rental, deploy, and ops!
4/27 Qwen3.6-Max-Preview / Flash Launch Aliyun official-relay dual launch! Max-Preview tops 6 coding benchmarks (SWE-bench Pro, Terminal-Bench 2.0, …), AIME 2025 93%, GPQA 86%; Flash is a 35B-A3B MoE with 1M multimodal context. Listed at Max $1.28/$7.68 and Flash $0.17/$1.02 per 1M tokens — list-price-matched, ~15% off via recharge promo!
4/25 GPT-5.5 Official Relay Launch OpenAI’s newest frontier model! SWE-bench Verified 88.7%, 60% fewer hallucinations vs. GPT-5.4, reasoning.effort adds the xhigh tier (five levels: none/low/medium/high/xhigh), 1.05M context + 128K output. Pricing matches OpenAI’s official rate: $5 input / $30 output / $0.50 cached input per 1M tokens!
4/25 Kimi K2.6 Launch Moonshot’s open-source flagship! 1T MoE / 32B active, 256K context, SWE-Bench Pro 58.6 beats GPT-5.4 and Claude Opus 4.6. Huawei Cloud official relay at $0.60 in / $2.40 out per 1M tokens — ~60% of the ¥6.5 / ¥27 list price, with Function Call + Prefix Continuation!
4/24 DeepSeek V4 Dual-Model Launch Pro (1.6T/49B active) and Flash (284B/13B active)! 1M context + Hybrid Attention, open-source SOTA on Agentic Coding, SWE-Verified 80.6 rivaling Claude/Gemini, Flash just $0.14/M input, official price matched with ~15% off via recharge!
4/23 gpt-image-2 Launch OpenAI’s flagship image generation model! Native 2K/4K (up to 3840×2160) any-resolution output, auto high-fidelity on reference images, 20-30% cheaper than 1.5, zero-code OpenAI SDK migration!
4/22 gpt-image-2-all Launch GPT image generation reverse-engineered model! Flat $0.03/image per-call, ~30s generation, supports text-to-image, multi-image fusion, natural-language editing, high text-rendering fidelity, native Chinese prompts!
4/20 Nano Banana 2 Token-Based Pricing Update Input $0.14→$0.18/M tokens, output $16.80→$21.6/M tokens, official discount raised to 36%, stackable with 20% deposit bonus!
4/17 Claude Opus 4.7 Launch Anthropic’s latest flagship! 93-task coding benchmark +13% over Opus 4.6, 3× production tasks on Rakuten-SWE-Bench, tool errors cut to 1/3, new xhigh effort tier, same pricing!
4/14 Nano Banana Pro Price Adjustment + Enterprise HA Channel Due to Google resource policy changes, Nano Banana Pro adjusted to $0.09/call, new NanoBananaEnterprise HA fallback channel (1.4x rate)!
4/9 GLM-5.1 Launch Zhipu’s most powerful open-source coding Agent! SWE-Bench Pro 58.4 beats GPT-5.4, Claude Opus 4.6, and Gemini 3.1 Pro. 744B MoE, 200K context, autonomous 8-hour long-horizon tasks, MIT licensed!
4/6 Qwen3.6-Plus Launch Alibaba’s most powerful coding Agent model! MoE 72B params with only 18B effective compute, Terminal-Bench 61.6 surpassing Claude Opus 4.5, 1M context, always-on CoT!
3/31 Gemini Embedding 2 Preview Launch Google’s first native multimodal embedding model! MTEB English #1 at 68.32, unified text/image/video/audio/PDF vectors, text just $0.20/M tokens!
3/30 Seed 2.0 Pro Launch ByteDance’s flagship reasoning model! AIME 2025 at 98.3, Codeforces 3020, strong Agent capabilities, ~4-6x cheaper than GPT-5.2!
3/24 MiniMax-M2.7 Series Launch Smallest Tier-1 model! 10B params, SWE-bench Pro 56.22%, self-evolving, standard $0.3 / highspeed $0.6 per million input tokens!
3/22 MiMo-V2 Series Launch Xiaomi’s trillion-param models! Pro approaches Opus 4.6 at 1/6th price, Omni supports text/image/video/audio multimodal!
3/20 Grok 4.20 Beta Series Launch xAI’s latest 4 models! Reasoning, multi-agent, base & non-reasoning variants, unified pricing $2/$6 per million tokens!
3/18 GPT-5.4 Mini / Nano Launch OpenAI’s most capable small models! Mini 2x faster approaching flagship, Nano just $0.20/M input tokens, great for OpenClaw!
3/8 Seed 2.0 Lite Launch ByteDance’s cost-efficient enterprise model! Outperforms Seed 1.8, multimodal input (image/video/text), ideal for high-QPS production!
3/6 GPT-5.4 / GPT-5.4 Pro Launch OpenAI’s most powerful models! Native computer use + deep research, official pricing, 10%+ deposit bonus!
3/5 Gemini 3.1 Flash Lite Preview Launch Google’s latest lightweight model! Optimized for agent tasks, 1M context, multimodal input, official direct connection!
3/4 GPT-5.3 Chat Launch OpenAI’s latest chat model! 26.8% fewer hallucinations, more natural conversations, official direct connection!
3/1 Model Name Suffix -c Explained Models with -c suffix are per-call billing versions, identical in capability to non-suffixed versions
3/1 Nano Banana 2 Pricing Adjustment Per-call $0.055/image, new token-based billing (as low as 28% of Google’s pricing), 512px from $0.025/image!
2/27 Nano Banana 2 Google’s latest image generation model! Pro-level quality + Flash-tier speed, 14 aspect ratios, Image Search Grounding!
1/25 Sora 2 Official Channel OpenAI official API transparent proxy is live! 99.99% stability, per-second billing (from $0.1/sec), supports image-to-video!
12/18 Gemini 3 Flash Preview Google’s latest high-performance fast model! SWE-bench 78% surpasses 3 Pro, 3x faster, MMMU-Pro 81.2% beats all competitors!
12/17 GPT Image 1.5 OpenAI’s latest image generation model! 4x faster, precision editing with consistent details, enhanced text rendering!
12/11 GPT-5.2 Series OpenAI strikes back at Gemini 3 with three variants! ARC-AGI-1 reaches 90%, first model to cross this threshold, 390× cost reduction!
12/04 SeeDream 4.5 BytePlus Volcano Engine’s most powerful 4K image generation model! 1.2B parameters, enhanced text rendering, supports 10 reference images, $0.035/image!
11/25 Claude Opus 4.5 Anthropic’s latest flagship model! SWE-bench 80.9% surpasses all competitors, price reduced to 1/3 of predecessor, supports Claude Code!
11/20 Nano Banana Pro (Gemini 3 Pro Image Preview) Google’s latest image generation model! 4K high-res support, industry-best text rendering, powerful local editing!
11/20 Gemini 3 Pro Preview Google’s latest multimodal AI model! LMArena 1501 Elo #1, 1M context, chain-of-thought support!
11/14 GPT-5.1 Series OpenAI’s latest iteration! Balance of intelligence and speed, SWE-bench 76.3%, 24-hour caching!

📅 June 2026

🌟 GLM-5.2 Launch: Zhipu’s 1M-Context Flagship Coding Model

June 18, 2026 Zhipu Z.AI’s new flagship glm-5.2 is live on APIYI via Alibaba Cloud’s official authorized channel (official relay partnership). The context window jumps from 200K to 1M tokens, on a 744B MoE architecture. It scores 51 on the Artificial Analysis Intelligence Index (top open-weight), 81.0 on Terminal-Bench 2.1 and 62.1 on SWE-bench Pro, leading open-weight long-horizon coding. Pricing aligns to Alibaba Cloud’s official rates (8/28 RMB per million input/output tokens), converted to $1.142/$3.997 per million tokens at our fixed 1:7 rate — easily ~85% of official pricing with recharge bonuses stacked. 📖 Read details

🌟 Claude Fable 5 Launch: Mythos-Class Power, 30-Day Data Retention Compliance

June 10, 2026 Anthropic’s new Mythos-class flagship claude-fable-5 is live on APIYI! Near-universal SOTA across software engineering and vision, with long async execution, self-updating skills, and self-built harnesses, at official pricing ($10/$50 per 1M tokens). Important compliance note: because of these advanced models’ capabilities, Bedrock and Anthropic enforce additional checks for high-risk misuse — Fable 5 and Mythos 5 retain inputs/outputs for 30 days for serious-abuse detection. By default only automated safety systems access it; Anthropic human review happens only when systems flag potential harm, and only to the extent needed; traffic from multiple accounts flagged for the same prohibited activity may be reviewed together. Note: retention happens on AWS Bedrock and Anthropic’s official side — APIYI itself retains no data (pure transparent proxy); only Fable 5 and Mythos 5 are affected, other Claude models are not. 📖 Read details

🌟 Seedance 2.0 Launch: ByteDance’s New Video Model, Pricing Pegged to Official

June 10, 2026 ByteDance Seedance 2.0 is live on APIYI, served on official Volcengine Mainland China resources (with upstream content safety built in)! Standard doubao-seedance-2-0 and fast doubao-seedance-2-0-fast both available (tiered pricing — fast is cheaper than standard), supporting text-to-video / first+last frame / first frame / multi-modal reference-to-video (0-9 reference images + 0-3 reference videos + 0-3 reference audios), with audio synchronized to the visuals by default, every ratio in a tier at the same price, and 4-15 s controllable duration. Token-based, tier-by-tier pegged to the official site: the general discount price is ~1.1× official, landing close to official after recharge bonuses (near parity for large-deposit customers). A companion visual testing tool icover.ai/zh lets you quickly validate results. 📖 Seedance 2.0 Overview | 🔗 API Reference

🌟 MiniMax-M3 Launch: 1M-Context Open Flagship at 50％ Off

June 5, 2026 MiniMax’s new open-weight flagship MiniMax-M3 is live on APIYI! SWE-Bench Pro 59.0 beats GPT-5.5 and Gemini 3.1 Pro, with MSA sparse attention powering a 1M-token context plus native multimodality. APIYI matches the official limited-time 50% discount: $0.30 input / $1.20 output per 1M tokens (0-512K tier, tiered billing), down to ~41% of list with recharge bonuses. Offer ends June 8, 00:00 (UTC+8); subsequent pricing TBD. 📖 Read details

📅 May 2026

🔄 Nano Banana Stable Model Names (no `-preview`)

May 29, 2026 Google updated its official docs and released stable model names gemini-3.1-flash-image (Nano Banana 2) and gemini-3-pro-image (Nano Banana Pro). APIYI already supports them. The original -preview names still work as usual at unchanged pricing, no code changes needed — both old and new names run fine. Google hasn’t clarified whether the stable version differs from the preview in output quality or safety behavior, so feel free to test and share feedback. 📖 Nano Banana 2 Docs | 🔗 Nano Banana Pro Docs

🌟 Claude Opus 4.8 Launches: Stronger Coding, More Honest, Same Price

May 28, 2026 Anthropic’s flagship Claude Opus 4.8 is officially here! Agentic coding rises from 64.3% to 69.2%, it’s about 4x less likely to overlook code flaws than its predecessor, flags uncertainty more readily with a lower deception rate, and adds dynamic workflows (hundreds of parallel subagents) plus effort control. APIYI has launched claude-opus-4-8 at the same price as Opus 4.7 ($5/$25 per 1M tokens) — effectively an upgrade at no extra cost. 📖 Read details | 🔗 Recharge Promotions

🌟 claude-jupiter-v1-p Launches: Claude Opus 4.8 Preview

May 26, 2026 APIYI launches claude-jupiter-v1-p — the preview build of Claude Opus 4.8, faithfully relayed straight from the official channel with inputs/outputs matching the official model. Priced identically to claude-opus-4-7 ($5 input / $25 output per 1M tokens) and usable directly as an Opus model — existing Opus workflows just swap the model name to try it. The preview may hit rate limits / timeouts and other instability, so it’s not recommended for production — keep critical workloads on claude-opus-4-7. 📖 Read details

🌟 Qwen3.7-Max Launches: #1 Chinese Model, Top-5 Globally on Intelligence Index

May 21, 2026 Alibaba’s Qwen team releases its next-gen flagship! Artificial Analysis Intelligence Index 56.6 (5th globally, top Chinese model — ahead of Gemini 3.5 Flash 55.3), Terminal-Bench 2.0 69.7, 1M context + 35-hour autonomous agent run (1,158 tool calls, 10× kernel speedup). APIYI’s official-proxy channel is listed at $1.7140/$5.1420 per 1M tokens (fixed 1:7 conversion from Alibaba’s ¥12/¥36) — identical to official rates, stackable recharge bonus brings effective cost to 79-86% of list. 📖 Read details | 🔗 Recharge Promo

🌟 chat-latest Launches: Always Tracks ChatGPT’s Default Instant Model

May 21, 2026 OpenAI introduces the version-less alias chat-latest, replacing gpt-5.2-chat-latest / gpt-5.3-chat-latest (both deprecated 5/8), auto-tracking ChatGPT’s current default Instant model (currently GPT-5.5 Instant). 400K context + 128K output + multimodal input, ~50%+ fewer hallucinations vs the previous generation. APIYI’s official-proxy channel is listed at $5/$30 per 1M tokens ($0.50 cached) — identical to OpenAI’s rates, stackable recharge bonus brings effective cost to 79-86% of list. 📖 Read details | 🔗 Recharge Promo

🌟 Gemini 3.5 Flash Launch: Flash Beats Gemini 3.1 Pro

May 20, 2026 Google announced Gemini 3.5 Flash at Google I/O 2026! Beats Gemini 3.1 Pro on Terminal-Bench 2.1 (76.2%), MCP Atlas (83.6%), Finance Agent v2 (57.9%) and GDPval-AA (1656 Elo), with ~289 t/s output (~4× faster than comparable frontier models). 1M context + 64K output + native multimodal. Listed at $1.50/$9.00 per 1M tokens ($0.15 cached) — identical to Google’s official rate, up to 20% off via recharge bonus. 📖 View Details | 🔗 Recharge Promotions

💳 Stripe Online Recharge Launched · Foreign Credit Cards Supported

May 19, 2026 New Stripe online recharge channel with foreign credit card support, real-time crediting, 3D Secure and strict fraud-prevention controls — smoother for overseas customers. Please use a real-name credit card issued to your company / yourself / your school — unsafe cards will be rejected by Stripe risk controls, and accounts attempting malicious payments (stolen cards, chargebacks, fraud testing, etc.) will be banned. Other payment methods (WeChat / Alipay / bank transfer) are unaffected. 📖 View Details | 🔗 Payment Methods

🌟 Referral Rebate Mechanism + Reward Transfer Workflow FAQ Published

May 18, 2026 APIYI has no separate “agent partnership” channel — the referral rebate program is open to all users by default: grab your invite code / invite link from the 推广信息 (Referral Info) section in the Account Center, and earn a continuous 5% credit rebate on every invitee’s recharges (adjusted from 10% to 5% starting 5/15). Rewards need to be manually transferred to your main balance before they can be spent — filter by 奖励金划转 (Reward Transfer) in the logs to see every payout. 📖 View Details | 🔗 Recharge Promotions

🌟 Email Registration Support List FAQ Published

May 18, 2026 Official clarification on which email suffixes are accepted for sign-up: international providers (Gmail / Outlook / Hotmail / Foxmail, etc.) and global university addresses (.edu / .edu.cn) all work; QQ / 126 / 163 are temporarily not supported (due to low deliverability for international senders). Corporate custom-domain emails are off by default and require contacting customer service for whitelist approval before use. 📖 View Details

🌟 Gemini 3.1 Flash Lite Goes GA: The Sweet Spot for Agents & Low Latency

May 8, 2026 Google announced GA on May 8 (UTC+8). The model name drops -preview and becomes gemini-3.1-flash-lite — contract is now production-stable. Output speed is 64% faster than 2.5 Flash with 2.5× faster TTFT; GPQA Diamond 86.9% / MMMU Pro 76.8%. Listed at $0.25/$1.50 per 1M tokens, identical to Google’s rates, stackable top-up promo brings effective price down to ~79–85% of list. 📖 View Details | 🔗 Deposit Promotions

🌟 Grok 4.3 Launch: xAI’s New Flagship, 37.5％ Input / 58.3％ Output Price Cuts

May 3, 2026 xAI’s new flagship grok-4.3 is live via the official direct-relay channel! Intelligence Index 53, GDPval-AA ELO 1500 (+321), τ²-Bench 98%, IFBench 81%, 1M context + multimodal + blazing 159 t/s output. Listed at $1.25/$2.50 per 1M tokens, tiered pricing doubles past 200K. Default, SVIP and other common groups all open — ~85% of upstream after the 10% recharge bonus. 📖 View Details | 🔗 Deposit Promotions

🌟 GPT-5.5 Pro Launch: OpenAI’s Strongest Reasoning Model

May 3, 2026 OpenAI’s flagship reasoning variant gpt-5.5-pro is live via the official direct-relay channel! Terminal-Bench 2.0 82.7%, Expert-SWE 73.1%, GDPval 84.9%, 1.05M context + 128K output. Tiered pricing: 0-272K range $30/$180, 272K-∞ range $60/$270 per 1M tokens. SVIP group only, NOT exposed on the Default group — single calls can cost several dollars, restricted to prevent accidental misuse. 📖 View Details | 🔗 Deposit Promotions

📅 April 2026

🌟 Qwen3.6 Open-Weight Duo Live: APIYI-Hosted, No GPU Rental

April 28, 2026 The Qwen3.6 series expands to 5 models! Two new open-weight releases — qwen3.6-27b (27B dense, coding rivals 397B-class) and qwen3.6-35b-a3b (same lineage as closed-source Flash MoE / 3B active) — are now hosted on APIYI’s official relay. Pay-as-you-go Chat flat pricing: 27b $0.42/$2.52 and 35b-a3b $0.26/$1.56 per 1M tokens. Skip the GPU rental, deploy, and ops. 📖 View Details | 🔗 Series Overview

🌟 Qwen3.6-Max-Preview / Flash Launch: Dual Aliyun Official Relay

April 27, 2026 Alibaba’s qwen3.6-max-preview and qwen3.6-flash are live via Aliyun’s official relay! Max-Preview tops 6 coding benchmarks including SWE-bench Pro and Terminal-Bench 2.0 (AIME 2025 93%, GPQA 86%); Flash is a 35B-A3B MoE with 1M multimodal context. Listed at Max $1.28/$7.68 and Flash $0.17/$1.02 per 1M tokens — matching Alibaba’s official rate, with the recharge promo stacking to ~15% off. 📖 View Details | 🔗 Deposit Promotions

🌟 GPT-5.5 Launch: New xhigh Reasoning Tier

April 25, 2026 OpenAI’s newest frontier model gpt-5.5 is live via the official direct-relay channel! SWE-bench Verified 88.7%, 60% fewer hallucinations vs. GPT-5.4, reasoning.effort adds the xhigh tier (five levels: none/low/medium/high/xhigh), 1.05M context + 128K max output. Pricing matches OpenAI’s official rate: $5 input / $30 output / $0.50 cached input per 1M tokens. 📖 View Details | 🔗 Deposit Promotions

🌟 Kimi K2.6 Launch: Open-Source Agent-Coding Leader

April 25, 2026 Moonshot’s Kimi K2.6 is live via Huawei Cloud official relay! 1T MoE / 32B active, 256K context — SWE-Bench Pro 58.6 beats GPT-5.4 and Claude Opus 4.6, with native Function Call and Prefix Continuation. Listed at $0.60 in / $2.40 out per 1M tokens, about 60% of the ¥6.5 / ¥27 RMB list price. 📖 View Details | 🔗 Deposit Promotions

🌟 DeepSeek V4-Pro / V4-Flash Launch: 1M Context + Open-Source SOTA

April 24, 2026 DeepSeek returns a year later with the V4 preview! Pro (1.6T/49B active) and Flash (284B/13B active), both MoE with 1M context + Hybrid Attention architecture. Agentic Coding open-source SOTA, SWE-Verified 80.6 rivaling Claude/Gemini. Official price matched: Flash just $0.14 in / $0.28 out per 1M tokens, with recharge stacking to ~15% off. 📖 View Details | 🔗 Deposit Promotions

🌟 gpt-image-2 Launch: Native 4K + 30％ Same-Tier Price Cut

April 23, 2026 OpenAI’s flagship image model gpt-image-2 is now live! Native support for any valid resolution (including 2K / 3840×2160 4K), auto high-fidelity on reference image edits, 20-30% lower cost than gpt-image-1.5 at same size and quality. Zero-code OpenAI SDK migration — just point base_url to api.apiyi.com/v1. Ideal for large-asset production and reference-driven character consistency. 📖 View Details | 🔗 Integration Doc

🌟 gpt-image-2-all Launch: $0.03/image GPT Reverse-Engineered Image Model

April 22, 2026 GPT image generation reverse-engineered model is live! Flat $0.03/image per-call, ~30s generation, supports text-to-image, multi-image fusion editing, and natural-language editing. Native Chinese prompt support, high text-rendering fidelity, size controlled via prompt (no size/n/quality parameters). Ideal for large-scale Chinese marketing materials and infographics. 📖 View Details | 🔗 Model Overview

💰 Nano Banana 2 Token-Based Pricing Update

April 20, 2026 Due to rising official API key costs and high-speed CDN bandwidth costs for image download/transfer, we have carefully considered a modest adjustment to Nano Banana 2 token-based pricing: input $0.14→$0.18/M tokens, output $16.80→$21.6/M tokens, with official discount raised to 36%. The increase is modest given the low base price. Stackable with deposit bonus up to 20%, ideal for daily usage. 🔗 Deposit Promotions

🌟 Claude Opus 4.7 Launch: Free Coding & Agent Upgrade

April 17, 2026 Anthropic’s new flagship Claude Opus 4.7 is now live! +13% over Opus 4.6 on a 93-task coding benchmark, 3× more production tasks on Rakuten-SWE-Bench, +14% on multi-step workflows with tool errors cut to 1/3, and a new xhigh effort tier — all at the same $5/$25 per 1M tokens. claude-opus-4-7-thinking variant available for deep reasoning. 📖 View Details | 🔗 Deposit Promotions

💰 Nano Banana Pro Price Adjustment & New Enterprise HA Channel

April 14, 2026 Due to Google’s resource policy changes, Nano Banana Pro (gemini-3-pro-image-preview) pricing adjusted from $0.05 to $0.09/call. New NanoBananaEnterprise enterprise HA channel (1.4x rate) added as stable fallback to ensure uninterrupted supply. Deposit $100+ for 10% bonus to further reduce costs. 📖 Best practice: Default channel + NanoBananaEnterprise fallback | 🔗 Deposit Promotions

🌟 GLM-5.1 Launch: Zhipu’s Most Powerful Open-Source Coding Agent

April 9, 2026 Zhipu Z.AI’s flagship open-source model is now available! SWE-Bench Pro 58.4 beats GPT-5.4, Claude Opus 4.6, and Gemini 3.1 Pro — first open-source model to top this benchmark. 744B MoE / 256 experts / 8 active, 200K context, autonomous 8-hour long-horizon coding tasks, MIT licensed. 📖 Learn more

🌟 Qwen3.6-Plus Launch: Alibaba’s Most Powerful Coding Agent Model

April 6, 2026 The first model in Alibaba’s Qwen3.6 series is now available! MoE architecture with 72B params (only 18B effective compute), Terminal-Bench 61.6 surpassing Claude Opus 4.5, SWE-bench 78.8, 1M token context + always-on chain-of-thought reasoning — China’s top coding Agent model. 📖 Learn more

📅 March 2026

🌟 Gemini Embedding 2 Preview Now Available

March 31, 2026 Google’s first natively multimodal embedding model is live! Unified vector space for text/image/video/audio/PDF, MTEB English #1 at 68.32, 3072-dim MRL flexible truncation, text at just $0.20/M tokens. 📖 Learn more

🌟 Seed 2.0 Pro Flagship Model Now Available

March 30, 2026 ByteDance’s Seed 2.0 Pro flagship reasoning model is now live! AIME 2025 at 98.3, Codeforces 3020 (no tools), SWE-Bench 76.5%, powerful Agent workflow capabilities. Input ~$0.47/M tokens — approximately 4-6x cheaper than GPT-5.2. 📖 Learn more

🌟 MiniMax-M2.7 / M2.7-highspeed Now Available

March 24, 2026 MiniMax’s self-evolving agent model M2.7 series is now live! Just 10B active parameters achieving Tier-1 performance — SWE-bench Pro 56.22%, Artificial Analysis Intelligence Index 50. Standard: $0.30 input / $1.20 output, highspeed: $0.60 input / $2.40 output per million tokens. Both variants produce identical output quality. 📖 Learn more

🌟 MiMo-V2-Pro / MiMo-V2-Omni Now Available

March 22, 2026 Xiaomi’s MiMo-V2 series is live! MiMo-V2-Pro: trillion-parameter reasoning model, AA Intelligence Index 49 (#8 globally), ClawEval 61.5 approaching Opus 4.6, input just $1/M tokens (~1/6th of competitors). MiMo-V2-Omni: unified multimodal model supporting text/image/video/audio, input just $0.40/M tokens. 📖 Learn more

🌟 Grok 4.20 Beta Series — 4 New Models Available

March 20, 2026 xAI’s latest Grok 4.20 Beta series with all 4 models now live! Including grok-4.20-beta-0309-reasoning, grok-4.20-multi-agent-beta-0309, grok-4.20-beta, and grok-4.20-beta-0309-non-reasoning. Unified pricing at $2 input / $6 output per million tokens.

📖 Learn more

🌟 GPT-5.4 Mini / GPT-5.4 Nano Now Available

March 18, 2026 OpenAI’s lightweight, cost-effective GPT-5.4 Mini and GPT-5.4 Nano are now live! Mini runs 2x faster than GPT-5 Mini with 54.4% SWE-Bench Pro approaching flagship performance; Nano at just $0.20/M input tokens for ultimate cost efficiency. Official pricing with ~10% deposit bonus, also great for OpenClaw and similar use cases. 📖 Read More | 🔗 API Docs

🌟 ByteDance Seed 2.0 Lite Now Available

March 8, 2026 ByteDance’s cost-efficient enterprise model is live! Outperforms Seed 1.8 with multimodal input (image/video/text), long-context understanding, and reliable structured outputs. Ideal for high-QPS production workloads — call via OpenAI-compatible mode. 📖 Read More

🌟 GPT-5.4 / GPT-5.4 Pro Now Available

March 6, 2026 OpenAI’s most powerful flagship models GPT-5.4 and GPT-5.4 Pro are now live! Native support for computer use, deep research, and advanced tool calling. Priced same as official rates (GPT-5.4: $2.50/$15, Pro: $30/$180 per million tokens), with 10%+ bonus on deposits starting from $100. 📖 Read More | 🔗 API Docs

🌟 Gemini 3.1 Flash Lite Preview Now Available

March 5, 2026 Google’s latest lightweight model Gemini 3.1 Flash Lite Preview is now live! Optimized for high-throughput agent tasks and low-latency scenarios, with 1M+ context window, multimodal input (text/images/video/audio/PDF), input pricing at just $0.25/million tokens. Official direct connection, pricing matches Google’s rates. 📖 Read More | 🔗 API Docs

🌟 GPT-5.3 Chat Now Available

March 4, 2026 OpenAI’s latest GPT-5.3 Instant chat model is now live! 26.8% fewer hallucinations (with web search), more natural conversation style, fewer unnecessary refusals and preachy responses. Same price as GPT-5.2 with better performance, official direct connection. 📖 Read More | 🔗 API Docs

🏷️ Model Name Suffix -c Explained

March 1, 2026 Models with the -c suffix (e.g., gemini-3-pro-image-preview-c) are per-call billing versions. They are identical in capability to their non-suffixed counterparts, differing only in billing method. We recommend using “Token-first” tokens for automatic adaptation. 📖 Learn More | 🔗 Token Billing Modes

📅 February 2026

💰 Nano Banana 2 Pricing Adjustment

March 1, 2026 Nano Banana 2 per-call pricing adjusted to $0.055/image. New token-based billing available (as low as 28% of Google’s pricing), 512px images from just $0.025/image. Flexible pay-per-resolution pricing. 🔗 API Docs

🌟 Nano Banana 2 Launch

February 27, 2026 Google’s latest image generation model Nano Banana 2 (gemini-3.1-flash-image-preview) is now live! Pro-level quality at Flash-tier speed. Supports 4K output, 14 aspect ratios, Image Search Grounding, and more exclusive features. 📖 Read More | 🔗 API Docs

📅 January 2025

🌟 Sora 2 Official API Channel Launched

January 25, 2025 Sora 2 Official channel is now live! OpenAI official API transparent proxy with 99.99% stability, per-second billing (from $0.1/sec), supporting 4/8/12 second durations and image-to-video. A reliable alternative to the reverse-engineered channel for users who prioritize quality and stability. 🔗 API Docs

📅 December 2025

⚡ Gemini 3 Flash Preview Launch

December 18, 2025 Google’s latest high-performance fast model! SWE-bench Verified 78% surpasses Gemini 3 Pro, 3x faster than 2.5 Pro, MMMU-Pro 81.2% beats all competitors. API.YI launches 3 model variants (auto/forced/no reasoning), official pricing ($0.5/$3.0 per million tokens), additional recharge discounts available. 📖 Read More | 🔗 API Docs

🎨 OpenAI GPT Image 1.5 Launch

December 17, 2025 OpenAI’s latest image generation model GPT Image 1.5 is now live! 4x faster generation, precision editing with consistent details, enhanced text rendering. Official pricing (low $0.01/image, standard $0.04/image, HD $0.17/image), additional discounts available through recharge promotions. 📖 Read More | 🔗 API Docs

🚀 GPT-5.2 Series Launch

December 11, 2025 OpenAI releases Instant, Thinking, and Pro variants to compete! GPT-5.2 Pro achieves 90% on ARC-AGI-1 reasoning test, first model to cross this threshold with 390× cost reduction compared to last year. Beats or ties industry professionals 70.9% of the time on GDPval tasks, 400k context window, 128k tokens single output. 📖 Read More | 🔗 API Docs

🎨 SeeDream 4.5 Launch

December 4, 2025 BytePlus Volcano Engine’s latest image generation model SeeDream 4.5 is officially released! 1.2B parameter architecture, enhanced 4K image generation capability, significantly improved text rendering, supports up to 10 reference images. Priced at just $0.035/image, official provides 200 free images for testing. 📖 Read More | 🔗 API Docs

📅 November 2025

🌟 Gemini 3 Pro Preview Launch

November 20, 2025 Google’s latest multimodal AI model is now live! Ranked #1 globally on LMArena with 1501 Elo, 76.2% on SWE-bench Verified, supports 1M context and chain-of-thought output. Up to 20% discount with recharge bonuses. 📖 Read More

🍌 Nano Banana Pro Launch

November 20, 2025 Google’s latest image generation model Nano Banana Pro (Gemini 3 Pro Image) is officially released! Support for 4K high-res output, industry-best text rendering, powerful local editing features. Migration from old version only requires changing model name. Full 4K HD version priced at only $0.05/image, official 4K price is $0.24/image, approximately 1/5 of the official price~! 📖 Read More | 🔗 API Docs

📅 October 2025

🎥 Sora 2 Async API Officially Launched

October 22, 2025

Sora 2 Async Calling Mode - Flexible Video Generation Workflow

🚀 New Features:
- Async API endpoint: /v1/videos
- 3-step workflow: Submit request → Poll status → Download video
- Supports batch processing and background tasks
💰 Pricing Maintained:
- Same price as sync API
- Standard version: $0.15/call (10 seconds)
- 15-second version: $0.25/call
- Video duration: 10 or 15 seconds optional
🌟 Core Features:
- Independent Task Management: Query status anytime after getting video_id
- Flexible Polling Strategy: Customize polling intervals to avoid unnecessary requests
- Reliable Video Download: Independent download after completion, supports resume
- Batch Processing Support: Submit up to 50 concurrent tasks simultaneously
🛠️ Complete Workflow:
1. Submit Request: POST /v1/videos, returns video_id
2. Poll Status: GET /v1/videos/{video_id}, check progress
3. Download Video: GET /v1/videos/{video_id}/content, save MP4
📊 Typical Timeline:
- Request submission: Instant response (< 1 second)
- Video generation: Usually 3-5 minutes
- Video download: 10-30 seconds (depends on network speed)
🎯 Recommended Scenarios:
- Batch video generation tasks
- Background scheduled tasks
- Scenarios requiring fine-grained control
- Applications with long wait times
📖 Complete Documentation:
- English docs: Sora 2 Async API
- Includes Python, JavaScript, Bash and other multi-language code examples

Sora 2 Async API complements the sync streaming API. Sync API is suitable for real-time progress feedback, while async API is suitable for batch processing. Both modes have the same pricing, users can choose flexibly based on specific scenarios. See documentation for detailed comparison.

✨ Gemini 2.5 Flash Latest Dated Version Launch

October 12, 2025

Gemini 2.5 Flash Preview 09-2025 - Google’s Latest Version

🚀 New Model: gemini-2.5-flash-preview-09-2025
📅 Version Notes:
- Latest dated version of Gemini 2.5 Flash (September 2025 release)
- Google’s official latest iteration of Gemini 2.5 Flash series
- Includes latest performance optimizations and feature improvements
🌟 Core Features:
- Continues the powerful capabilities of Gemini 2.5 Flash series
- Supports ultra-long context processing
- Multimodal understanding (text, images, video)
- Google’s latest technical optimizations and performance enhancements
💰 Pricing Advantage:
- APIYI offers highly competitive pricing
- Even better with top-up bonus promotions
- Official latest version with stronger performance
🛠️ Calling Method:
- Use standard chat completion endpoint: /v1/chat/completions
- Fully compatible with OpenAI API format
- Simply replace model name to use

gemini-2.5-flash-preview-09-2025 is Google’s officially released latest dated version of Gemini 2.5 Flash. Recommend using the latest version for best performance and experience. More info: Google Official Docs

🍌 Gemini 2.5 Flash Image Official Release

October 3, 2025

Gemini 2.5 Flash Image - Google Image Generation Official Release

🚀 Model Update:
- New model name: gemini-2.5-flash-image
- Old model name: gemini-2.5-flash-image-preview (still available)
💰 Pricing Maintained:
- APIYI pricing: $0.02/image (official $0.039)
- Save about 36% compared to official
- Continues cost-effectiveness advantage
🌟 Core Upgrades:
- 10 Aspect Ratio Support: From cinematic landscape to social media portrait
- Custom Resolution: Specify output resolution
- Production Ready: Upgraded from preview to official release
- Image-only Output: Supports pure image output mode
🎨 Powerful Capabilities:
- Seamlessly blend multiple images
- Maintain character consistency, enrich storytelling
- Natural language precise editing
- Combined with Gemini’s rich world knowledge
🛠️ Calling Method:
- Recommended new model name: gemini-2.5-flash-image
- Old model name gemini-2.5-flash-image-preview still available
- Other calling methods remain unchanged
- Fully compatible with existing code

New version supports 10 aspect ratios and custom resolution features. Recommend using gemini-2.5-flash-image for best experience. Old model name still available.

🎬 Sora 2 Video Generation API Launch

October 1, 2025 Evening

Sora 2 - OpenAI’s Revolutionary Video Generation Model

🚀 New Models:
- sora_video2 - Portrait video (704×1280)
- sora_video2-landscape - Landscape video (1280×704)
- sora-2-pro - Pro HD version (1024×1792)
💰 Great Pricing:
- Standard version: $0.15/call (10 second video)
- Pro version: $1.0/call (supports 15 seconds, HD)
- With top-up bonuses, from ~¥0.8/call
🎥 Core Features:
- Audio-Video Sync: Industry’s first audio-video synchronized generation
- Physical Reality Enhancement: Significantly improved video realism
- Watermark-free Output: Generated videos have no watermark (official has watermarks)
- Long Video Support: Up to 16 seconds of coherent narrative
🛠️ Generation Methods:
- Text-to-Video: Generate videos from pure text descriptions
- Image-to-Video: Supports 1 image reference for generation
- Supports URL and Base64 image upload
- Complete streaming output progress feedback
🌟 Technical Highlights:
- Endpoint: /v1/chat/completions
- Standard OpenAI API format
- Supports streaming output to view progress
- No invitation code needed, use immediately

Sora 2 is OpenAI’s revolutionary video generation model released on October 1, positioned as the “GPT-3.5 moment” for video generation. APIYI integrated official API first, priced from only $0.15/call, no invitation code required. Detailed docs: Sora 2 User Guide

💻 GLM-4.6 Strong Release

October 1, 2025

GLM-4.6 - Zhipu AI Code and Reasoning Enhanced Edition

🚀 New Model: glm-4.6
💰 Pricing Advantage:
- Input price: $0.50 / 1M tokens
- Output price: $1.75 / 1M tokens
- Extremely cost-effective, only 1/7 of Claude’s price
🧠 Core Upgrades:
- 200K Ultra-long Context: Expanded from 128K to 200K tokens
- Code Capability Enhancement: Excellent performance in Claude Code, Cline, Roo Code IDEs
- Advanced Reasoning: Significantly improved reasoning with tool use during reasoning
- Agent Enhancement: Stronger performance in tool calling and search-based agents
🌟 Performance Highlights:
- Programming: 95.0% accuracy, ranked 95th percentile
- Math: 96.0% accuracy, ranked 98th percentile
- Close to Claude Sonnet 4 (48.6% win rate)
- Significantly better than open-source baseline models
⚡ Writing Optimization:
- More human-preferred style and readability
- More natural role-playing scenarios
- Better visual effects for frontend page generation

GLM-4.6 is the latest version released by Zhipu AI (now rebranded as Z.AI) on September 30, with significant improvements in code generation, reasoning capabilities, and agent framework integration compared to GLM-4.5.

📅 September 2025

🚀 Claude Sonnet 4.5 Spectacular Launch

September 30, 2025

Claude Sonnet 4.5 - Anthropic’s Strongest Coding Model

🚀 New Models:
- claude-sonnet-4-5-20250929 - Standard version
- claude-sonnet-4-5-20250929-thinking - Reasoning mode
💰 Pricing Advantage:
- Completely aligned with Anthropic official pricing
- Enjoy top-up bonus promotions for $100+ recharges
- With bonus discounts, about 20% off official pricing
- Pricing: $3 / $15 per 1M tokens (input/output)
🏆 World-class Coding Capability:
- SWE-bench Verified 77.2%: World’s strongest coding model
- OSWorld 61.4%: Leading in real computer task testing
- Supports Claude Code, excellent development experience
- Top choice for building complex agents
🧠 Core Upgrades:
- Autonomous 30-hour Operation: Massive improvement over Opus 4’s 7 hours
- Significantly Enhanced Reasoning and Math
- Supports sustained focus on complex multi-step tasks
- Comprehensive safety training upgrade
🌟 Real-world Applications:
- Devin planning performance improved 18%
- End-to-end evaluation score improved 12%
- Ideal for building autonomous software development systems
- Enterprise-grade complex business process automation
🛠️ Calling Methods:
- Standard version: Regular conversation and coding tasks
- Thinking mode: Complex problems requiring deep reasoning
- Fully compatible with Anthropic API format

Claude Sonnet 4.5 is Anthropic’s strongest AI model released on September 29, hailed as the “world’s best coding model”, capable of autonomously running for up to 30 hours handling complex tasks. APIYI same price as official, enjoy 20% off with top-ups.

🧠 DeepSeek-V3.2-exp Early Launch

September 29, 2025

DeepSeek-V3.2-exp - DeepSeek’s Latest Experimental Version

🚀 New Model: deepseek-v3.2-exp
💰 Pricing Advantage:
- Input price: $0.26 / 1M tokens (official $0.28)
- Output price: $0.39 / 1M tokens (official $0.42)
- Save about 7% compared to official
- Better pricing in CNY (official ¥2/¥3)
🌟 Core Features:
- Official direct connection version, stable and reliable
- Launched first to meet customer needs
- DeepSeek’s latest experimental capabilities
- Continuous iteration and optimization
🛠️ Calling Method:
- Use standard Chat endpoint: /v1/chat/completions
- Compatible with OpenAI API format
- Pay-as-you-go, simple and transparent

DeepSeek-V3.2-exp is DeepSeek’s latest experimental version released today. APIYI integrated the official direct connection version first, providing service at prices slightly lower than official.

🤖 GLM-4.5 Series Models Launch

September 26, 2025

GLM-4.5 - Zhipu AI High-Performance Language Model

🚀 New Models:
- glm-4.5 - Standard version
- glm-4.5-air - Lightweight version
- glm-4.5v - Multimodal version
💰 Pricing Advantage:
- All models priced lower than Zhipu official
- Better cost-effectiveness while ensuring stability
- Pay-as-you-go billing
🌟 Core Features:
- Comprehensive coverage of different performance scenarios
- Excellent Chinese understanding and generation
- Stable and reliable API service
- Compatible with standard OpenAI API format

GLM-4.5 series is Zhipu AI’s high-performance language model. APIYI offers pricing lower than official, continuously providing better choices in model coverage and cost-effectiveness.

🌐 Grok Web Search Models Official Launch

September 25, 2025

Grok-4-all / Grok-3-all - Smart Models with Built-in Web Search

🚀 New Models:
- grok-4-all - Grok-4 web search enhanced version
- grok-3-all - Grok-3 web search enhanced version
💰 Unified Pricing:
- Input price: $1.50 / 1M tokens
- Output price: $7.50 / 1M tokens
- Pay-as-you-go, simple and transparent
🌟 Core Features:
- Native Web Search: No tool calling needed, model has built-in web capability
- Real-time Information: Direct access to latest web data
- Stable and Reliable: Reverse-engineered but runs stably
- Plug and Play: No additional Web Search tool configuration needed
🛠️ Calling Method:
- Use standard Chat endpoint: /v1/chat/completions
- Compatible with OpenAI API format
- Model automatically handles web requests, no manual tool calling
⚡ Use Cases:
- Q&A scenarios requiring real-time information
- News and information applications
- Market dynamics analysis
- Fact-checking and verification
- Any tasks requiring latest web data

Grok-4-all and Grok-3-all are models with native web search capabilities. More convenient than traditional tool calling approaches, suitable for applications requiring frequent web information access.

⚡ Grok-4-fast Series Spectacular Launch

September 24, 2025

Grok-4-fast - xAI Ultra-long Context Reasoning Model

🚀 New Models:
- grok-4-fast-reasoning - Reasoning mode, shows complete thought process
- grok-4-fast-non-reasoning - Non-reasoning mode, quick response
- grok-4-fast-reasoning-latest - Latest reasoning mode version
- grok-4-fast-latest - Latest standard version
💰 Ultra-low Pricing:
- Input price: $0.20 / 1M tokens
- Output price: $0.50 / 1M tokens
- 93%+ price reduction compared to Grok-4 series
- Most cost-effective ultra-long context model ever
🧠 Core Features:
- 200K Ultra-long Context: Supports up to 200K tokens context window
- Dual-mode Architecture: Unified model weights, switch reasoning mode via system prompt
- Efficient Reasoning: Saves 40% thinking tokens on average compared to Grok-4
- SOTA Cost Efficiency: Industry-leading cost-effectiveness
🌟 Technical Highlights:
- End-to-end tool use reinforcement learning training
- Supports Web Search and X (Twitter) search capabilities
- Built-in code execution capability
- Seamless switching between reasoning and non-reasoning modes
🛠️ Calling Method:
- Use standard Chat endpoint: /v1/chat/completions
- Compatible with OpenAI API format
- Pay-as-you-go, simple and transparent

Grok-4-fast is xAI’s cost-effective long-context model that maintains Grok-4-level performance while dramatically reducing costs. Perfect for applications requiring large context processing.

💻 Grok Code Fast 1 Code-Specialized Model Launch

September 24, 2025

Grok Code Fast 1 - xAI Code Generation and Agent Programming Model

🚀 New Model: grok-code-fast-1
💰 Ultimate Cost-effectiveness:
- Input price: $0.20 / 1M tokens
- Output price: $1.50 / 1M tokens
- Cache price: $0.02 / 1M tokens
- Pricing optimized for coding scenarios
🧠 Core Features:
- 256K Ultra-long Context: Sufficient for large codebases
- SWE-Bench 70.8%: Outstanding performance on SWE-Bench Verified full set
- High-speed Generation: ~92 tokens/sec throughput
- MoE Architecture: ~314B parameter mixture-of-experts model
🌟 Technical Highlights:
- New architecture designed specifically for code tasks
- Multiple programming languages: TypeScript, Python, Java, Rust, C++, Go
- Optimized tool integration: grep, terminal operations, file editing, etc.
- Smart caching: 90%+ cache hit rate in partner workflows
🛠️ Recommended Scenarios:
- Building projects from scratch
- Codebase Q&A
- Bug fixing and code refactoring
- Agentic Coding
- IDE integration development
⚡ Visible Reasoning Process:
- Response includes reasoning traces
- Developers can guide Grok Code for high-quality workflows
- Enhances code generation controllability and transparency

Grok Code Fast 1 is xAI’s high-performance model built specifically for programming scenarios. Already integrated into mainstream IDEs and programming tools like GitHub Copilot, Cursor, Cline, and Windsurf.

💻 OpenAI Codex Series Models Launch

September 16, 2025

GPT-5 Codex - Code Generation Models Optimized for Programming

🚀 New Models:
- gpt-5-codex-high - High-performance version, benchmarked against GPT-5
- gpt-5-codex-medium - Medium-performance version
- gpt-5-codex-low - Lightweight version
💰 Dual Billing Modes: Pay-as-you-go (suitable for small token conversations):
- High: $1.25/1M input + $10.00/1M output
- Medium: $0.60/1M input + $4.80/1M output
- Low: $0.30/1M input + $2.40/1M output
Per-call Billing (suitable for large context scenarios):
- High: $0.025/call
- Medium: $0.020/call
- Low: $0.015/call
🛠️ Calling Method:
- Use Chat endpoint for calling
- Supports both pay-as-you-go and per-call billing modes
- Flexibly choose billing method based on context length
🌟 Core Features:
- Specifically optimized for programming tasks
- Code generation quality comparable to GPT-5
- Supports multiple programming languages and frameworks
- More affordable pricing, extremely cost-effective

Selection Advice: If your context tokens are large, per-call billing is more cost-effective; for short conversations, pay-as-you-go is more economical.

🎨 SeeDream 4.0 API Official Launch

September 11, 2025

SeeDream 4.0 - BytePlus Volcano Engine High-Quality Image Generation

🚀 New Model: seedream-4-0-250828
🤝 Strategic Partnership: From BytePlus Volcano Engine overseas, APIYI reached official strategic cooperation
💰 Pricing Advantage:
- APIYI pricing: $0.025/image (official $0.03/image)
- Save 35% compared to official
- With 15% top-up bonus: ~¥0.14/image
- Pricing on par with Nano Banana (gemini-2.5-flash-image-preview)
🛠️ Calling Method:
- Use standard OpenAI image generation endpoint: /v1/images/generations
- Fully compatible with OpenAI Image API format
- Per-call billing, simple and transparent
🌟 Core Features:
- High-quality image generation: HD 4K output
- Official partnership resources, reliable and stable
- Excellent visual consistency, ~15s per image generation

📅 August 2025

🎨 Gemini 2.5 Flash Image Generation Model Launch

August 27, 2025

Gemini 2.5 Flash Image Preview - Google’s Strongest Image Generation Model

🚀 New Model: gemini-2.5-flash-image-preview
🏷️ Model Code Name: Nano Banana model
💰 Pricing Advantage:
- APIYI pricing: $0.025/image (official $0.04/image)
- With top-up bonus + exchange rate benefits: ~¥0.14/image
- Save about 50% compared to official (50% off official price)
💸 Competitive Comparison:
- Better advantage than gpt-image-1
- Pricing on par with flux-kontext-pro
- Higher than reverse-engineered sora_image
⚡ Performance Advantages:
- Fast generation speed, average only 10 seconds
- Faster response time than OpenAI series
- Google’s latest and strongest image generation/editing technology
🛠️ Calling Method:
- Use chat completion endpoint, compatible with gpt-4o-image
- Simply replace model name to use
- Perfectly compatible with sora_image calling method

🧠 DeepSeek V3.1 Mixed Reasoning Mode Launch

August 26, 2025

DeepSeek V3.1-250821 - Revolutionary Mixed Reasoning Mode

🚀 New Model: deepseek-v3-1-250821
💰 Pricing Advantage:
- Input price: $0.50 / 1M tokens (official $0.56)
- Output price: $1.50 / 1M tokens (official $1.68)
- Save about 11% compared to official pricing
- Supports dual-mode calling with flexible billing
🧠 Core Features:
- Think Mode: Deep reasoning, shows complete thought process
- Non-Think Mode: Quick response for everyday tasks
- 128K ultra-long context window
- Supports Anthropic API format compatibility
🌟 Technical Highlights:
- 840B token continuous pre-training optimization
- New tokenizer with improved encoding efficiency
- Beta Function Calling tool support
- Enhanced agent and tool usage capabilities
⚡ Mode Switching:
- deepseek-chat - Non-Think quick mode
- deepseek-reasoner - Think reasoning mode
- Official supports “DeepThink” one-click switching

DeepSeek V3.1 is the first large model supporting mixed reasoning mode, allowing users to flexibly choose reasoning depth based on task complexity, balancing efficiency and quality.

🤖 Kimi K2 Official Release Integration

August 25, 2025

Kimi K2-250711 - Official Volcano Engine Partnership Version

🚀 New Model: kimi-k2-250711
🔄 Replaces Model: Replaces kimi-k2-0711-preview preview version
🤝 Official Partnership:
- Official Volcano Engine authorized partnership version
- Direct connection to official interface, significantly improved stability
- Goodbye to preview version instability
💰 Pricing Advantage:
- Continues 15% lower than official pricing
- With top-up bonuses, equivalent to 30% off official
- Official release performance at preview pricing
🌟 Core Features:
- Excellent Chinese understanding and generation
- Strong long-text processing capability
- More stable and reliable API response
- Fully compatible with original calling method

Kimi K2-250711 is the official release version in partnership with Volcano Engine. Compared to preview version, it has significant improvements in stability and performance. Users are recommended to switch promptly.

🚀 GPT-5 Full Series Spectacular Launch

August 8, 2025

GPT-5 Full Series Official Release - OpenAI’s Strongest Model

🎯 Full Series Models:
- gpt-5 - Flagship version
- gpt-5-2025-08-07 - Specific version
- gpt-5-chat-latest - Latest chat version
- gpt-5-mini / gpt-5-mini-2025-08-07 - Lightweight version
- gpt-5-nano / gpt-5-nano-2025-08-07 - Ultra-lightweight version
💰 Pricing Advantage:
- Completely aligned with OpenAI official pricing
- Top-up bonus + exchange rate benefits, about 20% off official
- Input price: $1.25 - $0.05 / 1M tokens
- Output price: $10.00 - $0.40 / 1M tokens
🛠️ Technical Features:
- Supports official /v1/responses endpoint calling
- Temperature parameter temperature must be set to 1 or omitted (official limitation)
- Use max_completion_tokens instead of max_tokens
- gpt-5-chat-latest called via /v1/chat/completions
⚡ Performance Highlights:
- Comprehensively exceeds GPT-4 series reasoning capabilities
- Stronger context understanding and long-text processing
- Significantly improved code generation quality
- Multilingual capabilities reach new heights

Important Note: GPT-5 series models have specific parameter requirements. Please configure according to the above technical features to ensure normal calling.

🎨 Claude Opus 4.1 Performance Upgrade Launch

August 7, 2025

Claude Opus 4.1 - Code Capability Breakthrough

🚀 New Model: claude-opus-4-1-20250805
💰 More for Same Price:
- Pricing completely aligned with Anthropic official
- Better cost-effectiveness with top-up bonuses
- Same price, enjoy stronger performance
🌟 Core Upgrades:
- SWE-bench Verified reaches 74.5%, significantly improved code capability
- Compared to Opus 4, greatly enhanced reasoning and code generation quality
- Maintains 200K ultra-long context window
- Further optimized multilingual understanding and generation
⚡ Performance Highlights:
- Industry-leading code debugging and fixing capabilities
- 15%+ improvement in complex reasoning task accuracy
- API response speed maintains industry-leading level
- Perfectly compatible with all Claude series features

Claude Opus 4.1 is an iterative optimization version based on Opus 4. While maintaining original powerful capabilities, it has undergone specialized optimization for code generation and reasoning tasks.

🎯 OpenAI Open Source Models Official Launch

August 6, 2025

OpenAI’s First Open Source Large Models Spectacular Release

🚀 New Models:
- gpt-oss-120b - 117B parameters, performance comparable to o4-mini
- gpt-oss-20b - 21B parameters, can run on edge devices
💰 Pricing Advantage:
- Significantly lower pricing than DeepSeek R1 and V3
- Extremely cost-effective with top-up bonuses
- Significantly reduced enterprise usage costs
🌟 Core Features:
- 128K ultra-long context window
- Supports low/medium/high three-level reasoning modes
- MoE architecture, extremely efficient inference
- Apache 2.0 open source license
⚡ Performance Highlights:
- gpt-oss-120b runs on single 80GB GPU
- gpt-oss-20b supports 16GB memory device deployment
- Outstanding performance in programming, math, tool calling tasks

This is OpenAI’s first open source model release since GPT-2, marking an important milestone in the AI open ecosystem.

📅 July 2025

⚠️ GPT-4.5-Preview Model Discontinued

July 14, 2025

GPT-4.5-Preview Officially Stops Service

📌 Discontinued model: gpt-4.5-preview
🔄 Recommended replacement: gpt-4.1
📅 Effective date: July 14, 2025
📖 Official announcement: OpenAI Deprecations

Developers please update code promptly, replace gpt-4.5-preview with gpt-4.1 to avoid service interruption.

🚀 Kimi K2 Model Launch

July 15, 2025

Kimi K2 Preview Version Official Launch

📌 Model name: kimi-k2-0711-preview
💰 Pricing advantage: 15% lower than official
🎁 With top-up bonus, equivalent to 30% off official
🌟 Features: Better Chinese understanding and generation capabilities

🤖 Grok-4 Model Launch

July 12, 2025

xAI Grok-4 Official Integration

📌 Calling name: grok-4
🧠 Special feature: Built-in reasoning mode
📖 Recommended to check official docs for detailed usage

💸 Claude Full Series 20％ Price Reduction

July 8, 2025

Claude Models Major Discount

🎉 Full series 20% price reduction
✅ No speed limit, no account ban
💱 Exchange rate advantage + top-up bonus
🚀 Providing more economical AI services for users

📅 June 2025

🌟 Gemini 2.5 Official Release

June 18, 2025

Gemini 2.5 Series Completely Upgraded

📌 Recommended models: gemini-2.5-pro / gemini-2.5-flash
⚡ More stable performance, recommended to replace preview or exp versions
🎯 2M ultra-long context support

🎊 O3 Model Major 80％ Price Reduction

June 13, 2025

O3 Model Biggest Price Drop Ever

💥 Price reduction: 80%
💰 New pricing:
- Input: $3/M Tokens
- Output: $12/M Tokens
🆕 New o3-pro model added
📍 Only available via /v1/responses endpoint

🖼️ GPT-Image-1 Price Reduction

June 7, 2025

More Affordable Image Generation

📌 Model: gpt-image-1
💸 Significant price reduction, more competitive image generation service
🎨 Maintain high-quality output while reducing costs

Join Community

Telegram community: t.me/apiyi_community

View Pricing

Check latest pricing information for all models

📊 Update Statistics

This Month Highlights (October 2025)

✨ Version Update: Gemini 2.5 Flash latest dated version preview-09-2025 launched, Google’s latest optimization
🍌 Image Upgrade: Gemini 2.5 Flash Image official release, 10 aspect ratios, custom resolution support
🎬 Video Revolution: Sora 2 video generation launched, audio-video sync, watermark-free output, from $0.15/call
💻 Intelligence Upgrade: GLM-4.6 strong release, 200K context, comprehensive code and reasoning improvements

Last Month Highlights (September 2025)

🚀 World’s Strongest: Claude Sonnet 4.5 spectacular launch, SWE-bench 77.2%, world’s best coding model
🧠 Quick Response: DeepSeek-V3.2-exp early launch, first to integrate latest experimental version
🤖 Domestic Power: GLM-4.5 series fully launched, priced below official
🌐 Web Capability: Grok-4-all / Grok-3-all launched, native web search, no tool calling needed
⚡ Ultra-long Context: Grok-4-fast series launched, 200K tokens context, 93%+ price reduction
💻 Code-Specialized: Grok Code Fast 1 released, SWE-Bench 70.8%, integrated into mainstream IDEs
💻 Programming Optimization: OpenAI Codex series launched, optimized for programming scenarios, dual billing modes
🎨 Image Generation: SeeDream 4.0 officially launched, BytePlus Volcano Engine strategic partnership, 35% off official

Historical Milestones (August 2025)

🎨 Image Generation: Gemini 2.5 Flash Image Preview launched, Google’s strongest image model, 30% off official
🧠 Innovation Breakthrough: DeepSeek V3.1 mixed reasoning mode, first large model supporting Think/Non-Think dual modes
🤖 Official Partnership: Kimi K2-250711 official release, official Volcano Engine authorization, significantly improved stability
🚀 New Models: GPT-5 full series officially launched, 20% off with official pricing
🎨 Performance Upgrade: Claude Opus 4.1 performance upgrade, more for same price
🎯 Major Release: OpenAI’s first open source large models (gpt-oss-120b, gpt-oss-20b)
💸 Pricing Advantage: New models priced lower than DeepSeek R1/V3
🚀 Performance Breakthrough: Open source models reach commercial-grade performance
🌐 Ecosystem Expansion: Support local deployment and edge computing

Historical Milestones

October 2025: Gemini 2.5 Flash latest dated version preview-09-2025 launched; Gemini 2.5 Flash Image official release; Sora 2 video generation launched, OpenAI’s revolutionary video model; GLM-4.6 strong release, Zhipu AI code and reasoning enhanced edition
September 2025: Claude Sonnet 4.5 spectacular launch, world’s strongest coding model; DeepSeek-V3.2-exp early launch; GLM-4.5 series fully launched; Grok web search models launched, native web capability; xAI Grok-4-fast series launched, 200K ultra-long context record low price; Grok Code Fast 1 code-specialized model released; OpenAI Codex series released, optimized for programming; SeeDream 4.0 officially launched, BytePlus Volcano Engine strategic partnership reached
August 2025: Gemini 2.5 Flash Image Preview Google’s strongest image model launched; DeepSeek V3.1 mixed reasoning mode launched; Kimi K2 official release integrated; GPT-5 full series spectacular launch; First integration of OpenAI open source models; Claude Opus 4.1 performance upgrade released
July 2025: Breakthrough 200+ model support
June 2025: O3 model biggest price reduction ever
May 2025: Support Gemini 2.5 series

Update Frequency: This page is regularly updated. Recommend bookmarking and checking frequently. All price adjustments and model updates are announced here first.

Pro Tip: New model launches typically have special offers initially. Recommend trying them out promptly. We continuously optimize pricing strategy based on market feedback.

Live Updates ⚡

​🔥 Latest Updates

​📅 June 2026

​🌟 GLM-5.2 Launch: Zhipu’s 1M-Context Flagship Coding Model

​🌟 Claude Fable 5 Launch: Mythos-Class Power, 30-Day Data Retention Compliance

​🌟 Seedance 2.0 Launch: ByteDance’s New Video Model, Pricing Pegged to Official

​🌟 MiniMax-M3 Launch: 1M-Context Open Flagship at 50％ Off

​📅 May 2026

​🔄 Nano Banana Stable Model Names (no -preview)

​🌟 Claude Opus 4.8 Launches: Stronger Coding, More Honest, Same Price

​🌟 claude-jupiter-v1-p Launches: Claude Opus 4.8 Preview

​🌟 Qwen3.7-Max Launches: #1 Chinese Model, Top-5 Globally on Intelligence Index

​🌟 chat-latest Launches: Always Tracks ChatGPT’s Default Instant Model

​🌟 Gemini 3.5 Flash Launch: Flash Beats Gemini 3.1 Pro

​💳 Stripe Online Recharge Launched · Foreign Credit Cards Supported

​🌟 Referral Rebate Mechanism + Reward Transfer Workflow FAQ Published

​🌟 Email Registration Support List FAQ Published

​🌟 Gemini 3.1 Flash Lite Goes GA: The Sweet Spot for Agents & Low Latency

​🌟 Grok 4.3 Launch: xAI’s New Flagship, 37.5％ Input / 58.3％ Output Price Cuts

​🌟 GPT-5.5 Pro Launch: OpenAI’s Strongest Reasoning Model

​📅 April 2026

​🌟 Qwen3.6 Open-Weight Duo Live: APIYI-Hosted, No GPU Rental

​🌟 Qwen3.6-Max-Preview / Flash Launch: Dual Aliyun Official Relay

​🌟 GPT-5.5 Launch: New xhigh Reasoning Tier

​🌟 Kimi K2.6 Launch: Open-Source Agent-Coding Leader

​🌟 DeepSeek V4-Pro / V4-Flash Launch: 1M Context + Open-Source SOTA

​🌟 gpt-image-2 Launch: Native 4K + 30％ Same-Tier Price Cut

​🌟 gpt-image-2-all Launch: $0.03/image GPT Reverse-Engineered Image Model

​💰 Nano Banana 2 Token-Based Pricing Update

​🌟 Claude Opus 4.7 Launch: Free Coding & Agent Upgrade

​💰 Nano Banana Pro Price Adjustment & New Enterprise HA Channel

​🌟 GLM-5.1 Launch: Zhipu’s Most Powerful Open-Source Coding Agent

​🌟 Qwen3.6-Plus Launch: Alibaba’s Most Powerful Coding Agent Model

​📅 March 2026

​🌟 Gemini Embedding 2 Preview Now Available

​🌟 Seed 2.0 Pro Flagship Model Now Available

​🌟 MiniMax-M2.7 / M2.7-highspeed Now Available

​🌟 MiMo-V2-Pro / MiMo-V2-Omni Now Available

​🌟 Grok 4.20 Beta Series — 4 New Models Available

​🌟 GPT-5.4 Mini / GPT-5.4 Nano Now Available

​🌟 ByteDance Seed 2.0 Lite Now Available

​🌟 GPT-5.4 / GPT-5.4 Pro Now Available

​🌟 Gemini 3.1 Flash Lite Preview Now Available

​🌟 GPT-5.3 Chat Now Available

​🏷️ Model Name Suffix -c Explained

​📅 February 2026

​💰 Nano Banana 2 Pricing Adjustment

​🌟 Nano Banana 2 Launch

​📅 January 2025

​🌟 Sora 2 Official API Channel Launched

​📅 December 2025

​⚡ Gemini 3 Flash Preview Launch

​🎨 OpenAI GPT Image 1.5 Launch

​🚀 GPT-5.2 Series Launch

​🎨 SeeDream 4.5 Launch

​📅 November 2025

​🌟 Gemini 3 Pro Preview Launch

​🍌 Nano Banana Pro Launch

​📅 October 2025

​🎥 Sora 2 Async API Officially Launched

​✨ Gemini 2.5 Flash Latest Dated Version Launch

​🍌 Gemini 2.5 Flash Image Official Release

​🎬 Sora 2 Video Generation API Launch

​💻 GLM-4.6 Strong Release

​📅 September 2025

​🚀 Claude Sonnet 4.5 Spectacular Launch

​🧠 DeepSeek-V3.2-exp Early Launch

​🤖 GLM-4.5 Series Models Launch

​🌐 Grok Web Search Models Official Launch

​⚡ Grok-4-fast Series Spectacular Launch

​💻 Grok Code Fast 1 Code-Specialized Model Launch

​💻 OpenAI Codex Series Models Launch

​🎨 SeeDream 4.0 API Official Launch

​📅 August 2025

​🎨 Gemini 2.5 Flash Image Generation Model Launch

​🧠 DeepSeek V3.1 Mixed Reasoning Mode Launch

​🤖 Kimi K2 Official Release Integration

​🚀 GPT-5 Full Series Spectacular Launch

​🎨 Claude Opus 4.1 Performance Upgrade Launch

​🎯 OpenAI Open Source Models Official Launch

​📅 July 2025

🔥 Latest Updates

📅 June 2026

🌟 GLM-5.2 Launch: Zhipu’s 1M-Context Flagship Coding Model

🌟 Claude Fable 5 Launch: Mythos-Class Power, 30-Day Data Retention Compliance

🌟 Seedance 2.0 Launch: ByteDance’s New Video Model, Pricing Pegged to Official

🌟 MiniMax-M3 Launch: 1M-Context Open Flagship at 50％ Off

📅 May 2026

🔄 Nano Banana Stable Model Names (no `-preview`)

🌟 Claude Opus 4.8 Launches: Stronger Coding, More Honest, Same Price

🌟 claude-jupiter-v1-p Launches: Claude Opus 4.8 Preview

🌟 Qwen3.7-Max Launches: #1 Chinese Model, Top-5 Globally on Intelligence Index

🌟 chat-latest Launches: Always Tracks ChatGPT’s Default Instant Model

🌟 Gemini 3.5 Flash Launch: Flash Beats Gemini 3.1 Pro

💳 Stripe Online Recharge Launched · Foreign Credit Cards Supported

🌟 Referral Rebate Mechanism + Reward Transfer Workflow FAQ Published

🌟 Email Registration Support List FAQ Published

🌟 Gemini 3.1 Flash Lite Goes GA: The Sweet Spot for Agents & Low Latency

🌟 Grok 4.3 Launch: xAI’s New Flagship, 37.5％ Input / 58.3％ Output Price Cuts

🌟 GPT-5.5 Pro Launch: OpenAI’s Strongest Reasoning Model

📅 April 2026

🌟 Qwen3.6 Open-Weight Duo Live: APIYI-Hosted, No GPU Rental

🌟 Qwen3.6-Max-Preview / Flash Launch: Dual Aliyun Official Relay

🌟 GPT-5.5 Launch: New xhigh Reasoning Tier

🌟 Kimi K2.6 Launch: Open-Source Agent-Coding Leader

🌟 DeepSeek V4-Pro / V4-Flash Launch: 1M Context + Open-Source SOTA

🌟 gpt-image-2 Launch: Native 4K + 30％ Same-Tier Price Cut

🌟 gpt-image-2-all Launch: $0.03/image GPT Reverse-Engineered Image Model

💰 Nano Banana 2 Token-Based Pricing Update

🌟 Claude Opus 4.7 Launch: Free Coding & Agent Upgrade

💰 Nano Banana Pro Price Adjustment & New Enterprise HA Channel

🌟 GLM-5.1 Launch: Zhipu’s Most Powerful Open-Source Coding Agent

🌟 Qwen3.6-Plus Launch: Alibaba’s Most Powerful Coding Agent Model

📅 March 2026

🌟 Gemini Embedding 2 Preview Now Available

🌟 Seed 2.0 Pro Flagship Model Now Available

🌟 MiniMax-M2.7 / M2.7-highspeed Now Available

🌟 MiMo-V2-Pro / MiMo-V2-Omni Now Available

🌟 Grok 4.20 Beta Series — 4 New Models Available

🌟 GPT-5.4 Mini / GPT-5.4 Nano Now Available

🌟 ByteDance Seed 2.0 Lite Now Available

🌟 GPT-5.4 / GPT-5.4 Pro Now Available

🌟 Gemini 3.1 Flash Lite Preview Now Available

🌟 GPT-5.3 Chat Now Available

🏷️ Model Name Suffix -c Explained

📅 February 2026

💰 Nano Banana 2 Pricing Adjustment

🌟 Nano Banana 2 Launch

📅 January 2025

🌟 Sora 2 Official API Channel Launched

📅 December 2025

⚡ Gemini 3 Flash Preview Launch

🎨 OpenAI GPT Image 1.5 Launch

🚀 GPT-5.2 Series Launch

🎨 SeeDream 4.5 Launch

📅 November 2025

🌟 Gemini 3 Pro Preview Launch

🍌 Nano Banana Pro Launch

📅 October 2025

🎥 Sora 2 Async API Officially Launched

✨ Gemini 2.5 Flash Latest Dated Version Launch

🍌 Gemini 2.5 Flash Image Official Release

🎬 Sora 2 Video Generation API Launch

💻 GLM-4.6 Strong Release

📅 September 2025

🚀 Claude Sonnet 4.5 Spectacular Launch

🧠 DeepSeek-V3.2-exp Early Launch

🤖 GLM-4.5 Series Models Launch

🌐 Grok Web Search Models Official Launch

⚡ Grok-4-fast Series Spectacular Launch

💻 Grok Code Fast 1 Code-Specialized Model Launch

💻 OpenAI Codex Series Models Launch

🎨 SeeDream 4.0 API Official Launch

📅 August 2025

🎨 Gemini 2.5 Flash Image Generation Model Launch

🧠 DeepSeek V3.1 Mixed Reasoning Mode Launch

🤖 Kimi K2 Official Release Integration

🚀 GPT-5 Full Series Spectacular Launch

🎨 Claude Opus 4.1 Performance Upgrade Launch

🎯 OpenAI Open Source Models Official Launch

📅 July 2025