Skip to main content
APIYI supports 200+ mainstream AI models. This page provides detailed model information, pricing, and usage instructions.
Enterprise-grade Professional and Stable AI Large Model API Hub All models are officially sourced and forwarded, with ~20% off pricing (combining top-up bonuses and exchange rate advantages), aggregating various excellent large models. No speed limits, no expiration, no account ban risks, pay-as-you-go billing, long-term reliable service.
The following are currently stably supplied popular models. For complete model list and real-time pricing, visit APIYI Console Pricing Page.

Model Categories

🤖 OpenAI Series

Reasoning Models

Model NameModel IDFeaturesRecommended Scenarios
GPT-5.2 Pro 🔥gpt-5.2-proFirst to break 90% on ARC-AGI-1, expert knowledge leaderComplex reasoning, scientific research
GPT-5.2 🔥gpt-5.2GDPval 70.9% surpassing professionals, 400K contextProgramming planning, structured tasks
GPT-5.2 Instant 🔥gpt-5.2-chat-latestFast response version, maintains top reasoningQuick writing, information retrieval
GPT-5gpt-5Flagship stable version, ultra-strong reasoningTop-tier reasoning, complex tasks
GPT-5 Minigpt-5-miniGPT-5 lightweight version, excellent performanceBalance performance and cost
GPT-5 Nanogpt-5-nanoGPT-5 ultra-lightweight versionLarge-scale batch processing
o3o3Latest reasoning model, significantly price-reduced, extremely cost-effectiveComplex reasoning, math, programming
o4-minio4-miniLightweight reasoning modelTop choice for programming tasks
GPT-5 Series Usage Notes:
  1. Temperature parameter temperature must be set to 1 (only supports 1)
  2. Use max_completion_tokens instead of max_tokens
  3. Do not pass top_p parameter

GPT Series

Model NameModel IDContext LengthFeaturesRecommended Scenarios
GPT-5 Chat Latestgpt-5-chat-latest128KBenchmarked against ChatGPT web GPT-5Need latest features
GPT-4.1gpt-4.1128KFast speed, one of the main modelsGeneral applications
GPT-4.1 Minigpt-4.1-mini128KCheaper lightweight versionCost-sensitive scenarios
GPT-4ogpt-4o128KBalanced comprehensive capabilities, multimodal supportGeneral scenarios
GPT-4o Minigpt-4o-mini128KLightweight fast versionQuick response

Codex Programming Series

Model NameModel IDBilling ModeFeaturesRecommended Scenarios
GPT-5 Codex Highgpt-5-codex-highPer-token/Per-callBenchmarked against GPT-5, strongest programmingComplex programming tasks
GPT-5 Codex Mediumgpt-5-codex-mediumPer-token/Per-callMedium performance, moderate priceRegular programming tasks
GPT-5 Codex Lowgpt-5-codex-lowPer-token/Per-callLightweight version, lowest costSimple code generation
Codex Series Dual Billing Modes:
  • Per-token billing: Suitable for small token conversation scenarios
  • Per-call billing: Suitable for large context programming scenarios, more cost-effective

Image Generation Models

Model NameModel IDSupported SizesFeaturesPrice
GPT Image 1.5 🔥gpt-image-1.5Low/Med/High4x speed boost, precise editing, enhanced text renderingLow $0.01, Med $0.04, High $0.17
Nano Banana Progemini-3-pro-image-preview1K/2K/4K4K HD support, best-in-class text rendering, powerful local editingUniform $0.05 (as low as 20% of official)
SeeDream 4.5 🔥seedream-4-5-2511284K HD1.2B parameters, 4K quality boost, best text rendering$0.035/image
Nano Bananagemini-2.5-flash-image-previewMultiple sizesGoogle’s strongest image model, fast speed$0.025/image
SeeDream 4.0seedream-4-0-2508284K HDBytePlus Volcano partnership, high-quality output$0.025/image
Sora Imagesora_imageMultiple sizesReverse-engineered model, simulates official conversation-based generationSee docs
GPT-4o Imagegpt-4o-imageMultiple sizesReverse-engineered model, conversation-style generationSee docs
DALL·E 3dall-e-31024×1024 etc.Classic image generation modelBilled by size
Image Generation Testing Tool Visit imagen.apiyi.com to experience various image generation models.Detailed Documentation:

🎭 Claude Series (Anthropic)

Claude 4 Series (Latest)

Model NameModel IDContext LengthFeaturesRecommended Scenarios
Claude Opus 4.5 🔥claude-opus-4-5-20251101200KSWE-bench 80.9% top rank, price reduced to 1/3Complex programming, top-tier reasoning
Claude Sonnet 4.5claude-sonnet-4-5-20250929200KWorld’s strongest coding model, SWE-bench 77.2%Code generation, agent development
Claude Sonnet 4.5 Thinkingclaude-sonnet-4-5-20250929-thinking200KChain-of-thought mode, deep reasoningComplex programming reasoning
Claude Haiku 4.5 🔥claude-haiku-4-5-20251001200KHigh cost-performance coding, SWE-bench 73.3%, 2x speedReal-time chat, pair programming
Claude 4 Sonnetclaude-sonnet-4-20250514200KStable version, top choice for programmingCode generation, analysis
Claude 4 Sonnet Thinkingclaude-sonnet-4-20250514-thinking200KChain-of-thought modeComplex reasoning
Claude Opus 4.1claude-opus-4-1-20250805200KIterative upgrade, programming-optimizedHigh-demand programming tasks
Claude Opus 4.1 Thinkingclaude-opus-4-1-20250805-thinking200KChain-of-thought mode, reasoning-enhancedTop-tier reasoning tasks
Latest Recommendation: Claude Opus 4.5 tops programming rankings with SWE-bench 80.9%, price reduced to 1/3 of predecessor. Sonnet 4.5 is second strongest coding model (SWE-bench 77.2%), Haiku 4.5 offers same-tier performance at lower cost with 2x speed boost.

🌟 Google Gemini Series

Model NameModel IDContext LengthFeaturesRecommended Scenarios
Gemini 3 Flash Preview 🔥gemini-3-flash-preview1MSWE-bench 78% surpassing 3 Pro, 3x faster, 1/4 priceProgramming top choice, cost-performance king
Gemini 3 Flash Thinking 🔥gemini-3-flash-preview-thinking1MForced reasoning mode, shows complete thought processComplex programming, deep reasoning
Gemini 3 Flash NoThinking 🔥gemini-3-flash-preview-nothinking1MFast response mode, minimum latencySimple tasks, real-time apps
Gemini 3 Pro Previewgemini-3-pro-preview1MLMArena 1501 Elo global #1, SWE-bench 76.2%Top multimodal, complex reasoning
Gemini 3 Pro Preview Thinkinggemini-3-pro-preview-thinking1MChain-of-thought mode, shows full reasoningDeep reasoning, complex programming
Gemini 2.5 Progemini-2.5-pro2MOfficial release, programming advantage, strong multimodalLong text, programming, multimodal
Gemini 2.5 Pro Previewgemini-2.5-pro-preview-06-052MPreview versionTest new features
Gemini 2.5 Flashgemini-2.5-flash1MFast speed, low costQuick response scenarios
Gemini 2.5 Flash Litegemini-2.5-flash-lite1MUltra-lightweight, faster and cheaperLarge-scale simple tasks
Latest Recommendation: Gemini 3 Flash Preview achieves SWE-bench 78% surpassing 3 Pro, 3x faster, 1/4 price - programming cost-performance king! Gemini 3 Pro Preview maintains LMArena 1501 Elo global #1, ideal for top multimodal tasks. Learn more

🚀 xAI Grok Series

Model NameModel IDFeaturesRecommended Scenarios
Grok 4grok-4Latest official versionGeneral tasks
Grok 3grok-3Official stable versionDaily use
Grok 3 Minigrok-3-miniSmall model with reasoningLightweight tasks

🔍 DeepSeek Series

Model NameModel IDContext LengthFeaturesRecommended Scenarios
DeepSeek V3.1deepseek-v3-1-250821128KMixed reasoning mode, Think/Non-Think dual modesIntelligent reasoning, programming
DeepSeek R1deepseek-r164KReasoning modelMath, reasoning
DeepSeek V3deepseek-v3128KStrong comprehensive capabilitiesGeneral scenarios

🐘 Chinese Model Series

Alibaba Qwen

Model NameModel IDContext LengthFeatures
Qwen Maxqwen-max32KStrongest version
Qwen Plusqwen-plus32KEnhanced version
Qwen Turboqwen-turbo32KFast version

Moonshot Kimi Series

Model NameModel IDContext LengthFeatures
Kimi K2 Official Releasekimi-k2-250711200KOfficial Volcano Engine partnership, strong stability

💰 Pricing Information

Billing Methods

  • Pay-as-you-go: Charged based on actual Token usage
  • No minimum charge: Use what you pay for, balance never expires
  • Real-time deduction: Fees deducted from balance immediately after each call

Pricing Advantages

  • Official source forwarding with slight price advantages
  • Bulk users can contact customer service for better pricing
  • New users get 3 million tokens testing credit upon registration

View Real-time Pricing

Visit APIYI Console Pricing Page to view latest pricing for all models.

🛠️ Usage Recommendations

Model Selection Guide

Programming Development
  • Top performance: Claude Opus 4.5 (SWE-bench 80.9% top rank), Gemini 3 Flash Preview (SWE-bench 78%), Claude Sonnet 4.5 (SWE-bench 77.2%)
  • High cost-performance: Gemini 3 Flash Preview (surpassing 3 Pro, 1/4 price, 3x faster), Claude Haiku 4.5, GPT-5.1 Codex Mini
  • Alternatives: GPT-5.2 series, GPT-5 Codex series, DeepSeek V3.1, o4-mini
Text Creation
  • Top choice: GPT-5.2 series, GPT-5, Gemini 3 Pro Preview, Claude Opus 4.5
  • Alternatives: Claude Sonnet 4.5, GPT-4.1, GPT-4o, GPT-5 Chat Latest, Kimi K2 Official Release
Quick Response
  • Top choice: Gemini 3 Flash NoThinking (extreme speed), Claude Haiku 4.5 (2x faster), GPT-4o Mini
  • Alternatives: Gemini 2.5 Flash, Gemini 2.5 Flash Lite, Grok 3 Mini, GPT-4.1 Mini
Image Generation
  • Latest recommendation: GPT Image 1.5 (4x speed boost, precise editing, from $0.01)
  • Professional design: SeeDream 4.5 (1.2B parameters, 4K quality, $0.035/image), Nano Banana Pro (4K HD, best text rendering)
  • High cost-performance: Nano Banana (0.025/image),SeeDream4.0(0.025/image), SeeDream 4.0 (0.025/image)
  • Reverse-engineered, cheapest: sora_image, gpt-4o-image
Long Text Processing
  • Top choice: Gemini 2.5 Pro (2M context)
  • Alternatives: Claude 4 series (200K context)

Cost Optimization Recommendations

  1. Tiered Usage: Use cheaper models for simple tasks, advanced models for complex tasks
  2. Test Optimization: Test with small models first, use large models after determining needs
  3. Batch Processing: Choose Nano or Mini versions for large volumes of similar tasks
  4. Cache Reuse: Cache results for repeated queries
Model list is continuously updated. We will promptly add newly released excellent models. For specific model needs or bulk requirements, please contact customer service.