Skip to main content
APIYI supports 200+ mainstream AI models. This page provides detailed model information, pricing, and usage instructions.
Enterprise-grade Professional and Stable AI Large Model API Hub All models are officially sourced and forwarded, with ~20% off pricing (combining top-up bonuses and exchange rate advantages), aggregating various excellent large models. No speed limits, no expiration, no account ban risks, pay-as-you-go billing, long-term reliable service.
The following are currently stably supplied popular models. For complete model list and real-time pricing, visit APIYI Console Pricing Page.

Model Categories

🤖 OpenAI Series

Reasoning Models

Model NameModel IDFeaturesRecommended Scenarios
GPT-5gpt-5Latest flagship model, ultra-strong reasoningTop-tier reasoning, complex tasks
GPT-5 Minigpt-5-miniGPT-5 lightweight version, excellent performanceBalance performance and cost
GPT-5 Nanogpt-5-nanoGPT-5 ultra-lightweight versionLarge-scale batch processing
o3o3Latest reasoning model, significantly price-reduced, extremely cost-effectiveComplex reasoning, math, programming
o4-minio4-miniLightweight reasoning modelTop choice for programming tasks
GPT-5 Series Usage Notes:
  1. Temperature parameter temperature must be set to 1 (only supports 1)
  2. Use max_completion_tokens instead of max_tokens
  3. Do not pass top_p parameter

GPT Series

Model NameModel IDContext LengthFeaturesRecommended Scenarios
GPT-5 Chat Latestgpt-5-chat-latest128KBenchmarked against ChatGPT web GPT-5Need latest features
GPT-4.1gpt-4.1128KFast speed, one of the main modelsGeneral applications
GPT-4.1 Minigpt-4.1-mini128KCheaper lightweight versionCost-sensitive scenarios
GPT-4ogpt-4o128KBalanced comprehensive capabilities, multimodal supportGeneral scenarios
GPT-4o Minigpt-4o-mini128KLightweight fast versionQuick response

Codex Programming Series

Model NameModel IDBilling ModeFeaturesRecommended Scenarios
GPT-5 Codex Highgpt-5-codex-highPer-token/Per-callBenchmarked against GPT-5, strongest programmingComplex programming tasks
GPT-5 Codex Mediumgpt-5-codex-mediumPer-token/Per-callMedium performance, moderate priceRegular programming tasks
GPT-5 Codex Lowgpt-5-codex-lowPer-token/Per-callLightweight version, lowest costSimple code generation
Codex Series Dual Billing Modes:
  • Per-token billing: Suitable for small token conversation scenarios
  • Per-call billing: Suitable for large context programming scenarios, more cost-effective

Image Generation Models

Model NameModel IDSupported SizesFeaturesPrice
Nano Bananagemini-2.5-flash-image-previewMultiple sizesGoogle’s strongest image model, fast speed$0.025/image
SeeDream 4.0seedream-4-0-2508284K HDBytePlus Volcano partnership, high-quality output$0.025/image
GPT-Image-1gpt-image-11024×1024 etc.Cost-effective image generationSee docs below
Sora Imagesora_imageMultiple sizesReverse-engineered model, simulates official conversation-based generationSee docs
GPT-4o Imagegpt-4o-imageMultiple sizesReverse-engineered model, conversation-style generationSee docs
DALL·E 3dall-e-31024×1024 etc.Classic image generation modelBilled by size
Image Generation Testing Tool Visit imagen.apiyi.com to experience various image generation models.Detailed Documentation:

🎭 Claude Series (Anthropic)

Claude 4 Series (Latest)

Model NameModel IDContext LengthFeaturesRecommended Scenarios
Claude 4 Sonnetclaude-sonnet-4-20250514200KLatest model, top choice for programmingCode generation, analysis
Claude 4 Sonnet Thinkingclaude-sonnet-4-20250514-thinking200KChain-of-thought modeComplex reasoning
Claude Opus 4.1claude-opus-4-1-20250805200KIterative upgrade, programming-optimizedHigh-demand programming tasks
Claude Opus 4.1 Thinkingclaude-opus-4-1-20250805-thinking200KChain-of-thought mode, reasoning-enhancedTop-tier reasoning tasks
Important Note: Opus 4 is no longer recommended. Please migrate to Opus 4.1 version for better performance and specific programming scenario optimization.

🌟 Google Gemini Series

Model NameModel IDContext LengthFeaturesRecommended Scenarios
Gemini 2.5 Progemini-2.5-pro2MOfficial release, programming advantage, strong multimodalLong text, programming, multimodal
Gemini 2.5 Pro Previewgemini-2.5-pro-preview-06-052MPreview versionTest new features
Gemini 2.5 Flashgemini-2.5-flash1MFast speed, low costQuick response scenarios
Gemini 2.5 Flash Litegemini-2.5-flash-lite1MUltra-lightweight, faster and cheaperLarge-scale simple tasks

🚀 xAI Grok Series

Model NameModel IDFeaturesRecommended Scenarios
Grok 4grok-4Latest official versionGeneral tasks
Grok 3grok-3Official stable versionDaily use
Grok 3 Minigrok-3-miniSmall model with reasoningLightweight tasks

🔍 DeepSeek Series

Model NameModel IDContext LengthFeaturesRecommended Scenarios
DeepSeek V3.1deepseek-v3-1-250821128KMixed reasoning mode, Think/Non-Think dual modesIntelligent reasoning, programming
DeepSeek R1deepseek-r164KReasoning modelMath, reasoning
DeepSeek V3deepseek-v3128KStrong comprehensive capabilitiesGeneral scenarios

🐘 Chinese Model Series

Alibaba Qwen

Model NameModel IDContext LengthFeatures
Qwen Maxqwen-max32KStrongest version
Qwen Plusqwen-plus32KEnhanced version
Qwen Turboqwen-turbo32KFast version

Moonshot Kimi Series

Model NameModel IDContext LengthFeatures
Kimi K2 Official Releasekimi-k2-250711200KOfficial Volcano Engine partnership, strong stability

💰 Pricing Information

Billing Methods

  • Pay-as-you-go: Charged based on actual Token usage
  • No minimum charge: Use what you pay for, balance never expires
  • Real-time deduction: Fees deducted from balance immediately after each call

Pricing Advantages

  • Official source forwarding with slight price advantages
  • Bulk users can contact customer service for better pricing
  • New users get 3 million tokens testing credit upon registration

View Real-time Pricing

Visit APIYI Console Pricing Page to view latest pricing for all models.

🛠️ Usage Recommendations

Model Selection Guide

Programming Development
  • Top choice: GPT-5 Codex series, Claude 4 Sonnet, Claude Opus 4.1, DeepSeek V3.1, o4-mini, Gemini 2.5 Pro
  • Alternatives: DeepSeek V3, Kimi K2 Official Release, GPT-5 (note parameter settings)
Text Creation
  • Top choice: GPT-5, Claude 4 Sonnet, GPT-4.1
  • Alternatives: GPT-4o, GPT-5 Chat Latest, Kimi K2 Official Release, Qwen Max
Quick Response
  • Top choice: GPT-4o Mini, Gemini 2.5 Flash
  • Alternatives: Gemini 2.5 Flash Lite, Grok 3 Mini, GPT-4.1 Mini
Image Generation
  • Currently popular: Nano Banana, SeeDream 4.0 (both $0.025/image)
  • Stable and reliable: GPT-Image-1 (high official pricing, ~20% off on our platform)
  • Reverse-engineered, cheapest: sora_image, gpt-4o-image
Long Text Processing
  • Top choice: Gemini 2.5 Pro (2M context)
  • Alternatives: Claude 4 series (200K context)

Cost Optimization Recommendations

  1. Tiered Usage: Use cheaper models for simple tasks, advanced models for complex tasks
  2. Test Optimization: Test with small models first, use large models after determining needs
  3. Batch Processing: Choose Nano or Mini versions for large volumes of similar tasks
  4. Cache Reuse: Cache results for repeated queries
Model list is continuously updated. We will promptly add newly released excellent models. For specific model needs or bulk requirements, please contact customer service.