Цены на модели – Сравнение цен LLM API

kimi-k3

Moonshot

CONTEXT1.05M

INPUT$3.0000/M

OUTPUT$15.0000/M

Moonshot flagship model with native vision and up to a 1M-token context window. It currently runs at max reasoning effort and suits long-horizon coding, knowledge work, and complex tasks that coordinate terminal tools. For stable use, preserve the full reasoning history and start a new session in a compatible agent harness.

Input Type:

Output Type:

ReasoningTool UseLong Context

gpt-5.6-terra

OpenAI

CONTEXT1.05M

INPUT$2.5000/M

OUTPUT$15.0000/M

The balanced workhorse tier of GPT-5.6, combining capability and cost. Well suited to production work involving image understanding, tool use, and large source material; choose Sol for the hardest quality-first work or Luna for cost-controlled high volume.

Input Type:

Output Type:

ReasoningFunction CallingStructured OutputLong Context

gpt-5.6-sol

OpenAI

CONTEXT1.05M

INPUT$5.0000/M

OUTPUT$30.0000/M

The flagship GPT-5.6 tier for complex professional work and quality-first delivery. It supports image input, function calling, structured outputs, and an exceptionally large context window; consider Terra or Luna when latency or unit cost is the primary constraint.

Input Type:

Output Type:

ReasoningFunction CallingStructured OutputLong Context

gpt-5.6-luna

OpenAI

CONTEXT1.05M

INPUT$1.0000/M

OUTPUT$6.0000/M

The cost-sensitive, high-volume tier of GPT-5.6. A good fit for summarization, classification, rewriting, and batch automation; choose the Sol sibling for complex professional judgment or quality-first multi-step work.

Input Type:

Output Type:

ReasoningFunction CallingStructured OutputLong Context

grok-4.5

xAI

CONTEXT500K

INPUT$2.1000/M

OUTPUT$6.3000/M

A frontier reasoning model for coding, knowledge work, and STEM tasks. It supports image input, function calling, and structured outputs; its 500K-token context suits cross-file analysis and long source material. Prefer a lower-cost model when frontier reasoning is unnecessary.

Input Type:

Output Type:

ReasoningFunction CallingStructured OutputLong Context

claude-fable-5

Anthropic

CONTEXT1M

INPUT$12.5000/M

OUTPUT$50.0000/M

Anthropic's most capable widely released model, built for the most demanding reasoning and long-horizon agentic work. Offers a 1M-token context window with 128k max output and always-on adaptive thinking, excelling at autonomously exploring underspecified tasks, planning, and carrying long-running coding and multi-agent orchestration further before it needs human input.

Input Type:

Output Type:

ReasoningTool UseStructured OutputLong Context

claude-sonnet-5

Anthropic

CONTEXT1M

INPUT$2.0000/M

OUTPUT$10.0000/M

A daily-driver Sonnet-tier model aimed at bringing near-frontier agentic, coding, and knowledge-work capability at a lower operating cost. Its default 1M-token context and adaptive thinking fit long documents, codebases, and multi-step tool workflows; for the deepest reasoning or restricted high-risk security work, evaluate higher-tier or specialized models.

Input Type:

Output Type:

ReasoningTool UseStructured OutputLong Context

LongCat-2.0

美团

CONTEXT1M

INPUT$0.3000/M

OUTPUT$1.2000/M

A LongCat 2.0 workhorse for project-scale coding and long-running agent tasks, with native 1M-token context plus tool calling and multi-step reasoning. It is best when you need to keep large repositories, long documents, or automation workflows in scope; for lightweight Q&A or low-latency chat, a smaller Flash/Lite-style model is usually cheaper.

Input Type:

Output Type:

ReasoningTool UseFunction CallingLong Context

doubao-seed-2-1-pro

Doubao

CONTEXT256K

INPUT$0.9700/M

OUTPUT$4.8800/M

Built for coding, agents, and complex productivity tasks, advancing beyond Doubao Seed 2.0 in long context, long output, and tool-oriented workflows. Compared with Qwen, GLM, and DeepSeek peers, it is a cost-effective option for Chinese office work, coding, and multi-step automation.

Input Type:

Output Type:

ReasoningTool UseFunction CallingStructured OutputLong Context

happyhorse-1.1-t2v

阿里巴巴

PER REQ$0.1350

Built for text-to-video, with a stronger audio-native workflow than HappyHorse 1.0, generating dialogue, sound effects, and background music in one pass. Compared with Veo, Kling, and Runway, its differentiator is synchronized audiovisual generation and character-consistent workflows for short drama, ads, e-commerce, and brand marketing.

Input Type:

Output Type:

Multimodal Output

Loading…