AI Models

GPT-5.6 Sol is the flagship and most capable model in the GPT-5.6 family. It advances agentic software engineering, computer use, professional knowledge work, scientific research, and cybersecurity, with stronger reasoning for complex, long-horizon tasks.

GPT-5.5 Thinking

GPT-5.5 is the latest frontier-grade model in the GPT-5 series, offering stronger general-purpose reasoning, improved instruction adherence, and a more natural conversational style compared to GPT-5. It uses adaptive reasoning to allocate computation dynamically, responding quickly to simple queries while spending more depth on complex tasks. The model produces clearer, more grounded explanations with reduced jargon, making it easier to follow even on technical or multi-step problems.

GPT-5.6 Luna is the fastest and most cost-efficient model in the GPT-5.6 family. It delivers strong capability for everyday, high-volume workloads while keeping latency and cost low.

Claude Fable 5 is a Claude-family model available through ChatHub Cloud for general chat, writing, and coding workflows.

AdvancedAnthropic

Claude Opus 4.8

Opus 4.8 is Anthropic's most capable generally available model, built for complex reasoning, long-horizon agentic coding, and high-autonomy professional work. It improves on Opus 4.7 with stronger tool triggering, better long-context handling, more reliable reasoning effort calibration, and sharper execution across large codebases, multi-step debugging, financial analysis, and knowledge-work workflows.

AdvancedAnthropic

Claude Sonnet 5

Sonnet 5 is Anthropic's Sonnet-class model for coding, agents, and professional work. It is designed for iterative development, complex codebase navigation, end-to-end project management with memory, polished document creation, and computer use for web QA and workflow automation.

AdvancedAnthropic

Claude Sonnet 4.6

Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation, end-to-end project management with memory, polished document creation, and confident computer use for web QA and workflow automation.

AdvancedAnthropic

Claude Sonnet 4.6 Thinking

Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation, end-to-end project management with memory, polished document creation, and confident computer use for web QA and workflow automation.

AdvancedAnthropic

Claude Haiku 4.5

Claude Haiku 4.5 is Anthropic's fastest and most efficient model, delivering near-frontier intelligence at a fraction of the cost and latency of larger Claude models. Matching Claude Sonnet 4's performance across reasoning, coding, and computer-use tasks, Haiku 4.5 brings frontier-level capability to real-time and high-volume applications.

AdvancedAnthropic

Gemini 3.5 Flash

Gemini 3.5 Flash is Google's next Gemini 3-series Flash model, built for fast, efficient agentic and coding workflows. It combines natively multimodal reasoning with configurable thinking levels to balance quality, cost, and latency.

Gemini 3.1 Pro Preview is Google's frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation of the Gemini 3 series, it combines high-precision reasoning across text, image, video, audio, and code with a 1M-token context window.

Gemini 3 Flash is Google's distilled, latency-focused version of the Gemini 3 family, optimized for fast inference while maintaining high capability. It offers sub-second response times and is designed for applications requiring both speed and quality, featuring advanced multimodal understanding and reasoning capabilities.

Gemini 2.5 Pro is Google's state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs "thinking" capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Gemini 2.5 Flash

Gemini 2.5 Flash is Google's first fully hybrid reasoning model that allows developers to toggle thinking capabilities on or off according to their needs, offering enhanced reasoning abilities while maintaining the speed and cost-effectiveness of its predecessor.

Grok 4.5 is a reasoning model from xAI. It accepts text and image inputs with text output, and is suited for agentic workflows, instruction-following tasks, and applications requiring high factual accuracy.

DeepSeek-V4 Pro

DeepSeek-V4 Pro is a cutting-edge open-source model featuring a 1-million-token context window and a novel architecture with significantly faster inference. It excels at repository-level coding tasks, achieving 83.7% on SWE-bench Verified, and can diagnose and fix bugs spanning multiple files across entire codebases. The model includes optimized memory modules for long-term recall and offers reduced memory consumption through efficient KV cache storage.

AdvancedDeepSeek

DeepSeek-V3.2-Chat

DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism designed to improve training and inference efficiency in long-context scenarios while maintaining output quality.

DeepSeek-R1-Reasoning

DeepSeek R1 is a cutting-edge AI model developed by DeepSeek that was released as a competitor to OpenAI's o1 model. This model emphasizes strong reasoning capabilities in areas such as complex math, coding, and logic. Designed to compete with leading AI models, it offers both transparency and competitive performance, making it a significant step forward in open-source AI development.

Kimi K3 is Moonshot AI's latest flagship model in the Kimi family.

AdvancedMoonshot AI

Kimi K2.7 Code is Moonshot AI's coding-focused model in the Kimi K2 family. It is built for long-context software engineering, agentic task decomposition, and multi-turn programming workflows, with native multimodal inputs and a 256K-token context window.

BasicMoonshot AI

Kimi K2.6 is Moonshot AI's latest native multimodal model, delivering state-of-the-art visual coding capability and a self-directed agent swarm paradigm. Building on Kimi K2.5 with continued pretraining and improved post-training, it delivers even stronger performance in general reasoning, visual coding, and agentic tool-calling.

BasicMoonshot AI

MiniMax-M3 is a multimodal foundation model from MiniMax. It supports text, image, and video inputs with text output, a 1M-token context window, and is suited for long-horizon agentic work, coding, and tool use. It is built on MiniMax Sparse Attention (MSA), which replaces full attention with KV-block selection to cut per-token compute at long context — roughly 1/20 the cost of the previous generation at 1M tokens, with substantially faster prefill and decode while retaining quality across most tasks.

GLM-5.2 is a coding-focused model from Zai designed for agentic engineering, long-horizon software development, and complex multi-step programming tasks. It builds on the GLM-5 line with stronger coding and autonomous task execution capabilities.

Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It supports text, image, and video input with a 1M token context window.

Qwen3.7 Plus is Alibaba's balanced flagship model, pairing efficient linear attention with sparse mixture-of-experts routing for strong everyday performance at lower cost. It improves on Qwen3.6 Plus across agentic coding, front-end development, instruction following, and multi-step reasoning, making it a versatile choice for general assistant, coding, and tool-use workloads.

Qwen3.8 Max is Alibaba's new flagship model, a 2.4-trillion-parameter sparse mixture-of-experts system and the Qwen team's first multimodal model above one trillion parameters, accepting text, image, video, and document input. It surpasses Qwen3.7 Max on coding, full-stack development, data analysis, and office workflows, with stronger long-horizon reasoning and agentic execution.

AdvancedAlibaba

The Llama 4 collection of models are natively multimodal AI models that enable text and multimodal experiences. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.

Mistral Large 3

Mistral Large is a cutting-edge language model developed by Mistral AI, renowned for its advanced reasoning capabilities. It excels in multilingual tasks, code generation, and complex problem-solving, making it ideal for diverse text-based applications.

AdvancedMistral AI

Mistral Medium 3.1

Mistral Medium 3.1 is a cutting-edge language model designed for enterprise use, offering state-of-the-art performance at 8 times lower cost compared to competitors. It excels in professional applications like coding and multimodal understanding, while providing flexible deployment options and seamless integration into enterprise systems.

AdvancedMistral AI

Perplexity Sonar

The Perplexity Sonar Online model is a state-of-the-art large language model developed by Perplexity AI. It offers real-time internet access, ensuring up-to-date information retrieval. Known for its cost-efficiency, speed, and enhanced performance, it surpasses previous models in the Sonar family, making it ideal for dynamic and accurate data processing.

AdvancedPerplexity

Doubao Seed 2.1

Doubao Seed 2.1 is ByteDance's frontier multimodal model with one trillion parameters, delivering industry-leading performance across reasoning, coding, and visual understanding tasks. The Pro variant offers high-throughput processing while maintaining strong capability across a wide range of benchmarks.

Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, multilingual, and coding use cases.

Amazon Nova is a capable multimodal model from Amazon focused on providing a combination of accuracy, speed, and cost for a wide range of tasks. As of December 2024, it achieves state-of-the-art performance on key benchmarks including visual question answering (TextVQA) and video understanding (VATEX).

MiMo-V2.5-Pro is Xiaomi’s flagship model, delivering strong performance in general agentic capabilities, complex software engineering, and long-horizon tasks, with top rankings on benchmarks such as ClawEval, GDPVal, and SWE-bench Pro. It can independently and autonomously complete professional tasks that would take human experts days or weeks, involving more than a thousand tool calls. Its context length of up to 1M makes it well suited for integration with a wide range of agent frameworks.

Hy3 Preview is a high-efficiency Mixture-of-Experts model from Tencent designed for agentic workflows and production use. It supports configurable reasoning levels across disabled, low, and high modes, allowing it to balance speed and depth depending on the task, while delivering strong code generation and reliable performance across multi-step, real-world workflows.