Models
Blog
Download
Pricing
Sign in
AI Models
GPT-5.1
GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-purpose reasoning, improved instruction adherence, and a more natural conversational style compared to GPT-5. It uses adaptive reasoning to allocate computation dynamically, responding quickly to simple queries while spending more depth on complex tasks. The model produces clearer, more grounded explanations with reduced jargon, making it easier to follow even on technical or multi-step problems.
Advanced
OpenAI
Chat now
GPT-5.1 Thinking
GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-purpose reasoning, improved instruction adherence, and a more natural conversational style compared to GPT-5. It uses adaptive reasoning to allocate computation dynamically, responding quickly to simple queries while spending more depth on complex tasks. The model produces clearer, more grounded explanations with reduced jargon, making it easier to follow even on technical or multi-step problems.
Advanced
OpenAI
Chat now
GPT-5 mini
GPT-5 mini is a lightweight version of GPT-5. GPT-5 is OpenAI’s most advanced model, offering major improvements in reasoning, code quality, and user experience. It handles complex coding tasks with minimal prompting, provides clear explanations, and introduces enhanced agentic capabilities, making it a powerful coding collaborator and intelligent assistant for all users.
Basic
OpenAI
Chat now
GPT-4.1
GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and GPT-4.5 across coding (54.6% SWE-bench Verified), instruction compliance (87.4% IFEval), and multimodal understanding benchmarks. It is tuned for precise code diffs, agent reliability, and high recall in large document contexts, making it ideal for agents, IDE tooling, and enterprise knowledge retrieval.
Advanced
OpenAI
Chat now
GPT-4o
OpenAI's high-intelligence flagship model for complex, multi-step tasks. GPT-4o is cheaper and faster than GPT-4 Turbo.
Advanced
OpenAI
Chat now
o3
OpenAI o3 is a reflective generative pre-trained transformer (GPT) model developed by OpenAI as a successor to OpenAI o1.
Advanced
OpenAI
Chat now
gpt-oss-120b
gpt-oss-120b is a high-performance, open-weight language model designed for production-grade, general-purpose use cases. It fits on a single H100 GPU, making it accessible without requiring multi-GPU infrastructure. Trained on the Harmony response format, it excels at complex reasoning and supports configurable reasoning effort, full chain-of-thought transparency for easier debugging and trust, and native agentic capabilities for function calling, tool use, and structured outputs.
Basic
OpenAI
Chat now
Claude Sonnet 4.5
Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with improvements across system design, code security, and specification adherence. The model is designed for extended autonomous operation, maintaining task continuity across sessions and providing fact-based progress tracking.
Advanced
Anthropic
Chat now
Claude Sonnet 4.5 Thinking
Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with improvements across system design, code security, and specification adherence. The model is designed for extended autonomous operation, maintaining task continuity across sessions and providing fact-based progress tracking.
Advanced
Anthropic
Chat now
Claude Opus 4.1
Claude Opus 4.1 is benchmarked as the world’s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in software engineering, achieving leading results on SWE-bench (72.5%) and Terminal-bench (43.2%). Opus 4 supports extended, agentic workflows, handling thousands of task steps continuously for hours without degradation.
Advanced
Anthropic
Chat now
Claude Haiku 4.5
Claude Haiku 4.5 is Anthropic's fastest and most efficient model, delivering near-frontier intelligence at a fraction of the cost and latency of larger Claude models. Matching Claude Sonnet 4's performance across reasoning, coding, and computer-use tasks, Haiku 4.5 brings frontier-level capability to real-time and high-volume applications.
Advanced
Anthropic
Chat now
Gemini 2.5 Pro
Gemini 2.5 Pro is Google's state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.
Advanced
Google
Chat now
Gemini 2.5 Flash
Gemini 2.5 Flash is Google's first fully hybrid reasoning model that allows developers to toggle thinking capabilities on or off according to their needs, offering enhanced reasoning abilities while maintaining the speed and cost-effectiveness of its predecessor.
Basic
Google
Chat now
Grok 4
Grok 4 is the latest and greatest flagship model from xAI. Grok 4 displays significant improvements in reasoning, mathematics, coding, world knowledge, and instruction-following tasks.
Advanced
xAI
Chat now
Kimi K2 Thinking
Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in Kimi K2, it activates 32 billion parameters per forward pass and supports 256 k-token context windows.
Basic
Moonshot AI
Chat now
MiniMax M2
MiniMax M2 is a high-efficiency large language model built for coding and agentic workflows. With 10 billion activated parameters (230 billion in total), it delivers exceptional performance in multi-file edits, coding-run-fix loops, and test-validated repairs. The model supports a context window of 204,800 tokens and ranks among the top five globally in intelligence, achieving similar scores to Claude Sonnet 4.5 while being highly cost-effective.
Basic
MiniMax
Chat now
DeepSeek-R1
DeepSeek R1 is a cutting-edge AI model developed by DeepSeek that was released as a competitor to OpenAI's o1 model. This model emphasizes strong reasoning capabilities in areas such as complex math, coding, and logic. Designed to compete with leading AI models, it offers both transparency and competitive performance, making it a significant step forward in open-source AI development.
Basic
DeepSeek
Chat now
DeepSeek-V3.2
DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism designed to improve training and inference efficiency in long-context scenarios while maintaining output quality.
Basic
DeepSeek
Chat now
Llama 4
The Llama 4 collection of models are natively multimodal AI models that enable text and multimodal experiences. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.
Basic
Meta
Chat now
Mistral Large
Mistral Large is a cutting-edge language model developed by Mistral AI, renowned for its advanced reasoning capabilities. It excels in multilingual tasks, code generation, and complex problem-solving, making it ideal for diverse text-based applications.
Advanced
Mistral AI
Chat now
Mistral Medium 3.1
Mistral Medium 3.1 is a cutting-edge language model designed for enterprise use, offering state-of-the-art performance at 8 times lower cost compared to competitors. It excels in professional applications like coding and multimodal understanding, while providing flexible deployment options and seamless integration into enterprise systems.
Advanced
Mistral AI
Chat now
Perplexity Sonar
The Perplexity Sonar Online model is a state-of-the-art large language model developed by Perplexity AI. It offers real-time internet access, ensuring up-to-date information retrieval. Known for its cost-efficiency, speed, and enhanced performance, it surpasses previous models in the Sonar family, making it ideal for dynamic and accurate data processing.
Advanced
Perplexity
Chat now
Doubao Seed 1.6
Doubao Seed 1.6 is an advanced model from Doubao with enhanced reasoning capabilities and tool calling support.
Advanced
Doubao
Chat now
Qwen3 Plus
Qwen-Plus, based on the Qwen2.5 foundation model, is a 131K context model with a balanced performance, speed, and cost combination.
Basic
Alibaba
Chat now
Qwen3 Max
Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It delivers higher accuracy in math, coding, logic, and science tasks, follows complex instructions in Chinese and English more reliably, reduces hallucinations, and produces higher-quality responses for open-ended Q&A, writing, and conversation. The model supports over 100 languages with stronger translation and commonsense reasoning, and is optimized for retrieval-augmented generation (RAG) and tool calling, though it does not include a dedicated "thinking" mode.
Advanced
Alibaba
Chat now
Qwen3 VL
Qwen3-VL is a powerful open-source vision-language model with 235B parameters. It delivers comprehensive capabilities across text, image, and video understanding with a native 256K token context window. Key features include Visual Agent functionality for operating computer and mobile GUIs, advanced OCR in 32 languages, enhanced spatial perception and 3D grounding, and visual coding that generates Draw.io/HTML/CSS/JS from images and videos. The model excels at long-video comprehension, object localization, and embodied AI tasks.
Advanced
Alibaba
Chat now
GLM 4.6
GLM-4.6 is the latest flagship foundation model from Zai, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.6 delivers significantly enhanced capabilities in reasoning, code generation, and agent alignment.
Basic
Zai
Chat now
Command A
Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, multilingual, and coding use cases.
Advanced
Cohere
Chat now
Yi-Large
The Yi Large model was designed by 01.AI with the following usecases in mind: knowledge search, data classification, human-like chat bots, and customer service. It stands out for its multilingual proficiency, particularly in Spanish, Chinese, Japanese, German, and French.
Advanced
01.AI
Chat now
WizardLM 2
WizardLM-2 8x22B is Microsoft AI's most advanced Wizard model. It demonstrates highly competitive performance compared to leading proprietary models, and it consistently outperforms all existing state-of-the-art opensource models.
Advanced
Microsoft
Chat now
Amazon Nova Pro
Amazon Nova Pro 1.0 is a capable multimodal model from Amazon focused on providing a combination of accuracy, speed, and cost for a wide range of tasks. As of December 2024, it achieves state-of-the-art performance on key benchmarks including visual question answering (TextVQA) and video understanding (VATEX).
Advanced
Amazon
Chat now