Best LLMs

Large language models for general-purpose text generation, reasoning, summarization, and conversation. Capable of understanding complex instructions and producing high-quality, coherent text responses.

View model

Top Pick

View details

Coming Soon

Best rated

MiniMax M3

by MiniMax

MiniMax M3 is MiniMax's new flagship open-weight model for coding, agentic execution, and multimodal reasoning. It is built around the MiniMax Sparse Attention architecture, supports up to 1 million tokens of context, accepts image and video input in addition to text, and is designed for long-running software, research, browsing, and desktop-operation workflows that need strong tool use and sustained multi-step performance.

Featured Models

Top-performing models in this category, recommended by our community and performance benchmarks.

View details

Api Only

Gemini 3.5 Flash

by Google

Gemini 3.5 Flash is Google’s most intelligent Flash-series multimodal model for sustained frontier performance on agentic and coding tasks. It accepts text, images, video, audio, and PDFs, and is designed for long-horizon workflows, sub-agent orchestration, complex coding loops, multimodal understanding, and high-speed reasoning at production scale.

View details

Api Only

Claude Opus 4.8

by Anthropic

Claude Opus 4.8 is Anthropic's highest-capability Claude model. It is built for demanding coding, agent orchestration, multimodal reasoning, and professional workflows that need strong instruction following, adaptive and extended thinking, high-resolution vision, and a 1M-token context window.

View details

Api Only

Grok 4.3

by xAI

Grok 4.3 is xAI's flagship language model for agentic reasoning, strong instruction following, and minimal hallucinations. It supports text and image input, a 1 million token context window, configurable reasoning effort including non-reasoning mode, function calling, and structured outputs for production assistants, coding workflows, and long-context analysis.

View details

Api Only

DeepSeek-V4-Flash

by DeepSeek

DeepSeek-V4-Flash is DeepSeek's fast, efficient, and cost-focused frontier language model for coding, reasoning, and agent workflows. It supports both thinking and non-thinking modes, a 1M token context window, up to 384K output tokens, tool calls, JSON output, and efficient long-context operation for software, research, and structured professional tasks.

View details

Api Only

Gemma 4 31B

by Google

Gemma 4 31B is Google's flagship dense open-weights model in the Gemma 4 family. It combines strong reasoning, coding performance, native function calling, multimodal understanding across text, image, and video, and a 256K context window in a 31B-parameter open model designed for local and cloud deployment.

View details

Api Only

Kimi K2.6

by Moonshot AI

Kimi K2.6 is Moonshot AI's latest flagship open model for coding, reasoning, multimodal understanding, and agentic execution. It is designed for long-horizon software tasks, reliable tool use, autonomous multi-step workflows, coordinated agent swarms, and visual understanding across image and video inputs in addition to text.

View details

Api Only

GPT-5.5

by OpenAI

GPT-5.5 is OpenAI's newest frontier model for complex professional work, with strong performance in coding, reasoning, and tool-using workflows. It supports a 1,050,000 token context window, 128,000 max output tokens, configurable reasoning effort, image input, and a broad tool stack including web search, file search, code interpreter, hosted shell, apply patch, skills, MCP, tool search, and computer use.

Launch View details

GLM-5.1

by Z.ai

GLM-5.1 is Z.ai’s flagship language model for agentic engineering, coding, reasoning, and tool-driven workflows. It supports a 200K token context window with up to 128K output tokens, deep thinking, function calling, structured output, and streaming tool calls, and is designed to stay effective over long multi-step sessions rather than only short-horizon tasks.

#10

Launch View details

Gemini 3.1 Pro

by Google

Gemini 3.1 Pro is Google’s flagship multimodal language model that processes text alongside images, audio, video, code, and documents. It offers high-performance reasoning, complex instruction following, and deep contextual understanding for a wide range of tasks across language, analysis, and problem solving.

#11

View details

Api Only

Claude Opus 4.7

by Anthropic

Claude Opus 4.7 is Anthropic's highest-capability generally available Claude model. It is designed for demanding coding, agent orchestration, multimodal reasoning, and high-stakes professional workflows, with stronger instruction following, better high-resolution vision, adaptive thinking, and a 1M-token context window.

#12

View details

Api Only

Claude Sonnet 4.6

by Anthropic

Claude Sonnet 4.6 is Anthropic's most capable Sonnet model, built for daily production use across coding, agent workflows, long-context reasoning, computer use, and professional knowledge work. It supports adaptive and extended thinking, strong instruction following, high-volume automation, and a 1M-token context window in beta.

#13

Launch View details

MiniMax M2.7

by MiniMax

MiniMax M2.7 is a long-context LLM designed for agentic workflows across software engineering, search and tool use, and high-value office productivity tasks. It’s built for multi-step execution, with strong instruction following and dependable task decomposition, making it a solid default for production assistants that write code, call tools, and handle complex document workflows.

#14

Launch View details

MiniMax M2.7 Highspeed

by MiniMax

MiniMax M2.7-Highspeed is the performance-tuned variant of M2.7, built for lower latency and higher throughput while keeping output behavior consistent with the standard model. It’s a strong fit for interactive coding agents, tool-calling pipelines, and office automation flows where responsiveness matters.

#15

View details

Api Only

Claude Haiku 4.5

by Anthropic

Claude Haiku 4.5 is Anthropic's fastest and most cost-efficient Claude model. It is built for latency-sensitive applications, high-volume agents, sub-agent orchestration, coding assistance, and budget-conscious deployments that still need strong reasoning and multimodal understanding.

#16

Launch View details

Gemini 3 Flash

by Google

Gemini 3 Flash is Google’s flagship multimodal language model that processes text alongside images, audio, video, code, and documents. It offers high-performance reasoning, complex instruction following, and deep contextual understanding for a wide range of tasks across language, analysis, and problem solving.

#17

Launch View details

MiniMax M2.5

by MiniMax

MiniMax-M2.5 is MiniMax’s latest frontier model, optimized for fast, low-cost agentic workflows across coding, search/tool use, and high-value office tasks. Trained with large-scale reinforcement learning in complex real-world environments, it delivers strong reasoning, efficient task decomposition, and high-quality outputs for production assistants and enterprise workflows.

#18

Launch View details

Gemini 3.1 Flash Lite

by Google

Gemini 3.1 Flash Lite is Google’s flagship multimodal language model that processes text alongside images, audio, video, code, and documents. It offers high-performance reasoning, complex instruction following, and deep contextual understanding for a wide range of tasks across language, analysis, and problem solving

#19

Launch View details

GPT-5.4

by OpenAI

GPT-5.4 is OpenAI's flagship large language model, featuring a 1 million token context window, native computer use, and a 33% reduction in factual errors over GPT-5.2. It integrates coding capabilities from GPT-5.3-Codex, is 47% more token-efficient, and supports configurable reasoning effort for complex professional tasks.

#20

View details

Coming Soon

GPT-5.4 Pro

by OpenAI

GPT-5.4 Pro is the high-performance variant of GPT-5.4, optimized for enterprise-grade professional tasks. It offers deeper reasoning, enhanced accuracy, and extended compute for complex multi-step workflows including document creation, spreadsheet analysis, and autonomous agent orchestration. It shares the 1 million token context window and native computer use capabilities of the standard GPT-5.4.

#21

Launch View details

GPT-5.4 Mini

by OpenAI

GPT-5.4 Mini is a compact, efficient variant of GPT-5.4 designed for coding assistants, subagent orchestration, and multimodal applications requiring faster responsiveness. It supports a 400K token context window and retains native computer use and configurable reasoning effort at a lower cost than the flagship model.

#22

Launch View details

GPT-5.4 Nano

by OpenAI

GPT-5.4 Nano is the smallest and fastest variant of GPT-5.4, designed for high-throughput, low-latency tasks such as classification, data extraction, ranking, and lightweight automation. It prioritizes speed and cost efficiency for simple, high-volume workloads and is available exclusively via the API.

#23

Launch View details

GLM-4.7

by Z.ai

GLM-4.7 is a 358 billion parameter Mixture-of-Experts language model from Z.ai optimized for agentic coding, complex reasoning, and long-horizon tasks. It features interleaved thinking, preserved thinking for multi-turn consistency, and turn-level thinking control. It supports a 200K token context window with 128K max output, tool calling, and achieves 73.8% on SWE-bench Verified.

Best LLMs

MiniMax M3

Featured Models

Gemini 3.5 Flash

Claude Opus 4.8

Grok 4.3

DeepSeek-V4-Flash

Gemma 4 31B

Kimi K2.6

GPT-5.5

GLM-5.1

Gemini 3.1 Pro

Claude Opus 4.7

Claude Sonnet 4.6

MiniMax M2.7

MiniMax M2.7 Highspeed

Claude Haiku 4.5

Gemini 3 Flash

MiniMax M2.5

Gemini 3.1 Flash Lite

GPT-5.4

GPT-5.4 Pro

GPT-5.4 Mini

GPT-5.4 Nano

GLM-4.7

Explore other collections