AI Model Glossary

A researched directory of notable LLMs and coding models across major providers, with release dates, strengths, weaknesses, and open-versus-closed status.

This page is designed as a practical reference for teams evaluating model choices. Release dates refer to public launch dates for a model family or notable variant, not every patch release. The open vs closed column is about weight access: some models are fully closed APIs, some are open-weight, and some sit in a more restricted commercial middle ground. Entries are shown with the most recent launches first so the page works as a current-market reference, not just a history list.

Topic hubs

Use the glossary as an entry point into broader model topics

The glossary captures the market map. The topic hubs below branch into practical buying, implementation, and tooling questions so the page feeds the wider model SEO cluster instead of standing alone.

Model pricing and selection

Compare cost, latency, and quality trade-offs across API and open-weight options.

Managed AI platform choices

See how model decisions connect to Bedrock, Vertex AI, gateways, and managed stacks.

Vendor and tool comparisons

Move from model research into buyer-side comparisons and operational tooling.

Provider history

How the major model families evolved

The current model landscape makes more sense when you see how providers moved: closed labs expanded into coding and multimodal workflows, while open-weight ecosystems like Llama, DeepSeek, Qwen, and Hugging Face-origin models kept raising the floor on what open models can do.

OpenAI

OpenAI set the pace for mainstream API-delivered frontier models, then extended into multimodal and coding-agent workflows. GPT-4o and GPT-4.1 were retired from ChatGPT in February 2026.

2026-03-05

GPT-5.4

Current flagship with 1M context, native computer-use, and integrated Codex coding capabilities. Replaced GPT-5.2 as the default for complex reasoning.

2026-02-05

GPT-5.3-Codex

Specialized coding agent; capabilities integrated into GPT-5.4.

2025-09-30

GPT-5.2

Widely adopted tier with strong personality and creative support. Knowledge cutoff Sep 2024.

2025-04-14

GPT-4.1

Retired Feb 2026. API-first coding and long-context family.

2024-05-13

GPT-4o

Retired Feb 2026. Introduced omni multimodal direction.

Google

Google split its model story between Gemini as the managed frontier family and Gemma as the open-weight developer path.

2026-02-19

Gemini 3.1 Pro

Current flagship with 1M context, 2× reasoning of Gemini 3 Pro. Tops SWE-Bench and LiveCodeBench.

2025-06-17

Gemini 2.5 Flash

Extended the Gemini 2.5 line into a faster, more cost-sensitive production tier.

2025-03-25

Gemini 2.5 Pro

Brought stronger thinking and coding performance into the Gemini family.

2025-03-10

Gemma 3

Extended Google’s open-weight line with stronger multimodal and long-context capabilities.

2024-02-15

Gemini 1.5 Pro

Made very long context a practical buying criterion for production teams.

Anthropic

Anthropic became a default choice for repo-scale reasoning and long-context development work, especially in terminal-first workflows.

2026-02-17

Claude Sonnet 4.6

Pushed daily-use coding quality closer to Opus while keeping a lower-cost default tier.

2026-02-05

Claude Opus 4.6

Extended Anthropic’s premium reasoning tier for harder coding and agentic tasks.

2024-06-20

Claude 3.5 Sonnet

Marked a major quality and speed step-change for coding and general reasoning.

Mistral

Mistral mixed open-weight momentum with commercial model lines, especially for European buyers and code-focused teams.

2025-12-15

Mistral Large 3

675B total, 41B active. Open-weight, 256K context. Apache 2.0.

2024-07-24

Mistral Large 2

Strengthened Mistral’s enterprise and flagship positioning against larger US labs.

2024-05-29

Codestral

Marked Mistral’s explicit push into code-generation models.

2023-12-11

Mixtral 8x7B

Helped make sparse MoE models a core part of the open-weight conversation.

xAI

xAI positioned Grok as both a consumer-facing assistant brand and a developer/API family, with one notable open-weight release in Grok-1.

2025-11-17

Grok 4.1

Marked the later closed-model API phase with stronger agentic reasoning claims.

2024-03-17

Grok-1

Released as an Apache 2.0 open-weight base model after the initial beta period.

Hugging Face ecosystem

Hugging Face matters less as one single frontier family and more as the home of influential open-model releases, community fine-tunes, and distribution.

2025-07-08

SmolLM3

Made the case for compact, long-context, multilingual open models in the 3B range.

2023-10-25

Zephyr 7B Beta

Showed how alignment and post-training could make smaller open models highly competitive.

2022-07-12

BLOOM

One of the most important early multilingual open LLM efforts.

DeepSeek

DeepSeek became one of the most important open-weight challengers by combining strong efficiency claims with serious reasoning and coding performance.

2025-06-01

DeepSeek-V3.2

DSA attention for long-context. High-compute variant rivals GPT-5 and Gemini 3 Pro.

2025-03-24

DeepSeek-V3-0324

Updated the V3 line with stronger post-training and coding-oriented improvements.

2025-01-20

DeepSeek-R1

Made DeepSeek central to the open reasoning-model discussion.

2024-12-27

DeepSeek-V3

Showed that an open-weight MoE flagship could compete more directly with frontier closed models.

2024-05-01

DeepSeek-V2

Raised attention around efficient MoE design and lower-cost inference economics.

Qwen

Qwen grew into one of the most important open-weight ecosystems because it shipped broad size ranges, coder variants, and multilingual coverage quickly.

2025-04-29

Qwen3

Extended the line with hybrid thinking and larger MoE options.

2024-11-12

Qwen2.5-Coder

Strengthened Qwen’s position as a serious open-weight coding family.

2024-10-16

Qwen2.5

Expanded the family into a broad open-weight lineup spanning general, coder, and math variants.

Cohere

Cohere stayed especially relevant for enterprise retrieval and business-document workflows rather than general consumer-assistant mindshare.

2024-04-04

Command R+

Positioned Cohere as a serious enterprise retrieval and business-workflow model vendor.

Glossary

Notable AI models and coding models

This list is curated rather than exhaustive. It aims to cover the model families most useful for teams making real API, coding, open-weight, and platform decisions, including notable Hugging Face ecosystem entries, recent coding-model variants, and major open-weight challengers like DeepSeek and Qwen.

Model	Provider	Release date	Description	Strengths	Weaknesses	Open vs closed
GPT-5.4	OpenAI	2026-03-05	OpenAI’s current flagship with 1M context, native computer-use, and integrated Codex coding. Best for complex reasoning and professional work.	Frontier reasoning, coding, and multimodal. Adjustable reasoning modes including low-latency.	Premium pricing; vendor lock-in.	Closed
Gemini 3.1 Pro	Google	2026-02-19	Google's current flagship with 1M context and 65K output. Strong reasoning, coding, and agentic workflows.	Tops 12 of 18 benchmarks. 80.6% SWE-Bench, 85.9% BrowseComp. Three-tier thinking modes.	Preview; platform complexity vs simpler API products.	Closed
Claude Sonnet 4.6	Anthropic	2026-02-17	Anthropic’s newer daily-use frontier tier for coding, agents, and professional work.	Very strong default for coding and agentic tasks.	Still premium versus many open-weight options.	Closed
Claude Opus 4.6	Anthropic	2026-02-05	Anthropic’s premium reasoning tier for harder repo-scale and planning-heavy tasks.	High quality ceiling for complex reasoning and coding.	Easy to overspend if used as the default tier.	Closed
GPT-5.3-Codex	OpenAI	2026-02-05	OpenAI's coding-specialist model for software-engineering and agent workflows. Capabilities now in GPT-5.4.	Industry-leading coding. Cloud sandbox and agent tooling.	Access and pricing vary by product path.	Closed
Mistral Large 3	Mistral	2025-12-15	675B total, 41B active. Open-weight flagship with 256K context. Apache 2.0.	Strong multilingual and enterprise appeal. Multimodal. Competitive pricing.	Newer; fewer enterprise deployments than closed rivals.	Open-weight
Grok 4.1	xAI	2025-11-17	xAI’s later closed flagship focused on stronger reasoning and production API use.	Competitive frontier-style reasoning and agentic positioning.	Closed ecosystem and smaller enterprise footprint than some rivals.	Closed
GPT-5.2	OpenAI	2025-09-30	OpenAI's widely used tier with strong personality, creativity support, and 400K context. Default for many production workloads.	Good balance of speed, cost, and quality. Mature ecosystem.	Knowledge cutoff Sep 2024. Newer GPT-5.4 available for harder tasks.	Closed
GPT-5-mini	OpenAI	2025-09-30	OpenAI's cost-optimized model for lower latency and budget-sensitive workloads.	Fast, cheaper than flagship. Supports vision and tools.	Lower reasoning ceiling than GPT-5.4 or GPT-5.2.	Closed
SmolLM3	Hugging Face	2025-07-08	Hugging Face’s compact multilingual long-context model in the 3B class.	Practical small-model deployment with strong long-context and multilingual positioning.	Not a direct substitute for frontier-scale reasoning models.	Open-weight
Gemini 2.5 Flash	Google	2025-06-17	Google’s faster and usually cheaper production-tier member of the Gemini 2.5 family.	Better speed and price/performance balance than the flagship tier for many workloads.	Lower absolute reasoning ceiling than Gemini 2.5 Pro.	Closed
DeepSeek-V3.2	DeepSeek	2025-06-01	DeepSeek Sparse Attention for long-context. High-compute variant rivals GPT-5 and Gemini 3 Pro.	Gold medal IMO/IOI 2025. Improved agent and tool-use. MIT-compatible.	Large model; self-hosting costs remain significant.	Open-weight
Qwen3	Qwen	2025-04-29	Qwen’s newer family with hybrid thinking modes and larger MoE options.	Strong open-weight flexibility with modern reasoning features.	Deployment and benchmark interpretation still require hands-on evaluation.	Open-weight
Llama 4 Scout	Meta	2025-04-05	17B active params, 10M context. Natively multimodal. Outperforms Gemma 3 and Gemini 2.0 Flash-Lite.	Fits on single H100. Strong efficiency for multimodal tasks.	Newer than Llama 3; ecosystem still maturing.	Open-weight
Llama 4 Maverick	Meta	2025-04-05	17B active, 128 experts. Beats GPT-4o and Gemini 2.0 Flash; comparable to DeepSeek V3 at half the params.	Strong open-weight frontier alternative. Natively multimodal.	Newer; benchmark interpretation still evolving.	Open-weight
Gemini 2.5 Pro	Google	2025-03-25	Google’s thinking-oriented Gemini flagship for advanced reasoning and coding.	Strong reasoning, coding, and multimodal reach.	Managed platform complexity is higher than simpler API products.	Closed
DeepSeek-V3-0324	DeepSeek	2025-03-24	A stronger post-trained update to the DeepSeek-V3 line with notable coding and reasoning improvements.	Very competitive open-weight quality with stronger coding-oriented tuning than the original V3 launch.	Still a large model family that is costly to self-serve at scale.	Open-weight
Gemma 3	Google	2025-03-10	Google’s open-weight model family built to give developers a lighter-weight alternative to Gemini.	Open-weight flexibility with strong small-to-mid-size deployment options.	Not intended to fully replace top closed-model capability.	Open-weight
DeepSeek-R1	DeepSeek	2025-01-20	DeepSeek’s reasoning-focused open-weight family that became central to the reasoning-model conversation.	Strong reasoning, math, code, and open-weight mindshare.	Reasoning-style usage can still become expensive to serve at scale.	Open-weight
DeepSeek-V3	DeepSeek	2024-12-27	DeepSeek’s flagship open-weight general model with strong frontier comparisons.	Very competitive open-weight quality and strong coding reputation.	Large-scale serving remains non-trivial for most teams.	Open-weight
Llama 3.3 70B	Meta	2024-12-19	A more practical high-quality Llama tier for strong open-weight deployment without 400B scale.	Good balance of quality and deployment practicality.	Still requires careful infra planning compared with closed APIs.	Open-weight
Qwen2.5-Coder	Qwen	2024-11-12	Qwen’s open-weight coding family designed specifically for software engineering workloads.	Strong coder specialization with broad size options and open deployment flexibility.	Still requires model-selection discipline and self-hosting or hosted open-model trade-offs.	Open-weight
Qwen2.5	Qwen	2024-10-16	Alibaba’s broad open-weight family spanning many sizes and specialist variants.	Very wide size range, multilingual coverage, strong coder variants.	Family breadth can create selection and governance complexity.	Open-weight
Llama 3.2	Meta	2024-09-25	Meta’s extension of the Llama family into lighter text tiers and multimodal variants.	Broader deployment range across edge, lighter text, and multimodal use cases.	Not the strongest Llama option for highest-end reasoning.	Open-weight
Mistral Large 2	Mistral	2024-07-24	Mistral’s flagship commercial model line for enterprise-grade reasoning and code.	Strong multilingual and coding capability with enterprise appeal.	Not a straightforward open-weight option despite Mistral’s open-model reputation.	Restricted / commercial
Llama 3.1 405B	Meta	2024-07-23	Meta’s flagship open-weight frontier-scale model in the Llama 3.1 family.	Strong open-weight quality at very large scale.	Inference and serving costs are high in practice.	Open-weight
Claude 3.5 Sonnet	Anthropic	2024-06-20	Anthropic’s widely adopted mid-tier model that became a coding and reasoning default.	Excellent repo reasoning, strong cost-to-quality balance.	Closed access and output costs still matter in heavy usage.	Closed
Codestral	Mistral	2024-05-29	Mistral’s code-generation-focused model family built specifically for software engineering tasks.	Clear coding specialization and strong relevance for code-heavy workflows.	Licensing and access are more constrained than a simple fully open-weight release.	Restricted / commercial
DeepSeek-V2	DeepSeek	2024-05-01	DeepSeek’s MoE family that raised expectations around efficient open-weight inference.	Efficiency story and strong coding-oriented attention from developers.	Now mainly relevant as a history milestone, not the default latest recommendation.	Open-weight
Cohere Command R+	Cohere	2024-04-04	Cohere’s business and retrieval-oriented flagship built for enterprise RAG-style workloads.	Strong fit for enterprise retrieval and internal-data use cases.	Less general market mindshare than OpenAI, Anthropic, or Google.	Closed
Grok-1 (open release)	xAI	2024-03-17	xAI’s early Grok base model released as open weights under Apache 2.0.	Useful historical example of a major lab releasing an open-weight base model.	Not the practical default choice for most current production teams.	Open-weight
Gemini 1.5 Pro	Google	2024-02-15	Google’s long-context milestone model in the Gemini family.	Very large context and good multimodal platform integration.	Pricing and platform paths can be harder to model clearly.	Closed
Mixtral 8x7B	Mistral	2023-12-11	Mistral’s sparse MoE model that helped popularize open MoE deployment economics.	Strong efficiency and good quality for its active-parameter profile.	Older than newer flagship open reasoning families.	Open-weight
Zephyr 7B Beta	Hugging Face H4	2023-10-25	A Hugging Face alignment-focused fine-tune that showed how much post-training could improve smaller open models.	Strong instruct behavior for its size and highly instructive as a post-training milestone.	Small compared with current flagship open-weight families.	Open-weight
Llama 2	Meta	2023-07-18	Meta’s family that accelerated mainstream adoption of open-weight LLMs.	Major ecosystem impact and broad downstream adaptation.	Now clearly behind newer open-weight families.	Open-weight
BLOOM	Hugging Face / BigScience	2022-07-12	A major multilingual open model built through the BigScience collaboration and distributed through Hugging Face.	Important multilingual history and open collaboration milestone.	No longer competitive with newer open-weight families on raw quality.	Open-weight

Know where your cloud and AI spend stands — every day.

Connect providers in minutes. Get 90 days of visibility and start receiving daily cost updates before the invoice lands.

Start Free Trial

14-day free trial. No credit card required. Plans from $19/month.