Qwen 3

Qwen 3 is the latest large language model in the Qwen family, featuring advanced capabilities in coding, math, and reasoning, outperforming previous models like Qwen2.5 and Gemini-2.5-Pro

Designed for academics, developers, and organizations to create innovative solutions in areas like coding, STEM, and multilingual tasks

Available on platforms like Hugging Face, ModelScope, and Kaggle, with tools like Ollama, LMStudio, and llama.cpp recommended for local use

Supports context lengths up to 128K tokens, enabling efficient processing of longer inputs and complex tasks

Qwen3-MoE models achieve comparable performance to larger Qwen2.5 models with only 10% of the active parameters, reducing training and inference costs

Optimized for tool-calling and agentic tasks, with Qwen-Agent simplifying tool integration and usage

Multilingual Support: Qwen 3 supports 119 languages and dialects, enabling global applications and accessibility

Supports two modes—Thinking Mode for complex reasoning and Non-Thinking Mode for quick responses—allowing task-specific optimization

Both MoE and dense models are open-weighted under the Apache 2.0 license, enabling broad accessibility for developers and researchers

Includes two MoE models (Qwen3-235B-A22B with 235 billion parameters and Qwen3-30B-A3B with 30 billion parameters) and six dense models (ranging from Qwen3-0.6B to Qwen3-32B)