Qwen3 is the latest generation of large language models in the Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. With groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support, Qwen3 introduces key innovations like the ability to seamlessly switch between thinking mode (for complex logical reasoning, math, and coding) and non-thinking mode (for efficient, general-purpose dialogue) within a single model. This ensures optimal performance across various tasks and scenarios. Qwen3 significantly enhances its reasoning capabilities, surpassing previous models such as QwQ (in thinking mode) and Qwen2.5 (in non-thinking mode) in mathematics, code generation, and commonsense logical reasoning. The model excels in human preference alignment, particularly in creative writing, role-playing, multi-turn dialogues, and instruction following, delivering a more natural and engaging conversational experience. It also boasts expertise in agent capabilities, allowing for precise integration with external tools in both modes, achieving top-tier performance in complex agent-based tasks. With support for over 100 languages and dialects, Qwen3 offers strong multilingual instruction-following and translation capabilities, making it a powerful tool for global applications.
Provider
Context Size
Max Output
Cost
Speed
nebius_fast
128K
128K
€NaN/M
155.00 tps
nebius_fdt
128K
128K
€NaN/M
155.00 tps
nebius_slow
128K
128K
€NaN/M
155.00 tps
nebiusf
128K
128K
€NaN/M
155.00 tps
API Usage
Seamlessly integrate our API into your project by following these simple steps: