Qwen3 30B A3B

Instruct
Code
Reasoning
Qwen3 is the latest generation of large language models in the Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. With groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support, Qwen3 introduces key innovations like the ability to seamlessly switch between thinking mode (for complex logical reasoning, math, and coding) and non-thinking mode (for efficient, general-purpose dialogue) within a single model. This ensures optimal performance across various tasks and scenarios. Qwen3 significantly enhances its reasoning capabilities, surpassing previous models such as QwQ (in thinking mode) and Qwen2.5 (in non-thinking mode) in mathematics, code generation, and commonsense logical reasoning. The model excels in human preference alignment, particularly in creative writing, role-playing, multi-turn dialogues, and instruction following, delivering a more natural and engaging conversational experience. It also boasts expertise in agent capabilities, allowing for precise integration with external tools in both modes, achieving top-tier performance in complex agent-based tasks. With support for over 100 languages and dialects, Qwen3 offers strong multilingual instruction-following and translation capabilities, making it a powerful tool for global applications.
Provider
Context Size
Max Output
Cost
Speed

nebius_fast

128K

128K

NaN/M

155.00 tps

nebius_fdt

128K

128K

NaN/M

155.00 tps

nebius_slow

128K

128K

NaN/M

155.00 tps

nebiusf

128K

128K

NaN/M

155.00 tps

API Usage

Seamlessly integrate our API into your project by following these simple steps:

  1. Generate your API key from your profile.
  2. Copy the example code and replace the placeholder with your API key or see our documentation.

You can choose from three automatic provider selection preferences:

  • speed – Prioritizes the provider with the fastest response time.
  • cost – Selects the most cost-efficient provider.
  • balanced – Offers an optimal mix of speed and cost.