Nebius AI Provider

Nebius AI Studio - OpenAI-compatible API for large language models

Available Models

Gemma 3 27B

google
gemma-3-27b

Providers

Nebius AI
nebius/gemma-3-27b
Context Size
128k
Stability
STABLE
Pricing
Input
$0.27/M
Cached
/M
Output
$0.27/M
Capabilities
Streaming
Vision
Try in Playground

Llama 3.1 8B Instruct

meta
llama-3.1-8b-instruct

Providers

Nebius AI
nebius/llama-3.1-8b-instruct
Context Size
128k
Stability
STABLE
Pricing
Input
$0.02/M
Cached
/M
Output
$0.06/M
Capabilities
Streaming
Try in Playground

Llama 3.1 Nemotron Ultra 253B

meta
llama-3.1-nemotron-ultra-253b

Providers

Nebius AI
nebius/llama-3.1-nemotron-ultra-253b
Context Size
128k
Stability
STABLE
Pricing
Input
$0.60/M
Cached
/M
Output
$1.80/M
Capabilities
Streaming
JSON Output
Try in Playground

Llama 3.3 70B Instruct

meta
llama-3.3-70b-instruct

Providers

Nebius AI
nebius/llama-3.3-70b-instruct
Context Size
128k
Stability
STABLE
Pricing
Input
$0.13/M
Cached
/M
Output
$0.40/M
Capabilities
Streaming
Tools
JSON Output
Try in Playground

Llama 3.1 405B Instruct

meta
llama-3.1-405b-instruct

Providers

Nebius AI
nebius/llama-3.1-405b-instruct
Context Size
128k
Stability
STABLE
Pricing
Input
$1.00/M
Cached
/M
Output
$3.00/M
Capabilities
Streaming
Tools
JSON Output
Try in Playground

DeepSeek V3

deepseek
deepseek-v3

Providers

Nebius AI
nebius/deepseek-v3
Context Size
64k
Stability
unstable
Pricing
Input
$0.50/M
Cached
/M
Output
$1.50/M
Capabilities
Streaming
Try in Playground

DeepSeek R1 (0528)

deepseek
deepseek-r1-0528

Providers

Nebius AI
nebius/deepseek-r1-0528
Context Size
64k
Stability
unstable
Pricing
Input
$0.80/M
Cached
/M
Output
$2.40/M
Capabilities
Streaming
Try in Playground

Kimi K2

moonshot
kimi-k2

Providers

Nebius AI
nebius/kimi-k2
Context Size
131.1k
Stability
STABLE
Pricing
Input
$0.50/M
Cached
/M
Output
$2.40/M
Capabilities
Streaming
Tools
JSON Output
Try in Playground

Qwen QwQ 32B

alibaba
qwen-qwq-32b

Providers

Nebius AI
nebius/qwen-qwq-32b
Context Size
32.8k
Stability
STABLE
Pricing
Input
$0.15/M
Cached
/M
Output
$0.45/M
Capabilities
Streaming
JSON Output
Try in Playground

Qwen3 235B A22B Instruct 2507

alibaba
qwen3-235b-a22b-instruct-2507

Providers

Nebius AI
nebius/qwen3-235b-a22b-instruct-2507
Context Size
262k
Stability
STABLE
Pricing
Input
$0.20/M
Cached
/M
Output
$0.60/M
Capabilities
Streaming
Tools
JSON Output
Try in Playground

Qwen3 235B A22B Thinking 2507

alibaba
qwen3-235b-a22b-thinking-2507

Providers

Nebius AI
nebius/qwen3-235b-a22b-thinking-2507
Context Size
262k
Stability
unstable
Pricing
Input
$0.20/M
Cached
/M
Output
$0.60/M
Capabilities
Streaming
Tools
Reasoning
JSON Output
Try in Playground

Qwen3 14B

alibaba
qwen3-14b

Providers

Nebius AI
nebius/qwen3-14b
Context Size
32.8k
Stability
STABLE
Pricing
Input
$0.08/M
Cached
/M
Output
$0.24/M
Capabilities
Streaming
Tools
JSON Output
Try in Playground

Qwen3 32B

alibaba
qwen3-32b

Providers

Nebius AI
nebius/qwen3-32b
Context Size
32.8k
Stability
STABLE
Pricing
Input
$0.10/M
Cached
/M
Output
$0.30/M
Capabilities
Streaming
Tools
JSON Output
Try in Playground

Qwen3 30B A3B

alibaba
qwen3-30b-a3b

Providers

Nebius AI
nebius/qwen3-30b-a3b
Context Size
32.8k
Stability
STABLE
Pricing
Input
$0.10/M
Cached
/M
Output
$0.30/M
Capabilities
Streaming
Tools
JSON Output
Try in Playground

Qwen2.5 Coder 7B

alibaba
qwen25-coder-7b

Providers

Nebius AI
nebius/qwen25-coder-7b
Context Size
32.8k
Stability
STABLE
Pricing
Input
$0.01/M
Cached
/M
Output
$0.03/M
Capabilities
Streaming
JSON Output
Try in Playground

Qwen2.5 32B Instruct

alibaba
qwen25-32b-instruct

Providers

Nebius AI
nebius/qwen25-32b-instruct
Context Size
32.8k
Stability
STABLE
Pricing
Input
$0.06/M
Cached
/M
Output
$0.20/M
Capabilities
Streaming
Tools
JSON Output
Try in Playground

Qwen2.5 72B Instruct

alibaba
qwen25-72b-instruct

Providers

Nebius AI
nebius/qwen25-72b-instruct
Context Size
32.8k
Stability
STABLE
Pricing
Input
$0.13/M
Cached
/M
Output
$0.40/M
Capabilities
Streaming
Tools
JSON Output
Try in Playground

Qwen2 VL 72B Instruct

alibaba
qwen2-vl-72b-instruct

Providers

Nebius AI
nebius/qwen2-vl-72b-instruct
Context Size
32.8k
Stability
STABLE
Pricing
Input
$0.13/M
Cached
/M
Output
$0.40/M
Capabilities
Streaming
Vision
JSON Output
Try in Playground

Qwen2.5 VL 72B Instruct

alibaba
qwen2-5-vl-72b-instruct

Providers

Nebius AI
nebius/qwen2-5-vl-72b-instruct
Context Size
32.8k
Stability
STABLE
Pricing
Input
$0.13/M
Cached
/M
Output
$0.40/M
Capabilities
Streaming
Vision
JSON Output
Try in Playground

Qwen3 Coder 480B A35B Instruct

alibaba
qwen3-coder-480b-a35b-instruct

Providers

Nebius AI
nebius/qwen3-coder-480b-a35b-instruct
Context Size
262k
Stability
STABLE
Pricing
Input
$0.40/M
Cached
/M
Output
$1.80/M
Capabilities
Streaming
Tools
JSON Output
Try in Playground

Qwen3 Coder 30B A3B Instruct

alibaba
qwen3-coder-30b-a3b-instruct

Providers

Nebius AI
nebius/qwen3-coder-30b-a3b-instruct
Context Size
262k
Stability
STABLE
Pricing
Input
$0.10/M
Cached
/M
Output
$0.30/M
Capabilities
Streaming
Tools
JSON Output
Try in Playground

Qwen3 30B A3B Instruct 2507

alibaba
qwen3-30b-a3b-instruct-2507

Providers

Nebius AI
nebius/qwen3-30b-a3b-instruct-2507
Context Size
262k
Stability
STABLE
Pricing
Input
$0.10/M
Cached
/M
Output
$0.30/M
Capabilities
Streaming
Tools
JSON Output
Try in Playground

Qwen3 30B A3B Thinking 2507

alibaba
qwen3-30b-a3b-thinking-2507

Providers

Nebius AI
nebius/qwen3-30b-a3b-thinking-2507
Context Size
262k
Stability
STABLE
Pricing
Input
$0.10/M
Cached
/M
Output
$0.30/M
Capabilities
Streaming
Tools
Reasoning
JSON Output
Try in Playground

Hermes 3 Llama 405B

nousresearch
hermes-3-llama-405b

Providers

Nebius AI
nebius/hermes-3-llama-405b
Context Size
131.1k
Stability
STABLE
Pricing
Input
$1.00/M
Cached
/M
Output
$3.00/M
Capabilities
Streaming
JSON Output
Try in Playground
    Nebius AI - LLM Gateway