Model	Parameters	Category	Context
GPT OSS 120B	120B	General Purpose	8K
DeepSeek V3 0324	671B MoE	General Purpose	64K
Llama 4 Maverick 17B 128E Instruct	17B x 128E	MoE	128K
Llama 4 Scout 17B 16E Instruct	17B x 16E	MoE	128K
DeepSeek V3	671B MoE	General Purpose	64K
DeepSeek R1	671B MoE	Reasoning	64K
Dolphin 2.9.2 Mistral 8x22B	8x22B MoE	Uncensored	64K
Sarvam-2B	2B	Multilingual	4K
Hermes 3 Llama 3.1 405B	405B	Function Calling	128K

Deployment Options

Choose how you want to run your AI models.

Pay-per-token pricing with instant scaling

Reserved GPU capacity for consistent performance

Platform Features

Deploy any model in seconds with pre-optimized configurations.

Drop-in replacement for OpenAI API with minimal code changes.

Scale from zero to thousands of requests automatically.

Customize models on your data with built-in fine-tuning.

Deploy in your VPC for data privacy and compliance.

Monitor costs, latency, and usage with detailed dashboards.

Get started with our free tier. No credit card required.