High-Performance LLM
Inference at Low Cost

Serve models like Llama 4 and DeepSeek 3.1 with our high-performance GPU platform. Our Inference APIs make it simple to scale your projects and pay only for what you need.

Run Inference View Docs

DeepSeek-V3.1

$0.15 in | $0.50 out163.84K Context

Rivals closed models on math and code; open weights, vetted safety, and low pricing simplify enterprise deployment.

Get API Key

Kimi-K2-Instruct

$0.30 in | $1.75 out131.07K Context

Kimi K2 is a mixture-of-experts (MoE) language model with 1 trillion total parameters. Kimi K2 excels at knowledge, reasoning, and coding tasks, with strong agentic capabilities.

Get API Key

DeepSeek-R1-0528

$0.25 in | $1.00 out163.84K Context

May 28th update to the original DeepSeek R1 Performance on par with OpenAI o1, but open-sourced and with fully open reasoning tokens.

Get API Key

Quick Start

import OpenAI from "openai";

const openai = new OpenAI({
  baseURL: "https://inference.cloudrift.ai/v1",
  apiKey: "YOUR_RIFT_API_KEY",
});

const completion = await openai.chat.completions.create({
  model: "llama4:maverick",
  messages: [
    {
      role: "user",
      content: "What is the meaning of life?"
    }
  ],
  stream: true,
});

for await (const chunk of completion) {
  process.stdout.write(chunk.choices[0]?.delta.content as string);
}

Models

All Available Models

Cost-effective access to high-performance models — no queues, no GPUs to reserve. Just straightforward model options you can build on.

DeepSeek-V3.1

$0.15 in | $0.50 out163.84K Context

Rivals closed models on math and code; open weights, vetted safety, and low pricing simplify enterprise deployment.

Get API Key

Kimi-K2-Instruct

$0.30 in | $1.75 out131.07K Context

Kimi K2 is a mixture-of-experts (MoE) language model with 1 trillion total parameters. Kimi K2 excels at knowledge, reasoning, and coding tasks, with strong agentic capabilities.

Get API Key

DeepSeek-R1-0528

$0.25 in | $1.00 out163.84K Context

May 28th update to the original DeepSeek R1 Performance on par with OpenAI o1, but open-sourced and with fully open reasoning tokens.

Get API Key

DeepSeek-V3

$0.15 in | $0.40 out163.84K Context

Rivals closed models on math and code; open weights, vetted safety, and low pricing simplify enterprise deployment.

Get API Key

Get in touch

Ready to get started?

Get in touch with our team to discuss your requirements and find the right solution for your infrastructure.

High-Performance LLMInference at Low Cost

DeepSeek-V3.1

Kimi-K2-Instruct

DeepSeek-R1-0528

Quick Start

All Available Models

DeepSeek-V3.1

Kimi-K2-Instruct

DeepSeek-R1-0528

DeepSeek-V3

Ready to get started?

High-Performance LLM
Inference at Low Cost