Gateway API reference: Anthropic and OpenAI endpoints

The Raziel gateway exposes two protocol surfaces: a drop-in replacement for the Anthropic Messages API and an OpenAI-compatible chat completions endpoint. Both use your rz_live_… key for authentication and support streaming.

Anthropic-compatible endpoint

Send requests to https://raziel.fun/gateway/v1/messages in the same format you would send them to https://api.anthropic.com/v1/messages. The gateway forwards the request body unchanged to a live provider and returns the response in the same Anthropic shape.

When using the razi claude command, requests go through a local proxy at localhost:3099. The raziel.fun domain never appears in Claude Code’s network traffic.

Request

POST https://raziel.fun/gateway/v1/messages
Authorization: Bearer rz_live_xxxxxxxxxxxx
Content-Type: application/json
anthropic-version: 2023-06-01

{
  "model": "claude-sonnet-4-6",
  "max_tokens": 1024,
  "messages": [
    { "role": "user", "content": "Hello" }
  ]
}

Parameters

model

string

required

The model to use. See available models below.

messages

array

required

Array of message objects with role (user or assistant) and content.

max_tokens

integer

required

Maximum number of tokens to generate in the response.

stream

boolean

Set to true to receive a server-sent events stream. The gateway passes streaming through transparently.

system

string

System prompt passed directly to the model.

temperature

number

Sampling temperature between 0 and 1.

tools

array

Tool definitions for function calling. Passed through to the provider unchanged.

Example: streaming request

curl https://raziel.fun/gateway/v1/messages \
  -H "Authorization: Bearer rz_live_xxxxxxxxxxxx" \
  -H "Content-Type: application/json" \
  -H "anthropic-version: 2023-06-01" \
  -d '{
    "model": "claude-sonnet-4-6",
    "max_tokens": 512,
    "stream": true,
    "messages": [{"role": "user", "content": "Hello"}]
  }'

OpenAI-compatible endpoint

The gateway also speaks the OpenAI chat completions format at https://raziel.fun/gateway/openai/v1. Use it with any client that accepts a custom base URL — the OpenAI SDK, Hermes, Continue, LibreChat, and others.

Endpoints

Method	Path	Description
`POST`	`/gateway/openai/v1/chat/completions`	Create a chat completion
`GET`	`/gateway/openai/v1/models`	List available models

Request

POST https://raziel.fun/gateway/openai/v1/chat/completions
Authorization: Bearer rz_live_xxxxxxxxxxxx
Content-Type: application/json

{
  "model": "claude-sonnet-4-6",
  "messages": [
    { "role": "user", "content": "Hello" }
  ],
  "max_tokens": 512
}

Parameters

model

string

required

The model identifier. See available models below.

messages

array

required

Array of message objects with role and content.

max_tokens

integer

Maximum number of tokens to generate.

temperature

number

Sampling temperature between 0 and 2.

stream

boolean

Set to true to receive a server-sent events stream.

tools

array

OpenAI-format tool definitions for function calling.

tool_choice

string | object

Tool selection strategy: "auto", "none", "required", or a specific tool object.

stop

string | array

Up to 4 sequences where the API will stop generating.

top_p

number

Nucleus sampling parameter.

The following fields are accepted but silently ignored: logprobs, n, response_format, presence_penalty, frequency_penalty, seed. Requests that include them will succeed, but these parameters have no effect.

Example

from openai import OpenAI

client = OpenAI(
    base_url="https://raziel.fun/gateway/openai/v1",
    api_key="rz_live_xxxxxxxxxxxx",
)

resp = client.chat.completions.create(
    model="claude-sonnet-4-6",
    messages=[{"role": "user", "content": "Hello"}],
)
print(resp.choices[0].message.content)

Available models

Model ID	Description
`claude-opus-4-7`	Claude Opus 4.7 — most capable
`claude-sonnet-4-6`	Claude Sonnet 4.6 — balanced performance and speed
`claude-haiku-4-5-20251001`	Claude Haiku 4.5 — fastest responses

Use GET /gateway/openai/v1/models to retrieve the current model list programmatically.

Authentication

Both endpoints use the same Authorization: Bearer scheme.

Authorization: Bearer rz_live_xxxxxxxxxxxx

Alternatively, for the Anthropic-compatible endpoint you can pass your key as x-api-key:

x-api-key: rz_live_xxxxxxxxxxxx

Issue and manage keys from Customer → API Keys in the dashboard. Keys are shown in full only once at creation — store them immediately.

Get Started

Router

Teams

Billing

Reference

Gateway API reference: Anthropic and OpenAI endpoints

Anthropic-compatible endpoint

Request

Parameters

Example: streaming request

OpenAI-compatible endpoint

Endpoints

Request

Parameters

Example

Available models

Authentication

Get Started

Router

Teams

Billing

Reference

Documentation Index

​Anthropic-compatible endpoint

​Request

​Parameters

​Example: streaming request

​OpenAI-compatible endpoint

​Endpoints

​Request

​Parameters

​Example

​Available models

​Authentication

Anthropic-compatible endpoint

Request

Parameters

Example: streaming request

OpenAI-compatible endpoint

Endpoints

Request

Parameters

Example

Available models

Authentication