The Raziel gateway exposes two protocol surfaces: a drop-in replacement for the Anthropic Messages API and an OpenAI-compatible chat completions endpoint. Both use yourDocumentation Index
Fetch the complete documentation index at: https://docs.raziel.fun/llms.txt
Use this file to discover all available pages before exploring further.
rz_live_… key for authentication and support streaming.
Anthropic-compatible endpoint
Send requests tohttps://raziel.fun/gateway/v1/messages in the same format you would send them to https://api.anthropic.com/v1/messages. The gateway forwards the request body unchanged to a live provider and returns the response in the same Anthropic shape.
When using the
razi claude command, requests go through a local proxy at localhost:3099. The raziel.fun domain never appears in Claude Code’s network traffic.Request
Parameters
The model to use. See available models below.
Array of message objects with
role (user or assistant) and content.Maximum number of tokens to generate in the response.
Set to
true to receive a server-sent events stream. The gateway passes streaming through transparently.System prompt passed directly to the model.
Sampling temperature between 0 and 1.
Tool definitions for function calling. Passed through to the provider unchanged.
Example: streaming request
OpenAI-compatible endpoint
The gateway also speaks the OpenAI chat completions format athttps://raziel.fun/gateway/openai/v1. Use it with any client that accepts a custom base URL — the OpenAI SDK, Hermes, Continue, LibreChat, and others.
Endpoints
| Method | Path | Description |
|---|---|---|
POST | /gateway/openai/v1/chat/completions | Create a chat completion |
GET | /gateway/openai/v1/models | List available models |
Request
Parameters
The model identifier. See available models below.
Array of message objects with
role and content.Maximum number of tokens to generate.
Sampling temperature between 0 and 2.
Set to
true to receive a server-sent events stream.OpenAI-format tool definitions for function calling.
Tool selection strategy:
"auto", "none", "required", or a specific tool object.Up to 4 sequences where the API will stop generating.
Nucleus sampling parameter.
Example
Available models
| Model ID | Description |
|---|---|
claude-opus-4-7 | Claude Opus 4.7 — most capable |
claude-sonnet-4-6 | Claude Sonnet 4.6 — balanced performance and speed |
claude-haiku-4-5-20251001 | Claude Haiku 4.5 — fastest responses |
GET /gateway/openai/v1/models to retrieve the current model list programmatically.
Authentication
Both endpoints use the sameAuthorization: Bearer scheme.
x-api-key:
