Introduction
#Sogni Intelligence
Sogni Intelligence is an OpenAI-compatible LLM inference service powered by the Sogni Supernet β a decentralized network of GPU workers.
The service is frequently updated with the latest open-source LLM models including the powerful Qwen3.6-35B-A3B released in April 2026. Sogni Intelligence supports multi-turn conversations, media analysis, streaming, and agentic decision making with tool calling.
Sogni Platform Tools: Sogni Intelligence includes access to a powerful suite of generative media services for media creation and editing surfaced as Sogni Platform Tools. Sogni Platform Tools make the latest image, video, and music models available via Sogni's distributed GPU network for minimum cold-starts and maximum task concurrency.
Sogni Intelligence can be accessed through the Sogni Node / JavaScript Client SDK or through the `/chat/completions` REST API by both humans and agents.
Because Sogni Intelligence follows the OpenAI Chat Completions API format, any application or tool that supports a custom OpenAI-compatible endpoint can use the Sogni Supernet as a distributed backend. Just change the base URL and API key β everything else stays the same.
From prompt-expansion to media analysis to agentic multi-tool, multi-turn generative content creation, you can see Sogni Intelligence in action in Sogni SuperApps including Sogni Chat, Sogni 360, Sogni Makeover, Sogni Image Restore, Sogni Photobooth, and Sogni Create.
#OpenAI API Compatibility
Sogni Intelligence implements the following OpenAI-compatible endpoints:
| Endpoint | Method | Description |
|---|---|---|
/v1/chat/completions |
POST | Create a chat completion (streaming and non-streaming) |
/v1/models |
GET | List available models |
/v1/models/:model_id |
GET | Get details for a specific model |
You can use these endpoints with the OpenAI Python SDK, the OpenAI Node.js SDK, curl, or any OpenAI-compatible client.
#Base URL
Base URL: https://api.sogni.ai
All API requests go to https://api.sogni.ai/v1/...
#Available Models
| Model ID | Display Name | Context Window | Notes |
|---|---|---|---|
qwen3.6-35b-a3b-gguf-iq4xs |
Qwen 3.6 35B-A3B | 262K tokens | Default model. Qwen 3.6 with reasoning and tool calling. |
qwen3.5-35b-a3b-abliterated-gguf-q4km |
Qwen 3.5 35B-A3B Abliterated | 64K tokens | Uncensored variant. |
Use GET /v1/models to see the current list of available models at any time.
#Pricing
Sogni Intelligence usage is billed in Spark or SOGNI tokens β not USD.
| Rate | |
|---|---|
| Input tokens | ~$0.30 per 1M tokens |
| Output tokens | ~$0.90 per 1M tokens |
You receive free starter tokens when you create an account, and you can claim a daily bonus in the app. See Supernet & Rewards for more details on acquiring tokens.
#Getting Your API Key
- Create a free account at app.sogni.ai if you haven't already.
- Open your Dashboard β Go to dashboard.sogni.ai and log in. You'll land on the Account & Wallet tab.
- Go to the Developer section β On the right-hand side of the dashboard, locate the Developer section and click API Key.
- Copy your API key β A dialog will appear showing your API key (masked by default). Use the eye icon to reveal the full key, or the copy icon to copy it to your clipboard.
You can Regenerate your API key if it has been compromised, or Disable it if you no longer need access. Regenerating will invalidate your previous key β make sure to update it in all your applications.
Your API key can also be used with the Sogni SDK or to run a Sogni Fast Worker.
Include the key as a Bearer token in all requests:
Authorization: Bearer YOUR_API_KEY
Make sure your account has sufficient SOGNI or Spark Point balance before making API calls. You can check your balance on the Account & Wallet tab and top up by purchasing SOGNI tokens directly from the dashboard.
#API Reference Documentation for Agentic Agents
For the complete Sogni LLM API reference, see: https://api.sogni.ai/docs/llm-api.md
#Quick Test
Verify your API key works with a simple curl request:
curl https://api.sogni.ai/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_KEY" \
-d '{
"model": "qwen3.6-35b-a3b-gguf-iq4xs",
"messages": [
{"role": "user", "content": "Hello! What model are you?"}
]
}'
You should receive a JSON response with the model's reply in choices[0].message.content.
#Supported top-level fields
- model: model ID to use
- messages: conversation history
- stream: set true for SSE streaming of result tokens in real-time instead of a single response payload.
- max_tokens: max output tokens
- temperature, top_p, frequency_penalty, presence_penalty, stop: standard OpenAI-style sampling controls
- tools, tool_choice: custom tools/function calling
- sogni_tools: set false to disable auto-injected Sogni media tools
- sogni_tool_execution: set false to disable automatic Sogni tool execution
- task_profile: optional preset hint, one of
general,coding, orreasoning. This can be used to automagically set all inference properties liketop_pandfrequency_penaltyto optimized presets for your task. - chat_template_kwargs: optional backend-specific chat template settings such as turning thinking mode off or on: {"enable_thinking": false}
- token_type: billing token preference, one of
auto,spark, orsogni
#Sogni Tools
Sogni chat completions can act as a media-generation agent, not just a text model.
By default, POST /v1/chat/completions automatically injects six built-in Sogni tools so the model can decide when to generate media:
sogni_generate_image: text-to-image generationsogni_edit_image: image editing, likeness preservation, and multi-reference image generationsogni_generate_video: text-to-video and image-to-video generationsogni_sound_to_video: audio-driven video, music-video, and lip-sync style generationsogni_video_to_video: restyling or transforming an existing videosogni_generate_music: songs, beats, ambience, and instrumental music generation
These tools let the model generate structured tool_calls for Sogni media workflows.
If you want plain text only, disable tool injection set sogni_tools to false.
Example: automatic image generation
curl https://api.sogni.ai/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_KEY" \
-d '{
"model": "qwen3.6-35b-a3b-gguf-iq4xs",
"messages": [
{
"role": "user",
"content": "Generate a cinematic image of a neon-lit alley in Tokyo during rain."
}
]
}'
You should receive a JSON response with the model's reply in choices[0].message.content.
#Developer role support
The API accepts developer messages for compatibility with agentic coding tools such as Codex, OpenCode, and similar clients. If you send a developer message and do not set task_profile, the API defaults it to coding.
Example:
curl https://api.sogni.ai/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_KEY" \
-d '{
"model": "qwen3.6-35b-a3b-gguf-iq4xs",
"messages": [
{
"role": "developer",
"content": "You are a careful coding assistant. Explain changes before making risky suggestions."
},
{
"role": "user",
"content": "Write a React hook called useDebouncedValue in TypeScript."
}
]
}'
You should receive a JSON response with the model's reply in choices[0].message.content.
#What's Next
- Open WebUI Integration β Connect Sogni Intelligence to Open WebUI
- OpenClaw Integration β Use Sogni Intelligence as a provider in OpenClaw