Server-Sent Events stream (when stream=true)
Create chat completion
Main endpoint for AI chat interactions. Supports:
- Multi-turn conversations with memory
- RAG document retrieval
- Streaming responses
- Tool/function calling
- Response caching
- Web grounding
Plimver AI Models
- PV-TURBO: Ultra-fast responses (<1s) - 1 credit/request
- PV-STANDARD: Balanced performance - 2 credits/request
- PV-ADVANCED: Deep reasoning - 5 credits/request
- PV-CODEX: Specialized coding - 3 credits/request
Body
Required
-
Plimver AI model to use
Values are
PV-TURBO,PV-STANDARD,PV-ADVANCED, orPV-CODEX. -
Array of conversation messages
-
Unique user identifier for memory isolation
-
Sampling temperature
Minimum value is
0, maximum value is2. Default value is0.7. -
Maximum tokens in response
Minimum value is
1, maximum value is4096. Default value is2000. -
Enable Server-Sent Events streaming
Default value is
false. -
Enable RAG document retrieval
Default value is
false. -
Number of RAG documents to retrieve (tier-dependent)
Minimum value is
1, maximum value is100. Default value is5. -
Bypass response cache
Default value is
false. -
Function calling tools (OpenAI format)
POST
/z1/chat/completions
curl \
--request POST 'https://zenux-api.redglacier-fb4abe56.southafricanorth.azurecontainerapps.io/z1/chat/completions' \
--header "Authorization: Bearer $ACCESS_TOKEN" \
--header "Content-Type: application/json" \
--data '{"model":"PV-TURBO","user_id":"user_123","messages":[{"role":"user","content":"What is the capital of France?"}]}'
Request examples
Simple chat request
{
"model": "PV-TURBO",
"user_id": "user_123",
"messages": [
{
"role": "user",
"content": "What is the capital of France?"
}
]
}
{
"model": "PV-STANDARD",
"rag_k": 5,
"use_rag": true,
"user_id": "user_123",
"messages": [
{
"role": "user",
"content": "What does our privacy policy say about data retention?"
}
]
}
{
"model": "PV-TURBO",
"stream": true,
"user_id": "user_123",
"messages": [
{
"role": "user",
"content": "Write a short story"
}
]
}
Response examples (200)
{
"id": "chatcmpl-123",
"object": "chat.completion",
"created": 1699564800,
"model": "PV-TURBO",
"choices": [
{
"index": 42,
"message": {
"role": "system",
"content": "string",
"name": "string"
},
"finish_reason": "stop"
}
],
"usage": {
"prompt_tokens": 42,
"completion_tokens": 42,
"total_tokens": 42
},
"citations": [
{}
],
"memory_context": {
"items_retrieved": 42,
"retrieval_time_ms": 42
}
}
Response examples (400)
{
"code": "INVALID_MODEL",
"status": "error",
"message": "Invalid model specified"
}
Response examples (401)
{
"code": "UNAUTHORIZED",
"status": "error",
"message": "Invalid API key"
}
Response examples (429)
{
"code": "RATE_LIMIT_EXCEEDED",
"status": "error",
"message": "Rate limit exceeded: 1000 requests per minute"
}
Response examples (500)
{
"code": "INTERNAL_ERROR",
"status": "error",
"message": "An unexpected error occurred"
}