OpenAI chat completions
Text
/v1/chat/completions
OpenAI-compatible chat completion endpoint for multi-turn conversations, tool calling, and streaming responses.
POST
OpenAI chat completions
Creates a model response for a chat conversation. This endpoint is still the safest default when you need compatibility with existing OpenAI SDKs, chat clients, or legacy chat-completion workflows. Supported fields vary by model, especially for reasoning, tool use, and multimodal inputs.Documentation Index
Fetch the complete documentation index at: https://docs.ent.apigo.ai/llms.txt
Use this file to discover all available pages before exploring further.
Integration guidance
- Authenticate with
Authorization: Bearer {API_KEY} - Use this as the default entry point for existing OpenAI-style chat integrations
- If you want a more unified interface for structured output, multimodal input, and tools, prefer
/v1/responses - Streaming clients should handle SSE chunks incrementally instead of waiting for one final JSON response
Request highlights
messagesis required and carries the conversation historymodelis required and selects the target modeltemperatureandtop_pboth affect sampling, but most integrations should tune only one of them- If you need token-level probabilities, combine
logprobswithtop_logprobs - For caching and safety attribution, prefer
prompt_cache_keyandsafety_identifier
Response highlights
- Plain text is usually read from
choices[0].message.content - Tool calls can be read from
message.tool_calls - Streaming responses arrive as SSE chunks and must be merged incrementally
- Usage accounting is exposed through
usage, including more detailed token breakdowns
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
application/json
