Routescope APIRoutescope API
Chat Completions

OpenAI

OpenAI-style Chat Completions, reasoning, and code model overview

OpenAI-series models use the OpenAI-compatible Chat Completions endpoint. This page describes common parameters and model differences for GPT, Codex, and o reasoning models.

Endpoint Paths

MethodPathPurpose
POST/v1/chat/completionsCreate a chat completion, reasoning, or code-generation request
GET/v1/modelsList models available to the current token
GET/v1/models/{model}Retrieve one model's details

Request Structure

{
  "model": "gpt-5.4",
  "messages": [
    {
      "role": "user",
      "content": "Introduce Routescope API in one sentence."
    }
  ],
  "temperature": 0.7
}
POST
/v1/chat/completions
curl -X POST "https://api.routescope.ai/v1/chat/completions" \  -H "Content-Type: application/json" \  -d '{    "model": "gpt-4o-mini",    "messages": [      {        "role": "user",        "content": "Hello, please introduce yourself in one sentence."      }    ]  }'
{
  "id": "task_01JZ8M9Q4R7V2K8N9P0Q",
  "object": "string",
  "created": 1,
  "model": "gpt-4o-mini",
  "choices": [],
  "usage": {
    "prompt_tokens": 1,
    "completion_tokens": 1,
    "total_tokens": 1,
    "input_tokens": 1,
    "output_tokens": 1
  }
}
{
  "error": null,
  "message": "success"
}

Authorization

BearerAuth

AuthorizationBearer <token>

Model relay interface recognition. Request heading: Autoration: Bearer .

In: header

Request Body

application/json

model*string

The model name to call.

messages*

Can not open message Can not open message Scope: At least 1 message.

temperature?number

Sampling temperatures spread more and more. Sample temperature. Range: 0 to 2; the larger the value, the more random.

Range0 <= value <= 2
top_p?number

Nuclear sampling parameters. Nuclear sampling parameters. Range: 0 to 1; usually no large adjustments with temperature.

Range0 <= value <= 1
max_tokens?integer

Maximum output number of Tokens. Maximum output number of Tokens. Scope: 1 to the maximum of the context of the model.

Range1 <= value
stream?boolean

Whether to enable SSE flow output. Whether or not to enable flow output. Scope: True or false.

stream_options?

Stream extension options. Upstream support varied.

enable_thinking?boolean

Whether or not to start deep thinking mode. Qwen/Ariyuncrery OpenAI compatible extension parameters: _FD_PROTECT_0 _ Start thinking, _FD_PROTECT_1 _ Close thinking; part of the thinking model is always open and does not support closure. Python OpenAI SDK can be imported through FD_PROTECT_2.

tools?

Tool definitions for which models can be called. . Scope: The length of arrays and the complexity of schema are based on upstream limits.

tool_choice?string|

Tools call policies, such as _FD_PROTEC_0, FD_PROTEC_1 or visible specifying functions. Tool call policy. Scope: auto, none, required or visible tool objects.

response_format?object

Structured output constraints, such as JSON Schema.

user?string

End-user identification for audit and control.

Response Body

application/json

application/json

Model Selection

Model IDCapabilityTypical Use
gpt-5, gpt-5-2025-08-07Chat completionsGeneral text generation and multi-turn conversation
gpt-5.1, gpt-5.1-2025-11-13, gpt-5.1-chat-latestChat completionsGeneral conversation, pinned versions, or latest entry
gpt-5.2-2025-12-11, gpt-5.2-chat-latestChat completionsGeneral conversation, pinned versions, or latest entry
gpt-5.3-chat-latestChat completionsLatest entry
gpt-5.4, gpt-5.4-2026-03-05, gpt-5.4-mini, gpt-5.4-mini-2026-03-17, gpt-5.4-nano, gpt-5.4-pro, gpt-5.4-pro-2026-03-05Chat completionsGeneral, mini, nano, and pro tiers
gpt-5-pro, gpt-5-pro-2025-10-06Chat completionsAdvanced text tasks
gpt-5.1-codex-mini, gpt-5.1-codex, gpt-5.1-codex-max, gpt-5.2-codex, gpt-5.3-codexCode generationCodex and code-focused tasks
o3, o3-2025-04-16ReasoningReasoning-heavy tasks

Common Parameters

FieldTypeRequiredDescription
modelstringYesModel ID to call.
messagesarrayYesConversation messages. Roles include system, user, assistant, and tool.
temperaturenumberNoSampling temperature. OpenAPI schema marks the range as 0 to 2.
top_pnumberNoNucleus sampling parameter. OpenAPI schema marks the range as 0 to 1.
max_tokensintegerNoMaximum output tokens, limited by the selected model context.
streambooleanNoWhether to use SSE streaming.
toolsarrayNoTool definitions.
tool_choicestring/objectNoTool-calling strategy.
response_formatobjectNoStructured output constraint, such as JSON Schema.
userstringNoEnd-user identifier for auditing and risk control.

Model-Specific Notes

FieldApplicable ModelsDescription
modelAllCopy a model ID available to the current token from the model selection table or /v1/models.
stream_optionsOpenAI-compatible models that support streamingMarked in the OpenAPI schema as streaming extensions. Upstream support may vary.
enable_thinkingQwen / Alibaba Cloud OpenAI-compatible extensionThis field exists in the OpenAPI schema but is not a generic OpenAI GPT-series field, so it is not included in GPT examples.
Reasoning / code model capabilityo3 and Codex seriesThere is no unified extra field listed here. Follow API responses or the actual console display.

Response Structure

OpenAI-style responses include id, object, created, model, choices, and usage. When stream=true, the response is pushed as SSE chunks.

How is this guide?

Last updated on