Skip to main content
Effective Date — May 7, 2026
As part of the IAM hierarchy simplification, a new format of the TIR APIs is now available. All endpoints that previously included /teams/:team_id segment will now follow a service-centric structure, with project_id now passed as a query parameter instead of a path segment. You can read more about this change

Chat Completions

Generate chat completions using a deployed model endpoint. Supports any model deployed via TIR Inference Service, following the OpenAI-compatible chat completions API format.

POST/project/p-{Project_Id}/endpoint/is-{Endpoint_Id}/v1/chat/completions

Path parameters

  • Endpoint_IdPathstringrequired

Query parameters

  • project_idQueryintegerrequired

    Project ID

Request body

application/json

object
modelstringrequired

The model identifier to use for completion.

exampleyour-model-name
messagesarrayrequired

A list of messages comprising the conversation so far.

max_tokensinteger

Maximum number of tokens to generate in the response.

example512
temperaturenumber

Controls randomness. Lower values make output more deterministic, higher values more creative.

example0.7
top_pnumber

Nucleus sampling probability mass. Only tokens with cumulative probability up to top_p are considered.

example1
top_kinteger

Limits the number of highest-probability tokens considered at each step.

example50
frequency_penaltynumber

Penalizes tokens based on how frequently they have appeared in the text so far.

example0
streamboolean

If true, responses are streamed as server-sent events.

examplefalse

Responses

200Successful chat completion response.
object
idstring

Unique identifier for the completion.

examplechatcmpl-ab36a36c-2315-472f-9e58-0f68f19b8bf8
objectstring

Object type, always chat.completion.

examplechat.completion
createdinteger

Unix timestamp of when the completion was created.

example1776750085
modelstring

The model used for the completion.

examplemeta-llama/Llama-3.2-1B-Instruct
choicesarray

List of generated completion choices.

service_tierstring
examplenull
system_fingerprintstring
examplenull
usageobject
prompt_logprobsstring
examplenull
prompt_token_idsstring
examplenull
kv_transfer_paramsstring
examplenull