Chat Completion (Open AI) — API

Path parameters

Conversation_IdPathstringrequired

Request body

application/json

object

modelstringrequired

Model to use for chat completion. Available models:

mistral_7b_instruct (Mistral-7B-Instruct-v0.3)
gemma_7b (Gemma-7B-It)
llama3_1_8b_instruct (Llama-3.1-8B-Instruct)
llama_3_2_11b_vision_instruct (Llama-3.2-11B-Vision-Instruct)
llama3_2_3b_instruct (Llama-3.2-3B-Instruct)
llama_3_3_70b_instruct_fp8 (Llama-3.3-70B-Instruct)
phi_3_medium_128k_instruct (Phi-3-Medium-128K-Instruct)

messagesarrayrequired

List of messages in the conversation

streamboolean

Whether to stream the response

Responses

200Streamed chat response with multiple chunks

object

idstring

Unique identifier for the response

objectstringchat.completion.chunk

Type of response object

createdinteger

Timestamp of response creation (milliseconds)

modelstring

Model used for the response

choicesarray

usageobject

Request

curl -X POST 'https://api.e2enetworks.com/myaccount/api/v1/gpu/conversation/{Conversation_Id}/v1/chat/completions?apikey=%3Capikey%3E' \
  -H 'Authorization: Bearer <JWT>' \
  -H 'Content-Type: application/json' \
  -d '{
  "model": "string",
  "messages": [
    {
      "role": "string",
      "content": "string"
    }
  ],
  "stream": true
}'

Response · 200

{
  "id": "string",
  "object": "string",
  "created": 0,
  "model": "string",
  "choices": [
    {
      "index": 0,
      "delta": {
        "role": "string",
        "content": "string",
        "tool_calls": [
          {}
        ]
      },
      "logprobs": {},
      "finish_reason": "string",
      "stop_reason": "string"
    }
  ],
  "usage": {
    "prompt_tokens": 0,
    "completion_tokens": 0,
    "total_tokens": 0
  }
}