Beta
ChatKit
Domain Types
ChatKit Workflow
-
chatkit_workflow: object { id, state_variables, tracing, version }Workflow metadata and state returned for the session.
-
id: stringIdentifier of the workflow backing the session.
-
state_variables: map[string or boolean or number]State variable key-value pairs applied when invoking the workflow. Defaults to null when no overrides were provided.
-
union_member_0: string -
union_member_1: boolean -
union_member_2: number
-
-
tracing: object { enabled }Tracing settings applied to the workflow.
-
enabled: booleanIndicates whether tracing is enabled.
-
-
version: stringSpecific workflow version used for the session. Defaults to null when using the latest deployment.
-
Sessions
Cancel chat session
$ openai beta:chatkit:sessions cancel
post /chatkit/sessions/{session_id}/cancel
Cancel an active ChatKit session and return its most recent metadata.
Cancelling prevents new requests from using the issued client secret.
Parameters
-
--session-id: stringUnique identifier for the ChatKit session to cancel.
Returns
-
chat_session: object { id, chatkit_configuration, client_secret, 7 more }Represents a ChatKit session and its resolved configuration.
-
id: stringIdentifier for the ChatKit session.
-
chatkit_configuration: object { automatic_thread_titling, file_upload, history }Resolved ChatKit feature configuration for the session.
-
automatic_thread_titling: object { enabled }Automatic thread titling preferences.
-
enabled: booleanWhether automatic thread titling is enabled.
-
-
file_upload: object { enabled, max_file_size, max_files }Upload settings for the session.
-
enabled: booleanIndicates if uploads are enabled for the session.
-
max_file_size: numberMaximum upload size in megabytes.
-
max_files: numberMaximum number of uploads allowed during the session.
-
-
history: object { enabled, recent_threads }History retention configuration.
-
enabled: booleanIndicates if chat history is persisted for the session.
-
recent_threads: numberNumber of prior threads surfaced in history views. Defaults to null when all history is retained.
-
-
-
client_secret: stringEphemeral client secret that authenticates session requests.
-
expires_at: numberUnix timestamp (in seconds) for when the session expires.
-
max_requests_per_1_minute: numberConvenience copy of the per-minute request limit.
-
object: "chatkit.session"Type discriminator that is always
chatkit.session. -
rate_limits: object { max_requests_per_1_minute }Resolved rate limit values.
-
max_requests_per_1_minute: numberMaximum allowed requests per one-minute window.
-
-
status: "active" or "expired" or "cancelled"Current lifecycle state of the session.
-
"active" -
"expired" -
"cancelled"
-
-
user: stringUser identifier associated with the session.
-
workflow: object { id, state_variables, tracing, version }Workflow metadata for the session.
-
id: stringIdentifier of the workflow backing the session.
-
state_variables: map[string or boolean or number]State variable key-value pairs applied when invoking the workflow. Defaults to null when no overrides were provided.
-
union_member_0: string -
union_member_1: boolean -
union_member_2: number
-
-
tracing: object { enabled }Tracing settings applied to the workflow.
-
enabled: booleanIndicates whether tracing is enabled.
-
-
version: stringSpecific workflow version used for the session. Defaults to null when using the latest deployment.
-
-
Example
openai beta:chatkit:sessions cancel \
--api-key 'My API Key' \
--session-id cksess_123
Response
{
"id": "id",
"chatkit_configuration": {
"automatic_thread_titling": {
"enabled": true
},
"file_upload": {
"enabled": true,
"max_file_size": 0,
"max_files": 0
},
"history": {
"enabled": true,
"recent_threads": 0
}
},
"client_secret": "client_secret",
"expires_at": 0,
"max_requests_per_1_minute": 0,
"object": "chatkit.session",
"rate_limits": {
"max_requests_per_1_minute": 0
},
"status": "active",
"user": "user",
"workflow": {
"id": "id",
"state_variables": {
"foo": "string"
},
"tracing": {
"enabled": true
},
"version": "version"
}
}
Create ChatKit session
$ openai beta:chatkit:sessions create
post /chatkit/sessions
Create a ChatKit session.
Parameters
-
--user: stringA free-form string that identifies your end user; ensures this Session can access other objects that have the same
userscope. -
--workflow: object { id, state_variables, tracing, version }Workflow that powers the session.
-
--chatkit-configuration: optional object { automatic_thread_titling, file_upload, history }Optional overrides for ChatKit runtime configuration features
-
--expires-after: optional object { anchor, seconds }Optional override for session expiration timing in seconds from creation. Defaults to 10 minutes.
-
--rate-limits: optional object { max_requests_per_1_minute }Optional override for per-minute request limits. When omitted, defaults to 10.
Returns
-
chat_session: object { id, chatkit_configuration, client_secret, 7 more }Represents a ChatKit session and its resolved configuration.
-
id: stringIdentifier for the ChatKit session.
-
chatkit_configuration: object { automatic_thread_titling, file_upload, history }Resolved ChatKit feature configuration for the session.
-
automatic_thread_titling: object { enabled }Automatic thread titling preferences.
-
enabled: booleanWhether automatic thread titling is enabled.
-
-
file_upload: object { enabled, max_file_size, max_files }Upload settings for the session.
-
enabled: booleanIndicates if uploads are enabled for the session.
-
max_file_size: numberMaximum upload size in megabytes.
-
max_files: numberMaximum number of uploads allowed during the session.
-
-
history: object { enabled, recent_threads }History retention configuration.
-
enabled: booleanIndicates if chat history is persisted for the session.
-
recent_threads: numberNumber of prior threads surfaced in history views. Defaults to null when all history is retained.
-
-
-
client_secret: stringEphemeral client secret that authenticates session requests.
-
expires_at: numberUnix timestamp (in seconds) for when the session expires.
-
max_requests_per_1_minute: numberConvenience copy of the per-minute request limit.
-
object: "chatkit.session"Type discriminator that is always
chatkit.session. -
rate_limits: object { max_requests_per_1_minute }Resolved rate limit values.
-
max_requests_per_1_minute: numberMaximum allowed requests per one-minute window.
-
-
status: "active" or "expired" or "cancelled"Current lifecycle state of the session.
-
"active" -
"expired" -
"cancelled"
-
-
user: stringUser identifier associated with the session.
-
workflow: object { id, state_variables, tracing, version }Workflow metadata for the session.
-
id: stringIdentifier of the workflow backing the session.
-
state_variables: map[string or boolean or number]State variable key-value pairs applied when invoking the workflow. Defaults to null when no overrides were provided.
-
union_member_0: string -
union_member_1: boolean -
union_member_2: number
-
-
tracing: object { enabled }Tracing settings applied to the workflow.
-
enabled: booleanIndicates whether tracing is enabled.
-
-
version: stringSpecific workflow version used for the session. Defaults to null when using the latest deployment.
-
-
Example
openai beta:chatkit:sessions create \
--api-key 'My API Key' \
--user x \
--workflow '{id: id}'
Response
{
"id": "id",
"chatkit_configuration": {
"automatic_thread_titling": {
"enabled": true
},
"file_upload": {
"enabled": true,
"max_file_size": 0,
"max_files": 0
},
"history": {
"enabled": true,
"recent_threads": 0
}
},
"client_secret": "client_secret",
"expires_at": 0,
"max_requests_per_1_minute": 0,
"object": "chatkit.session",
"rate_limits": {
"max_requests_per_1_minute": 0
},
"status": "active",
"user": "user",
"workflow": {
"id": "id",
"state_variables": {
"foo": "string"
},
"tracing": {
"enabled": true
},
"version": "version"
}
}
Threads
List ChatKit thread items
$ openai beta:chatkit:threads list-items
get /chatkit/threads/{thread_id}/items
List items that belong to a ChatKit thread.
Parameters
-
--thread-id: stringIdentifier of the ChatKit thread whose items are requested.
-
--after: optional stringList items created after this thread item ID. Defaults to null for the first page.
-
--before: optional stringList items created before this thread item ID. Defaults to null for the newest results.
-
--limit: optional numberMaximum number of thread items to return. Defaults to 20.
-
--order: optional "asc" or "desc"Sort order for results by creation time. Defaults to
desc.
Returns
-
chatkit_thread_item_list: object { data, first_id, has_more, 2 more }A paginated list of thread items rendered for the ChatKit API.
-
data: array of ChatKitThreadUserMessageItem or ChatKitThreadAssistantMessageItem or ChatKitWidgetItem or 3 moreA list of items
-
chatkit_thread_user_message_item: object { id, attachments, content, 5 more }User-authored messages within a thread.
-
id: stringIdentifier of the thread item.
-
attachments: array of ChatKitAttachmentAttachments associated with the user message. Defaults to an empty list.
-
id: stringIdentifier for the attachment.
-
mime_type: stringMIME type of the attachment.
-
name: stringOriginal display name for the attachment.
-
preview_url: stringPreview URL for rendering the attachment inline.
-
type: "image" or "file"Attachment discriminator.
-
"image" -
"file"
-
-
-
content: array of object { text, type } or object { text, type }Ordered content elements supplied by the user.
-
input_text: object { text, type }Text block that a user contributed to the thread.
-
text: stringPlain-text content supplied by the user.
-
type: "input_text"Type discriminator that is always
input_text.
-
-
quoted_text: object { text, type }Quoted snippet that the user referenced in their message.
-
text: stringQuoted text content.
-
type: "quoted_text"Type discriminator that is always
quoted_text.
-
-
-
created_at: numberUnix timestamp (in seconds) for when the item was created.
-
inference_options: object { model, tool_choice }Inference overrides applied to the message. Defaults to null when unset.
-
model: stringModel name that generated the response. Defaults to null when using the session default.
-
tool_choice: object { id }Preferred tool to invoke. Defaults to null when ChatKit should auto-select.
-
id: stringIdentifier of the requested tool.
-
-
-
object: "chatkit.thread_item"Type discriminator that is always
chatkit.thread_item. -
thread_id: stringIdentifier of the parent thread.
-
type: "chatkit.user_message"
-
-
chatkit_thread_assistant_message_item: object { id, content, created_at, 3 more }Assistant-authored message within a thread.
-
id: stringIdentifier of the thread item.
-
content: array of ChatKitResponseOutputTextOrdered assistant response segments.
-
annotations: array of object { source, type } or object { source, type }Ordered list of annotations attached to the response text.
-
file: object { source, type }Annotation that references an uploaded file.
-
source: object { filename, type }File attachment referenced by the annotation.
-
filename: stringFilename referenced by the annotation.
-
type: "file"Type discriminator that is always
file.
-
-
type: "file"Type discriminator that is always
filefor this annotation.
-
-
url: object { source, type }Annotation that references a URL.
-
source: object { type, url }URL referenced by the annotation.
-
type: "url"Type discriminator that is always
url. -
url: stringURL referenced by the annotation.
-
-
type: "url"Type discriminator that is always
urlfor this annotation.
-
-
-
text: stringAssistant generated text.
-
type: "output_text"Type discriminator that is always
output_text.
-
-
created_at: numberUnix timestamp (in seconds) for when the item was created.
-
object: "chatkit.thread_item"Type discriminator that is always
chatkit.thread_item. -
thread_id: stringIdentifier of the parent thread.
-
type: "chatkit.assistant_message"Type discriminator that is always
chatkit.assistant_message.
-
-
chatkit_widget_item: object { id, created_at, object, 3 more }Thread item that renders a widget payload.
-
id: stringIdentifier of the thread item.
-
created_at: numberUnix timestamp (in seconds) for when the item was created.
-
object: "chatkit.thread_item"Type discriminator that is always
chatkit.thread_item. -
thread_id: stringIdentifier of the parent thread.
-
type: "chatkit.widget"Type discriminator that is always
chatkit.widget. -
widget: stringSerialized widget payload rendered in the UI.
-
-
chatkit.client_tool_call: object { id, arguments, call_id, 7 more }Record of a client side tool invocation initiated by the assistant.
-
id: stringIdentifier of the thread item.
-
arguments: stringJSON-encoded arguments that were sent to the tool.
-
call_id: stringIdentifier for the client tool call.
-
created_at: numberUnix timestamp (in seconds) for when the item was created.
-
name: stringTool name that was invoked.
-
object: "chatkit.thread_item"Type discriminator that is always
chatkit.thread_item. -
output: stringJSON-encoded output captured from the tool. Defaults to null while execution is in progress.
-
status: "in_progress" or "completed"Execution status for the tool call.
-
"in_progress" -
"completed"
-
-
thread_id: stringIdentifier of the parent thread.
-
type: "chatkit.client_tool_call"Type discriminator that is always
chatkit.client_tool_call.
-
-
chatkit.task: object { id, created_at, heading, 5 more }Task emitted by the workflow to show progress and status updates.
-
id: stringIdentifier of the thread item.
-
created_at: numberUnix timestamp (in seconds) for when the item was created.
-
heading: stringOptional heading for the task. Defaults to null when not provided.
-
object: "chatkit.thread_item"Type discriminator that is always
chatkit.thread_item. -
summary: stringOptional summary that describes the task. Defaults to null when omitted.
-
task_type: "custom" or "thought"Subtype for the task.
-
"custom" -
"thought"
-
-
thread_id: stringIdentifier of the parent thread.
-
type: "chatkit.task"Type discriminator that is always
chatkit.task.
-
-
chatkit.task_group: object { id, created_at, object, 3 more }Collection of workflow tasks grouped together in the thread.
-
id: stringIdentifier of the thread item.
-
created_at: numberUnix timestamp (in seconds) for when the item was created.
-
object: "chatkit.thread_item"Type discriminator that is always
chatkit.thread_item. -
tasks: array of object { heading, summary, type }Tasks included in the group.
-
heading: stringOptional heading for the grouped task. Defaults to null when not provided.
-
summary: stringOptional summary that describes the grouped task. Defaults to null when omitted.
-
type: "custom" or "thought"Subtype for the grouped task.
-
"custom" -
"thought"
-
-
-
thread_id: stringIdentifier of the parent thread.
-
type: "chatkit.task_group"Type discriminator that is always
chatkit.task_group.
-
-
-
first_id: stringThe ID of the first item in the list.
-
has_more: booleanWhether there are more items available.
-
last_id: stringThe ID of the last item in the list.
-
object: "list"The type of object returned, must be
list.
-
Example
openai beta:chatkit:threads list-items \
--api-key 'My API Key' \
--thread-id cthr_123
Response
{
"data": [
{
"id": "id",
"attachments": [
{
"id": "id",
"mime_type": "mime_type",
"name": "name",
"preview_url": "https://example.com",
"type": "image"
}
],
"content": [
{
"text": "text",
"type": "input_text"
}
],
"created_at": 0,
"inference_options": {
"model": "model",
"tool_choice": {
"id": "id"
}
},
"object": "chatkit.thread_item",
"thread_id": "thread_id",
"type": "chatkit.user_message"
}
],
"first_id": "first_id",
"has_more": true,
"last_id": "last_id",
"object": "list"
}
Retrieve ChatKit thread
$ openai beta:chatkit:threads retrieve
get /chatkit/threads/{thread_id}
Retrieve a ChatKit thread by its identifier.
Parameters
-
--thread-id: stringIdentifier of the ChatKit thread to retrieve.
Returns
-
chatkit_thread: object { id, created_at, object, 3 more }Represents a ChatKit thread and its current status.
-
id: stringIdentifier of the thread.
-
created_at: numberUnix timestamp (in seconds) for when the thread was created.
-
object: "chatkit.thread"Type discriminator that is always
chatkit.thread. -
status: object { type } or object { reason, type } or object { reason, type }Current status for the thread. Defaults to
activefor newly created threads.-
active: object { type }Indicates that a thread is active.
-
locked: object { reason, type }Indicates that a thread is locked and cannot accept new input.
-
reason: stringReason that the thread was locked. Defaults to null when no reason is recorded.
-
type: "locked"Status discriminator that is always
locked.
-
-
closed: object { reason, type }Indicates that a thread has been closed.
-
reason: stringReason that the thread was closed. Defaults to null when no reason is recorded.
-
type: "closed"Status discriminator that is always
closed.
-
-
-
title: stringOptional human-readable title for the thread. Defaults to null when no title has been generated.
-
user: stringFree-form string that identifies your end user who owns the thread.
-
Example
openai beta:chatkit:threads retrieve \
--api-key 'My API Key' \
--thread-id cthr_123
Response
{
"id": "cthr_def456",
"created_at": 1712345600,
"object": "chatkit.thread",
"status": {
"type": "active"
},
"title": "Demo feedback",
"user": "user_456"
}
Delete ChatKit thread
$ openai beta:chatkit:threads delete
delete /chatkit/threads/{thread_id}
Delete a ChatKit thread along with its items and stored attachments.
Parameters
-
--thread-id: stringIdentifier of the ChatKit thread to delete.
Returns
-
BetaChatKitThreadDeleteResponse: object { id, deleted, object }Confirmation payload returned after deleting a thread.
-
id: stringIdentifier of the deleted thread.
-
deleted: booleanIndicates that the thread has been deleted.
-
object: "chatkit.thread.deleted"Type discriminator that is always
chatkit.thread.deleted.
-
Example
openai beta:chatkit:threads delete \
--api-key 'My API Key' \
--thread-id cthr_123
Response
{
"id": "id",
"deleted": true,
"object": "chatkit.thread.deleted"
}
List ChatKit threads
$ openai beta:chatkit:threads list
get /chatkit/threads
List ChatKit threads with optional pagination and user filters.
Parameters
-
--after: optional stringList items created after this thread item ID. Defaults to null for the first page.
-
--before: optional stringList items created before this thread item ID. Defaults to null for the newest results.
-
--limit: optional numberMaximum number of thread items to return. Defaults to 20.
-
--order: optional "asc" or "desc"Sort order for results by creation time. Defaults to
desc. -
--user: optional stringFilter threads that belong to this user identifier. Defaults to null to return all users.
Returns
-
Threads: object { data, first_id, has_more, 2 more }A paginated list of ChatKit threads.
-
data: array of ChatKitThreadA list of items
-
id: stringIdentifier of the thread.
-
created_at: numberUnix timestamp (in seconds) for when the thread was created.
-
object: "chatkit.thread"Type discriminator that is always
chatkit.thread. -
status: object { type } or object { reason, type } or object { reason, type }Current status for the thread. Defaults to
activefor newly created threads.-
active: object { type }Indicates that a thread is active.
-
locked: object { reason, type }Indicates that a thread is locked and cannot accept new input.
-
reason: stringReason that the thread was locked. Defaults to null when no reason is recorded.
-
type: "locked"Status discriminator that is always
locked.
-
-
closed: object { reason, type }Indicates that a thread has been closed.
-
reason: stringReason that the thread was closed. Defaults to null when no reason is recorded.
-
type: "closed"Status discriminator that is always
closed.
-
-
-
title: stringOptional human-readable title for the thread. Defaults to null when no title has been generated.
-
user: stringFree-form string that identifies your end user who owns the thread.
-
-
first_id: stringThe ID of the first item in the list.
-
has_more: booleanWhether there are more items available.
-
last_id: stringThe ID of the last item in the list.
-
object: "list"The type of object returned, must be
list.
-
Example
openai beta:chatkit:threads list \
--api-key 'My API Key'
Response
{
"data": [
{
"id": "cthr_def456",
"created_at": 1712345600,
"object": "chatkit.thread",
"status": {
"type": "active"
},
"title": "Demo feedback",
"user": "user_456"
}
],
"first_id": "first_id",
"has_more": true,
"last_id": "last_id",
"object": "list"
}
Domain Types
Chat Session
-
chat_session: object { id, chatkit_configuration, client_secret, 7 more }Represents a ChatKit session and its resolved configuration.
-
id: stringIdentifier for the ChatKit session.
-
chatkit_configuration: object { automatic_thread_titling, file_upload, history }Resolved ChatKit feature configuration for the session.
-
automatic_thread_titling: object { enabled }Automatic thread titling preferences.
-
enabled: booleanWhether automatic thread titling is enabled.
-
-
file_upload: object { enabled, max_file_size, max_files }Upload settings for the session.
-
enabled: booleanIndicates if uploads are enabled for the session.
-
max_file_size: numberMaximum upload size in megabytes.
-
max_files: numberMaximum number of uploads allowed during the session.
-
-
history: object { enabled, recent_threads }History retention configuration.
-
enabled: booleanIndicates if chat history is persisted for the session.
-
recent_threads: numberNumber of prior threads surfaced in history views. Defaults to null when all history is retained.
-
-
-
client_secret: stringEphemeral client secret that authenticates session requests.
-
expires_at: numberUnix timestamp (in seconds) for when the session expires.
-
max_requests_per_1_minute: numberConvenience copy of the per-minute request limit.
-
object: "chatkit.session"Type discriminator that is always
chatkit.session. -
rate_limits: object { max_requests_per_1_minute }Resolved rate limit values.
-
max_requests_per_1_minute: numberMaximum allowed requests per one-minute window.
-
-
status: "active" or "expired" or "cancelled"Current lifecycle state of the session.
-
"active" -
"expired" -
"cancelled"
-
-
user: stringUser identifier associated with the session.
-
workflow: object { id, state_variables, tracing, version }Workflow metadata for the session.
-
id: stringIdentifier of the workflow backing the session.
-
state_variables: map[string or boolean or number]State variable key-value pairs applied when invoking the workflow. Defaults to null when no overrides were provided.
-
union_member_0: string -
union_member_1: boolean -
union_member_2: number
-
-
tracing: object { enabled }Tracing settings applied to the workflow.
-
enabled: booleanIndicates whether tracing is enabled.
-
-
version: stringSpecific workflow version used for the session. Defaults to null when using the latest deployment.
-
-
Chat Session Automatic Thread Titling
-
chat_session_automatic_thread_titling: object { enabled }Automatic thread title preferences for the session.
-
enabled: booleanWhether automatic thread titling is enabled.
-
Chat Session ChatKit Configuration
-
chat_session_chatkit_configuration: object { automatic_thread_titling, file_upload, history }ChatKit configuration for the session.
-
automatic_thread_titling: object { enabled }Automatic thread titling preferences.
-
enabled: booleanWhether automatic thread titling is enabled.
-
-
file_upload: object { enabled, max_file_size, max_files }Upload settings for the session.
-
enabled: booleanIndicates if uploads are enabled for the session.
-
max_file_size: numberMaximum upload size in megabytes.
-
max_files: numberMaximum number of uploads allowed during the session.
-
-
history: object { enabled, recent_threads }History retention configuration.
-
enabled: booleanIndicates if chat history is persisted for the session.
-
recent_threads: numberNumber of prior threads surfaced in history views. Defaults to null when all history is retained.
-
-
Chat Session ChatKit Configuration Param
-
chat_session_chatkit_configuration_param: object { automatic_thread_titling, file_upload, history }Optional per-session configuration settings for ChatKit behavior.
-
automatic_thread_titling: optional object { enabled }Configuration for automatic thread titling. When omitted, automatic thread titling is enabled by default.
-
enabled: optional booleanEnable automatic thread title generation. Defaults to true.
-
-
file_upload: optional object { enabled, max_file_size, max_files }Configuration for upload enablement and limits. When omitted, uploads are disabled by default (max_files 10, max_file_size 512 MB).
-
enabled: optional booleanEnable uploads for this session. Defaults to false.
-
max_file_size: optional numberMaximum size in megabytes for each uploaded file. Defaults to 512 MB, which is the maximum allowable size.
-
max_files: optional numberMaximum number of files that can be uploaded to the session. Defaults to 10.
-
-
history: optional object { enabled, recent_threads }Configuration for chat history retention. When omitted, history is enabled by default with no limit on recent_threads (null).
-
enabled: optional booleanEnables chat users to access previous ChatKit threads. Defaults to true.
-
recent_threads: optional numberNumber of recent ChatKit threads users have access to. Defaults to unlimited when unset.
-
-
Chat Session Expires After Param
-
chat_session_expires_after_param: object { anchor, seconds }Controls when the session expires relative to an anchor timestamp.
-
anchor: "created_at"Base timestamp used to calculate expiration. Currently fixed to
created_at. -
seconds: numberNumber of seconds after the anchor when the session expires.
-
Chat Session File Upload
-
chat_session_file_upload: object { enabled, max_file_size, max_files }Upload permissions and limits applied to the session.
-
enabled: booleanIndicates if uploads are enabled for the session.
-
max_file_size: numberMaximum upload size in megabytes.
-
max_files: numberMaximum number of uploads allowed during the session.
-
Chat Session History
-
chat_session_history: object { enabled, recent_threads }History retention preferences returned for the session.
-
enabled: booleanIndicates if chat history is persisted for the session.
-
recent_threads: numberNumber of prior threads surfaced in history views. Defaults to null when all history is retained.
-
Chat Session Rate Limits
-
chat_session_rate_limits: object { max_requests_per_1_minute }Active per-minute request limit for the session.
-
max_requests_per_1_minute: numberMaximum allowed requests per one-minute window.
-
Chat Session Rate Limits Param
-
chat_session_rate_limits_param: object { max_requests_per_1_minute }Controls request rate limits for the session.
-
max_requests_per_1_minute: optional numberMaximum number of requests allowed per minute for the session. Defaults to 10.
-
Chat Session Status
-
chat_session_status: "active" or "expired" or "cancelled"-
"active" -
"expired" -
"cancelled"
-
Chat Session Workflow Param
-
chat_session_workflow_param: object { id, state_variables, tracing, version }Workflow reference and overrides applied to the chat session.
-
id: stringIdentifier for the workflow invoked by the session.
-
state_variables: optional map[string or boolean or number]State variables forwarded to the workflow. Keys may be up to 64 characters, values must be primitive types, and the map defaults to an empty object.
-
union_member_0: string -
union_member_1: boolean -
union_member_2: number
-
-
tracing: optional object { enabled }Optional tracing overrides for the workflow invocation. When omitted, tracing is enabled by default.
-
enabled: optional booleanWhether tracing is enabled during the session. Defaults to true.
-
-
version: optional stringSpecific workflow version to run. Defaults to the latest deployed version.
-
ChatKit Attachment
-
chatkit_attachment: object { id, mime_type, name, 2 more }Attachment metadata included on thread items.
-
id: stringIdentifier for the attachment.
-
mime_type: stringMIME type of the attachment.
-
name: stringOriginal display name for the attachment.
-
preview_url: stringPreview URL for rendering the attachment inline.
-
type: "image" or "file"Attachment discriminator.
-
"image" -
"file"
-
-
ChatKit Response Output Text
-
chatkit_response_output_text: object { annotations, text, type }Assistant response text accompanied by optional annotations.
-
annotations: array of object { source, type } or object { source, type }Ordered list of annotations attached to the response text.
-
file: object { source, type }Annotation that references an uploaded file.
-
source: object { filename, type }File attachment referenced by the annotation.
-
filename: stringFilename referenced by the annotation.
-
type: "file"Type discriminator that is always
file.
-
-
type: "file"Type discriminator that is always
filefor this annotation.
-
-
url: object { source, type }Annotation that references a URL.
-
source: object { type, url }URL referenced by the annotation.
-
type: "url"Type discriminator that is always
url. -
url: stringURL referenced by the annotation.
-
-
type: "url"Type discriminator that is always
urlfor this annotation.
-
-
-
text: stringAssistant generated text.
-
type: "output_text"Type discriminator that is always
output_text.
-
ChatKit Thread
-
chatkit_thread: object { id, created_at, object, 3 more }Represents a ChatKit thread and its current status.
-
id: stringIdentifier of the thread.
-
created_at: numberUnix timestamp (in seconds) for when the thread was created.
-
object: "chatkit.thread"Type discriminator that is always
chatkit.thread. -
status: object { type } or object { reason, type } or object { reason, type }Current status for the thread. Defaults to
activefor newly created threads.-
active: object { type }Indicates that a thread is active.
-
locked: object { reason, type }Indicates that a thread is locked and cannot accept new input.
-
reason: stringReason that the thread was locked. Defaults to null when no reason is recorded.
-
type: "locked"Status discriminator that is always
locked.
-
-
closed: object { reason, type }Indicates that a thread has been closed.
-
reason: stringReason that the thread was closed. Defaults to null when no reason is recorded.
-
type: "closed"Status discriminator that is always
closed.
-
-
-
title: stringOptional human-readable title for the thread. Defaults to null when no title has been generated.
-
user: stringFree-form string that identifies your end user who owns the thread.
-
ChatKit Thread Assistant Message Item
-
chatkit_thread_assistant_message_item: object { id, content, created_at, 3 more }Assistant-authored message within a thread.
-
id: stringIdentifier of the thread item.
-
content: array of ChatKitResponseOutputTextOrdered assistant response segments.
-
annotations: array of object { source, type } or object { source, type }Ordered list of annotations attached to the response text.
-
file: object { source, type }Annotation that references an uploaded file.
-
source: object { filename, type }File attachment referenced by the annotation.
-
filename: stringFilename referenced by the annotation.
-
type: "file"Type discriminator that is always
file.
-
-
type: "file"Type discriminator that is always
filefor this annotation.
-
-
url: object { source, type }Annotation that references a URL.
-
source: object { type, url }URL referenced by the annotation.
-
type: "url"Type discriminator that is always
url. -
url: stringURL referenced by the annotation.
-
-
type: "url"Type discriminator that is always
urlfor this annotation.
-
-
-
text: stringAssistant generated text.
-
type: "output_text"Type discriminator that is always
output_text.
-
-
created_at: numberUnix timestamp (in seconds) for when the item was created.
-
object: "chatkit.thread_item"Type discriminator that is always
chatkit.thread_item. -
thread_id: stringIdentifier of the parent thread.
-
type: "chatkit.assistant_message"Type discriminator that is always
chatkit.assistant_message.
-
ChatKit Thread Item List
-
chatkit_thread_item_list: object { data, first_id, has_more, 2 more }A paginated list of thread items rendered for the ChatKit API.
-
data: array of ChatKitThreadUserMessageItem or ChatKitThreadAssistantMessageItem or ChatKitWidgetItem or 3 moreA list of items
-
chatkit_thread_user_message_item: object { id, attachments, content, 5 more }User-authored messages within a thread.
-
id: stringIdentifier of the thread item.
-
attachments: array of ChatKitAttachmentAttachments associated with the user message. Defaults to an empty list.
-
id: stringIdentifier for the attachment.
-
mime_type: stringMIME type of the attachment.
-
name: stringOriginal display name for the attachment.
-
preview_url: stringPreview URL for rendering the attachment inline.
-
type: "image" or "file"Attachment discriminator.
-
"image" -
"file"
-
-
-
content: array of object { text, type } or object { text, type }Ordered content elements supplied by the user.
-
input_text: object { text, type }Text block that a user contributed to the thread.
-
text: stringPlain-text content supplied by the user.
-
type: "input_text"Type discriminator that is always
input_text.
-
-
quoted_text: object { text, type }Quoted snippet that the user referenced in their message.
-
text: stringQuoted text content.
-
type: "quoted_text"Type discriminator that is always
quoted_text.
-
-
-
created_at: numberUnix timestamp (in seconds) for when the item was created.
-
inference_options: object { model, tool_choice }Inference overrides applied to the message. Defaults to null when unset.
-
model: stringModel name that generated the response. Defaults to null when using the session default.
-
tool_choice: object { id }Preferred tool to invoke. Defaults to null when ChatKit should auto-select.
-
id: stringIdentifier of the requested tool.
-
-
-
object: "chatkit.thread_item"Type discriminator that is always
chatkit.thread_item. -
thread_id: stringIdentifier of the parent thread.
-
type: "chatkit.user_message"
-
-
chatkit_thread_assistant_message_item: object { id, content, created_at, 3 more }Assistant-authored message within a thread.
-
id: stringIdentifier of the thread item.
-
content: array of ChatKitResponseOutputTextOrdered assistant response segments.
-
annotations: array of object { source, type } or object { source, type }Ordered list of annotations attached to the response text.
-
file: object { source, type }Annotation that references an uploaded file.
-
source: object { filename, type }File attachment referenced by the annotation.
-
filename: stringFilename referenced by the annotation.
-
type: "file"Type discriminator that is always
file.
-
-
type: "file"Type discriminator that is always
filefor this annotation.
-
-
url: object { source, type }Annotation that references a URL.
-
source: object { type, url }URL referenced by the annotation.
-
type: "url"Type discriminator that is always
url. -
url: stringURL referenced by the annotation.
-
-
type: "url"Type discriminator that is always
urlfor this annotation.
-
-
-
text: stringAssistant generated text.
-
type: "output_text"Type discriminator that is always
output_text.
-
-
created_at: numberUnix timestamp (in seconds) for when the item was created.
-
object: "chatkit.thread_item"Type discriminator that is always
chatkit.thread_item. -
thread_id: stringIdentifier of the parent thread.
-
type: "chatkit.assistant_message"Type discriminator that is always
chatkit.assistant_message.
-
-
chatkit_widget_item: object { id, created_at, object, 3 more }Thread item that renders a widget payload.
-
id: stringIdentifier of the thread item.
-
created_at: numberUnix timestamp (in seconds) for when the item was created.
-
object: "chatkit.thread_item"Type discriminator that is always
chatkit.thread_item. -
thread_id: stringIdentifier of the parent thread.
-
type: "chatkit.widget"Type discriminator that is always
chatkit.widget. -
widget: stringSerialized widget payload rendered in the UI.
-
-
chatkit.client_tool_call: object { id, arguments, call_id, 7 more }Record of a client side tool invocation initiated by the assistant.
-
id: stringIdentifier of the thread item.
-
arguments: stringJSON-encoded arguments that were sent to the tool.
-
call_id: stringIdentifier for the client tool call.
-
created_at: numberUnix timestamp (in seconds) for when the item was created.
-
name: stringTool name that was invoked.
-
object: "chatkit.thread_item"Type discriminator that is always
chatkit.thread_item. -
output: stringJSON-encoded output captured from the tool. Defaults to null while execution is in progress.
-
status: "in_progress" or "completed"Execution status for the tool call.
-
"in_progress" -
"completed"
-
-
thread_id: stringIdentifier of the parent thread.
-
type: "chatkit.client_tool_call"Type discriminator that is always
chatkit.client_tool_call.
-
-
chatkit.task: object { id, created_at, heading, 5 more }Task emitted by the workflow to show progress and status updates.
-
id: stringIdentifier of the thread item.
-
created_at: numberUnix timestamp (in seconds) for when the item was created.
-
heading: stringOptional heading for the task. Defaults to null when not provided.
-
object: "chatkit.thread_item"Type discriminator that is always
chatkit.thread_item. -
summary: stringOptional summary that describes the task. Defaults to null when omitted.
-
task_type: "custom" or "thought"Subtype for the task.
-
"custom" -
"thought"
-
-
thread_id: stringIdentifier of the parent thread.
-
type: "chatkit.task"Type discriminator that is always
chatkit.task.
-
-
chatkit.task_group: object { id, created_at, object, 3 more }Collection of workflow tasks grouped together in the thread.
-
id: stringIdentifier of the thread item.
-
created_at: numberUnix timestamp (in seconds) for when the item was created.
-
object: "chatkit.thread_item"Type discriminator that is always
chatkit.thread_item. -
tasks: array of object { heading, summary, type }Tasks included in the group.
-
heading: stringOptional heading for the grouped task. Defaults to null when not provided.
-
summary: stringOptional summary that describes the grouped task. Defaults to null when omitted.
-
type: "custom" or "thought"Subtype for the grouped task.
-
"custom" -
"thought"
-
-
-
thread_id: stringIdentifier of the parent thread.
-
type: "chatkit.task_group"Type discriminator that is always
chatkit.task_group.
-
-
-
first_id: stringThe ID of the first item in the list.
-
has_more: booleanWhether there are more items available.
-
last_id: stringThe ID of the last item in the list.
-
object: "list"The type of object returned, must be
list.
-
ChatKit Thread User Message Item
-
chatkit_thread_user_message_item: object { id, attachments, content, 5 more }User-authored messages within a thread.
-
id: stringIdentifier of the thread item.
-
attachments: array of ChatKitAttachmentAttachments associated with the user message. Defaults to an empty list.
-
id: stringIdentifier for the attachment.
-
mime_type: stringMIME type of the attachment.
-
name: stringOriginal display name for the attachment.
-
preview_url: stringPreview URL for rendering the attachment inline.
-
type: "image" or "file"Attachment discriminator.
-
"image" -
"file"
-
-
-
content: array of object { text, type } or object { text, type }Ordered content elements supplied by the user.
-
input_text: object { text, type }Text block that a user contributed to the thread.
-
text: stringPlain-text content supplied by the user.
-
type: "input_text"Type discriminator that is always
input_text.
-
-
quoted_text: object { text, type }Quoted snippet that the user referenced in their message.
-
text: stringQuoted text content.
-
type: "quoted_text"Type discriminator that is always
quoted_text.
-
-
-
created_at: numberUnix timestamp (in seconds) for when the item was created.
-
inference_options: object { model, tool_choice }Inference overrides applied to the message. Defaults to null when unset.
-
model: stringModel name that generated the response. Defaults to null when using the session default.
-
tool_choice: object { id }Preferred tool to invoke. Defaults to null when ChatKit should auto-select.
-
id: stringIdentifier of the requested tool.
-
-
-
object: "chatkit.thread_item"Type discriminator that is always
chatkit.thread_item. -
thread_id: stringIdentifier of the parent thread.
-
type: "chatkit.user_message"
-
ChatKit Widget Item
-
chatkit_widget_item: object { id, created_at, object, 3 more }Thread item that renders a widget payload.
-
id: stringIdentifier of the thread item.
-
created_at: numberUnix timestamp (in seconds) for when the item was created.
-
object: "chatkit.thread_item"Type discriminator that is always
chatkit.thread_item. -
thread_id: stringIdentifier of the parent thread.
-
type: "chatkit.widget"Type discriminator that is always
chatkit.widget. -
widget: stringSerialized widget payload rendered in the UI.
-
Assistants
List assistants
$ openai beta:assistants list
get /assistants
Returns a list of assistants.
Parameters
-
--after: optional stringA cursor for use in pagination.
afteris an object ID that defines your place in the list. For instance, if you make a list request and receive 100 objects, ending with obj_foo, your subsequent call can include after=obj_foo in order to fetch the next page of the list. -
--before: optional stringA cursor for use in pagination.
beforeis an object ID that defines your place in the list. For instance, if you make a list request and receive 100 objects, starting with obj_foo, your subsequent call can include before=obj_foo in order to fetch the previous page of the list. -
--limit: optional numberA limit on the number of objects to be returned. Limit can range between 1 and 100, and the default is 20.
-
--order: optional "asc" or "desc"Sort order by the
created_attimestamp of the objects.ascfor ascending order anddescfor descending order.
Returns
-
ListAssistantsResponse: object { data, first_id, has_more, 2 more }-
data: array of Assistant-
id: stringThe identifier, which can be referenced in API endpoints.
-
created_at: numberThe Unix timestamp (in seconds) for when the assistant was created.
-
description: stringThe description of the assistant. The maximum length is 512 characters.
-
instructions: stringThe system instructions that the assistant uses. The maximum length is 256,000 characters.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringID of the model to use. You can use the List models API to see all of your available models, or see our Model overview for descriptions of them.
-
name: stringThe name of the assistant. The maximum length is 256 characters.
-
object: "assistant"The object type, which is always
assistant. -
tools: array of AssistantToolA list of tool enabled on the assistant. There can be a maximum of 128 tools per assistant. Tools can be of types
code_interpreter,file_search, orfunction.-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
file_search_tool: object { type, file_search }-
type: "file_search"The type of tool being defined:
file_search -
file_search: optional object { max_num_results, ranking_options }Overrides for the file search tool.
-
max_num_results: optional numberThe maximum number of results the file search tool should output. The default is 20 for
gpt-4*models and 5 forgpt-3.5-turbo. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_resultsresults. See the file search tool documentation for more information. -
ranking_options: optional object { score_threshold, ranker }The ranking options for the file search. If not specified, the file search tool will use the
autoranker and a score_threshold of 0.See the file search tool documentation for more information.
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
ranker: optional "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
-
-
-
function_tool: object { function, type }-
function: object { name, description, parameters, strict }-
name: stringThe name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the function does, used by the model to choose when and how to call the function.
-
parameters: optional map[unknown]The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parametersdefines a function with an empty parameter list. -
strict: optional booleanWhether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parametersfield. Only a subset of JSON Schema is supported whenstrictistrue. Learn more about Structured Outputs in the function calling guide.
-
-
type: "function"The type of tool being defined:
function
-
-
-
response_format: optional "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length.-
union_member_0: "auto"autois the default value -
response_format_text: object { type }Default response format. Used to generate text responses.
-
type: "text"The type of response format being defined. Always
text.
-
-
response_format_json_object: object { type }JSON object response format. An older method of generating JSON responses. Using
json_schemais recommended for models that support it. Note that the model will not generate JSON without a system or user message instructing it to do so.-
type: "json_object"The type of response format being defined. Always
json_object.
-
-
response_format_json_schema: object { json_schema, type }JSON Schema response format. Used to generate structured JSON responses. Learn more about Structured Outputs.
-
json_schema: object { name, description, schema, strict }Structured Outputs configuration options, including a JSON Schema.
-
name: stringThe name of the response format. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the response format is for, used by the model to determine how to respond in the format.
-
schema: optional map[unknown]The schema for the response format, described as a JSON Schema object. Learn how to build JSON schemas here.
-
strict: optional booleanWhether to enable strict schema adherence when generating the output. If set to true, the model will always follow the exact schema defined in the
schemafield. Only a subset of JSON Schema is supported whenstrictistrue. To learn more, read the Structured Outputs guide.
-
-
type: "json_schema"The type of response format being defined. Always
json_schema.
-
-
-
temperature: optional numberWhat sampling temperature to use, between 0 and 2. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.
-
tool_resources: optional object { code_interpreter, file_search }A set of resources that are used by the assistant's tools. The resources are specific to the type of tool. For example, the
code_interpretertool requires a list of file IDs, while thefile_searchtool requires a list of vector store IDs.-
code_interpreter: optional object { file_ids }-
file_ids: optional array of stringA list of file IDs made available to the `code_interpreter`` tool. There can be a maximum of 20 files associated with the tool.
-
-
file_search: optional object { vector_store_ids }-
vector_store_ids: optional array of stringThe ID of the vector store attached to this assistant. There can be a maximum of 1 vector store attached to the assistant.
-
-
-
top_p: optional numberAn alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered.
We generally recommend altering this or temperature but not both.
-
-
first_id: string -
has_more: boolean -
last_id: string -
object: string
-
Example
openai beta:assistants list \
--api-key 'My API Key'
Response
{
"data": [
{
"id": "id",
"created_at": 0,
"description": "description",
"instructions": "instructions",
"metadata": {
"foo": "string"
},
"model": "model",
"name": "name",
"object": "assistant",
"tools": [
{
"type": "code_interpreter"
}
],
"response_format": "auto",
"temperature": 1,
"tool_resources": {
"code_interpreter": {
"file_ids": [
"string"
]
},
"file_search": {
"vector_store_ids": [
"string"
]
}
},
"top_p": 1
}
],
"first_id": "asst_abc123",
"has_more": false,
"last_id": "asst_abc456",
"object": "list"
}
Create assistant
$ openai beta:assistants create
post /assistants
Create an assistant with a model and instructions.
Parameters
-
--model: string or ChatModelID of the model to use. You can use the List models API to see all of your available models, or see our Model overview for descriptions of them.
-
--description: optional stringThe description of the assistant. The maximum length is 512 characters.
-
--instructions: optional stringThe system instructions that the assistant uses. The maximum length is 256,000 characters.
-
--metadata: optional map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
--name: optional stringThe name of the assistant. The maximum length is 256 characters.
-
--reasoning-effort: optional "none" or "minimal" or "low" or 3 moreConstrains effort on reasoning for reasoning models. Currently supported values are
none,minimal,low,medium,high, andxhigh. Reducing reasoning effort can result in faster responses and fewer tokens used on reasoning in a response.gpt-5.1defaults tonone, which does not perform reasoning. The supported reasoning values forgpt-5.1arenone,low,medium, andhigh. Tool calls are supported for all reasoning values in gpt-5.1.- All models before
gpt-5.1default tomediumreasoning effort, and do not supportnone. - The
gpt-5-promodel defaults to (and only supports)highreasoning effort. xhighis supported for all models aftergpt-5.1-codex-max.
-
--response-format: optional "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
--temperature: optional numberWhat sampling temperature to use, between 0 and 2. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.
-
--tool-resources: optional object { code_interpreter, file_search }A set of resources that are used by the assistant's tools. The resources are specific to the type of tool. For example, the
code_interpretertool requires a list of file IDs, while thefile_searchtool requires a list of vector store IDs. -
--tool: optional array of AssistantToolA list of tool enabled on the assistant. There can be a maximum of 128 tools per assistant. Tools can be of types
code_interpreter,file_search, orfunction. -
--top-p: optional numberAn alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered.
We generally recommend altering this or temperature but not both.
Returns
-
assistant: object { id, created_at, description, 10 more }Represents an
assistantthat can call the model and use tools.-
id: stringThe identifier, which can be referenced in API endpoints.
-
created_at: numberThe Unix timestamp (in seconds) for when the assistant was created.
-
description: stringThe description of the assistant. The maximum length is 512 characters.
-
instructions: stringThe system instructions that the assistant uses. The maximum length is 256,000 characters.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringID of the model to use. You can use the List models API to see all of your available models, or see our Model overview for descriptions of them.
-
name: stringThe name of the assistant. The maximum length is 256 characters.
-
object: "assistant"The object type, which is always
assistant. -
tools: array of AssistantToolA list of tool enabled on the assistant. There can be a maximum of 128 tools per assistant. Tools can be of types
code_interpreter,file_search, orfunction.-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
file_search_tool: object { type, file_search }-
type: "file_search"The type of tool being defined:
file_search -
file_search: optional object { max_num_results, ranking_options }Overrides for the file search tool.
-
max_num_results: optional numberThe maximum number of results the file search tool should output. The default is 20 for
gpt-4*models and 5 forgpt-3.5-turbo. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_resultsresults. See the file search tool documentation for more information. -
ranking_options: optional object { score_threshold, ranker }The ranking options for the file search. If not specified, the file search tool will use the
autoranker and a score_threshold of 0.See the file search tool documentation for more information.
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
ranker: optional "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
-
-
-
function_tool: object { function, type }-
function: object { name, description, parameters, strict }-
name: stringThe name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the function does, used by the model to choose when and how to call the function.
-
parameters: optional map[unknown]The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parametersdefines a function with an empty parameter list. -
strict: optional booleanWhether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parametersfield. Only a subset of JSON Schema is supported whenstrictistrue. Learn more about Structured Outputs in the function calling guide.
-
-
type: "function"The type of tool being defined:
function
-
-
-
response_format: optional "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length.-
union_member_0: "auto"autois the default value -
response_format_text: object { type }Default response format. Used to generate text responses.
-
type: "text"The type of response format being defined. Always
text.
-
-
response_format_json_object: object { type }JSON object response format. An older method of generating JSON responses. Using
json_schemais recommended for models that support it. Note that the model will not generate JSON without a system or user message instructing it to do so.-
type: "json_object"The type of response format being defined. Always
json_object.
-
-
response_format_json_schema: object { json_schema, type }JSON Schema response format. Used to generate structured JSON responses. Learn more about Structured Outputs.
-
json_schema: object { name, description, schema, strict }Structured Outputs configuration options, including a JSON Schema.
-
name: stringThe name of the response format. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the response format is for, used by the model to determine how to respond in the format.
-
schema: optional map[unknown]The schema for the response format, described as a JSON Schema object. Learn how to build JSON schemas here.
-
strict: optional booleanWhether to enable strict schema adherence when generating the output. If set to true, the model will always follow the exact schema defined in the
schemafield. Only a subset of JSON Schema is supported whenstrictistrue. To learn more, read the Structured Outputs guide.
-
-
type: "json_schema"The type of response format being defined. Always
json_schema.
-
-
-
temperature: optional numberWhat sampling temperature to use, between 0 and 2. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.
-
tool_resources: optional object { code_interpreter, file_search }A set of resources that are used by the assistant's tools. The resources are specific to the type of tool. For example, the
code_interpretertool requires a list of file IDs, while thefile_searchtool requires a list of vector store IDs.-
code_interpreter: optional object { file_ids }-
file_ids: optional array of stringA list of file IDs made available to the `code_interpreter`` tool. There can be a maximum of 20 files associated with the tool.
-
-
file_search: optional object { vector_store_ids }-
vector_store_ids: optional array of stringThe ID of the vector store attached to this assistant. There can be a maximum of 1 vector store attached to the assistant.
-
-
-
top_p: optional numberAn alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered.
We generally recommend altering this or temperature but not both.
-
Example
openai beta:assistants create \
--api-key 'My API Key' \
--model gpt-4o
Response
{
"id": "id",
"created_at": 0,
"description": "description",
"instructions": "instructions",
"metadata": {
"foo": "string"
},
"model": "model",
"name": "name",
"object": "assistant",
"tools": [
{
"type": "code_interpreter"
}
],
"response_format": "auto",
"temperature": 1,
"tool_resources": {
"code_interpreter": {
"file_ids": [
"string"
]
},
"file_search": {
"vector_store_ids": [
"string"
]
}
},
"top_p": 1
}
Retrieve assistant
$ openai beta:assistants retrieve
get /assistants/{assistant_id}
Retrieves an assistant.
Parameters
-
--assistant-id: stringThe ID of the assistant to retrieve.
Returns
-
assistant: object { id, created_at, description, 10 more }Represents an
assistantthat can call the model and use tools.-
id: stringThe identifier, which can be referenced in API endpoints.
-
created_at: numberThe Unix timestamp (in seconds) for when the assistant was created.
-
description: stringThe description of the assistant. The maximum length is 512 characters.
-
instructions: stringThe system instructions that the assistant uses. The maximum length is 256,000 characters.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringID of the model to use. You can use the List models API to see all of your available models, or see our Model overview for descriptions of them.
-
name: stringThe name of the assistant. The maximum length is 256 characters.
-
object: "assistant"The object type, which is always
assistant. -
tools: array of AssistantToolA list of tool enabled on the assistant. There can be a maximum of 128 tools per assistant. Tools can be of types
code_interpreter,file_search, orfunction.-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
file_search_tool: object { type, file_search }-
type: "file_search"The type of tool being defined:
file_search -
file_search: optional object { max_num_results, ranking_options }Overrides for the file search tool.
-
max_num_results: optional numberThe maximum number of results the file search tool should output. The default is 20 for
gpt-4*models and 5 forgpt-3.5-turbo. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_resultsresults. See the file search tool documentation for more information. -
ranking_options: optional object { score_threshold, ranker }The ranking options for the file search. If not specified, the file search tool will use the
autoranker and a score_threshold of 0.See the file search tool documentation for more information.
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
ranker: optional "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
-
-
-
function_tool: object { function, type }-
function: object { name, description, parameters, strict }-
name: stringThe name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the function does, used by the model to choose when and how to call the function.
-
parameters: optional map[unknown]The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parametersdefines a function with an empty parameter list. -
strict: optional booleanWhether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parametersfield. Only a subset of JSON Schema is supported whenstrictistrue. Learn more about Structured Outputs in the function calling guide.
-
-
type: "function"The type of tool being defined:
function
-
-
-
response_format: optional "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length.-
union_member_0: "auto"autois the default value -
response_format_text: object { type }Default response format. Used to generate text responses.
-
type: "text"The type of response format being defined. Always
text.
-
-
response_format_json_object: object { type }JSON object response format. An older method of generating JSON responses. Using
json_schemais recommended for models that support it. Note that the model will not generate JSON without a system or user message instructing it to do so.-
type: "json_object"The type of response format being defined. Always
json_object.
-
-
response_format_json_schema: object { json_schema, type }JSON Schema response format. Used to generate structured JSON responses. Learn more about Structured Outputs.
-
json_schema: object { name, description, schema, strict }Structured Outputs configuration options, including a JSON Schema.
-
name: stringThe name of the response format. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the response format is for, used by the model to determine how to respond in the format.
-
schema: optional map[unknown]The schema for the response format, described as a JSON Schema object. Learn how to build JSON schemas here.
-
strict: optional booleanWhether to enable strict schema adherence when generating the output. If set to true, the model will always follow the exact schema defined in the
schemafield. Only a subset of JSON Schema is supported whenstrictistrue. To learn more, read the Structured Outputs guide.
-
-
type: "json_schema"The type of response format being defined. Always
json_schema.
-
-
-
temperature: optional numberWhat sampling temperature to use, between 0 and 2. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.
-
tool_resources: optional object { code_interpreter, file_search }A set of resources that are used by the assistant's tools. The resources are specific to the type of tool. For example, the
code_interpretertool requires a list of file IDs, while thefile_searchtool requires a list of vector store IDs.-
code_interpreter: optional object { file_ids }-
file_ids: optional array of stringA list of file IDs made available to the `code_interpreter`` tool. There can be a maximum of 20 files associated with the tool.
-
-
file_search: optional object { vector_store_ids }-
vector_store_ids: optional array of stringThe ID of the vector store attached to this assistant. There can be a maximum of 1 vector store attached to the assistant.
-
-
-
top_p: optional numberAn alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered.
We generally recommend altering this or temperature but not both.
-
Example
openai beta:assistants retrieve \
--api-key 'My API Key' \
--assistant-id assistant_id
Response
{
"id": "id",
"created_at": 0,
"description": "description",
"instructions": "instructions",
"metadata": {
"foo": "string"
},
"model": "model",
"name": "name",
"object": "assistant",
"tools": [
{
"type": "code_interpreter"
}
],
"response_format": "auto",
"temperature": 1,
"tool_resources": {
"code_interpreter": {
"file_ids": [
"string"
]
},
"file_search": {
"vector_store_ids": [
"string"
]
}
},
"top_p": 1
}
Modify assistant
$ openai beta:assistants update
post /assistants/{assistant_id}
Modifies an assistant.
Parameters
-
--assistant-id: stringThe ID of the assistant to modify.
-
--description: optional stringThe description of the assistant. The maximum length is 512 characters.
-
--instructions: optional stringThe system instructions that the assistant uses. The maximum length is 256,000 characters.
-
--metadata: optional map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
--model: optional string or "gpt-5" or "gpt-5-mini" or "gpt-5-nano" or 39 moreID of the model to use. You can use the List models API to see all of your available models, or see our Model overview for descriptions of them.
-
--name: optional stringThe name of the assistant. The maximum length is 256 characters.
-
--reasoning-effort: optional "none" or "minimal" or "low" or 3 moreConstrains effort on reasoning for reasoning models. Currently supported values are
none,minimal,low,medium,high, andxhigh. Reducing reasoning effort can result in faster responses and fewer tokens used on reasoning in a response.gpt-5.1defaults tonone, which does not perform reasoning. The supported reasoning values forgpt-5.1arenone,low,medium, andhigh. Tool calls are supported for all reasoning values in gpt-5.1.- All models before
gpt-5.1default tomediumreasoning effort, and do not supportnone. - The
gpt-5-promodel defaults to (and only supports)highreasoning effort. xhighis supported for all models aftergpt-5.1-codex-max.
-
--response-format: optional "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
--temperature: optional numberWhat sampling temperature to use, between 0 and 2. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.
-
--tool-resources: optional object { code_interpreter, file_search }A set of resources that are used by the assistant's tools. The resources are specific to the type of tool. For example, the
code_interpretertool requires a list of file IDs, while thefile_searchtool requires a list of vector store IDs. -
--tool: optional array of AssistantToolA list of tool enabled on the assistant. There can be a maximum of 128 tools per assistant. Tools can be of types
code_interpreter,file_search, orfunction. -
--top-p: optional numberAn alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered.
We generally recommend altering this or temperature but not both.
Returns
-
assistant: object { id, created_at, description, 10 more }Represents an
assistantthat can call the model and use tools.-
id: stringThe identifier, which can be referenced in API endpoints.
-
created_at: numberThe Unix timestamp (in seconds) for when the assistant was created.
-
description: stringThe description of the assistant. The maximum length is 512 characters.
-
instructions: stringThe system instructions that the assistant uses. The maximum length is 256,000 characters.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringID of the model to use. You can use the List models API to see all of your available models, or see our Model overview for descriptions of them.
-
name: stringThe name of the assistant. The maximum length is 256 characters.
-
object: "assistant"The object type, which is always
assistant. -
tools: array of AssistantToolA list of tool enabled on the assistant. There can be a maximum of 128 tools per assistant. Tools can be of types
code_interpreter,file_search, orfunction.-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
file_search_tool: object { type, file_search }-
type: "file_search"The type of tool being defined:
file_search -
file_search: optional object { max_num_results, ranking_options }Overrides for the file search tool.
-
max_num_results: optional numberThe maximum number of results the file search tool should output. The default is 20 for
gpt-4*models and 5 forgpt-3.5-turbo. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_resultsresults. See the file search tool documentation for more information. -
ranking_options: optional object { score_threshold, ranker }The ranking options for the file search. If not specified, the file search tool will use the
autoranker and a score_threshold of 0.See the file search tool documentation for more information.
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
ranker: optional "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
-
-
-
function_tool: object { function, type }-
function: object { name, description, parameters, strict }-
name: stringThe name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the function does, used by the model to choose when and how to call the function.
-
parameters: optional map[unknown]The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parametersdefines a function with an empty parameter list. -
strict: optional booleanWhether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parametersfield. Only a subset of JSON Schema is supported whenstrictistrue. Learn more about Structured Outputs in the function calling guide.
-
-
type: "function"The type of tool being defined:
function
-
-
-
response_format: optional "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length.-
union_member_0: "auto"autois the default value -
response_format_text: object { type }Default response format. Used to generate text responses.
-
type: "text"The type of response format being defined. Always
text.
-
-
response_format_json_object: object { type }JSON object response format. An older method of generating JSON responses. Using
json_schemais recommended for models that support it. Note that the model will not generate JSON without a system or user message instructing it to do so.-
type: "json_object"The type of response format being defined. Always
json_object.
-
-
response_format_json_schema: object { json_schema, type }JSON Schema response format. Used to generate structured JSON responses. Learn more about Structured Outputs.
-
json_schema: object { name, description, schema, strict }Structured Outputs configuration options, including a JSON Schema.
-
name: stringThe name of the response format. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the response format is for, used by the model to determine how to respond in the format.
-
schema: optional map[unknown]The schema for the response format, described as a JSON Schema object. Learn how to build JSON schemas here.
-
strict: optional booleanWhether to enable strict schema adherence when generating the output. If set to true, the model will always follow the exact schema defined in the
schemafield. Only a subset of JSON Schema is supported whenstrictistrue. To learn more, read the Structured Outputs guide.
-
-
type: "json_schema"The type of response format being defined. Always
json_schema.
-
-
-
temperature: optional numberWhat sampling temperature to use, between 0 and 2. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.
-
tool_resources: optional object { code_interpreter, file_search }A set of resources that are used by the assistant's tools. The resources are specific to the type of tool. For example, the
code_interpretertool requires a list of file IDs, while thefile_searchtool requires a list of vector store IDs.-
code_interpreter: optional object { file_ids }-
file_ids: optional array of stringA list of file IDs made available to the `code_interpreter`` tool. There can be a maximum of 20 files associated with the tool.
-
-
file_search: optional object { vector_store_ids }-
vector_store_ids: optional array of stringThe ID of the vector store attached to this assistant. There can be a maximum of 1 vector store attached to the assistant.
-
-
-
top_p: optional numberAn alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered.
We generally recommend altering this or temperature but not both.
-
Example
openai beta:assistants update \
--api-key 'My API Key' \
--assistant-id assistant_id
Response
{
"id": "id",
"created_at": 0,
"description": "description",
"instructions": "instructions",
"metadata": {
"foo": "string"
},
"model": "model",
"name": "name",
"object": "assistant",
"tools": [
{
"type": "code_interpreter"
}
],
"response_format": "auto",
"temperature": 1,
"tool_resources": {
"code_interpreter": {
"file_ids": [
"string"
]
},
"file_search": {
"vector_store_ids": [
"string"
]
}
},
"top_p": 1
}
Delete assistant
$ openai beta:assistants delete
delete /assistants/{assistant_id}
Delete an assistant.
Parameters
-
--assistant-id: stringThe ID of the assistant to delete.
Returns
-
assistant_deleted: object { id, deleted, object }-
id: string -
deleted: boolean -
object: "assistant.deleted"
-
Example
openai beta:assistants delete \
--api-key 'My API Key' \
--assistant-id assistant_id
Response
{
"id": "id",
"deleted": true,
"object": "assistant.deleted"
}
Domain Types
Assistant
-
assistant: object { id, created_at, description, 10 more }Represents an
assistantthat can call the model and use tools.-
id: stringThe identifier, which can be referenced in API endpoints.
-
created_at: numberThe Unix timestamp (in seconds) for when the assistant was created.
-
description: stringThe description of the assistant. The maximum length is 512 characters.
-
instructions: stringThe system instructions that the assistant uses. The maximum length is 256,000 characters.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringID of the model to use. You can use the List models API to see all of your available models, or see our Model overview for descriptions of them.
-
name: stringThe name of the assistant. The maximum length is 256 characters.
-
object: "assistant"The object type, which is always
assistant. -
tools: array of AssistantToolA list of tool enabled on the assistant. There can be a maximum of 128 tools per assistant. Tools can be of types
code_interpreter,file_search, orfunction.-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
file_search_tool: object { type, file_search }-
type: "file_search"The type of tool being defined:
file_search -
file_search: optional object { max_num_results, ranking_options }Overrides for the file search tool.
-
max_num_results: optional numberThe maximum number of results the file search tool should output. The default is 20 for
gpt-4*models and 5 forgpt-3.5-turbo. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_resultsresults. See the file search tool documentation for more information. -
ranking_options: optional object { score_threshold, ranker }The ranking options for the file search. If not specified, the file search tool will use the
autoranker and a score_threshold of 0.See the file search tool documentation for more information.
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
ranker: optional "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
-
-
-
function_tool: object { function, type }-
function: object { name, description, parameters, strict }-
name: stringThe name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the function does, used by the model to choose when and how to call the function.
-
parameters: optional map[unknown]The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parametersdefines a function with an empty parameter list. -
strict: optional booleanWhether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parametersfield. Only a subset of JSON Schema is supported whenstrictistrue. Learn more about Structured Outputs in the function calling guide.
-
-
type: "function"The type of tool being defined:
function
-
-
-
response_format: optional "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length.-
union_member_0: "auto"autois the default value -
response_format_text: object { type }Default response format. Used to generate text responses.
-
type: "text"The type of response format being defined. Always
text.
-
-
response_format_json_object: object { type }JSON object response format. An older method of generating JSON responses. Using
json_schemais recommended for models that support it. Note that the model will not generate JSON without a system or user message instructing it to do so.-
type: "json_object"The type of response format being defined. Always
json_object.
-
-
response_format_json_schema: object { json_schema, type }JSON Schema response format. Used to generate structured JSON responses. Learn more about Structured Outputs.
-
json_schema: object { name, description, schema, strict }Structured Outputs configuration options, including a JSON Schema.
-
name: stringThe name of the response format. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the response format is for, used by the model to determine how to respond in the format.
-
schema: optional map[unknown]The schema for the response format, described as a JSON Schema object. Learn how to build JSON schemas here.
-
strict: optional booleanWhether to enable strict schema adherence when generating the output. If set to true, the model will always follow the exact schema defined in the
schemafield. Only a subset of JSON Schema is supported whenstrictistrue. To learn more, read the Structured Outputs guide.
-
-
type: "json_schema"The type of response format being defined. Always
json_schema.
-
-
-
temperature: optional numberWhat sampling temperature to use, between 0 and 2. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.
-
tool_resources: optional object { code_interpreter, file_search }A set of resources that are used by the assistant's tools. The resources are specific to the type of tool. For example, the
code_interpretertool requires a list of file IDs, while thefile_searchtool requires a list of vector store IDs.-
code_interpreter: optional object { file_ids }-
file_ids: optional array of stringA list of file IDs made available to the `code_interpreter`` tool. There can be a maximum of 20 files associated with the tool.
-
-
file_search: optional object { vector_store_ids }-
vector_store_ids: optional array of stringThe ID of the vector store attached to this assistant. There can be a maximum of 1 vector store attached to the assistant.
-
-
-
top_p: optional numberAn alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered.
We generally recommend altering this or temperature but not both.
-
Assistant Deleted
-
assistant_deleted: object { id, deleted, object }-
id: string -
deleted: boolean -
object: "assistant.deleted"
-
Assistant Stream Event
-
assistant_stream_event: object { data, event, enabled } or object { data, event } or object { data, event } or 21 moreRepresents an event emitted when streaming a Run.
Each event in a server-sent events stream has an
eventanddataproperty:event: thread.created data: {"id": "thread_123", "object": "thread", ...}We emit events whenever a new object is created, transitions to a new state, or is being streamed in parts (deltas). For example, we emit
thread.run.createdwhen a new run is created,thread.run.completedwhen a run completes, and so on. When an Assistant chooses to create a message during a run, we emit athread.message.created event, athread.message.in_progressevent, manythread.message.deltaevents, and finally athread.message.completedevent.We may add additional events over time, so we recommend handling unknown events gracefully in your code. See the Assistants API quickstart to learn how to integrate the Assistants API with streaming.
-
thread.created: object { data, event, enabled }Occurs when a new thread is created.
-
data: object { id, created_at, metadata, 2 more }Represents a thread that contains messages.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
created_at: numberThe Unix timestamp (in seconds) for when the thread was created.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread"The object type, which is always
thread. -
tool_resources: object { code_interpreter, file_search }A set of resources that are made available to the assistant's tools in this thread. The resources are specific to the type of tool. For example, the
code_interpretertool requires a list of file IDs, while thefile_searchtool requires a list of vector store IDs.-
code_interpreter: optional object { file_ids }-
file_ids: optional array of stringA list of file IDs made available to the
code_interpretertool. There can be a maximum of 20 files associated with the tool.
-
-
file_search: optional object { vector_store_ids }-
vector_store_ids: optional array of stringThe vector store attached to this thread. There can be a maximum of 1 vector store attached to the thread.
-
-
-
-
event: "thread.created" -
enabled: optional booleanWhether to enable input audio transcription.
-
-
thread.run.created: object { data, event }Occurs when a new run is created.
-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete.-
reason: optional "max_completion_tokens" or "max_prompt_tokens"The reason why the run is incomplete. This will point to which specific token limit was reached over the course of the run.
-
"max_completion_tokens" -
"max_prompt_tokens"
-
-
-
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors.-
code: "server_error" or "rate_limit_exceeded" or "invalid_prompt"One of
server_error,rate_limit_exceeded, orinvalid_prompt.-
"server_error" -
"rate_limit_exceeded" -
"invalid_prompt"
-
-
message: stringA human-readable description of the error.
-
-
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required.-
submit_tool_outputs: object { tool_calls }Details on the tool outputs needed for this run to continue.
-
tool_calls: array of RequiredActionFunctionToolCallA list of the relevant tool calls.
-
id: stringThe ID of the tool call. This ID must be referenced when you submit the tool outputs in using the Submit tool outputs to run endpoint.
-
function: object { arguments, name }The function definition.
-
arguments: stringThe arguments that the model expects you to pass to the function.
-
name: stringThe name of the function.
-
-
type: "function"The type of tool call the output is required for. For now, this is always
function.
-
-
-
type: "submit_tool_outputs"For now, this is always
submit_tool_outputs.
-
-
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length.-
union_member_0: "auto"autois the default value -
response_format_text: object { type }Default response format. Used to generate text responses.
-
type: "text"The type of response format being defined. Always
text.
-
-
response_format_json_object: object { type }JSON object response format. An older method of generating JSON responses. Using
json_schemais recommended for models that support it. Note that the model will not generate JSON without a system or user message instructing it to do so.-
type: "json_object"The type of response format being defined. Always
json_object.
-
-
response_format_json_schema: object { json_schema, type }JSON Schema response format. Used to generate structured JSON responses. Learn more about Structured Outputs.
-
json_schema: object { name, description, schema, strict }Structured Outputs configuration options, including a JSON Schema.
-
name: stringThe name of the response format. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the response format is for, used by the model to determine how to respond in the format.
-
schema: optional map[unknown]The schema for the response format, described as a JSON Schema object. Learn how to build JSON schemas here.
-
strict: optional booleanWhether to enable strict schema adherence when generating the output. If set to true, the model will always follow the exact schema defined in the
schemafield. Only a subset of JSON Schema is supported whenstrictistrue. To learn more, read the Structured Outputs guide.
-
-
type: "json_schema"The type of response format being defined. Always
json_schema.
-
-
-
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired.-
"queued" -
"in_progress" -
"requires_action" -
"cancelling" -
"cancelled" -
"failed" -
"completed" -
"incomplete" -
"expired"
-
-
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool.-
Auto: "none" or "auto" or "required"nonemeans the model will not call any tools and instead generates a message.automeans the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user.-
"none" -
"auto" -
"required"
-
-
assistant_tool_choice: object { type, function }Specifies a tool the model should use. Use to force the model to call a specific tool.
-
type: "function" or "code_interpreter" or "file_search"The type of the tool. If type is
function, the function name must be set-
"function" -
"code_interpreter" -
"file_search"
-
-
function: optional object { name }-
name: stringThe name of the function to call.
-
-
-
-
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
file_search_tool: object { type, file_search }-
type: "file_search"The type of tool being defined:
file_search -
file_search: optional object { max_num_results, ranking_options }Overrides for the file search tool.
-
max_num_results: optional numberThe maximum number of results the file search tool should output. The default is 20 for
gpt-4*models and 5 forgpt-3.5-turbo. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_resultsresults. See the file search tool documentation for more information. -
ranking_options: optional object { score_threshold, ranker }The ranking options for the file search. If not specified, the file search tool will use the
autoranker and a score_threshold of 0.See the file search tool documentation for more information.
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
ranker: optional "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
-
-
-
function_tool: object { function, type }-
function: object { name, description, parameters, strict }-
name: stringThe name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the function does, used by the model to choose when and how to call the function.
-
parameters: optional map[unknown]The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parametersdefines a function with an empty parameter list. -
strict: optional booleanWhether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parametersfield. Only a subset of JSON Schema is supported whenstrictistrue. Learn more about Structured Outputs in the function calling guide.
-
-
type: "function"The type of tool being defined:
function
-
-
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
type: "auto" or "last_messages"The truncation strategy to use for the thread. The default is
auto. If set tolast_messages, the thread will be truncated to the n most recent messages in the thread. When set toauto, messages in the middle of the thread will be dropped to fit the context length of the model,max_prompt_tokens.-
"auto" -
"last_messages"
-
-
last_messages: optional numberThe number of most recent messages from the thread when constructing the context for the run.
-
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.).-
completion_tokens: numberNumber of completion tokens used over the course of the run.
-
prompt_tokens: numberNumber of prompt tokens used over the course of the run.
-
total_tokens: numberTotal number of tokens used (prompt + completion).
-
-
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.created"
-
-
thread.run.queued: object { data, event }Occurs when a run moves to a
queuedstatus.-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.queued"
-
-
thread.run.in_progress: object { data, event }Occurs when a run moves to an
in_progressstatus.-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.in_progress"
-
-
thread.run.requires_action: object { data, event }Occurs when a run moves to a
requires_actionstatus.-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.requires_action"
-
-
thread.run.completed: object { data, event }Occurs when a run is completed.
-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.completed"
-
-
thread.run.incomplete: object { data, event }Occurs when a run ends with status
incomplete.-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.incomplete"
-
-
thread.run.failed: object { data, event }Occurs when a run fails.
-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.failed"
-
-
thread.run.cancelling: object { data, event }Occurs when a run moves to a
cancellingstatus.-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.cancelling"
-
-
thread.run.cancelled: object { data, event }Occurs when a run is cancelled.
-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.cancelled"
-
-
thread.run.expired: object { data, event }Occurs when a run expires.
-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.expired"
-
-
thread.run.step.created: object { data, event }Occurs when a run step is created.
-
data: object { id, assistant_id, cancelled_at, 13 more }Represents a step in execution of a run.
-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant associated with the run step.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run step was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run step completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run step was created.
-
expired_at: numberThe Unix timestamp (in seconds) for when the run step expired. A step is considered expired if the parent run is expired.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run step failed.
-
last_error: object { code, message }The last error associated with this run step. Will be
nullif there are no errors.-
code: "server_error" or "rate_limit_exceeded"One of
server_errororrate_limit_exceeded.-
"server_error" -
"rate_limit_exceeded"
-
-
message: stringA human-readable description of the error.
-
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.run.step"The object type, which is always
thread.run.step. -
run_id: stringThe ID of the run that this run step is a part of.
-
status: "in_progress" or "cancelled" or "failed" or 2 moreThe status of the run step, which can be either
in_progress,cancelled,failed,completed, orexpired.-
"in_progress" -
"cancelled" -
"failed" -
"completed" -
"expired"
-
-
step_details: MessageCreationStepDetails or ToolCallsStepDetailsThe details of the run step.
-
message_creation_step_details: object { message_creation, type }Details of the message creation by the run step.
-
message_creation: object { message_id }-
message_id: stringThe ID of the message that was created by this run step.
-
-
type: "message_creation"Always
message_creation.
-
-
tool_calls_step_details: object { tool_calls, type }Details of the tool call.
-
tool_calls: array of ToolCallAn array of tool calls the run step was involved in. These can be associated with one of three types of tools:
code_interpreter,file_search, orfunction.-
code_interpreter_tool_call: object { id, code_interpreter, type }Details of the Code Interpreter tool call the run step was involved in.
-
id: stringThe ID of the tool call.
-
code_interpreter: object { input, outputs }The Code Interpreter tool call definition.
-
input: stringThe input to the Code Interpreter tool call.
-
outputs: array of object { logs, type } or object { image, type }The outputs from the Code Interpreter tool call. Code Interpreter can output one or more items, including text (
logs) or images (image). Each of these are represented by a different object type.-
logs: object { logs, type }Text output from the Code Interpreter tool call as part of a run step.
-
logs: stringThe text output from the Code Interpreter tool call.
-
type: "logs"Always
logs.
-
-
image: object { image, type }-
image: object { file_id }-
file_id: stringThe file ID of the image.
-
-
type: "image"Always
image.
-
-
-
-
type: "code_interpreter"The type of tool call. This is always going to be
code_interpreterfor this type of tool call.
-
-
file_search_tool_call: object { id, file_search, type }-
id: stringThe ID of the tool call object.
-
file_search: object { ranking_options, results }For now, this is always going to be an empty object.
-
ranking_options: optional object { ranker, score_threshold }The ranking options for the file search.
-
ranker: "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
-
results: optional array of object { file_id, file_name, score, content }The results of the file search.
-
file_id: stringThe ID of the file that result was found in.
-
file_name: stringThe name of the file that result was found in.
-
score: numberThe score of the result. All values must be a floating point number between 0 and 1.
-
content: optional array of object { text, type }The content of the result that was found. The content is only included if requested via the include query parameter.
-
text: optional stringThe text content of the file.
-
type: optional "text"The type of the content.
"text"
-
-
-
-
type: "file_search"The type of tool call. This is always going to be
file_searchfor this type of tool call.
-
-
function_tool_call: object { id, function, type }-
id: stringThe ID of the tool call object.
-
function: object { arguments, name, output }The definition of the function that was called.
-
arguments: stringThe arguments passed to the function.
-
name: stringThe name of the function.
-
output: stringThe output of the function. This will be
nullif the outputs have not been submitted yet.
-
-
type: "function"The type of tool call. This is always going to be
functionfor this type of tool call.
-
-
-
type: "tool_calls"Always
tool_calls.
-
-
-
thread_id: stringThe ID of the thread that was run.
-
type: "message_creation" or "tool_calls"The type of run step, which can be either
message_creationortool_calls.-
"message_creation" -
"tool_calls"
-
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run step. This value will be
nullwhile the run step's status isin_progress.-
completion_tokens: numberNumber of completion tokens used over the course of the run step.
-
prompt_tokens: numberNumber of prompt tokens used over the course of the run step.
-
total_tokens: numberTotal number of tokens used (prompt + completion).
-
-
-
event: "thread.run.step.created"
-
-
thread.run.step.in_progress: object { data, event }Occurs when a run step moves to an
in_progressstate.-
data: object { id, assistant_id, cancelled_at, 13 more }Represents a step in execution of a run.
-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant associated with the run step.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run step was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run step completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run step was created.
-
expired_at: numberThe Unix timestamp (in seconds) for when the run step expired. A step is considered expired if the parent run is expired.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run step failed.
-
last_error: object { code, message }The last error associated with this run step. Will be
nullif there are no errors. -
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.run.step"The object type, which is always
thread.run.step. -
run_id: stringThe ID of the run that this run step is a part of.
-
status: "in_progress" or "cancelled" or "failed" or 2 moreThe status of the run step, which can be either
in_progress,cancelled,failed,completed, orexpired. -
step_details: MessageCreationStepDetails or ToolCallsStepDetailsThe details of the run step.
-
thread_id: stringThe ID of the thread that was run.
-
type: "message_creation" or "tool_calls"The type of run step, which can be either
message_creationortool_calls. -
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run step. This value will be
nullwhile the run step's status isin_progress.
-
-
event: "thread.run.step.in_progress"
-
-
thread.run.step.delta: object { data, event }Occurs when parts of a run step are being streamed.
-
data: object { id, delta, object }Represents a run step delta i.e. any changed fields on a run step during streaming.
-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
delta: object { step_details }The delta containing the fields that have changed on the run step.
-
step_details: optional RunStepDeltaMessageDelta or ToolCallDeltaObjectThe details of the run step.
-
run_step_delta_message_delta: object { type, message_creation }Details of the message creation by the run step.
-
type: "message_creation"Always
message_creation. -
message_creation: optional object { message_id }-
message_id: optional stringThe ID of the message that was created by this run step.
-
-
-
tool_call_delta_object: object { type, tool_calls }Details of the tool call.
-
type: "tool_calls"Always
tool_calls. -
tool_calls: optional array of ToolCallDeltaAn array of tool calls the run step was involved in. These can be associated with one of three types of tools:
code_interpreter,file_search, orfunction.-
code_interpreter_tool_call_delta: object { index, type, id, code_interpreter }Details of the Code Interpreter tool call the run step was involved in.
-
index: numberThe index of the tool call in the tool calls array.
-
type: "code_interpreter"The type of tool call. This is always going to be
code_interpreterfor this type of tool call. -
id: optional stringThe ID of the tool call.
-
code_interpreter: optional object { input, outputs }The Code Interpreter tool call definition.
-
input: optional stringThe input to the Code Interpreter tool call.
-
outputs: optional array of CodeInterpreterLogs or CodeInterpreterOutputImageThe outputs from the Code Interpreter tool call. Code Interpreter can output one or more items, including text (
logs) or images (image). Each of these are represented by a different object type.-
code_interpreter_logs: object { index, type, logs }Text output from the Code Interpreter tool call as part of a run step.
-
index: numberThe index of the output in the outputs array.
-
type: "logs"Always
logs. -
logs: optional stringThe text output from the Code Interpreter tool call.
-
-
code_interpreter_output_image: object { index, type, image }-
index: numberThe index of the output in the outputs array.
-
type: "image"Always
image. -
image: optional object { file_id }-
file_id: optional stringThe file ID of the image.
-
-
-
-
-
-
file_search_tool_call_delta: object { file_search, index, type, id }-
file_search: unknownFor now, this is always going to be an empty object.
-
index: numberThe index of the tool call in the tool calls array.
-
type: "file_search"The type of tool call. This is always going to be
file_searchfor this type of tool call. -
id: optional stringThe ID of the tool call object.
-
-
function_tool_call_delta: object { index, type, id, function }-
index: numberThe index of the tool call in the tool calls array.
-
type: "function"The type of tool call. This is always going to be
functionfor this type of tool call. -
id: optional stringThe ID of the tool call object.
-
function: optional object { arguments, name, output }The definition of the function that was called.
-
arguments: optional stringThe arguments passed to the function.
-
name: optional stringThe name of the function.
-
output: optional stringThe output of the function. This will be
nullif the outputs have not been submitted yet.
-
-
-
-
-
-
-
object: "thread.run.step.delta"The object type, which is always
thread.run.step.delta.
-
-
event: "thread.run.step.delta"
-
-
thread.run.step.completed: object { data, event }Occurs when a run step is completed.
-
data: object { id, assistant_id, cancelled_at, 13 more }Represents a step in execution of a run.
-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant associated with the run step.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run step was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run step completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run step was created.
-
expired_at: numberThe Unix timestamp (in seconds) for when the run step expired. A step is considered expired if the parent run is expired.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run step failed.
-
last_error: object { code, message }The last error associated with this run step. Will be
nullif there are no errors. -
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.run.step"The object type, which is always
thread.run.step. -
run_id: stringThe ID of the run that this run step is a part of.
-
status: "in_progress" or "cancelled" or "failed" or 2 moreThe status of the run step, which can be either
in_progress,cancelled,failed,completed, orexpired. -
step_details: MessageCreationStepDetails or ToolCallsStepDetailsThe details of the run step.
-
thread_id: stringThe ID of the thread that was run.
-
type: "message_creation" or "tool_calls"The type of run step, which can be either
message_creationortool_calls. -
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run step. This value will be
nullwhile the run step's status isin_progress.
-
-
event: "thread.run.step.completed"
-
-
thread.run.step.failed: object { data, event }Occurs when a run step fails.
-
data: object { id, assistant_id, cancelled_at, 13 more }Represents a step in execution of a run.
-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant associated with the run step.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run step was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run step completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run step was created.
-
expired_at: numberThe Unix timestamp (in seconds) for when the run step expired. A step is considered expired if the parent run is expired.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run step failed.
-
last_error: object { code, message }The last error associated with this run step. Will be
nullif there are no errors. -
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.run.step"The object type, which is always
thread.run.step. -
run_id: stringThe ID of the run that this run step is a part of.
-
status: "in_progress" or "cancelled" or "failed" or 2 moreThe status of the run step, which can be either
in_progress,cancelled,failed,completed, orexpired. -
step_details: MessageCreationStepDetails or ToolCallsStepDetailsThe details of the run step.
-
thread_id: stringThe ID of the thread that was run.
-
type: "message_creation" or "tool_calls"The type of run step, which can be either
message_creationortool_calls. -
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run step. This value will be
nullwhile the run step's status isin_progress.
-
-
event: "thread.run.step.failed"
-
-
thread.run.step.cancelled: object { data, event }Occurs when a run step is cancelled.
-
data: object { id, assistant_id, cancelled_at, 13 more }Represents a step in execution of a run.
-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant associated with the run step.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run step was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run step completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run step was created.
-
expired_at: numberThe Unix timestamp (in seconds) for when the run step expired. A step is considered expired if the parent run is expired.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run step failed.
-
last_error: object { code, message }The last error associated with this run step. Will be
nullif there are no errors. -
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.run.step"The object type, which is always
thread.run.step. -
run_id: stringThe ID of the run that this run step is a part of.
-
status: "in_progress" or "cancelled" or "failed" or 2 moreThe status of the run step, which can be either
in_progress,cancelled,failed,completed, orexpired. -
step_details: MessageCreationStepDetails or ToolCallsStepDetailsThe details of the run step.
-
thread_id: stringThe ID of the thread that was run.
-
type: "message_creation" or "tool_calls"The type of run step, which can be either
message_creationortool_calls. -
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run step. This value will be
nullwhile the run step's status isin_progress.
-
-
event: "thread.run.step.cancelled"
-
-
thread.run.step.expired: object { data, event }Occurs when a run step expires.
-
data: object { id, assistant_id, cancelled_at, 13 more }Represents a step in execution of a run.
-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant associated with the run step.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run step was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run step completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run step was created.
-
expired_at: numberThe Unix timestamp (in seconds) for when the run step expired. A step is considered expired if the parent run is expired.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run step failed.
-
last_error: object { code, message }The last error associated with this run step. Will be
nullif there are no errors. -
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.run.step"The object type, which is always
thread.run.step. -
run_id: stringThe ID of the run that this run step is a part of.
-
status: "in_progress" or "cancelled" or "failed" or 2 moreThe status of the run step, which can be either
in_progress,cancelled,failed,completed, orexpired. -
step_details: MessageCreationStepDetails or ToolCallsStepDetailsThe details of the run step.
-
thread_id: stringThe ID of the thread that was run.
-
type: "message_creation" or "tool_calls"The type of run step, which can be either
message_creationortool_calls. -
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run step. This value will be
nullwhile the run step's status isin_progress.
-
-
event: "thread.run.step.expired"
-
-
thread.message.created: object { data, event }Occurs when a message is created.
-
data: object { id, assistant_id, attachments, 11 more }Represents a message within a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringIf applicable, the ID of the assistant that authored this message.
-
attachments: array of object { file_id, tools }A list of files attached to the message, and the tools they were added to.
-
file_id: optional stringThe ID of the file to attach to the message.
-
tools: optional array of CodeInterpreterTool or object { type }The tools to add this file to.
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
AssistantToolsFileSearchTypeOnly: object { type }
-
-
-
completed_at: numberThe Unix timestamp (in seconds) for when the message was completed.
-
content: array of MessageContentThe content of the message in array of text and/or images.
-
image_file_content_block: object { image_file, type }References an image File in the content of a message.
-
image_file: object { file_id, detail }-
file_id: stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content. -
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
-
type: "image_file"Always
image_file.
-
-
image_url_content_block: object { image_url, type }References an image URL in the content of a message.
-
image_url: object { url, detail }-
url: stringThe external URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh. Default value isauto-
"auto" -
"low" -
"high"
-
-
-
type: "image_url"The type of the content part.
-
-
text_content_block: object { text, type }The text content that is part of a message.
-
text: object { annotations, value }-
annotations: array of Annotation-
file_citation_annotation: object { end_index, file_citation, start_index, 2 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
end_index: number -
file_citation: object { file_id }-
file_id: stringThe ID of the specific File the citation is from.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_citation"Always
file_citation.
-
-
file_path_annotation: object { end_index, file_path, start_index, 2 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
end_index: number -
file_path: object { file_id }-
file_id: stringThe ID of the file that was generated.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_path"Always
file_path.
-
-
-
value: stringThe data that makes up the text.
-
-
type: "text"Always
text.
-
-
refusal_content_block: object { refusal, type }The refusal content generated by the assistant.
-
refusal: string -
type: "refusal"Always
refusal.
-
-
-
created_at: numberThe Unix timestamp (in seconds) for when the message was created.
-
incomplete_at: numberThe Unix timestamp (in seconds) for when the message was marked as incomplete.
-
incomplete_details: object { reason }On an incomplete message, details about why the message is incomplete.
-
reason: "content_filter" or "max_tokens" or "run_cancelled" or 2 moreThe reason the message is incomplete.
-
"content_filter" -
"max_tokens" -
"run_cancelled" -
"run_expired" -
"run_failed"
-
-
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.message"The object type, which is always
thread.message. -
role: "user" or "assistant"The entity that produced the message. One of
userorassistant.-
"user" -
"assistant"
-
-
run_id: stringThe ID of the run associated with the creation of this message. Value is
nullwhen messages are created manually using the create message or create thread endpoints. -
status: "in_progress" or "incomplete" or "completed"The status of the message, which can be either
in_progress,incomplete, orcompleted.-
"in_progress" -
"incomplete" -
"completed"
-
-
thread_id: stringThe thread ID that this message belongs to.
-
-
event: "thread.message.created"
-
-
thread.message.in_progress: object { data, event }Occurs when a message moves to an
in_progressstate.-
data: object { id, assistant_id, attachments, 11 more }Represents a message within a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringIf applicable, the ID of the assistant that authored this message.
-
attachments: array of object { file_id, tools }A list of files attached to the message, and the tools they were added to.
-
completed_at: numberThe Unix timestamp (in seconds) for when the message was completed.
-
content: array of MessageContentThe content of the message in array of text and/or images.
-
created_at: numberThe Unix timestamp (in seconds) for when the message was created.
-
incomplete_at: numberThe Unix timestamp (in seconds) for when the message was marked as incomplete.
-
incomplete_details: object { reason }On an incomplete message, details about why the message is incomplete.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.message"The object type, which is always
thread.message. -
role: "user" or "assistant"The entity that produced the message. One of
userorassistant. -
run_id: stringThe ID of the run associated with the creation of this message. Value is
nullwhen messages are created manually using the create message or create thread endpoints. -
status: "in_progress" or "incomplete" or "completed"The status of the message, which can be either
in_progress,incomplete, orcompleted. -
thread_id: stringThe thread ID that this message belongs to.
-
-
event: "thread.message.in_progress"
-
-
thread.message.delta: object { data, event }Occurs when parts of a Message are being streamed.
-
data: object { id, delta, object }Represents a message delta i.e. any changed fields on a message during streaming.
-
id: stringThe identifier of the message, which can be referenced in API endpoints.
-
delta: object { content, role }The delta containing the fields that have changed on the Message.
-
content: optional array of MessageContentDeltaThe content of the message in array of text and/or images.
-
image_file_delta_block: object { index, type, image_file }References an image File in the content of a message.
-
index: numberThe index of the content part in the message.
-
type: "image_file"Always
image_file. -
image_file: optional object { detail, file_id }-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
file_id: optional stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content.
-
-
-
text_delta_block: object { index, type, text }The text content that is part of a message.
-
index: numberThe index of the content part in the message.
-
type: "text"Always
text. -
text: optional object { annotations, value }-
annotations: optional array of AnnotationDelta-
file_citation_delta_annotation: object { index, type, end_index, 3 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
index: numberThe index of the annotation in the text content part.
-
type: "file_citation"Always
file_citation. -
end_index: optional number -
file_citation: optional object { file_id, quote }-
file_id: optional stringThe ID of the specific File the citation is from.
-
quote: optional stringThe specific quote in the file.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
-
file_path_delta_annotation: object { index, type, end_index, 3 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
index: numberThe index of the annotation in the text content part.
-
type: "file_path"Always
file_path. -
end_index: optional number -
file_path: optional object { file_id }-
file_id: optional stringThe ID of the file that was generated.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
-
-
value: optional stringThe data that makes up the text.
-
-
-
refusal_delta_block: object { index, type, refusal }The refusal content that is part of a message.
-
index: numberThe index of the refusal part in the message.
-
type: "refusal"Always
refusal. -
refusal: optional string
-
-
image_url_delta_block: object { index, type, image_url }References an image URL in the content of a message.
-
index: numberThe index of the content part in the message.
-
type: "image_url"Always
image_url. -
image_url: optional object { detail, url }-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
url: optional stringThe URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
-
-
-
role: optional "user" or "assistant"The entity that produced the message. One of
userorassistant.-
"user" -
"assistant"
-
-
-
object: "thread.message.delta"The object type, which is always
thread.message.delta.
-
-
event: "thread.message.delta"
-
-
thread.message.completed: object { data, event }Occurs when a message is completed.
-
data: object { id, assistant_id, attachments, 11 more }Represents a message within a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringIf applicable, the ID of the assistant that authored this message.
-
attachments: array of object { file_id, tools }A list of files attached to the message, and the tools they were added to.
-
completed_at: numberThe Unix timestamp (in seconds) for when the message was completed.
-
content: array of MessageContentThe content of the message in array of text and/or images.
-
created_at: numberThe Unix timestamp (in seconds) for when the message was created.
-
incomplete_at: numberThe Unix timestamp (in seconds) for when the message was marked as incomplete.
-
incomplete_details: object { reason }On an incomplete message, details about why the message is incomplete.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.message"The object type, which is always
thread.message. -
role: "user" or "assistant"The entity that produced the message. One of
userorassistant. -
run_id: stringThe ID of the run associated with the creation of this message. Value is
nullwhen messages are created manually using the create message or create thread endpoints. -
status: "in_progress" or "incomplete" or "completed"The status of the message, which can be either
in_progress,incomplete, orcompleted. -
thread_id: stringThe thread ID that this message belongs to.
-
-
event: "thread.message.completed"
-
-
thread.message.incomplete: object { data, event }Occurs when a message ends before it is completed.
-
data: object { id, assistant_id, attachments, 11 more }Represents a message within a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringIf applicable, the ID of the assistant that authored this message.
-
attachments: array of object { file_id, tools }A list of files attached to the message, and the tools they were added to.
-
completed_at: numberThe Unix timestamp (in seconds) for when the message was completed.
-
content: array of MessageContentThe content of the message in array of text and/or images.
-
created_at: numberThe Unix timestamp (in seconds) for when the message was created.
-
incomplete_at: numberThe Unix timestamp (in seconds) for when the message was marked as incomplete.
-
incomplete_details: object { reason }On an incomplete message, details about why the message is incomplete.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.message"The object type, which is always
thread.message. -
role: "user" or "assistant"The entity that produced the message. One of
userorassistant. -
run_id: stringThe ID of the run associated with the creation of this message. Value is
nullwhen messages are created manually using the create message or create thread endpoints. -
status: "in_progress" or "incomplete" or "completed"The status of the message, which can be either
in_progress,incomplete, orcompleted. -
thread_id: stringThe thread ID that this message belongs to.
-
-
event: "thread.message.incomplete"
-
-
error_event: object { data, event }Occurs when an error occurs. This can happen due to an internal server error or a timeout.
-
data: object { code, message, param, type }-
code: string -
message: string -
param: string -
type: string
-
-
event: "error"
-
-
Assistant Tool
-
assistant_tool: CodeInterpreterTool or FileSearchTool or FunctionTool-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
file_search_tool: object { type, file_search }-
type: "file_search"The type of tool being defined:
file_search -
file_search: optional object { max_num_results, ranking_options }Overrides for the file search tool.
-
max_num_results: optional numberThe maximum number of results the file search tool should output. The default is 20 for
gpt-4*models and 5 forgpt-3.5-turbo. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_resultsresults. See the file search tool documentation for more information. -
ranking_options: optional object { score_threshold, ranker }The ranking options for the file search. If not specified, the file search tool will use the
autoranker and a score_threshold of 0.See the file search tool documentation for more information.
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
ranker: optional "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
-
-
-
function_tool: object { function, type }-
function: object { name, description, parameters, strict }-
name: stringThe name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the function does, used by the model to choose when and how to call the function.
-
parameters: optional map[unknown]The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parametersdefines a function with an empty parameter list. -
strict: optional booleanWhether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parametersfield. Only a subset of JSON Schema is supported whenstrictistrue. Learn more about Structured Outputs in the function calling guide.
-
-
type: "function"The type of tool being defined:
function
-
-
Code Interpreter Tool
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
File Search Tool
-
file_search_tool: object { type, file_search }-
type: "file_search"The type of tool being defined:
file_search -
file_search: optional object { max_num_results, ranking_options }Overrides for the file search tool.
-
max_num_results: optional numberThe maximum number of results the file search tool should output. The default is 20 for
gpt-4*models and 5 forgpt-3.5-turbo. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_resultsresults. See the file search tool documentation for more information. -
ranking_options: optional object { score_threshold, ranker }The ranking options for the file search. If not specified, the file search tool will use the
autoranker and a score_threshold of 0.See the file search tool documentation for more information.
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
ranker: optional "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
-
-
Function Tool
-
function_tool: object { function, type }-
function: object { name, description, parameters, strict }-
name: stringThe name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the function does, used by the model to choose when and how to call the function.
-
parameters: optional map[unknown]The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parametersdefines a function with an empty parameter list. -
strict: optional booleanWhether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parametersfield. Only a subset of JSON Schema is supported whenstrictistrue. Learn more about Structured Outputs in the function calling guide.
-
-
type: "function"The type of tool being defined:
function
-
Message Stream Event
-
message_stream_event: object { data, event } or object { data, event } or object { data, event } or 2 moreOccurs when a message is created.
-
thread.message.created: object { data, event }Occurs when a message is created.
-
data: object { id, assistant_id, attachments, 11 more }Represents a message within a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringIf applicable, the ID of the assistant that authored this message.
-
attachments: array of object { file_id, tools }A list of files attached to the message, and the tools they were added to.
-
file_id: optional stringThe ID of the file to attach to the message.
-
tools: optional array of CodeInterpreterTool or object { type }The tools to add this file to.
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
AssistantToolsFileSearchTypeOnly: object { type }
-
-
-
completed_at: numberThe Unix timestamp (in seconds) for when the message was completed.
-
content: array of MessageContentThe content of the message in array of text and/or images.
-
image_file_content_block: object { image_file, type }References an image File in the content of a message.
-
image_file: object { file_id, detail }-
file_id: stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content. -
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
-
type: "image_file"Always
image_file.
-
-
image_url_content_block: object { image_url, type }References an image URL in the content of a message.
-
image_url: object { url, detail }-
url: stringThe external URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh. Default value isauto-
"auto" -
"low" -
"high"
-
-
-
type: "image_url"The type of the content part.
-
-
text_content_block: object { text, type }The text content that is part of a message.
-
text: object { annotations, value }-
annotations: array of Annotation-
file_citation_annotation: object { end_index, file_citation, start_index, 2 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
end_index: number -
file_citation: object { file_id }-
file_id: stringThe ID of the specific File the citation is from.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_citation"Always
file_citation.
-
-
file_path_annotation: object { end_index, file_path, start_index, 2 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
end_index: number -
file_path: object { file_id }-
file_id: stringThe ID of the file that was generated.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_path"Always
file_path.
-
-
-
value: stringThe data that makes up the text.
-
-
type: "text"Always
text.
-
-
refusal_content_block: object { refusal, type }The refusal content generated by the assistant.
-
refusal: string -
type: "refusal"Always
refusal.
-
-
-
created_at: numberThe Unix timestamp (in seconds) for when the message was created.
-
incomplete_at: numberThe Unix timestamp (in seconds) for when the message was marked as incomplete.
-
incomplete_details: object { reason }On an incomplete message, details about why the message is incomplete.
-
reason: "content_filter" or "max_tokens" or "run_cancelled" or 2 moreThe reason the message is incomplete.
-
"content_filter" -
"max_tokens" -
"run_cancelled" -
"run_expired" -
"run_failed"
-
-
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.message"The object type, which is always
thread.message. -
role: "user" or "assistant"The entity that produced the message. One of
userorassistant.-
"user" -
"assistant"
-
-
run_id: stringThe ID of the run associated with the creation of this message. Value is
nullwhen messages are created manually using the create message or create thread endpoints. -
status: "in_progress" or "incomplete" or "completed"The status of the message, which can be either
in_progress,incomplete, orcompleted.-
"in_progress" -
"incomplete" -
"completed"
-
-
thread_id: stringThe thread ID that this message belongs to.
-
-
event: "thread.message.created"
-
-
thread.message.in_progress: object { data, event }Occurs when a message moves to an
in_progressstate.-
data: object { id, assistant_id, attachments, 11 more }Represents a message within a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringIf applicable, the ID of the assistant that authored this message.
-
attachments: array of object { file_id, tools }A list of files attached to the message, and the tools they were added to.
-
completed_at: numberThe Unix timestamp (in seconds) for when the message was completed.
-
content: array of MessageContentThe content of the message in array of text and/or images.
-
created_at: numberThe Unix timestamp (in seconds) for when the message was created.
-
incomplete_at: numberThe Unix timestamp (in seconds) for when the message was marked as incomplete.
-
incomplete_details: object { reason }On an incomplete message, details about why the message is incomplete.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.message"The object type, which is always
thread.message. -
role: "user" or "assistant"The entity that produced the message. One of
userorassistant. -
run_id: stringThe ID of the run associated with the creation of this message. Value is
nullwhen messages are created manually using the create message or create thread endpoints. -
status: "in_progress" or "incomplete" or "completed"The status of the message, which can be either
in_progress,incomplete, orcompleted. -
thread_id: stringThe thread ID that this message belongs to.
-
-
event: "thread.message.in_progress"
-
-
thread.message.delta: object { data, event }Occurs when parts of a Message are being streamed.
-
data: object { id, delta, object }Represents a message delta i.e. any changed fields on a message during streaming.
-
id: stringThe identifier of the message, which can be referenced in API endpoints.
-
delta: object { content, role }The delta containing the fields that have changed on the Message.
-
content: optional array of MessageContentDeltaThe content of the message in array of text and/or images.
-
image_file_delta_block: object { index, type, image_file }References an image File in the content of a message.
-
index: numberThe index of the content part in the message.
-
type: "image_file"Always
image_file. -
image_file: optional object { detail, file_id }-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
file_id: optional stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content.
-
-
-
text_delta_block: object { index, type, text }The text content that is part of a message.
-
index: numberThe index of the content part in the message.
-
type: "text"Always
text. -
text: optional object { annotations, value }-
annotations: optional array of AnnotationDelta-
file_citation_delta_annotation: object { index, type, end_index, 3 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
index: numberThe index of the annotation in the text content part.
-
type: "file_citation"Always
file_citation. -
end_index: optional number -
file_citation: optional object { file_id, quote }-
file_id: optional stringThe ID of the specific File the citation is from.
-
quote: optional stringThe specific quote in the file.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
-
file_path_delta_annotation: object { index, type, end_index, 3 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
index: numberThe index of the annotation in the text content part.
-
type: "file_path"Always
file_path. -
end_index: optional number -
file_path: optional object { file_id }-
file_id: optional stringThe ID of the file that was generated.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
-
-
value: optional stringThe data that makes up the text.
-
-
-
refusal_delta_block: object { index, type, refusal }The refusal content that is part of a message.
-
index: numberThe index of the refusal part in the message.
-
type: "refusal"Always
refusal. -
refusal: optional string
-
-
image_url_delta_block: object { index, type, image_url }References an image URL in the content of a message.
-
index: numberThe index of the content part in the message.
-
type: "image_url"Always
image_url. -
image_url: optional object { detail, url }-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
url: optional stringThe URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
-
-
-
role: optional "user" or "assistant"The entity that produced the message. One of
userorassistant.-
"user" -
"assistant"
-
-
-
object: "thread.message.delta"The object type, which is always
thread.message.delta.
-
-
event: "thread.message.delta"
-
-
thread.message.completed: object { data, event }Occurs when a message is completed.
-
data: object { id, assistant_id, attachments, 11 more }Represents a message within a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringIf applicable, the ID of the assistant that authored this message.
-
attachments: array of object { file_id, tools }A list of files attached to the message, and the tools they were added to.
-
completed_at: numberThe Unix timestamp (in seconds) for when the message was completed.
-
content: array of MessageContentThe content of the message in array of text and/or images.
-
created_at: numberThe Unix timestamp (in seconds) for when the message was created.
-
incomplete_at: numberThe Unix timestamp (in seconds) for when the message was marked as incomplete.
-
incomplete_details: object { reason }On an incomplete message, details about why the message is incomplete.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.message"The object type, which is always
thread.message. -
role: "user" or "assistant"The entity that produced the message. One of
userorassistant. -
run_id: stringThe ID of the run associated with the creation of this message. Value is
nullwhen messages are created manually using the create message or create thread endpoints. -
status: "in_progress" or "incomplete" or "completed"The status of the message, which can be either
in_progress,incomplete, orcompleted. -
thread_id: stringThe thread ID that this message belongs to.
-
-
event: "thread.message.completed"
-
-
thread.message.incomplete: object { data, event }Occurs when a message ends before it is completed.
-
data: object { id, assistant_id, attachments, 11 more }Represents a message within a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringIf applicable, the ID of the assistant that authored this message.
-
attachments: array of object { file_id, tools }A list of files attached to the message, and the tools they were added to.
-
completed_at: numberThe Unix timestamp (in seconds) for when the message was completed.
-
content: array of MessageContentThe content of the message in array of text and/or images.
-
created_at: numberThe Unix timestamp (in seconds) for when the message was created.
-
incomplete_at: numberThe Unix timestamp (in seconds) for when the message was marked as incomplete.
-
incomplete_details: object { reason }On an incomplete message, details about why the message is incomplete.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.message"The object type, which is always
thread.message. -
role: "user" or "assistant"The entity that produced the message. One of
userorassistant. -
run_id: stringThe ID of the run associated with the creation of this message. Value is
nullwhen messages are created manually using the create message or create thread endpoints. -
status: "in_progress" or "incomplete" or "completed"The status of the message, which can be either
in_progress,incomplete, orcompleted. -
thread_id: stringThe thread ID that this message belongs to.
-
-
event: "thread.message.incomplete"
-
-
Run Step Stream Event
-
run_step_stream_event: object { data, event } or object { data, event } or object { data, event } or 4 moreOccurs when a run step is created.
-
thread.run.step.created: object { data, event }Occurs when a run step is created.
-
data: object { id, assistant_id, cancelled_at, 13 more }Represents a step in execution of a run.
-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant associated with the run step.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run step was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run step completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run step was created.
-
expired_at: numberThe Unix timestamp (in seconds) for when the run step expired. A step is considered expired if the parent run is expired.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run step failed.
-
last_error: object { code, message }The last error associated with this run step. Will be
nullif there are no errors.-
code: "server_error" or "rate_limit_exceeded"One of
server_errororrate_limit_exceeded.-
"server_error" -
"rate_limit_exceeded"
-
-
message: stringA human-readable description of the error.
-
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.run.step"The object type, which is always
thread.run.step. -
run_id: stringThe ID of the run that this run step is a part of.
-
status: "in_progress" or "cancelled" or "failed" or 2 moreThe status of the run step, which can be either
in_progress,cancelled,failed,completed, orexpired.-
"in_progress" -
"cancelled" -
"failed" -
"completed" -
"expired"
-
-
step_details: MessageCreationStepDetails or ToolCallsStepDetailsThe details of the run step.
-
message_creation_step_details: object { message_creation, type }Details of the message creation by the run step.
-
message_creation: object { message_id }-
message_id: stringThe ID of the message that was created by this run step.
-
-
type: "message_creation"Always
message_creation.
-
-
tool_calls_step_details: object { tool_calls, type }Details of the tool call.
-
tool_calls: array of ToolCallAn array of tool calls the run step was involved in. These can be associated with one of three types of tools:
code_interpreter,file_search, orfunction.-
code_interpreter_tool_call: object { id, code_interpreter, type }Details of the Code Interpreter tool call the run step was involved in.
-
id: stringThe ID of the tool call.
-
code_interpreter: object { input, outputs }The Code Interpreter tool call definition.
-
input: stringThe input to the Code Interpreter tool call.
-
outputs: array of object { logs, type } or object { image, type }The outputs from the Code Interpreter tool call. Code Interpreter can output one or more items, including text (
logs) or images (image). Each of these are represented by a different object type.-
logs: object { logs, type }Text output from the Code Interpreter tool call as part of a run step.
-
logs: stringThe text output from the Code Interpreter tool call.
-
type: "logs"Always
logs.
-
-
image: object { image, type }-
image: object { file_id }-
file_id: stringThe file ID of the image.
-
-
type: "image"Always
image.
-
-
-
-
type: "code_interpreter"The type of tool call. This is always going to be
code_interpreterfor this type of tool call.
-
-
file_search_tool_call: object { id, file_search, type }-
id: stringThe ID of the tool call object.
-
file_search: object { ranking_options, results }For now, this is always going to be an empty object.
-
ranking_options: optional object { ranker, score_threshold }The ranking options for the file search.
-
ranker: "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
-
results: optional array of object { file_id, file_name, score, content }The results of the file search.
-
file_id: stringThe ID of the file that result was found in.
-
file_name: stringThe name of the file that result was found in.
-
score: numberThe score of the result. All values must be a floating point number between 0 and 1.
-
content: optional array of object { text, type }The content of the result that was found. The content is only included if requested via the include query parameter.
-
text: optional stringThe text content of the file.
-
type: optional "text"The type of the content.
"text"
-
-
-
-
type: "file_search"The type of tool call. This is always going to be
file_searchfor this type of tool call.
-
-
function_tool_call: object { id, function, type }-
id: stringThe ID of the tool call object.
-
function: object { arguments, name, output }The definition of the function that was called.
-
arguments: stringThe arguments passed to the function.
-
name: stringThe name of the function.
-
output: stringThe output of the function. This will be
nullif the outputs have not been submitted yet.
-
-
type: "function"The type of tool call. This is always going to be
functionfor this type of tool call.
-
-
-
type: "tool_calls"Always
tool_calls.
-
-
-
thread_id: stringThe ID of the thread that was run.
-
type: "message_creation" or "tool_calls"The type of run step, which can be either
message_creationortool_calls.-
"message_creation" -
"tool_calls"
-
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run step. This value will be
nullwhile the run step's status isin_progress.-
completion_tokens: numberNumber of completion tokens used over the course of the run step.
-
prompt_tokens: numberNumber of prompt tokens used over the course of the run step.
-
total_tokens: numberTotal number of tokens used (prompt + completion).
-
-
-
event: "thread.run.step.created"
-
-
thread.run.step.in_progress: object { data, event }Occurs when a run step moves to an
in_progressstate.-
data: object { id, assistant_id, cancelled_at, 13 more }Represents a step in execution of a run.
-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant associated with the run step.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run step was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run step completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run step was created.
-
expired_at: numberThe Unix timestamp (in seconds) for when the run step expired. A step is considered expired if the parent run is expired.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run step failed.
-
last_error: object { code, message }The last error associated with this run step. Will be
nullif there are no errors. -
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.run.step"The object type, which is always
thread.run.step. -
run_id: stringThe ID of the run that this run step is a part of.
-
status: "in_progress" or "cancelled" or "failed" or 2 moreThe status of the run step, which can be either
in_progress,cancelled,failed,completed, orexpired. -
step_details: MessageCreationStepDetails or ToolCallsStepDetailsThe details of the run step.
-
thread_id: stringThe ID of the thread that was run.
-
type: "message_creation" or "tool_calls"The type of run step, which can be either
message_creationortool_calls. -
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run step. This value will be
nullwhile the run step's status isin_progress.
-
-
event: "thread.run.step.in_progress"
-
-
thread.run.step.delta: object { data, event }Occurs when parts of a run step are being streamed.
-
data: object { id, delta, object }Represents a run step delta i.e. any changed fields on a run step during streaming.
-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
delta: object { step_details }The delta containing the fields that have changed on the run step.
-
step_details: optional RunStepDeltaMessageDelta or ToolCallDeltaObjectThe details of the run step.
-
run_step_delta_message_delta: object { type, message_creation }Details of the message creation by the run step.
-
type: "message_creation"Always
message_creation. -
message_creation: optional object { message_id }-
message_id: optional stringThe ID of the message that was created by this run step.
-
-
-
tool_call_delta_object: object { type, tool_calls }Details of the tool call.
-
type: "tool_calls"Always
tool_calls. -
tool_calls: optional array of ToolCallDeltaAn array of tool calls the run step was involved in. These can be associated with one of three types of tools:
code_interpreter,file_search, orfunction.-
code_interpreter_tool_call_delta: object { index, type, id, code_interpreter }Details of the Code Interpreter tool call the run step was involved in.
-
index: numberThe index of the tool call in the tool calls array.
-
type: "code_interpreter"The type of tool call. This is always going to be
code_interpreterfor this type of tool call. -
id: optional stringThe ID of the tool call.
-
code_interpreter: optional object { input, outputs }The Code Interpreter tool call definition.
-
input: optional stringThe input to the Code Interpreter tool call.
-
outputs: optional array of CodeInterpreterLogs or CodeInterpreterOutputImageThe outputs from the Code Interpreter tool call. Code Interpreter can output one or more items, including text (
logs) or images (image). Each of these are represented by a different object type.-
code_interpreter_logs: object { index, type, logs }Text output from the Code Interpreter tool call as part of a run step.
-
index: numberThe index of the output in the outputs array.
-
type: "logs"Always
logs. -
logs: optional stringThe text output from the Code Interpreter tool call.
-
-
code_interpreter_output_image: object { index, type, image }-
index: numberThe index of the output in the outputs array.
-
type: "image"Always
image. -
image: optional object { file_id }-
file_id: optional stringThe file ID of the image.
-
-
-
-
-
-
file_search_tool_call_delta: object { file_search, index, type, id }-
file_search: unknownFor now, this is always going to be an empty object.
-
index: numberThe index of the tool call in the tool calls array.
-
type: "file_search"The type of tool call. This is always going to be
file_searchfor this type of tool call. -
id: optional stringThe ID of the tool call object.
-
-
function_tool_call_delta: object { index, type, id, function }-
index: numberThe index of the tool call in the tool calls array.
-
type: "function"The type of tool call. This is always going to be
functionfor this type of tool call. -
id: optional stringThe ID of the tool call object.
-
function: optional object { arguments, name, output }The definition of the function that was called.
-
arguments: optional stringThe arguments passed to the function.
-
name: optional stringThe name of the function.
-
output: optional stringThe output of the function. This will be
nullif the outputs have not been submitted yet.
-
-
-
-
-
-
-
object: "thread.run.step.delta"The object type, which is always
thread.run.step.delta.
-
-
event: "thread.run.step.delta"
-
-
thread.run.step.completed: object { data, event }Occurs when a run step is completed.
-
data: object { id, assistant_id, cancelled_at, 13 more }Represents a step in execution of a run.
-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant associated with the run step.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run step was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run step completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run step was created.
-
expired_at: numberThe Unix timestamp (in seconds) for when the run step expired. A step is considered expired if the parent run is expired.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run step failed.
-
last_error: object { code, message }The last error associated with this run step. Will be
nullif there are no errors. -
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.run.step"The object type, which is always
thread.run.step. -
run_id: stringThe ID of the run that this run step is a part of.
-
status: "in_progress" or "cancelled" or "failed" or 2 moreThe status of the run step, which can be either
in_progress,cancelled,failed,completed, orexpired. -
step_details: MessageCreationStepDetails or ToolCallsStepDetailsThe details of the run step.
-
thread_id: stringThe ID of the thread that was run.
-
type: "message_creation" or "tool_calls"The type of run step, which can be either
message_creationortool_calls. -
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run step. This value will be
nullwhile the run step's status isin_progress.
-
-
event: "thread.run.step.completed"
-
-
thread.run.step.failed: object { data, event }Occurs when a run step fails.
-
data: object { id, assistant_id, cancelled_at, 13 more }Represents a step in execution of a run.
-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant associated with the run step.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run step was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run step completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run step was created.
-
expired_at: numberThe Unix timestamp (in seconds) for when the run step expired. A step is considered expired if the parent run is expired.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run step failed.
-
last_error: object { code, message }The last error associated with this run step. Will be
nullif there are no errors. -
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.run.step"The object type, which is always
thread.run.step. -
run_id: stringThe ID of the run that this run step is a part of.
-
status: "in_progress" or "cancelled" or "failed" or 2 moreThe status of the run step, which can be either
in_progress,cancelled,failed,completed, orexpired. -
step_details: MessageCreationStepDetails or ToolCallsStepDetailsThe details of the run step.
-
thread_id: stringThe ID of the thread that was run.
-
type: "message_creation" or "tool_calls"The type of run step, which can be either
message_creationortool_calls. -
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run step. This value will be
nullwhile the run step's status isin_progress.
-
-
event: "thread.run.step.failed"
-
-
thread.run.step.cancelled: object { data, event }Occurs when a run step is cancelled.
-
data: object { id, assistant_id, cancelled_at, 13 more }Represents a step in execution of a run.
-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant associated with the run step.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run step was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run step completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run step was created.
-
expired_at: numberThe Unix timestamp (in seconds) for when the run step expired. A step is considered expired if the parent run is expired.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run step failed.
-
last_error: object { code, message }The last error associated with this run step. Will be
nullif there are no errors. -
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.run.step"The object type, which is always
thread.run.step. -
run_id: stringThe ID of the run that this run step is a part of.
-
status: "in_progress" or "cancelled" or "failed" or 2 moreThe status of the run step, which can be either
in_progress,cancelled,failed,completed, orexpired. -
step_details: MessageCreationStepDetails or ToolCallsStepDetailsThe details of the run step.
-
thread_id: stringThe ID of the thread that was run.
-
type: "message_creation" or "tool_calls"The type of run step, which can be either
message_creationortool_calls. -
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run step. This value will be
nullwhile the run step's status isin_progress.
-
-
event: "thread.run.step.cancelled"
-
-
thread.run.step.expired: object { data, event }Occurs when a run step expires.
-
data: object { id, assistant_id, cancelled_at, 13 more }Represents a step in execution of a run.
-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant associated with the run step.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run step was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run step completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run step was created.
-
expired_at: numberThe Unix timestamp (in seconds) for when the run step expired. A step is considered expired if the parent run is expired.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run step failed.
-
last_error: object { code, message }The last error associated with this run step. Will be
nullif there are no errors. -
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.run.step"The object type, which is always
thread.run.step. -
run_id: stringThe ID of the run that this run step is a part of.
-
status: "in_progress" or "cancelled" or "failed" or 2 moreThe status of the run step, which can be either
in_progress,cancelled,failed,completed, orexpired. -
step_details: MessageCreationStepDetails or ToolCallsStepDetailsThe details of the run step.
-
thread_id: stringThe ID of the thread that was run.
-
type: "message_creation" or "tool_calls"The type of run step, which can be either
message_creationortool_calls. -
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run step. This value will be
nullwhile the run step's status isin_progress.
-
-
event: "thread.run.step.expired"
-
-
Run Stream Event
-
run_stream_event: object { data, event } or object { data, event } or object { data, event } or 7 moreOccurs when a new run is created.
-
thread.run.created: object { data, event }Occurs when a new run is created.
-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete.-
reason: optional "max_completion_tokens" or "max_prompt_tokens"The reason why the run is incomplete. This will point to which specific token limit was reached over the course of the run.
-
"max_completion_tokens" -
"max_prompt_tokens"
-
-
-
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors.-
code: "server_error" or "rate_limit_exceeded" or "invalid_prompt"One of
server_error,rate_limit_exceeded, orinvalid_prompt.-
"server_error" -
"rate_limit_exceeded" -
"invalid_prompt"
-
-
message: stringA human-readable description of the error.
-
-
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required.-
submit_tool_outputs: object { tool_calls }Details on the tool outputs needed for this run to continue.
-
tool_calls: array of RequiredActionFunctionToolCallA list of the relevant tool calls.
-
id: stringThe ID of the tool call. This ID must be referenced when you submit the tool outputs in using the Submit tool outputs to run endpoint.
-
function: object { arguments, name }The function definition.
-
arguments: stringThe arguments that the model expects you to pass to the function.
-
name: stringThe name of the function.
-
-
type: "function"The type of tool call the output is required for. For now, this is always
function.
-
-
-
type: "submit_tool_outputs"For now, this is always
submit_tool_outputs.
-
-
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length.-
union_member_0: "auto"autois the default value -
response_format_text: object { type }Default response format. Used to generate text responses.
-
type: "text"The type of response format being defined. Always
text.
-
-
response_format_json_object: object { type }JSON object response format. An older method of generating JSON responses. Using
json_schemais recommended for models that support it. Note that the model will not generate JSON without a system or user message instructing it to do so.-
type: "json_object"The type of response format being defined. Always
json_object.
-
-
response_format_json_schema: object { json_schema, type }JSON Schema response format. Used to generate structured JSON responses. Learn more about Structured Outputs.
-
json_schema: object { name, description, schema, strict }Structured Outputs configuration options, including a JSON Schema.
-
name: stringThe name of the response format. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the response format is for, used by the model to determine how to respond in the format.
-
schema: optional map[unknown]The schema for the response format, described as a JSON Schema object. Learn how to build JSON schemas here.
-
strict: optional booleanWhether to enable strict schema adherence when generating the output. If set to true, the model will always follow the exact schema defined in the
schemafield. Only a subset of JSON Schema is supported whenstrictistrue. To learn more, read the Structured Outputs guide.
-
-
type: "json_schema"The type of response format being defined. Always
json_schema.
-
-
-
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired.-
"queued" -
"in_progress" -
"requires_action" -
"cancelling" -
"cancelled" -
"failed" -
"completed" -
"incomplete" -
"expired"
-
-
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool.-
Auto: "none" or "auto" or "required"nonemeans the model will not call any tools and instead generates a message.automeans the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user.-
"none" -
"auto" -
"required"
-
-
assistant_tool_choice: object { type, function }Specifies a tool the model should use. Use to force the model to call a specific tool.
-
type: "function" or "code_interpreter" or "file_search"The type of the tool. If type is
function, the function name must be set-
"function" -
"code_interpreter" -
"file_search"
-
-
function: optional object { name }-
name: stringThe name of the function to call.
-
-
-
-
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
file_search_tool: object { type, file_search }-
type: "file_search"The type of tool being defined:
file_search -
file_search: optional object { max_num_results, ranking_options }Overrides for the file search tool.
-
max_num_results: optional numberThe maximum number of results the file search tool should output. The default is 20 for
gpt-4*models and 5 forgpt-3.5-turbo. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_resultsresults. See the file search tool documentation for more information. -
ranking_options: optional object { score_threshold, ranker }The ranking options for the file search. If not specified, the file search tool will use the
autoranker and a score_threshold of 0.See the file search tool documentation for more information.
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
ranker: optional "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
-
-
-
function_tool: object { function, type }-
function: object { name, description, parameters, strict }-
name: stringThe name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the function does, used by the model to choose when and how to call the function.
-
parameters: optional map[unknown]The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parametersdefines a function with an empty parameter list. -
strict: optional booleanWhether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parametersfield. Only a subset of JSON Schema is supported whenstrictistrue. Learn more about Structured Outputs in the function calling guide.
-
-
type: "function"The type of tool being defined:
function
-
-
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
type: "auto" or "last_messages"The truncation strategy to use for the thread. The default is
auto. If set tolast_messages, the thread will be truncated to the n most recent messages in the thread. When set toauto, messages in the middle of the thread will be dropped to fit the context length of the model,max_prompt_tokens.-
"auto" -
"last_messages"
-
-
last_messages: optional numberThe number of most recent messages from the thread when constructing the context for the run.
-
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.).-
completion_tokens: numberNumber of completion tokens used over the course of the run.
-
prompt_tokens: numberNumber of prompt tokens used over the course of the run.
-
total_tokens: numberTotal number of tokens used (prompt + completion).
-
-
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.created"
-
-
thread.run.queued: object { data, event }Occurs when a run moves to a
queuedstatus.-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.queued"
-
-
thread.run.in_progress: object { data, event }Occurs when a run moves to an
in_progressstatus.-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.in_progress"
-
-
thread.run.requires_action: object { data, event }Occurs when a run moves to a
requires_actionstatus.-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.requires_action"
-
-
thread.run.completed: object { data, event }Occurs when a run is completed.
-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.completed"
-
-
thread.run.incomplete: object { data, event }Occurs when a run ends with status
incomplete.-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.incomplete"
-
-
thread.run.failed: object { data, event }Occurs when a run fails.
-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.failed"
-
-
thread.run.cancelling: object { data, event }Occurs when a run moves to a
cancellingstatus.-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.cancelling"
-
-
thread.run.cancelled: object { data, event }Occurs when a run is cancelled.
-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.cancelled"
-
-
thread.run.expired: object { data, event }Occurs when a run expires.
-
data: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete. -
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors. -
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required. -
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired. -
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.). -
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
event: "thread.run.expired"
-
-
Thread Stream Event
-
thread_stream_event: object { data, event, enabled }Occurs when a new thread is created.
-
data: object { id, created_at, metadata, 2 more }Represents a thread that contains messages.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
created_at: numberThe Unix timestamp (in seconds) for when the thread was created.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread"The object type, which is always
thread. -
tool_resources: object { code_interpreter, file_search }A set of resources that are made available to the assistant's tools in this thread. The resources are specific to the type of tool. For example, the
code_interpretertool requires a list of file IDs, while thefile_searchtool requires a list of vector store IDs.-
code_interpreter: optional object { file_ids }-
file_ids: optional array of stringA list of file IDs made available to the
code_interpretertool. There can be a maximum of 20 files associated with the tool.
-
-
file_search: optional object { vector_store_ids }-
vector_store_ids: optional array of stringThe vector store attached to this thread. There can be a maximum of 1 vector store attached to the thread.
-
-
-
-
event: "thread.created" -
enabled: optional booleanWhether to enable input audio transcription.
-
Threads
Create thread
$ openai beta:threads create
post /threads
Create a thread.
Parameters
-
--message: optional array of object { content, role, attachments, metadata }A list of messages to start the thread with.
-
--metadata: optional map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
--tool-resources: optional object { code_interpreter, file_search }A set of resources that are made available to the assistant's tools in this thread. The resources are specific to the type of tool. For example, the
code_interpretertool requires a list of file IDs, while thefile_searchtool requires a list of vector store IDs.
Returns
-
thread: object { id, created_at, metadata, 2 more }Represents a thread that contains messages.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
created_at: numberThe Unix timestamp (in seconds) for when the thread was created.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread"The object type, which is always
thread. -
tool_resources: object { code_interpreter, file_search }A set of resources that are made available to the assistant's tools in this thread. The resources are specific to the type of tool. For example, the
code_interpretertool requires a list of file IDs, while thefile_searchtool requires a list of vector store IDs.-
code_interpreter: optional object { file_ids }-
file_ids: optional array of stringA list of file IDs made available to the
code_interpretertool. There can be a maximum of 20 files associated with the tool.
-
-
file_search: optional object { vector_store_ids }-
vector_store_ids: optional array of stringThe vector store attached to this thread. There can be a maximum of 1 vector store attached to the thread.
-
-
-
Example
openai beta:threads create \
--api-key 'My API Key'
Response
{
"id": "id",
"created_at": 0,
"metadata": {
"foo": "string"
},
"object": "thread",
"tool_resources": {
"code_interpreter": {
"file_ids": [
"string"
]
},
"file_search": {
"vector_store_ids": [
"string"
]
}
}
}
Create thread and run
$ openai beta:threads create-and-run
post /threads/runs
Create a thread and run it in one request.
Parameters
-
--assistant-id: stringThe ID of the assistant to use to execute this run.
-
--instructions: optional stringOverride the default system message of the assistant. This is useful for modifying the behavior on a per-run basis.
-
--max-completion-tokens: optional numberThe maximum number of completion tokens that may be used over the course of the run. The run will make a best effort to use only the number of completion tokens specified, across multiple turns of the run. If the run exceeds the number of completion tokens specified, the run will end with status
incomplete. Seeincomplete_detailsfor more info. -
--max-prompt-tokens: optional numberThe maximum number of prompt tokens that may be used over the course of the run. The run will make a best effort to use only the number of prompt tokens specified, across multiple turns of the run. If the run exceeds the number of prompt tokens specified, the run will end with status
incomplete. Seeincomplete_detailsfor more info. -
--metadata: optional map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
--model: optional string or ChatModelThe ID of the Model to be used to execute this run. If a value is provided here, it will override the model associated with the assistant. If not, the model associated with the assistant will be used.
-
--parallel-tool-calls: optional booleanWhether to enable parallel function calling during tool use.
-
--response-format: optional "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
--temperature: optional numberWhat sampling temperature to use, between 0 and 2. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.
-
--thread: optional object { messages, metadata, tool_resources }Options to create a new thread. If no thread is provided when running a request, an empty thread will be created.
-
--tool-choice: optional "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
--tool-resources: optional object { code_interpreter, file_search }A set of resources that are used by the assistant's tools. The resources are specific to the type of tool. For example, the
code_interpretertool requires a list of file IDs, while thefile_searchtool requires a list of vector store IDs. -
--tool: optional array of AssistantToolOverride the tools the assistant can use for this run. This is useful for modifying the behavior on a per-run basis.
-
--top-p: optional numberAn alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered.
We generally recommend altering this or temperature but not both.
-
--truncation-strategy: optional object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
Returns
-
run: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete.-
reason: optional "max_completion_tokens" or "max_prompt_tokens"The reason why the run is incomplete. This will point to which specific token limit was reached over the course of the run.
-
"max_completion_tokens" -
"max_prompt_tokens"
-
-
-
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors.-
code: "server_error" or "rate_limit_exceeded" or "invalid_prompt"One of
server_error,rate_limit_exceeded, orinvalid_prompt.-
"server_error" -
"rate_limit_exceeded" -
"invalid_prompt"
-
-
message: stringA human-readable description of the error.
-
-
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required.-
submit_tool_outputs: object { tool_calls }Details on the tool outputs needed for this run to continue.
-
tool_calls: array of RequiredActionFunctionToolCallA list of the relevant tool calls.
-
id: stringThe ID of the tool call. This ID must be referenced when you submit the tool outputs in using the Submit tool outputs to run endpoint.
-
function: object { arguments, name }The function definition.
-
arguments: stringThe arguments that the model expects you to pass to the function.
-
name: stringThe name of the function.
-
-
type: "function"The type of tool call the output is required for. For now, this is always
function.
-
-
-
type: "submit_tool_outputs"For now, this is always
submit_tool_outputs.
-
-
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length.-
union_member_0: "auto"autois the default value -
response_format_text: object { type }Default response format. Used to generate text responses.
-
type: "text"The type of response format being defined. Always
text.
-
-
response_format_json_object: object { type }JSON object response format. An older method of generating JSON responses. Using
json_schemais recommended for models that support it. Note that the model will not generate JSON without a system or user message instructing it to do so.-
type: "json_object"The type of response format being defined. Always
json_object.
-
-
response_format_json_schema: object { json_schema, type }JSON Schema response format. Used to generate structured JSON responses. Learn more about Structured Outputs.
-
json_schema: object { name, description, schema, strict }Structured Outputs configuration options, including a JSON Schema.
-
name: stringThe name of the response format. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the response format is for, used by the model to determine how to respond in the format.
-
schema: optional map[unknown]The schema for the response format, described as a JSON Schema object. Learn how to build JSON schemas here.
-
strict: optional booleanWhether to enable strict schema adherence when generating the output. If set to true, the model will always follow the exact schema defined in the
schemafield. Only a subset of JSON Schema is supported whenstrictistrue. To learn more, read the Structured Outputs guide.
-
-
type: "json_schema"The type of response format being defined. Always
json_schema.
-
-
-
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired.-
"queued" -
"in_progress" -
"requires_action" -
"cancelling" -
"cancelled" -
"failed" -
"completed" -
"incomplete" -
"expired"
-
-
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool.-
Auto: "none" or "auto" or "required"nonemeans the model will not call any tools and instead generates a message.automeans the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user.-
"none" -
"auto" -
"required"
-
-
assistant_tool_choice: object { type, function }Specifies a tool the model should use. Use to force the model to call a specific tool.
-
type: "function" or "code_interpreter" or "file_search"The type of the tool. If type is
function, the function name must be set-
"function" -
"code_interpreter" -
"file_search"
-
-
function: optional object { name }-
name: stringThe name of the function to call.
-
-
-
-
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
file_search_tool: object { type, file_search }-
type: "file_search"The type of tool being defined:
file_search -
file_search: optional object { max_num_results, ranking_options }Overrides for the file search tool.
-
max_num_results: optional numberThe maximum number of results the file search tool should output. The default is 20 for
gpt-4*models and 5 forgpt-3.5-turbo. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_resultsresults. See the file search tool documentation for more information. -
ranking_options: optional object { score_threshold, ranker }The ranking options for the file search. If not specified, the file search tool will use the
autoranker and a score_threshold of 0.See the file search tool documentation for more information.
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
ranker: optional "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
-
-
-
function_tool: object { function, type }-
function: object { name, description, parameters, strict }-
name: stringThe name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the function does, used by the model to choose when and how to call the function.
-
parameters: optional map[unknown]The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parametersdefines a function with an empty parameter list. -
strict: optional booleanWhether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parametersfield. Only a subset of JSON Schema is supported whenstrictistrue. Learn more about Structured Outputs in the function calling guide.
-
-
type: "function"The type of tool being defined:
function
-
-
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
type: "auto" or "last_messages"The truncation strategy to use for the thread. The default is
auto. If set tolast_messages, the thread will be truncated to the n most recent messages in the thread. When set toauto, messages in the middle of the thread will be dropped to fit the context length of the model,max_prompt_tokens.-
"auto" -
"last_messages"
-
-
last_messages: optional numberThe number of most recent messages from the thread when constructing the context for the run.
-
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.).-
completion_tokens: numberNumber of completion tokens used over the course of the run.
-
prompt_tokens: numberNumber of prompt tokens used over the course of the run.
-
total_tokens: numberTotal number of tokens used (prompt + completion).
-
-
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
Example
openai beta:threads create-and-run \
--api-key 'My API Key' \
--assistant-id assistant_id
Response
{
"id": "id",
"assistant_id": "assistant_id",
"cancelled_at": 0,
"completed_at": 0,
"created_at": 0,
"expires_at": 0,
"failed_at": 0,
"incomplete_details": {
"reason": "max_completion_tokens"
},
"instructions": "instructions",
"last_error": {
"code": "server_error",
"message": "message"
},
"max_completion_tokens": 256,
"max_prompt_tokens": 256,
"metadata": {
"foo": "string"
},
"model": "model",
"object": "thread.run",
"parallel_tool_calls": true,
"required_action": {
"submit_tool_outputs": {
"tool_calls": [
{
"id": "id",
"function": {
"arguments": "arguments",
"name": "name"
},
"type": "function"
}
]
},
"type": "submit_tool_outputs"
},
"response_format": "auto",
"started_at": 0,
"status": "queued",
"thread_id": "thread_id",
"tool_choice": "none",
"tools": [
{
"type": "code_interpreter"
}
],
"truncation_strategy": {
"type": "auto",
"last_messages": 1
},
"usage": {
"completion_tokens": 0,
"prompt_tokens": 0,
"total_tokens": 0
},
"temperature": 0,
"top_p": 0
}
Retrieve thread
$ openai beta:threads retrieve
get /threads/{thread_id}
Retrieves a thread.
Parameters
-
--thread-id: stringThe ID of the thread to retrieve.
Returns
-
thread: object { id, created_at, metadata, 2 more }Represents a thread that contains messages.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
created_at: numberThe Unix timestamp (in seconds) for when the thread was created.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread"The object type, which is always
thread. -
tool_resources: object { code_interpreter, file_search }A set of resources that are made available to the assistant's tools in this thread. The resources are specific to the type of tool. For example, the
code_interpretertool requires a list of file IDs, while thefile_searchtool requires a list of vector store IDs.-
code_interpreter: optional object { file_ids }-
file_ids: optional array of stringA list of file IDs made available to the
code_interpretertool. There can be a maximum of 20 files associated with the tool.
-
-
file_search: optional object { vector_store_ids }-
vector_store_ids: optional array of stringThe vector store attached to this thread. There can be a maximum of 1 vector store attached to the thread.
-
-
-
Example
openai beta:threads retrieve \
--api-key 'My API Key' \
--thread-id thread_id
Response
{
"id": "id",
"created_at": 0,
"metadata": {
"foo": "string"
},
"object": "thread",
"tool_resources": {
"code_interpreter": {
"file_ids": [
"string"
]
},
"file_search": {
"vector_store_ids": [
"string"
]
}
}
}
Modify thread
$ openai beta:threads update
post /threads/{thread_id}
Modifies a thread.
Parameters
-
--thread-id: stringThe ID of the thread to modify. Only the
metadatacan be modified. -
--metadata: optional map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
--tool-resources: optional object { code_interpreter, file_search }A set of resources that are made available to the assistant's tools in this thread. The resources are specific to the type of tool. For example, the
code_interpretertool requires a list of file IDs, while thefile_searchtool requires a list of vector store IDs.
Returns
-
thread: object { id, created_at, metadata, 2 more }Represents a thread that contains messages.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
created_at: numberThe Unix timestamp (in seconds) for when the thread was created.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread"The object type, which is always
thread. -
tool_resources: object { code_interpreter, file_search }A set of resources that are made available to the assistant's tools in this thread. The resources are specific to the type of tool. For example, the
code_interpretertool requires a list of file IDs, while thefile_searchtool requires a list of vector store IDs.-
code_interpreter: optional object { file_ids }-
file_ids: optional array of stringA list of file IDs made available to the
code_interpretertool. There can be a maximum of 20 files associated with the tool.
-
-
file_search: optional object { vector_store_ids }-
vector_store_ids: optional array of stringThe vector store attached to this thread. There can be a maximum of 1 vector store attached to the thread.
-
-
-
Example
openai beta:threads update \
--api-key 'My API Key' \
--thread-id thread_id
Response
{
"id": "id",
"created_at": 0,
"metadata": {
"foo": "string"
},
"object": "thread",
"tool_resources": {
"code_interpreter": {
"file_ids": [
"string"
]
},
"file_search": {
"vector_store_ids": [
"string"
]
}
}
}
Delete thread
$ openai beta:threads delete
delete /threads/{thread_id}
Delete a thread.
Parameters
-
--thread-id: stringThe ID of the thread to delete.
Returns
-
thread_deleted: object { id, deleted, object }-
id: string -
deleted: boolean -
object: "thread.deleted"
-
Example
openai beta:threads delete \
--api-key 'My API Key' \
--thread-id thread_id
Response
{
"id": "id",
"deleted": true,
"object": "thread.deleted"
}
Domain Types
Assistant Response Format Option
-
assistant_response_format_option: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length.-
union_member_0: "auto"autois the default value -
response_format_text: object { type }Default response format. Used to generate text responses.
-
type: "text"The type of response format being defined. Always
text.
-
-
response_format_json_object: object { type }JSON object response format. An older method of generating JSON responses. Using
json_schemais recommended for models that support it. Note that the model will not generate JSON without a system or user message instructing it to do so.-
type: "json_object"The type of response format being defined. Always
json_object.
-
-
response_format_json_schema: object { json_schema, type }JSON Schema response format. Used to generate structured JSON responses. Learn more about Structured Outputs.
-
json_schema: object { name, description, schema, strict }Structured Outputs configuration options, including a JSON Schema.
-
name: stringThe name of the response format. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the response format is for, used by the model to determine how to respond in the format.
-
schema: optional map[unknown]The schema for the response format, described as a JSON Schema object. Learn how to build JSON schemas here.
-
strict: optional booleanWhether to enable strict schema adherence when generating the output. If set to true, the model will always follow the exact schema defined in the
schemafield. Only a subset of JSON Schema is supported whenstrictistrue. To learn more, read the Structured Outputs guide.
-
-
type: "json_schema"The type of response format being defined. Always
json_schema.
-
-
Assistant Tool Choice
-
assistant_tool_choice: object { type, function }Specifies a tool the model should use. Use to force the model to call a specific tool.
-
type: "function" or "code_interpreter" or "file_search"The type of the tool. If type is
function, the function name must be set-
"function" -
"code_interpreter" -
"file_search"
-
-
function: optional object { name }-
name: stringThe name of the function to call.
-
-
Assistant Tool Choice Function
-
assistant_tool_choice_function: object { name }-
name: stringThe name of the function to call.
-
Assistant Tool Choice Option
-
assistant_tool_choice_option: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool.-
Auto: "none" or "auto" or "required"nonemeans the model will not call any tools and instead generates a message.automeans the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user.-
"none" -
"auto" -
"required"
-
-
assistant_tool_choice: object { type, function }Specifies a tool the model should use. Use to force the model to call a specific tool.
-
type: "function" or "code_interpreter" or "file_search"The type of the tool. If type is
function, the function name must be set-
"function" -
"code_interpreter" -
"file_search"
-
-
function: optional object { name }-
name: stringThe name of the function to call.
-
-
-
Thread
-
thread: object { id, created_at, metadata, 2 more }Represents a thread that contains messages.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
created_at: numberThe Unix timestamp (in seconds) for when the thread was created.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread"The object type, which is always
thread. -
tool_resources: object { code_interpreter, file_search }A set of resources that are made available to the assistant's tools in this thread. The resources are specific to the type of tool. For example, the
code_interpretertool requires a list of file IDs, while thefile_searchtool requires a list of vector store IDs.-
code_interpreter: optional object { file_ids }-
file_ids: optional array of stringA list of file IDs made available to the
code_interpretertool. There can be a maximum of 20 files associated with the tool.
-
-
file_search: optional object { vector_store_ids }-
vector_store_ids: optional array of stringThe vector store attached to this thread. There can be a maximum of 1 vector store attached to the thread.
-
-
-
Thread Deleted
-
thread_deleted: object { id, deleted, object }-
id: string -
deleted: boolean -
object: "thread.deleted"
-
Runs
List runs
$ openai beta:threads:runs list
get /threads/{thread_id}/runs
Returns a list of runs belonging to a thread.
Parameters
-
--thread-id: stringThe ID of the thread the run belongs to.
-
--after: optional stringA cursor for use in pagination.
afteris an object ID that defines your place in the list. For instance, if you make a list request and receive 100 objects, ending with obj_foo, your subsequent call can include after=obj_foo in order to fetch the next page of the list. -
--before: optional stringA cursor for use in pagination.
beforeis an object ID that defines your place in the list. For instance, if you make a list request and receive 100 objects, starting with obj_foo, your subsequent call can include before=obj_foo in order to fetch the previous page of the list. -
--limit: optional numberA limit on the number of objects to be returned. Limit can range between 1 and 100, and the default is 20.
-
--order: optional "asc" or "desc"Sort order by the
created_attimestamp of the objects.ascfor ascending order anddescfor descending order.
Returns
-
ListRunsResponse: object { data, first_id, has_more, 2 more }-
data: array of Run-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete.-
reason: optional "max_completion_tokens" or "max_prompt_tokens"The reason why the run is incomplete. This will point to which specific token limit was reached over the course of the run.
-
"max_completion_tokens" -
"max_prompt_tokens"
-
-
-
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors.-
code: "server_error" or "rate_limit_exceeded" or "invalid_prompt"One of
server_error,rate_limit_exceeded, orinvalid_prompt.-
"server_error" -
"rate_limit_exceeded" -
"invalid_prompt"
-
-
message: stringA human-readable description of the error.
-
-
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required.-
submit_tool_outputs: object { tool_calls }Details on the tool outputs needed for this run to continue.
-
tool_calls: array of RequiredActionFunctionToolCallA list of the relevant tool calls.
-
id: stringThe ID of the tool call. This ID must be referenced when you submit the tool outputs in using the Submit tool outputs to run endpoint.
-
function: object { arguments, name }The function definition.
-
arguments: stringThe arguments that the model expects you to pass to the function.
-
name: stringThe name of the function.
-
-
type: "function"The type of tool call the output is required for. For now, this is always
function.
-
-
-
type: "submit_tool_outputs"For now, this is always
submit_tool_outputs.
-
-
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length.-
union_member_0: "auto"autois the default value -
response_format_text: object { type }Default response format. Used to generate text responses.
-
type: "text"The type of response format being defined. Always
text.
-
-
response_format_json_object: object { type }JSON object response format. An older method of generating JSON responses. Using
json_schemais recommended for models that support it. Note that the model will not generate JSON without a system or user message instructing it to do so.-
type: "json_object"The type of response format being defined. Always
json_object.
-
-
response_format_json_schema: object { json_schema, type }JSON Schema response format. Used to generate structured JSON responses. Learn more about Structured Outputs.
-
json_schema: object { name, description, schema, strict }Structured Outputs configuration options, including a JSON Schema.
-
name: stringThe name of the response format. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the response format is for, used by the model to determine how to respond in the format.
-
schema: optional map[unknown]The schema for the response format, described as a JSON Schema object. Learn how to build JSON schemas here.
-
strict: optional booleanWhether to enable strict schema adherence when generating the output. If set to true, the model will always follow the exact schema defined in the
schemafield. Only a subset of JSON Schema is supported whenstrictistrue. To learn more, read the Structured Outputs guide.
-
-
type: "json_schema"The type of response format being defined. Always
json_schema.
-
-
-
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired.-
"queued" -
"in_progress" -
"requires_action" -
"cancelling" -
"cancelled" -
"failed" -
"completed" -
"incomplete" -
"expired"
-
-
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool.-
Auto: "none" or "auto" or "required"nonemeans the model will not call any tools and instead generates a message.automeans the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user.-
"none" -
"auto" -
"required"
-
-
assistant_tool_choice: object { type, function }Specifies a tool the model should use. Use to force the model to call a specific tool.
-
type: "function" or "code_interpreter" or "file_search"The type of the tool. If type is
function, the function name must be set-
"function" -
"code_interpreter" -
"file_search"
-
-
function: optional object { name }-
name: stringThe name of the function to call.
-
-
-
-
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
file_search_tool: object { type, file_search }-
type: "file_search"The type of tool being defined:
file_search -
file_search: optional object { max_num_results, ranking_options }Overrides for the file search tool.
-
max_num_results: optional numberThe maximum number of results the file search tool should output. The default is 20 for
gpt-4*models and 5 forgpt-3.5-turbo. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_resultsresults. See the file search tool documentation for more information. -
ranking_options: optional object { score_threshold, ranker }The ranking options for the file search. If not specified, the file search tool will use the
autoranker and a score_threshold of 0.See the file search tool documentation for more information.
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
ranker: optional "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
-
-
-
function_tool: object { function, type }-
function: object { name, description, parameters, strict }-
name: stringThe name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the function does, used by the model to choose when and how to call the function.
-
parameters: optional map[unknown]The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parametersdefines a function with an empty parameter list. -
strict: optional booleanWhether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parametersfield. Only a subset of JSON Schema is supported whenstrictistrue. Learn more about Structured Outputs in the function calling guide.
-
-
type: "function"The type of tool being defined:
function
-
-
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
type: "auto" or "last_messages"The truncation strategy to use for the thread. The default is
auto. If set tolast_messages, the thread will be truncated to the n most recent messages in the thread. When set toauto, messages in the middle of the thread will be dropped to fit the context length of the model,max_prompt_tokens.-
"auto" -
"last_messages"
-
-
last_messages: optional numberThe number of most recent messages from the thread when constructing the context for the run.
-
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.).-
completion_tokens: numberNumber of completion tokens used over the course of the run.
-
prompt_tokens: numberNumber of prompt tokens used over the course of the run.
-
total_tokens: numberTotal number of tokens used (prompt + completion).
-
-
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
-
first_id: string -
has_more: boolean -
last_id: string -
object: string
-
Example
openai beta:threads:runs list \
--api-key 'My API Key' \
--thread-id thread_id
Response
{
"data": [
{
"id": "id",
"assistant_id": "assistant_id",
"cancelled_at": 0,
"completed_at": 0,
"created_at": 0,
"expires_at": 0,
"failed_at": 0,
"incomplete_details": {
"reason": "max_completion_tokens"
},
"instructions": "instructions",
"last_error": {
"code": "server_error",
"message": "message"
},
"max_completion_tokens": 256,
"max_prompt_tokens": 256,
"metadata": {
"foo": "string"
},
"model": "model",
"object": "thread.run",
"parallel_tool_calls": true,
"required_action": {
"submit_tool_outputs": {
"tool_calls": [
{
"id": "id",
"function": {
"arguments": "arguments",
"name": "name"
},
"type": "function"
}
]
},
"type": "submit_tool_outputs"
},
"response_format": "auto",
"started_at": 0,
"status": "queued",
"thread_id": "thread_id",
"tool_choice": "none",
"tools": [
{
"type": "code_interpreter"
}
],
"truncation_strategy": {
"type": "auto",
"last_messages": 1
},
"usage": {
"completion_tokens": 0,
"prompt_tokens": 0,
"total_tokens": 0
},
"temperature": 0,
"top_p": 0
}
],
"first_id": "run_abc123",
"has_more": false,
"last_id": "run_abc456",
"object": "list"
}
Create run
$ openai beta:threads:runs create
post /threads/{thread_id}/runs
Create a run.
Parameters
-
--thread-id: stringPath param: The ID of the thread to run.
-
--assistant-id: stringBody param: The ID of the assistant to use to execute this run.
-
--include: optional array of RunStepIncludeQuery param: A list of additional fields to include in the response. Currently the only supported value is
step_details.tool_calls[*].file_search.results[*].contentto fetch the file search result content.See the file search tool documentation for more information.
-
--additional-instructions: optional stringBody param: Appends additional instructions at the end of the instructions for the run. This is useful for modifying the behavior on a per-run basis without overriding other instructions.
-
--additional-message: optional array of object { content, role, attachments, metadata }Body param: Adds additional messages to the thread before creating the run.
-
--instructions: optional stringBody param: Overrides the instructions of the assistant. This is useful for modifying the behavior on a per-run basis.
-
--max-completion-tokens: optional numberBody param: The maximum number of completion tokens that may be used over the course of the run. The run will make a best effort to use only the number of completion tokens specified, across multiple turns of the run. If the run exceeds the number of completion tokens specified, the run will end with status
incomplete. Seeincomplete_detailsfor more info. -
--max-prompt-tokens: optional numberBody param: The maximum number of prompt tokens that may be used over the course of the run. The run will make a best effort to use only the number of prompt tokens specified, across multiple turns of the run. If the run exceeds the number of prompt tokens specified, the run will end with status
incomplete. Seeincomplete_detailsfor more info. -
--metadata: optional map[string]Body param: Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
--model: optional string or ChatModelBody param: The ID of the Model to be used to execute this run. If a value is provided here, it will override the model associated with the assistant. If not, the model associated with the assistant will be used.
-
--parallel-tool-calls: optional booleanBody param: Whether to enable parallel function calling during tool use.
-
--reasoning-effort: optional "none" or "minimal" or "low" or 3 moreBody param: Constrains effort on reasoning for reasoning models. Currently supported values are
none,minimal,low,medium,high, andxhigh. Reducing reasoning effort can result in faster responses and fewer tokens used on reasoning in a response.gpt-5.1defaults tonone, which does not perform reasoning. The supported reasoning values forgpt-5.1arenone,low,medium, andhigh. Tool calls are supported for all reasoning values in gpt-5.1.- All models before
gpt-5.1default tomediumreasoning effort, and do not supportnone. - The
gpt-5-promodel defaults to (and only supports)highreasoning effort. xhighis supported for all models aftergpt-5.1-codex-max.
-
--response-format: optional "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaBody param: Specifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length. -
--temperature: optional numberBody param: What sampling temperature to use, between 0 and 2. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.
-
--tool-choice: optional "none" or "auto" or "required" or AssistantToolChoiceBody param: Controls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool. -
--tool: optional array of AssistantToolBody param: Override the tools the assistant can use for this run. This is useful for modifying the behavior on a per-run basis.
-
--top-p: optional numberBody param: An alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered.
We generally recommend altering this or temperature but not both.
-
--truncation-strategy: optional object { type, last_messages }Body param: Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
Returns
-
run: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete.-
reason: optional "max_completion_tokens" or "max_prompt_tokens"The reason why the run is incomplete. This will point to which specific token limit was reached over the course of the run.
-
"max_completion_tokens" -
"max_prompt_tokens"
-
-
-
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors.-
code: "server_error" or "rate_limit_exceeded" or "invalid_prompt"One of
server_error,rate_limit_exceeded, orinvalid_prompt.-
"server_error" -
"rate_limit_exceeded" -
"invalid_prompt"
-
-
message: stringA human-readable description of the error.
-
-
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required.-
submit_tool_outputs: object { tool_calls }Details on the tool outputs needed for this run to continue.
-
tool_calls: array of RequiredActionFunctionToolCallA list of the relevant tool calls.
-
id: stringThe ID of the tool call. This ID must be referenced when you submit the tool outputs in using the Submit tool outputs to run endpoint.
-
function: object { arguments, name }The function definition.
-
arguments: stringThe arguments that the model expects you to pass to the function.
-
name: stringThe name of the function.
-
-
type: "function"The type of tool call the output is required for. For now, this is always
function.
-
-
-
type: "submit_tool_outputs"For now, this is always
submit_tool_outputs.
-
-
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length.-
union_member_0: "auto"autois the default value -
response_format_text: object { type }Default response format. Used to generate text responses.
-
type: "text"The type of response format being defined. Always
text.
-
-
response_format_json_object: object { type }JSON object response format. An older method of generating JSON responses. Using
json_schemais recommended for models that support it. Note that the model will not generate JSON without a system or user message instructing it to do so.-
type: "json_object"The type of response format being defined. Always
json_object.
-
-
response_format_json_schema: object { json_schema, type }JSON Schema response format. Used to generate structured JSON responses. Learn more about Structured Outputs.
-
json_schema: object { name, description, schema, strict }Structured Outputs configuration options, including a JSON Schema.
-
name: stringThe name of the response format. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the response format is for, used by the model to determine how to respond in the format.
-
schema: optional map[unknown]The schema for the response format, described as a JSON Schema object. Learn how to build JSON schemas here.
-
strict: optional booleanWhether to enable strict schema adherence when generating the output. If set to true, the model will always follow the exact schema defined in the
schemafield. Only a subset of JSON Schema is supported whenstrictistrue. To learn more, read the Structured Outputs guide.
-
-
type: "json_schema"The type of response format being defined. Always
json_schema.
-
-
-
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired.-
"queued" -
"in_progress" -
"requires_action" -
"cancelling" -
"cancelled" -
"failed" -
"completed" -
"incomplete" -
"expired"
-
-
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool.-
Auto: "none" or "auto" or "required"nonemeans the model will not call any tools and instead generates a message.automeans the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user.-
"none" -
"auto" -
"required"
-
-
assistant_tool_choice: object { type, function }Specifies a tool the model should use. Use to force the model to call a specific tool.
-
type: "function" or "code_interpreter" or "file_search"The type of the tool. If type is
function, the function name must be set-
"function" -
"code_interpreter" -
"file_search"
-
-
function: optional object { name }-
name: stringThe name of the function to call.
-
-
-
-
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
file_search_tool: object { type, file_search }-
type: "file_search"The type of tool being defined:
file_search -
file_search: optional object { max_num_results, ranking_options }Overrides for the file search tool.
-
max_num_results: optional numberThe maximum number of results the file search tool should output. The default is 20 for
gpt-4*models and 5 forgpt-3.5-turbo. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_resultsresults. See the file search tool documentation for more information. -
ranking_options: optional object { score_threshold, ranker }The ranking options for the file search. If not specified, the file search tool will use the
autoranker and a score_threshold of 0.See the file search tool documentation for more information.
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
ranker: optional "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
-
-
-
function_tool: object { function, type }-
function: object { name, description, parameters, strict }-
name: stringThe name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the function does, used by the model to choose when and how to call the function.
-
parameters: optional map[unknown]The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parametersdefines a function with an empty parameter list. -
strict: optional booleanWhether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parametersfield. Only a subset of JSON Schema is supported whenstrictistrue. Learn more about Structured Outputs in the function calling guide.
-
-
type: "function"The type of tool being defined:
function
-
-
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
type: "auto" or "last_messages"The truncation strategy to use for the thread. The default is
auto. If set tolast_messages, the thread will be truncated to the n most recent messages in the thread. When set toauto, messages in the middle of the thread will be dropped to fit the context length of the model,max_prompt_tokens.-
"auto" -
"last_messages"
-
-
last_messages: optional numberThe number of most recent messages from the thread when constructing the context for the run.
-
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.).-
completion_tokens: numberNumber of completion tokens used over the course of the run.
-
prompt_tokens: numberNumber of prompt tokens used over the course of the run.
-
total_tokens: numberTotal number of tokens used (prompt + completion).
-
-
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
Example
openai beta:threads:runs create \
--api-key 'My API Key' \
--thread-id thread_id \
--assistant-id assistant_id
Response
{
"id": "id",
"assistant_id": "assistant_id",
"cancelled_at": 0,
"completed_at": 0,
"created_at": 0,
"expires_at": 0,
"failed_at": 0,
"incomplete_details": {
"reason": "max_completion_tokens"
},
"instructions": "instructions",
"last_error": {
"code": "server_error",
"message": "message"
},
"max_completion_tokens": 256,
"max_prompt_tokens": 256,
"metadata": {
"foo": "string"
},
"model": "model",
"object": "thread.run",
"parallel_tool_calls": true,
"required_action": {
"submit_tool_outputs": {
"tool_calls": [
{
"id": "id",
"function": {
"arguments": "arguments",
"name": "name"
},
"type": "function"
}
]
},
"type": "submit_tool_outputs"
},
"response_format": "auto",
"started_at": 0,
"status": "queued",
"thread_id": "thread_id",
"tool_choice": "none",
"tools": [
{
"type": "code_interpreter"
}
],
"truncation_strategy": {
"type": "auto",
"last_messages": 1
},
"usage": {
"completion_tokens": 0,
"prompt_tokens": 0,
"total_tokens": 0
},
"temperature": 0,
"top_p": 0
}
Retrieve run
$ openai beta:threads:runs retrieve
get /threads/{thread_id}/runs/{run_id}
Retrieves a run.
Parameters
-
--thread-id: stringThe ID of the thread that was run.
-
--run-id: stringThe ID of the run to retrieve.
Returns
-
run: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete.-
reason: optional "max_completion_tokens" or "max_prompt_tokens"The reason why the run is incomplete. This will point to which specific token limit was reached over the course of the run.
-
"max_completion_tokens" -
"max_prompt_tokens"
-
-
-
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors.-
code: "server_error" or "rate_limit_exceeded" or "invalid_prompt"One of
server_error,rate_limit_exceeded, orinvalid_prompt.-
"server_error" -
"rate_limit_exceeded" -
"invalid_prompt"
-
-
message: stringA human-readable description of the error.
-
-
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required.-
submit_tool_outputs: object { tool_calls }Details on the tool outputs needed for this run to continue.
-
tool_calls: array of RequiredActionFunctionToolCallA list of the relevant tool calls.
-
id: stringThe ID of the tool call. This ID must be referenced when you submit the tool outputs in using the Submit tool outputs to run endpoint.
-
function: object { arguments, name }The function definition.
-
arguments: stringThe arguments that the model expects you to pass to the function.
-
name: stringThe name of the function.
-
-
type: "function"The type of tool call the output is required for. For now, this is always
function.
-
-
-
type: "submit_tool_outputs"For now, this is always
submit_tool_outputs.
-
-
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length.-
union_member_0: "auto"autois the default value -
response_format_text: object { type }Default response format. Used to generate text responses.
-
type: "text"The type of response format being defined. Always
text.
-
-
response_format_json_object: object { type }JSON object response format. An older method of generating JSON responses. Using
json_schemais recommended for models that support it. Note that the model will not generate JSON without a system or user message instructing it to do so.-
type: "json_object"The type of response format being defined. Always
json_object.
-
-
response_format_json_schema: object { json_schema, type }JSON Schema response format. Used to generate structured JSON responses. Learn more about Structured Outputs.
-
json_schema: object { name, description, schema, strict }Structured Outputs configuration options, including a JSON Schema.
-
name: stringThe name of the response format. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the response format is for, used by the model to determine how to respond in the format.
-
schema: optional map[unknown]The schema for the response format, described as a JSON Schema object. Learn how to build JSON schemas here.
-
strict: optional booleanWhether to enable strict schema adherence when generating the output. If set to true, the model will always follow the exact schema defined in the
schemafield. Only a subset of JSON Schema is supported whenstrictistrue. To learn more, read the Structured Outputs guide.
-
-
type: "json_schema"The type of response format being defined. Always
json_schema.
-
-
-
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired.-
"queued" -
"in_progress" -
"requires_action" -
"cancelling" -
"cancelled" -
"failed" -
"completed" -
"incomplete" -
"expired"
-
-
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool.-
Auto: "none" or "auto" or "required"nonemeans the model will not call any tools and instead generates a message.automeans the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user.-
"none" -
"auto" -
"required"
-
-
assistant_tool_choice: object { type, function }Specifies a tool the model should use. Use to force the model to call a specific tool.
-
type: "function" or "code_interpreter" or "file_search"The type of the tool. If type is
function, the function name must be set-
"function" -
"code_interpreter" -
"file_search"
-
-
function: optional object { name }-
name: stringThe name of the function to call.
-
-
-
-
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
file_search_tool: object { type, file_search }-
type: "file_search"The type of tool being defined:
file_search -
file_search: optional object { max_num_results, ranking_options }Overrides for the file search tool.
-
max_num_results: optional numberThe maximum number of results the file search tool should output. The default is 20 for
gpt-4*models and 5 forgpt-3.5-turbo. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_resultsresults. See the file search tool documentation for more information. -
ranking_options: optional object { score_threshold, ranker }The ranking options for the file search. If not specified, the file search tool will use the
autoranker and a score_threshold of 0.See the file search tool documentation for more information.
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
ranker: optional "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
-
-
-
function_tool: object { function, type }-
function: object { name, description, parameters, strict }-
name: stringThe name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the function does, used by the model to choose when and how to call the function.
-
parameters: optional map[unknown]The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parametersdefines a function with an empty parameter list. -
strict: optional booleanWhether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parametersfield. Only a subset of JSON Schema is supported whenstrictistrue. Learn more about Structured Outputs in the function calling guide.
-
-
type: "function"The type of tool being defined:
function
-
-
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
type: "auto" or "last_messages"The truncation strategy to use for the thread. The default is
auto. If set tolast_messages, the thread will be truncated to the n most recent messages in the thread. When set toauto, messages in the middle of the thread will be dropped to fit the context length of the model,max_prompt_tokens.-
"auto" -
"last_messages"
-
-
last_messages: optional numberThe number of most recent messages from the thread when constructing the context for the run.
-
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.).-
completion_tokens: numberNumber of completion tokens used over the course of the run.
-
prompt_tokens: numberNumber of prompt tokens used over the course of the run.
-
total_tokens: numberTotal number of tokens used (prompt + completion).
-
-
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
Example
openai beta:threads:runs retrieve \
--api-key 'My API Key' \
--thread-id thread_id \
--run-id run_id
Response
{
"id": "id",
"assistant_id": "assistant_id",
"cancelled_at": 0,
"completed_at": 0,
"created_at": 0,
"expires_at": 0,
"failed_at": 0,
"incomplete_details": {
"reason": "max_completion_tokens"
},
"instructions": "instructions",
"last_error": {
"code": "server_error",
"message": "message"
},
"max_completion_tokens": 256,
"max_prompt_tokens": 256,
"metadata": {
"foo": "string"
},
"model": "model",
"object": "thread.run",
"parallel_tool_calls": true,
"required_action": {
"submit_tool_outputs": {
"tool_calls": [
{
"id": "id",
"function": {
"arguments": "arguments",
"name": "name"
},
"type": "function"
}
]
},
"type": "submit_tool_outputs"
},
"response_format": "auto",
"started_at": 0,
"status": "queued",
"thread_id": "thread_id",
"tool_choice": "none",
"tools": [
{
"type": "code_interpreter"
}
],
"truncation_strategy": {
"type": "auto",
"last_messages": 1
},
"usage": {
"completion_tokens": 0,
"prompt_tokens": 0,
"total_tokens": 0
},
"temperature": 0,
"top_p": 0
}
Modify run
$ openai beta:threads:runs update
post /threads/{thread_id}/runs/{run_id}
Modifies a run.
Parameters
-
--thread-id: stringThe ID of the thread that was run.
-
--run-id: stringThe ID of the run to modify.
-
--metadata: optional map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
Returns
-
run: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete.-
reason: optional "max_completion_tokens" or "max_prompt_tokens"The reason why the run is incomplete. This will point to which specific token limit was reached over the course of the run.
-
"max_completion_tokens" -
"max_prompt_tokens"
-
-
-
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors.-
code: "server_error" or "rate_limit_exceeded" or "invalid_prompt"One of
server_error,rate_limit_exceeded, orinvalid_prompt.-
"server_error" -
"rate_limit_exceeded" -
"invalid_prompt"
-
-
message: stringA human-readable description of the error.
-
-
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required.-
submit_tool_outputs: object { tool_calls }Details on the tool outputs needed for this run to continue.
-
tool_calls: array of RequiredActionFunctionToolCallA list of the relevant tool calls.
-
id: stringThe ID of the tool call. This ID must be referenced when you submit the tool outputs in using the Submit tool outputs to run endpoint.
-
function: object { arguments, name }The function definition.
-
arguments: stringThe arguments that the model expects you to pass to the function.
-
name: stringThe name of the function.
-
-
type: "function"The type of tool call the output is required for. For now, this is always
function.
-
-
-
type: "submit_tool_outputs"For now, this is always
submit_tool_outputs.
-
-
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length.-
union_member_0: "auto"autois the default value -
response_format_text: object { type }Default response format. Used to generate text responses.
-
type: "text"The type of response format being defined. Always
text.
-
-
response_format_json_object: object { type }JSON object response format. An older method of generating JSON responses. Using
json_schemais recommended for models that support it. Note that the model will not generate JSON without a system or user message instructing it to do so.-
type: "json_object"The type of response format being defined. Always
json_object.
-
-
response_format_json_schema: object { json_schema, type }JSON Schema response format. Used to generate structured JSON responses. Learn more about Structured Outputs.
-
json_schema: object { name, description, schema, strict }Structured Outputs configuration options, including a JSON Schema.
-
name: stringThe name of the response format. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the response format is for, used by the model to determine how to respond in the format.
-
schema: optional map[unknown]The schema for the response format, described as a JSON Schema object. Learn how to build JSON schemas here.
-
strict: optional booleanWhether to enable strict schema adherence when generating the output. If set to true, the model will always follow the exact schema defined in the
schemafield. Only a subset of JSON Schema is supported whenstrictistrue. To learn more, read the Structured Outputs guide.
-
-
type: "json_schema"The type of response format being defined. Always
json_schema.
-
-
-
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired.-
"queued" -
"in_progress" -
"requires_action" -
"cancelling" -
"cancelled" -
"failed" -
"completed" -
"incomplete" -
"expired"
-
-
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool.-
Auto: "none" or "auto" or "required"nonemeans the model will not call any tools and instead generates a message.automeans the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user.-
"none" -
"auto" -
"required"
-
-
assistant_tool_choice: object { type, function }Specifies a tool the model should use. Use to force the model to call a specific tool.
-
type: "function" or "code_interpreter" or "file_search"The type of the tool. If type is
function, the function name must be set-
"function" -
"code_interpreter" -
"file_search"
-
-
function: optional object { name }-
name: stringThe name of the function to call.
-
-
-
-
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
file_search_tool: object { type, file_search }-
type: "file_search"The type of tool being defined:
file_search -
file_search: optional object { max_num_results, ranking_options }Overrides for the file search tool.
-
max_num_results: optional numberThe maximum number of results the file search tool should output. The default is 20 for
gpt-4*models and 5 forgpt-3.5-turbo. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_resultsresults. See the file search tool documentation for more information. -
ranking_options: optional object { score_threshold, ranker }The ranking options for the file search. If not specified, the file search tool will use the
autoranker and a score_threshold of 0.See the file search tool documentation for more information.
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
ranker: optional "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
-
-
-
function_tool: object { function, type }-
function: object { name, description, parameters, strict }-
name: stringThe name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the function does, used by the model to choose when and how to call the function.
-
parameters: optional map[unknown]The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parametersdefines a function with an empty parameter list. -
strict: optional booleanWhether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parametersfield. Only a subset of JSON Schema is supported whenstrictistrue. Learn more about Structured Outputs in the function calling guide.
-
-
type: "function"The type of tool being defined:
function
-
-
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
type: "auto" or "last_messages"The truncation strategy to use for the thread. The default is
auto. If set tolast_messages, the thread will be truncated to the n most recent messages in the thread. When set toauto, messages in the middle of the thread will be dropped to fit the context length of the model,max_prompt_tokens.-
"auto" -
"last_messages"
-
-
last_messages: optional numberThe number of most recent messages from the thread when constructing the context for the run.
-
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.).-
completion_tokens: numberNumber of completion tokens used over the course of the run.
-
prompt_tokens: numberNumber of prompt tokens used over the course of the run.
-
total_tokens: numberTotal number of tokens used (prompt + completion).
-
-
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
Example
openai beta:threads:runs update \
--api-key 'My API Key' \
--thread-id thread_id \
--run-id run_id
Response
{
"id": "id",
"assistant_id": "assistant_id",
"cancelled_at": 0,
"completed_at": 0,
"created_at": 0,
"expires_at": 0,
"failed_at": 0,
"incomplete_details": {
"reason": "max_completion_tokens"
},
"instructions": "instructions",
"last_error": {
"code": "server_error",
"message": "message"
},
"max_completion_tokens": 256,
"max_prompt_tokens": 256,
"metadata": {
"foo": "string"
},
"model": "model",
"object": "thread.run",
"parallel_tool_calls": true,
"required_action": {
"submit_tool_outputs": {
"tool_calls": [
{
"id": "id",
"function": {
"arguments": "arguments",
"name": "name"
},
"type": "function"
}
]
},
"type": "submit_tool_outputs"
},
"response_format": "auto",
"started_at": 0,
"status": "queued",
"thread_id": "thread_id",
"tool_choice": "none",
"tools": [
{
"type": "code_interpreter"
}
],
"truncation_strategy": {
"type": "auto",
"last_messages": 1
},
"usage": {
"completion_tokens": 0,
"prompt_tokens": 0,
"total_tokens": 0
},
"temperature": 0,
"top_p": 0
}
Submit tool outputs to run
$ openai beta:threads:runs submit-tool-outputs
post /threads/{thread_id}/runs/{run_id}/submit_tool_outputs
When a run has the status: "requires_action" and required_action.type is submit_tool_outputs, this endpoint can be used to submit the outputs from the tool calls once they're all completed. All outputs must be submitted in a single request.
Parameters
-
--thread-id: stringThe ID of the thread to which this run belongs.
-
--run-id: stringThe ID of the run that requires the tool output submission.
-
--tool-output: array of object { output, tool_call_id }A list of tools for which the outputs are being submitted.
Returns
-
run: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete.-
reason: optional "max_completion_tokens" or "max_prompt_tokens"The reason why the run is incomplete. This will point to which specific token limit was reached over the course of the run.
-
"max_completion_tokens" -
"max_prompt_tokens"
-
-
-
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors.-
code: "server_error" or "rate_limit_exceeded" or "invalid_prompt"One of
server_error,rate_limit_exceeded, orinvalid_prompt.-
"server_error" -
"rate_limit_exceeded" -
"invalid_prompt"
-
-
message: stringA human-readable description of the error.
-
-
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required.-
submit_tool_outputs: object { tool_calls }Details on the tool outputs needed for this run to continue.
-
tool_calls: array of RequiredActionFunctionToolCallA list of the relevant tool calls.
-
id: stringThe ID of the tool call. This ID must be referenced when you submit the tool outputs in using the Submit tool outputs to run endpoint.
-
function: object { arguments, name }The function definition.
-
arguments: stringThe arguments that the model expects you to pass to the function.
-
name: stringThe name of the function.
-
-
type: "function"The type of tool call the output is required for. For now, this is always
function.
-
-
-
type: "submit_tool_outputs"For now, this is always
submit_tool_outputs.
-
-
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length.-
union_member_0: "auto"autois the default value -
response_format_text: object { type }Default response format. Used to generate text responses.
-
type: "text"The type of response format being defined. Always
text.
-
-
response_format_json_object: object { type }JSON object response format. An older method of generating JSON responses. Using
json_schemais recommended for models that support it. Note that the model will not generate JSON without a system or user message instructing it to do so.-
type: "json_object"The type of response format being defined. Always
json_object.
-
-
response_format_json_schema: object { json_schema, type }JSON Schema response format. Used to generate structured JSON responses. Learn more about Structured Outputs.
-
json_schema: object { name, description, schema, strict }Structured Outputs configuration options, including a JSON Schema.
-
name: stringThe name of the response format. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the response format is for, used by the model to determine how to respond in the format.
-
schema: optional map[unknown]The schema for the response format, described as a JSON Schema object. Learn how to build JSON schemas here.
-
strict: optional booleanWhether to enable strict schema adherence when generating the output. If set to true, the model will always follow the exact schema defined in the
schemafield. Only a subset of JSON Schema is supported whenstrictistrue. To learn more, read the Structured Outputs guide.
-
-
type: "json_schema"The type of response format being defined. Always
json_schema.
-
-
-
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired.-
"queued" -
"in_progress" -
"requires_action" -
"cancelling" -
"cancelled" -
"failed" -
"completed" -
"incomplete" -
"expired"
-
-
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool.-
Auto: "none" or "auto" or "required"nonemeans the model will not call any tools and instead generates a message.automeans the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user.-
"none" -
"auto" -
"required"
-
-
assistant_tool_choice: object { type, function }Specifies a tool the model should use. Use to force the model to call a specific tool.
-
type: "function" or "code_interpreter" or "file_search"The type of the tool. If type is
function, the function name must be set-
"function" -
"code_interpreter" -
"file_search"
-
-
function: optional object { name }-
name: stringThe name of the function to call.
-
-
-
-
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
file_search_tool: object { type, file_search }-
type: "file_search"The type of tool being defined:
file_search -
file_search: optional object { max_num_results, ranking_options }Overrides for the file search tool.
-
max_num_results: optional numberThe maximum number of results the file search tool should output. The default is 20 for
gpt-4*models and 5 forgpt-3.5-turbo. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_resultsresults. See the file search tool documentation for more information. -
ranking_options: optional object { score_threshold, ranker }The ranking options for the file search. If not specified, the file search tool will use the
autoranker and a score_threshold of 0.See the file search tool documentation for more information.
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
ranker: optional "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
-
-
-
function_tool: object { function, type }-
function: object { name, description, parameters, strict }-
name: stringThe name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the function does, used by the model to choose when and how to call the function.
-
parameters: optional map[unknown]The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parametersdefines a function with an empty parameter list. -
strict: optional booleanWhether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parametersfield. Only a subset of JSON Schema is supported whenstrictistrue. Learn more about Structured Outputs in the function calling guide.
-
-
type: "function"The type of tool being defined:
function
-
-
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
type: "auto" or "last_messages"The truncation strategy to use for the thread. The default is
auto. If set tolast_messages, the thread will be truncated to the n most recent messages in the thread. When set toauto, messages in the middle of the thread will be dropped to fit the context length of the model,max_prompt_tokens.-
"auto" -
"last_messages"
-
-
last_messages: optional numberThe number of most recent messages from the thread when constructing the context for the run.
-
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.).-
completion_tokens: numberNumber of completion tokens used over the course of the run.
-
prompt_tokens: numberNumber of prompt tokens used over the course of the run.
-
total_tokens: numberTotal number of tokens used (prompt + completion).
-
-
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
Example
openai beta:threads:runs submit-tool-outputs \
--api-key 'My API Key' \
--thread-id thread_id \
--run-id run_id \
--tool-output '{}'
Response
{
"id": "id",
"assistant_id": "assistant_id",
"cancelled_at": 0,
"completed_at": 0,
"created_at": 0,
"expires_at": 0,
"failed_at": 0,
"incomplete_details": {
"reason": "max_completion_tokens"
},
"instructions": "instructions",
"last_error": {
"code": "server_error",
"message": "message"
},
"max_completion_tokens": 256,
"max_prompt_tokens": 256,
"metadata": {
"foo": "string"
},
"model": "model",
"object": "thread.run",
"parallel_tool_calls": true,
"required_action": {
"submit_tool_outputs": {
"tool_calls": [
{
"id": "id",
"function": {
"arguments": "arguments",
"name": "name"
},
"type": "function"
}
]
},
"type": "submit_tool_outputs"
},
"response_format": "auto",
"started_at": 0,
"status": "queued",
"thread_id": "thread_id",
"tool_choice": "none",
"tools": [
{
"type": "code_interpreter"
}
],
"truncation_strategy": {
"type": "auto",
"last_messages": 1
},
"usage": {
"completion_tokens": 0,
"prompt_tokens": 0,
"total_tokens": 0
},
"temperature": 0,
"top_p": 0
}
Cancel a run
$ openai beta:threads:runs cancel
post /threads/{thread_id}/runs/{run_id}/cancel
Cancels a run that is in_progress.
Parameters
-
--thread-id: stringThe ID of the thread to which this run belongs.
-
--run-id: stringThe ID of the run to cancel.
Returns
-
run: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete.-
reason: optional "max_completion_tokens" or "max_prompt_tokens"The reason why the run is incomplete. This will point to which specific token limit was reached over the course of the run.
-
"max_completion_tokens" -
"max_prompt_tokens"
-
-
-
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors.-
code: "server_error" or "rate_limit_exceeded" or "invalid_prompt"One of
server_error,rate_limit_exceeded, orinvalid_prompt.-
"server_error" -
"rate_limit_exceeded" -
"invalid_prompt"
-
-
message: stringA human-readable description of the error.
-
-
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required.-
submit_tool_outputs: object { tool_calls }Details on the tool outputs needed for this run to continue.
-
tool_calls: array of RequiredActionFunctionToolCallA list of the relevant tool calls.
-
id: stringThe ID of the tool call. This ID must be referenced when you submit the tool outputs in using the Submit tool outputs to run endpoint.
-
function: object { arguments, name }The function definition.
-
arguments: stringThe arguments that the model expects you to pass to the function.
-
name: stringThe name of the function.
-
-
type: "function"The type of tool call the output is required for. For now, this is always
function.
-
-
-
type: "submit_tool_outputs"For now, this is always
submit_tool_outputs.
-
-
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length.-
union_member_0: "auto"autois the default value -
response_format_text: object { type }Default response format. Used to generate text responses.
-
type: "text"The type of response format being defined. Always
text.
-
-
response_format_json_object: object { type }JSON object response format. An older method of generating JSON responses. Using
json_schemais recommended for models that support it. Note that the model will not generate JSON without a system or user message instructing it to do so.-
type: "json_object"The type of response format being defined. Always
json_object.
-
-
response_format_json_schema: object { json_schema, type }JSON Schema response format. Used to generate structured JSON responses. Learn more about Structured Outputs.
-
json_schema: object { name, description, schema, strict }Structured Outputs configuration options, including a JSON Schema.
-
name: stringThe name of the response format. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the response format is for, used by the model to determine how to respond in the format.
-
schema: optional map[unknown]The schema for the response format, described as a JSON Schema object. Learn how to build JSON schemas here.
-
strict: optional booleanWhether to enable strict schema adherence when generating the output. If set to true, the model will always follow the exact schema defined in the
schemafield. Only a subset of JSON Schema is supported whenstrictistrue. To learn more, read the Structured Outputs guide.
-
-
type: "json_schema"The type of response format being defined. Always
json_schema.
-
-
-
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired.-
"queued" -
"in_progress" -
"requires_action" -
"cancelling" -
"cancelled" -
"failed" -
"completed" -
"incomplete" -
"expired"
-
-
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool.-
Auto: "none" or "auto" or "required"nonemeans the model will not call any tools and instead generates a message.automeans the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user.-
"none" -
"auto" -
"required"
-
-
assistant_tool_choice: object { type, function }Specifies a tool the model should use. Use to force the model to call a specific tool.
-
type: "function" or "code_interpreter" or "file_search"The type of the tool. If type is
function, the function name must be set-
"function" -
"code_interpreter" -
"file_search"
-
-
function: optional object { name }-
name: stringThe name of the function to call.
-
-
-
-
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
file_search_tool: object { type, file_search }-
type: "file_search"The type of tool being defined:
file_search -
file_search: optional object { max_num_results, ranking_options }Overrides for the file search tool.
-
max_num_results: optional numberThe maximum number of results the file search tool should output. The default is 20 for
gpt-4*models and 5 forgpt-3.5-turbo. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_resultsresults. See the file search tool documentation for more information. -
ranking_options: optional object { score_threshold, ranker }The ranking options for the file search. If not specified, the file search tool will use the
autoranker and a score_threshold of 0.See the file search tool documentation for more information.
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
ranker: optional "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
-
-
-
function_tool: object { function, type }-
function: object { name, description, parameters, strict }-
name: stringThe name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the function does, used by the model to choose when and how to call the function.
-
parameters: optional map[unknown]The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parametersdefines a function with an empty parameter list. -
strict: optional booleanWhether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parametersfield. Only a subset of JSON Schema is supported whenstrictistrue. Learn more about Structured Outputs in the function calling guide.
-
-
type: "function"The type of tool being defined:
function
-
-
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
type: "auto" or "last_messages"The truncation strategy to use for the thread. The default is
auto. If set tolast_messages, the thread will be truncated to the n most recent messages in the thread. When set toauto, messages in the middle of the thread will be dropped to fit the context length of the model,max_prompt_tokens.-
"auto" -
"last_messages"
-
-
last_messages: optional numberThe number of most recent messages from the thread when constructing the context for the run.
-
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.).-
completion_tokens: numberNumber of completion tokens used over the course of the run.
-
prompt_tokens: numberNumber of prompt tokens used over the course of the run.
-
total_tokens: numberTotal number of tokens used (prompt + completion).
-
-
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
Example
openai beta:threads:runs cancel \
--api-key 'My API Key' \
--thread-id thread_id \
--run-id run_id
Response
{
"id": "id",
"assistant_id": "assistant_id",
"cancelled_at": 0,
"completed_at": 0,
"created_at": 0,
"expires_at": 0,
"failed_at": 0,
"incomplete_details": {
"reason": "max_completion_tokens"
},
"instructions": "instructions",
"last_error": {
"code": "server_error",
"message": "message"
},
"max_completion_tokens": 256,
"max_prompt_tokens": 256,
"metadata": {
"foo": "string"
},
"model": "model",
"object": "thread.run",
"parallel_tool_calls": true,
"required_action": {
"submit_tool_outputs": {
"tool_calls": [
{
"id": "id",
"function": {
"arguments": "arguments",
"name": "name"
},
"type": "function"
}
]
},
"type": "submit_tool_outputs"
},
"response_format": "auto",
"started_at": 0,
"status": "queued",
"thread_id": "thread_id",
"tool_choice": "none",
"tools": [
{
"type": "code_interpreter"
}
],
"truncation_strategy": {
"type": "auto",
"last_messages": 1
},
"usage": {
"completion_tokens": 0,
"prompt_tokens": 0,
"total_tokens": 0
},
"temperature": 0,
"top_p": 0
}
Domain Types
Required Action Function Tool Call
-
required_action_function_tool_call: object { id, function, type }Tool call objects
-
id: stringThe ID of the tool call. This ID must be referenced when you submit the tool outputs in using the Submit tool outputs to run endpoint.
-
function: object { arguments, name }The function definition.
-
arguments: stringThe arguments that the model expects you to pass to the function.
-
name: stringThe name of the function.
-
-
type: "function"The type of tool call the output is required for. For now, this is always
function.
-
Run
-
run: object { id, assistant_id, cancelled_at, 24 more }Represents an execution run on a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant used for execution of this run.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run was completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run was created.
-
expires_at: numberThe Unix timestamp (in seconds) for when the run will expire.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run failed.
-
incomplete_details: object { reason }Details on why the run is incomplete. Will be
nullif the run is not incomplete.-
reason: optional "max_completion_tokens" or "max_prompt_tokens"The reason why the run is incomplete. This will point to which specific token limit was reached over the course of the run.
-
"max_completion_tokens" -
"max_prompt_tokens"
-
-
-
instructions: stringThe instructions that the assistant used for this run.
-
last_error: object { code, message }The last error associated with this run. Will be
nullif there are no errors.-
code: "server_error" or "rate_limit_exceeded" or "invalid_prompt"One of
server_error,rate_limit_exceeded, orinvalid_prompt.-
"server_error" -
"rate_limit_exceeded" -
"invalid_prompt"
-
-
message: stringA human-readable description of the error.
-
-
max_completion_tokens: numberThe maximum number of completion tokens specified to have been used over the course of the run.
-
max_prompt_tokens: numberThe maximum number of prompt tokens specified to have been used over the course of the run.
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
model: stringThe model that the assistant used for this run.
-
object: "thread.run"The object type, which is always
thread.run. -
parallel_tool_calls: booleanWhether to enable parallel function calling during tool use.
-
required_action: object { submit_tool_outputs, type }Details on the action required to continue the run. Will be
nullif no action is required.-
submit_tool_outputs: object { tool_calls }Details on the tool outputs needed for this run to continue.
-
tool_calls: array of RequiredActionFunctionToolCallA list of the relevant tool calls.
-
id: stringThe ID of the tool call. This ID must be referenced when you submit the tool outputs in using the Submit tool outputs to run endpoint.
-
function: object { arguments, name }The function definition.
-
arguments: stringThe arguments that the model expects you to pass to the function.
-
name: stringThe name of the function.
-
-
type: "function"The type of tool call the output is required for. For now, this is always
function.
-
-
-
type: "submit_tool_outputs"For now, this is always
submit_tool_outputs.
-
-
response_format: "auto" or ResponseFormatText or ResponseFormatJSONObject or ResponseFormatJSONSchemaSpecifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106.Setting to
{ "type": "json_schema", "json_schema": {...} }enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.Setting to
{ "type": "json_object" }enables JSON mode, which ensures the message the model generates is valid JSON.Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length", which indicates the generation exceededmax_tokensor the conversation exceeded the max context length.-
union_member_0: "auto"autois the default value -
response_format_text: object { type }Default response format. Used to generate text responses.
-
type: "text"The type of response format being defined. Always
text.
-
-
response_format_json_object: object { type }JSON object response format. An older method of generating JSON responses. Using
json_schemais recommended for models that support it. Note that the model will not generate JSON without a system or user message instructing it to do so.-
type: "json_object"The type of response format being defined. Always
json_object.
-
-
response_format_json_schema: object { json_schema, type }JSON Schema response format. Used to generate structured JSON responses. Learn more about Structured Outputs.
-
json_schema: object { name, description, schema, strict }Structured Outputs configuration options, including a JSON Schema.
-
name: stringThe name of the response format. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the response format is for, used by the model to determine how to respond in the format.
-
schema: optional map[unknown]The schema for the response format, described as a JSON Schema object. Learn how to build JSON schemas here.
-
strict: optional booleanWhether to enable strict schema adherence when generating the output. If set to true, the model will always follow the exact schema defined in the
schemafield. Only a subset of JSON Schema is supported whenstrictistrue. To learn more, read the Structured Outputs guide.
-
-
type: "json_schema"The type of response format being defined. Always
json_schema.
-
-
-
started_at: numberThe Unix timestamp (in seconds) for when the run was started.
-
status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired.-
"queued" -
"in_progress" -
"requires_action" -
"cancelling" -
"cancelled" -
"failed" -
"completed" -
"incomplete" -
"expired"
-
-
thread_id: stringThe ID of the thread that was executed on as a part of this run.
-
tool_choice: "none" or "auto" or "required" or AssistantToolChoiceControls which (if any) tool is called by the model.
nonemeans the model will not call any tools and instead generates a message.autois the default value and means the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user. Specifying a particular tool like{"type": "file_search"}or{"type": "function", "function": {"name": "my_function"}}forces the model to call that tool.-
Auto: "none" or "auto" or "required"nonemeans the model will not call any tools and instead generates a message.automeans the model can pick between generating a message or calling one or more tools.requiredmeans the model must call one or more tools before responding to the user.-
"none" -
"auto" -
"required"
-
-
assistant_tool_choice: object { type, function }Specifies a tool the model should use. Use to force the model to call a specific tool.
-
type: "function" or "code_interpreter" or "file_search"The type of the tool. If type is
function, the function name must be set-
"function" -
"code_interpreter" -
"file_search"
-
-
function: optional object { name }-
name: stringThe name of the function to call.
-
-
-
-
tools: array of AssistantToolThe list of tools that the assistant used for this run.
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
file_search_tool: object { type, file_search }-
type: "file_search"The type of tool being defined:
file_search -
file_search: optional object { max_num_results, ranking_options }Overrides for the file search tool.
-
max_num_results: optional numberThe maximum number of results the file search tool should output. The default is 20 for
gpt-4*models and 5 forgpt-3.5-turbo. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_resultsresults. See the file search tool documentation for more information. -
ranking_options: optional object { score_threshold, ranker }The ranking options for the file search. If not specified, the file search tool will use the
autoranker and a score_threshold of 0.See the file search tool documentation for more information.
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
ranker: optional "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
-
-
-
function_tool: object { function, type }-
function: object { name, description, parameters, strict }-
name: stringThe name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.
-
description: optional stringA description of what the function does, used by the model to choose when and how to call the function.
-
parameters: optional map[unknown]The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parametersdefines a function with an empty parameter list. -
strict: optional booleanWhether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parametersfield. Only a subset of JSON Schema is supported whenstrictistrue. Learn more about Structured Outputs in the function calling guide.
-
-
type: "function"The type of tool being defined:
function
-
-
-
truncation_strategy: object { type, last_messages }Controls for how a thread will be truncated prior to the run. Use this to control the initial context window of the run.
-
type: "auto" or "last_messages"The truncation strategy to use for the thread. The default is
auto. If set tolast_messages, the thread will be truncated to the n most recent messages in the thread. When set toauto, messages in the middle of the thread will be dropped to fit the context length of the model,max_prompt_tokens.-
"auto" -
"last_messages"
-
-
last_messages: optional numberThe number of most recent messages from the thread when constructing the context for the run.
-
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run. This value will be
nullif the run is not in a terminal state (i.e.in_progress,queued, etc.).-
completion_tokens: numberNumber of completion tokens used over the course of the run.
-
prompt_tokens: numberNumber of prompt tokens used over the course of the run.
-
total_tokens: numberTotal number of tokens used (prompt + completion).
-
-
temperature: optional numberThe sampling temperature used for this run. If not set, defaults to 1.
-
top_p: optional numberThe nucleus sampling value used for this run. If not set, defaults to 1.
-
Run Status
-
run_status: "queued" or "in_progress" or "requires_action" or 6 moreThe status of the run, which can be either
queued,in_progress,requires_action,cancelling,cancelled,failed,completed,incomplete, orexpired.-
"queued" -
"in_progress" -
"requires_action" -
"cancelling" -
"cancelled" -
"failed" -
"completed" -
"incomplete" -
"expired"
-
Steps
List run steps
$ openai beta:threads:runs:steps list
get /threads/{thread_id}/runs/{run_id}/steps
Returns a list of run steps belonging to a run.
Parameters
-
--thread-id: stringThe ID of the thread the run and run steps belong to.
-
--run-id: stringThe ID of the run the run steps belong to.
-
--after: optional stringA cursor for use in pagination.
afteris an object ID that defines your place in the list. For instance, if you make a list request and receive 100 objects, ending with obj_foo, your subsequent call can include after=obj_foo in order to fetch the next page of the list. -
--before: optional stringA cursor for use in pagination.
beforeis an object ID that defines your place in the list. For instance, if you make a list request and receive 100 objects, starting with obj_foo, your subsequent call can include before=obj_foo in order to fetch the previous page of the list. -
--include: optional array of RunStepIncludeA list of additional fields to include in the response. Currently the only supported value is
step_details.tool_calls[*].file_search.results[*].contentto fetch the file search result content.See the file search tool documentation for more information.
-
--limit: optional numberA limit on the number of objects to be returned. Limit can range between 1 and 100, and the default is 20.
-
--order: optional "asc" or "desc"Sort order by the
created_attimestamp of the objects.ascfor ascending order anddescfor descending order.
Returns
-
ListRunStepsResponse: object { data, first_id, has_more, 2 more }-
data: array of RunStep-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant associated with the run step.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run step was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run step completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run step was created.
-
expired_at: numberThe Unix timestamp (in seconds) for when the run step expired. A step is considered expired if the parent run is expired.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run step failed.
-
last_error: object { code, message }The last error associated with this run step. Will be
nullif there are no errors.-
code: "server_error" or "rate_limit_exceeded"One of
server_errororrate_limit_exceeded.-
"server_error" -
"rate_limit_exceeded"
-
-
message: stringA human-readable description of the error.
-
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.run.step"The object type, which is always
thread.run.step. -
run_id: stringThe ID of the run that this run step is a part of.
-
status: "in_progress" or "cancelled" or "failed" or 2 moreThe status of the run step, which can be either
in_progress,cancelled,failed,completed, orexpired.-
"in_progress" -
"cancelled" -
"failed" -
"completed" -
"expired"
-
-
step_details: MessageCreationStepDetails or ToolCallsStepDetailsThe details of the run step.
-
message_creation_step_details: object { message_creation, type }Details of the message creation by the run step.
-
message_creation: object { message_id }-
message_id: stringThe ID of the message that was created by this run step.
-
-
type: "message_creation"Always
message_creation.
-
-
tool_calls_step_details: object { tool_calls, type }Details of the tool call.
-
tool_calls: array of ToolCallAn array of tool calls the run step was involved in. These can be associated with one of three types of tools:
code_interpreter,file_search, orfunction.-
code_interpreter_tool_call: object { id, code_interpreter, type }Details of the Code Interpreter tool call the run step was involved in.
-
id: stringThe ID of the tool call.
-
code_interpreter: object { input, outputs }The Code Interpreter tool call definition.
-
input: stringThe input to the Code Interpreter tool call.
-
outputs: array of object { logs, type } or object { image, type }The outputs from the Code Interpreter tool call. Code Interpreter can output one or more items, including text (
logs) or images (image). Each of these are represented by a different object type.-
logs: object { logs, type }Text output from the Code Interpreter tool call as part of a run step.
-
logs: stringThe text output from the Code Interpreter tool call.
-
type: "logs"Always
logs.
-
-
image: object { image, type }-
image: object { file_id }-
file_id: stringThe file ID of the image.
-
-
type: "image"Always
image.
-
-
-
-
type: "code_interpreter"The type of tool call. This is always going to be
code_interpreterfor this type of tool call.
-
-
file_search_tool_call: object { id, file_search, type }-
id: stringThe ID of the tool call object.
-
file_search: object { ranking_options, results }For now, this is always going to be an empty object.
-
ranking_options: optional object { ranker, score_threshold }The ranking options for the file search.
-
ranker: "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
-
results: optional array of object { file_id, file_name, score, content }The results of the file search.
-
file_id: stringThe ID of the file that result was found in.
-
file_name: stringThe name of the file that result was found in.
-
score: numberThe score of the result. All values must be a floating point number between 0 and 1.
-
content: optional array of object { text, type }The content of the result that was found. The content is only included if requested via the include query parameter.
-
text: optional stringThe text content of the file.
-
type: optional "text"The type of the content.
"text"
-
-
-
-
type: "file_search"The type of tool call. This is always going to be
file_searchfor this type of tool call.
-
-
function_tool_call: object { id, function, type }-
id: stringThe ID of the tool call object.
-
function: object { arguments, name, output }The definition of the function that was called.
-
arguments: stringThe arguments passed to the function.
-
name: stringThe name of the function.
-
output: stringThe output of the function. This will be
nullif the outputs have not been submitted yet.
-
-
type: "function"The type of tool call. This is always going to be
functionfor this type of tool call.
-
-
-
type: "tool_calls"Always
tool_calls.
-
-
-
thread_id: stringThe ID of the thread that was run.
-
type: "message_creation" or "tool_calls"The type of run step, which can be either
message_creationortool_calls.-
"message_creation" -
"tool_calls"
-
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run step. This value will be
nullwhile the run step's status isin_progress.-
completion_tokens: numberNumber of completion tokens used over the course of the run step.
-
prompt_tokens: numberNumber of prompt tokens used over the course of the run step.
-
total_tokens: numberTotal number of tokens used (prompt + completion).
-
-
-
first_id: string -
has_more: boolean -
last_id: string -
object: string
-
Example
openai beta:threads:runs:steps list \
--api-key 'My API Key' \
--thread-id thread_id \
--run-id run_id
Response
{
"data": [
{
"id": "id",
"assistant_id": "assistant_id",
"cancelled_at": 0,
"completed_at": 0,
"created_at": 0,
"expired_at": 0,
"failed_at": 0,
"last_error": {
"code": "server_error",
"message": "message"
},
"metadata": {
"foo": "string"
},
"object": "thread.run.step",
"run_id": "run_id",
"status": "in_progress",
"step_details": {
"message_creation": {
"message_id": "message_id"
},
"type": "message_creation"
},
"thread_id": "thread_id",
"type": "message_creation",
"usage": {
"completion_tokens": 0,
"prompt_tokens": 0,
"total_tokens": 0
}
}
],
"first_id": "step_abc123",
"has_more": false,
"last_id": "step_abc456",
"object": "list"
}
Retrieve run step
$ openai beta:threads:runs:steps retrieve
get /threads/{thread_id}/runs/{run_id}/steps/{step_id}
Retrieves a run step.
Parameters
-
--thread-id: stringThe ID of the thread to which the run and run step belongs.
-
--run-id: stringThe ID of the run to which the run step belongs.
-
--step-id: stringThe ID of the run step to retrieve.
-
--include: optional array of RunStepIncludeA list of additional fields to include in the response. Currently the only supported value is
step_details.tool_calls[*].file_search.results[*].contentto fetch the file search result content.See the file search tool documentation for more information.
Returns
-
run_step: object { id, assistant_id, cancelled_at, 13 more }Represents a step in execution of a run.
-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant associated with the run step.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run step was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run step completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run step was created.
-
expired_at: numberThe Unix timestamp (in seconds) for when the run step expired. A step is considered expired if the parent run is expired.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run step failed.
-
last_error: object { code, message }The last error associated with this run step. Will be
nullif there are no errors.-
code: "server_error" or "rate_limit_exceeded"One of
server_errororrate_limit_exceeded.-
"server_error" -
"rate_limit_exceeded"
-
-
message: stringA human-readable description of the error.
-
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.run.step"The object type, which is always
thread.run.step. -
run_id: stringThe ID of the run that this run step is a part of.
-
status: "in_progress" or "cancelled" or "failed" or 2 moreThe status of the run step, which can be either
in_progress,cancelled,failed,completed, orexpired.-
"in_progress" -
"cancelled" -
"failed" -
"completed" -
"expired"
-
-
step_details: MessageCreationStepDetails or ToolCallsStepDetailsThe details of the run step.
-
message_creation_step_details: object { message_creation, type }Details of the message creation by the run step.
-
message_creation: object { message_id }-
message_id: stringThe ID of the message that was created by this run step.
-
-
type: "message_creation"Always
message_creation.
-
-
tool_calls_step_details: object { tool_calls, type }Details of the tool call.
-
tool_calls: array of ToolCallAn array of tool calls the run step was involved in. These can be associated with one of three types of tools:
code_interpreter,file_search, orfunction.-
code_interpreter_tool_call: object { id, code_interpreter, type }Details of the Code Interpreter tool call the run step was involved in.
-
id: stringThe ID of the tool call.
-
code_interpreter: object { input, outputs }The Code Interpreter tool call definition.
-
input: stringThe input to the Code Interpreter tool call.
-
outputs: array of object { logs, type } or object { image, type }The outputs from the Code Interpreter tool call. Code Interpreter can output one or more items, including text (
logs) or images (image). Each of these are represented by a different object type.-
logs: object { logs, type }Text output from the Code Interpreter tool call as part of a run step.
-
logs: stringThe text output from the Code Interpreter tool call.
-
type: "logs"Always
logs.
-
-
image: object { image, type }-
image: object { file_id }-
file_id: stringThe file ID of the image.
-
-
type: "image"Always
image.
-
-
-
-
type: "code_interpreter"The type of tool call. This is always going to be
code_interpreterfor this type of tool call.
-
-
file_search_tool_call: object { id, file_search, type }-
id: stringThe ID of the tool call object.
-
file_search: object { ranking_options, results }For now, this is always going to be an empty object.
-
ranking_options: optional object { ranker, score_threshold }The ranking options for the file search.
-
ranker: "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
-
results: optional array of object { file_id, file_name, score, content }The results of the file search.
-
file_id: stringThe ID of the file that result was found in.
-
file_name: stringThe name of the file that result was found in.
-
score: numberThe score of the result. All values must be a floating point number between 0 and 1.
-
content: optional array of object { text, type }The content of the result that was found. The content is only included if requested via the include query parameter.
-
text: optional stringThe text content of the file.
-
type: optional "text"The type of the content.
"text"
-
-
-
-
type: "file_search"The type of tool call. This is always going to be
file_searchfor this type of tool call.
-
-
function_tool_call: object { id, function, type }-
id: stringThe ID of the tool call object.
-
function: object { arguments, name, output }The definition of the function that was called.
-
arguments: stringThe arguments passed to the function.
-
name: stringThe name of the function.
-
output: stringThe output of the function. This will be
nullif the outputs have not been submitted yet.
-
-
type: "function"The type of tool call. This is always going to be
functionfor this type of tool call.
-
-
-
type: "tool_calls"Always
tool_calls.
-
-
-
thread_id: stringThe ID of the thread that was run.
-
type: "message_creation" or "tool_calls"The type of run step, which can be either
message_creationortool_calls.-
"message_creation" -
"tool_calls"
-
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run step. This value will be
nullwhile the run step's status isin_progress.-
completion_tokens: numberNumber of completion tokens used over the course of the run step.
-
prompt_tokens: numberNumber of prompt tokens used over the course of the run step.
-
total_tokens: numberTotal number of tokens used (prompt + completion).
-
-
Example
openai beta:threads:runs:steps retrieve \
--api-key 'My API Key' \
--thread-id thread_id \
--run-id run_id \
--step-id step_id
Response
{
"id": "id",
"assistant_id": "assistant_id",
"cancelled_at": 0,
"completed_at": 0,
"created_at": 0,
"expired_at": 0,
"failed_at": 0,
"last_error": {
"code": "server_error",
"message": "message"
},
"metadata": {
"foo": "string"
},
"object": "thread.run.step",
"run_id": "run_id",
"status": "in_progress",
"step_details": {
"message_creation": {
"message_id": "message_id"
},
"type": "message_creation"
},
"thread_id": "thread_id",
"type": "message_creation",
"usage": {
"completion_tokens": 0,
"prompt_tokens": 0,
"total_tokens": 0
}
}
Domain Types
Code Interpreter Logs
-
code_interpreter_logs: object { index, type, logs }Text output from the Code Interpreter tool call as part of a run step.
-
index: numberThe index of the output in the outputs array.
-
type: "logs"Always
logs. -
logs: optional stringThe text output from the Code Interpreter tool call.
-
Code Interpreter Output Image
-
code_interpreter_output_image: object { index, type, image }-
index: numberThe index of the output in the outputs array.
-
type: "image"Always
image. -
image: optional object { file_id }-
file_id: optional stringThe file ID of the image.
-
-
Code Interpreter Tool Call
-
code_interpreter_tool_call: object { id, code_interpreter, type }Details of the Code Interpreter tool call the run step was involved in.
-
id: stringThe ID of the tool call.
-
code_interpreter: object { input, outputs }The Code Interpreter tool call definition.
-
input: stringThe input to the Code Interpreter tool call.
-
outputs: array of object { logs, type } or object { image, type }The outputs from the Code Interpreter tool call. Code Interpreter can output one or more items, including text (
logs) or images (image). Each of these are represented by a different object type.-
logs: object { logs, type }Text output from the Code Interpreter tool call as part of a run step.
-
logs: stringThe text output from the Code Interpreter tool call.
-
type: "logs"Always
logs.
-
-
image: object { image, type }-
image: object { file_id }-
file_id: stringThe file ID of the image.
-
-
type: "image"Always
image.
-
-
-
-
type: "code_interpreter"The type of tool call. This is always going to be
code_interpreterfor this type of tool call.
-
Code Interpreter Tool Call Delta
-
code_interpreter_tool_call_delta: object { index, type, id, code_interpreter }Details of the Code Interpreter tool call the run step was involved in.
-
index: numberThe index of the tool call in the tool calls array.
-
type: "code_interpreter"The type of tool call. This is always going to be
code_interpreterfor this type of tool call. -
id: optional stringThe ID of the tool call.
-
code_interpreter: optional object { input, outputs }The Code Interpreter tool call definition.
-
input: optional stringThe input to the Code Interpreter tool call.
-
outputs: optional array of CodeInterpreterLogs or CodeInterpreterOutputImageThe outputs from the Code Interpreter tool call. Code Interpreter can output one or more items, including text (
logs) or images (image). Each of these are represented by a different object type.-
code_interpreter_logs: object { index, type, logs }Text output from the Code Interpreter tool call as part of a run step.
-
index: numberThe index of the output in the outputs array.
-
type: "logs"Always
logs. -
logs: optional stringThe text output from the Code Interpreter tool call.
-
-
code_interpreter_output_image: object { index, type, image }-
index: numberThe index of the output in the outputs array.
-
type: "image"Always
image. -
image: optional object { file_id }-
file_id: optional stringThe file ID of the image.
-
-
-
-
-
File Search Tool Call
-
file_search_tool_call: object { id, file_search, type }-
id: stringThe ID of the tool call object.
-
file_search: object { ranking_options, results }For now, this is always going to be an empty object.
-
ranking_options: optional object { ranker, score_threshold }The ranking options for the file search.
-
ranker: "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
-
results: optional array of object { file_id, file_name, score, content }The results of the file search.
-
file_id: stringThe ID of the file that result was found in.
-
file_name: stringThe name of the file that result was found in.
-
score: numberThe score of the result. All values must be a floating point number between 0 and 1.
-
content: optional array of object { text, type }The content of the result that was found. The content is only included if requested via the include query parameter.
-
text: optional stringThe text content of the file.
-
type: optional "text"The type of the content.
"text"
-
-
-
-
type: "file_search"The type of tool call. This is always going to be
file_searchfor this type of tool call.
-
File Search Tool Call Delta
-
file_search_tool_call_delta: object { file_search, index, type, id }-
file_search: unknownFor now, this is always going to be an empty object.
-
index: numberThe index of the tool call in the tool calls array.
-
type: "file_search"The type of tool call. This is always going to be
file_searchfor this type of tool call. -
id: optional stringThe ID of the tool call object.
-
Function Tool Call
-
function_tool_call: object { id, function, type }-
id: stringThe ID of the tool call object.
-
function: object { arguments, name, output }The definition of the function that was called.
-
arguments: stringThe arguments passed to the function.
-
name: stringThe name of the function.
-
output: stringThe output of the function. This will be
nullif the outputs have not been submitted yet.
-
-
type: "function"The type of tool call. This is always going to be
functionfor this type of tool call.
-
Function Tool Call Delta
-
function_tool_call_delta: object { index, type, id, function }-
index: numberThe index of the tool call in the tool calls array.
-
type: "function"The type of tool call. This is always going to be
functionfor this type of tool call. -
id: optional stringThe ID of the tool call object.
-
function: optional object { arguments, name, output }The definition of the function that was called.
-
arguments: optional stringThe arguments passed to the function.
-
name: optional stringThe name of the function.
-
output: optional stringThe output of the function. This will be
nullif the outputs have not been submitted yet.
-
-
Message Creation Step Details
-
message_creation_step_details: object { message_creation, type }Details of the message creation by the run step.
-
message_creation: object { message_id }-
message_id: stringThe ID of the message that was created by this run step.
-
-
type: "message_creation"Always
message_creation.
-
Run Step
-
run_step: object { id, assistant_id, cancelled_at, 13 more }Represents a step in execution of a run.
-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
assistant_id: stringThe ID of the assistant associated with the run step.
-
cancelled_at: numberThe Unix timestamp (in seconds) for when the run step was cancelled.
-
completed_at: numberThe Unix timestamp (in seconds) for when the run step completed.
-
created_at: numberThe Unix timestamp (in seconds) for when the run step was created.
-
expired_at: numberThe Unix timestamp (in seconds) for when the run step expired. A step is considered expired if the parent run is expired.
-
failed_at: numberThe Unix timestamp (in seconds) for when the run step failed.
-
last_error: object { code, message }The last error associated with this run step. Will be
nullif there are no errors.-
code: "server_error" or "rate_limit_exceeded"One of
server_errororrate_limit_exceeded.-
"server_error" -
"rate_limit_exceeded"
-
-
message: stringA human-readable description of the error.
-
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.run.step"The object type, which is always
thread.run.step. -
run_id: stringThe ID of the run that this run step is a part of.
-
status: "in_progress" or "cancelled" or "failed" or 2 moreThe status of the run step, which can be either
in_progress,cancelled,failed,completed, orexpired.-
"in_progress" -
"cancelled" -
"failed" -
"completed" -
"expired"
-
-
step_details: MessageCreationStepDetails or ToolCallsStepDetailsThe details of the run step.
-
message_creation_step_details: object { message_creation, type }Details of the message creation by the run step.
-
message_creation: object { message_id }-
message_id: stringThe ID of the message that was created by this run step.
-
-
type: "message_creation"Always
message_creation.
-
-
tool_calls_step_details: object { tool_calls, type }Details of the tool call.
-
tool_calls: array of ToolCallAn array of tool calls the run step was involved in. These can be associated with one of three types of tools:
code_interpreter,file_search, orfunction.-
code_interpreter_tool_call: object { id, code_interpreter, type }Details of the Code Interpreter tool call the run step was involved in.
-
id: stringThe ID of the tool call.
-
code_interpreter: object { input, outputs }The Code Interpreter tool call definition.
-
input: stringThe input to the Code Interpreter tool call.
-
outputs: array of object { logs, type } or object { image, type }The outputs from the Code Interpreter tool call. Code Interpreter can output one or more items, including text (
logs) or images (image). Each of these are represented by a different object type.-
logs: object { logs, type }Text output from the Code Interpreter tool call as part of a run step.
-
logs: stringThe text output from the Code Interpreter tool call.
-
type: "logs"Always
logs.
-
-
image: object { image, type }-
image: object { file_id }-
file_id: stringThe file ID of the image.
-
-
type: "image"Always
image.
-
-
-
-
type: "code_interpreter"The type of tool call. This is always going to be
code_interpreterfor this type of tool call.
-
-
file_search_tool_call: object { id, file_search, type }-
id: stringThe ID of the tool call object.
-
file_search: object { ranking_options, results }For now, this is always going to be an empty object.
-
ranking_options: optional object { ranker, score_threshold }The ranking options for the file search.
-
ranker: "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
-
results: optional array of object { file_id, file_name, score, content }The results of the file search.
-
file_id: stringThe ID of the file that result was found in.
-
file_name: stringThe name of the file that result was found in.
-
score: numberThe score of the result. All values must be a floating point number between 0 and 1.
-
content: optional array of object { text, type }The content of the result that was found. The content is only included if requested via the include query parameter.
-
text: optional stringThe text content of the file.
-
type: optional "text"The type of the content.
"text"
-
-
-
-
type: "file_search"The type of tool call. This is always going to be
file_searchfor this type of tool call.
-
-
function_tool_call: object { id, function, type }-
id: stringThe ID of the tool call object.
-
function: object { arguments, name, output }The definition of the function that was called.
-
arguments: stringThe arguments passed to the function.
-
name: stringThe name of the function.
-
output: stringThe output of the function. This will be
nullif the outputs have not been submitted yet.
-
-
type: "function"The type of tool call. This is always going to be
functionfor this type of tool call.
-
-
-
type: "tool_calls"Always
tool_calls.
-
-
-
thread_id: stringThe ID of the thread that was run.
-
type: "message_creation" or "tool_calls"The type of run step, which can be either
message_creationortool_calls.-
"message_creation" -
"tool_calls"
-
-
usage: object { completion_tokens, prompt_tokens, total_tokens }Usage statistics related to the run step. This value will be
nullwhile the run step's status isin_progress.-
completion_tokens: numberNumber of completion tokens used over the course of the run step.
-
prompt_tokens: numberNumber of prompt tokens used over the course of the run step.
-
total_tokens: numberTotal number of tokens used (prompt + completion).
-
-
Run Step Delta
-
run_step_delta: object { step_details }The delta containing the fields that have changed on the run step.
-
step_details: optional RunStepDeltaMessageDelta or ToolCallDeltaObjectThe details of the run step.
-
run_step_delta_message_delta: object { type, message_creation }Details of the message creation by the run step.
-
type: "message_creation"Always
message_creation. -
message_creation: optional object { message_id }-
message_id: optional stringThe ID of the message that was created by this run step.
-
-
-
tool_call_delta_object: object { type, tool_calls }Details of the tool call.
-
type: "tool_calls"Always
tool_calls. -
tool_calls: optional array of ToolCallDeltaAn array of tool calls the run step was involved in. These can be associated with one of three types of tools:
code_interpreter,file_search, orfunction.-
code_interpreter_tool_call_delta: object { index, type, id, code_interpreter }Details of the Code Interpreter tool call the run step was involved in.
-
index: numberThe index of the tool call in the tool calls array.
-
type: "code_interpreter"The type of tool call. This is always going to be
code_interpreterfor this type of tool call. -
id: optional stringThe ID of the tool call.
-
code_interpreter: optional object { input, outputs }The Code Interpreter tool call definition.
-
input: optional stringThe input to the Code Interpreter tool call.
-
outputs: optional array of CodeInterpreterLogs or CodeInterpreterOutputImageThe outputs from the Code Interpreter tool call. Code Interpreter can output one or more items, including text (
logs) or images (image). Each of these are represented by a different object type.-
code_interpreter_logs: object { index, type, logs }Text output from the Code Interpreter tool call as part of a run step.
-
index: numberThe index of the output in the outputs array.
-
type: "logs"Always
logs. -
logs: optional stringThe text output from the Code Interpreter tool call.
-
-
code_interpreter_output_image: object { index, type, image }-
index: numberThe index of the output in the outputs array.
-
type: "image"Always
image. -
image: optional object { file_id }-
file_id: optional stringThe file ID of the image.
-
-
-
-
-
-
file_search_tool_call_delta: object { file_search, index, type, id }-
file_search: unknownFor now, this is always going to be an empty object.
-
index: numberThe index of the tool call in the tool calls array.
-
type: "file_search"The type of tool call. This is always going to be
file_searchfor this type of tool call. -
id: optional stringThe ID of the tool call object.
-
-
function_tool_call_delta: object { index, type, id, function }-
index: numberThe index of the tool call in the tool calls array.
-
type: "function"The type of tool call. This is always going to be
functionfor this type of tool call. -
id: optional stringThe ID of the tool call object.
-
function: optional object { arguments, name, output }The definition of the function that was called.
-
arguments: optional stringThe arguments passed to the function.
-
name: optional stringThe name of the function.
-
output: optional stringThe output of the function. This will be
nullif the outputs have not been submitted yet.
-
-
-
-
-
-
Run Step Delta Event
-
run_step_delta_event: object { id, delta, object }Represents a run step delta i.e. any changed fields on a run step during streaming.
-
id: stringThe identifier of the run step, which can be referenced in API endpoints.
-
delta: object { step_details }The delta containing the fields that have changed on the run step.
-
step_details: optional RunStepDeltaMessageDelta or ToolCallDeltaObjectThe details of the run step.
-
run_step_delta_message_delta: object { type, message_creation }Details of the message creation by the run step.
-
type: "message_creation"Always
message_creation. -
message_creation: optional object { message_id }-
message_id: optional stringThe ID of the message that was created by this run step.
-
-
-
tool_call_delta_object: object { type, tool_calls }Details of the tool call.
-
type: "tool_calls"Always
tool_calls. -
tool_calls: optional array of ToolCallDeltaAn array of tool calls the run step was involved in. These can be associated with one of three types of tools:
code_interpreter,file_search, orfunction.-
code_interpreter_tool_call_delta: object { index, type, id, code_interpreter }Details of the Code Interpreter tool call the run step was involved in.
-
index: numberThe index of the tool call in the tool calls array.
-
type: "code_interpreter"The type of tool call. This is always going to be
code_interpreterfor this type of tool call. -
id: optional stringThe ID of the tool call.
-
code_interpreter: optional object { input, outputs }The Code Interpreter tool call definition.
-
input: optional stringThe input to the Code Interpreter tool call.
-
outputs: optional array of CodeInterpreterLogs or CodeInterpreterOutputImageThe outputs from the Code Interpreter tool call. Code Interpreter can output one or more items, including text (
logs) or images (image). Each of these are represented by a different object type.-
code_interpreter_logs: object { index, type, logs }Text output from the Code Interpreter tool call as part of a run step.
-
index: numberThe index of the output in the outputs array.
-
type: "logs"Always
logs. -
logs: optional stringThe text output from the Code Interpreter tool call.
-
-
code_interpreter_output_image: object { index, type, image }-
index: numberThe index of the output in the outputs array.
-
type: "image"Always
image. -
image: optional object { file_id }-
file_id: optional stringThe file ID of the image.
-
-
-
-
-
-
file_search_tool_call_delta: object { file_search, index, type, id }-
file_search: unknownFor now, this is always going to be an empty object.
-
index: numberThe index of the tool call in the tool calls array.
-
type: "file_search"The type of tool call. This is always going to be
file_searchfor this type of tool call. -
id: optional stringThe ID of the tool call object.
-
-
function_tool_call_delta: object { index, type, id, function }-
index: numberThe index of the tool call in the tool calls array.
-
type: "function"The type of tool call. This is always going to be
functionfor this type of tool call. -
id: optional stringThe ID of the tool call object.
-
function: optional object { arguments, name, output }The definition of the function that was called.
-
arguments: optional stringThe arguments passed to the function.
-
name: optional stringThe name of the function.
-
output: optional stringThe output of the function. This will be
nullif the outputs have not been submitted yet.
-
-
-
-
-
-
-
object: "thread.run.step.delta"The object type, which is always
thread.run.step.delta.
-
Run Step Delta Message Delta
-
run_step_delta_message_delta: object { type, message_creation }Details of the message creation by the run step.
-
type: "message_creation"Always
message_creation. -
message_creation: optional object { message_id }-
message_id: optional stringThe ID of the message that was created by this run step.
-
-
Run Step Include
-
run_step_include: "step_details.tool_calls[*].file_search.results[*].content""step_details.tool_calls[*].file_search.results[*].content"
Tool Call
-
tool_call: CodeInterpreterToolCall or FileSearchToolCall or FunctionToolCallDetails of the Code Interpreter tool call the run step was involved in.
-
code_interpreter_tool_call: object { id, code_interpreter, type }Details of the Code Interpreter tool call the run step was involved in.
-
id: stringThe ID of the tool call.
-
code_interpreter: object { input, outputs }The Code Interpreter tool call definition.
-
input: stringThe input to the Code Interpreter tool call.
-
outputs: array of object { logs, type } or object { image, type }The outputs from the Code Interpreter tool call. Code Interpreter can output one or more items, including text (
logs) or images (image). Each of these are represented by a different object type.-
logs: object { logs, type }Text output from the Code Interpreter tool call as part of a run step.
-
logs: stringThe text output from the Code Interpreter tool call.
-
type: "logs"Always
logs.
-
-
image: object { image, type }-
image: object { file_id }-
file_id: stringThe file ID of the image.
-
-
type: "image"Always
image.
-
-
-
-
type: "code_interpreter"The type of tool call. This is always going to be
code_interpreterfor this type of tool call.
-
-
file_search_tool_call: object { id, file_search, type }-
id: stringThe ID of the tool call object.
-
file_search: object { ranking_options, results }For now, this is always going to be an empty object.
-
ranking_options: optional object { ranker, score_threshold }The ranking options for the file search.
-
ranker: "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
-
results: optional array of object { file_id, file_name, score, content }The results of the file search.
-
file_id: stringThe ID of the file that result was found in.
-
file_name: stringThe name of the file that result was found in.
-
score: numberThe score of the result. All values must be a floating point number between 0 and 1.
-
content: optional array of object { text, type }The content of the result that was found. The content is only included if requested via the include query parameter.
-
text: optional stringThe text content of the file.
-
type: optional "text"The type of the content.
"text"
-
-
-
-
type: "file_search"The type of tool call. This is always going to be
file_searchfor this type of tool call.
-
-
function_tool_call: object { id, function, type }-
id: stringThe ID of the tool call object.
-
function: object { arguments, name, output }The definition of the function that was called.
-
arguments: stringThe arguments passed to the function.
-
name: stringThe name of the function.
-
output: stringThe output of the function. This will be
nullif the outputs have not been submitted yet.
-
-
type: "function"The type of tool call. This is always going to be
functionfor this type of tool call.
-
-
Tool Call Delta
-
tool_call_delta: CodeInterpreterToolCallDelta or FileSearchToolCallDelta or FunctionToolCallDeltaDetails of the Code Interpreter tool call the run step was involved in.
-
code_interpreter_tool_call_delta: object { index, type, id, code_interpreter }Details of the Code Interpreter tool call the run step was involved in.
-
index: numberThe index of the tool call in the tool calls array.
-
type: "code_interpreter"The type of tool call. This is always going to be
code_interpreterfor this type of tool call. -
id: optional stringThe ID of the tool call.
-
code_interpreter: optional object { input, outputs }The Code Interpreter tool call definition.
-
input: optional stringThe input to the Code Interpreter tool call.
-
outputs: optional array of CodeInterpreterLogs or CodeInterpreterOutputImageThe outputs from the Code Interpreter tool call. Code Interpreter can output one or more items, including text (
logs) or images (image). Each of these are represented by a different object type.-
code_interpreter_logs: object { index, type, logs }Text output from the Code Interpreter tool call as part of a run step.
-
index: numberThe index of the output in the outputs array.
-
type: "logs"Always
logs. -
logs: optional stringThe text output from the Code Interpreter tool call.
-
-
code_interpreter_output_image: object { index, type, image }-
index: numberThe index of the output in the outputs array.
-
type: "image"Always
image. -
image: optional object { file_id }-
file_id: optional stringThe file ID of the image.
-
-
-
-
-
-
file_search_tool_call_delta: object { file_search, index, type, id }-
file_search: unknownFor now, this is always going to be an empty object.
-
index: numberThe index of the tool call in the tool calls array.
-
type: "file_search"The type of tool call. This is always going to be
file_searchfor this type of tool call. -
id: optional stringThe ID of the tool call object.
-
-
function_tool_call_delta: object { index, type, id, function }-
index: numberThe index of the tool call in the tool calls array.
-
type: "function"The type of tool call. This is always going to be
functionfor this type of tool call. -
id: optional stringThe ID of the tool call object.
-
function: optional object { arguments, name, output }The definition of the function that was called.
-
arguments: optional stringThe arguments passed to the function.
-
name: optional stringThe name of the function.
-
output: optional stringThe output of the function. This will be
nullif the outputs have not been submitted yet.
-
-
-
Tool Call Delta Object
-
tool_call_delta_object: object { type, tool_calls }Details of the tool call.
-
type: "tool_calls"Always
tool_calls. -
tool_calls: optional array of ToolCallDeltaAn array of tool calls the run step was involved in. These can be associated with one of three types of tools:
code_interpreter,file_search, orfunction.-
code_interpreter_tool_call_delta: object { index, type, id, code_interpreter }Details of the Code Interpreter tool call the run step was involved in.
-
index: numberThe index of the tool call in the tool calls array.
-
type: "code_interpreter"The type of tool call. This is always going to be
code_interpreterfor this type of tool call. -
id: optional stringThe ID of the tool call.
-
code_interpreter: optional object { input, outputs }The Code Interpreter tool call definition.
-
input: optional stringThe input to the Code Interpreter tool call.
-
outputs: optional array of CodeInterpreterLogs or CodeInterpreterOutputImageThe outputs from the Code Interpreter tool call. Code Interpreter can output one or more items, including text (
logs) or images (image). Each of these are represented by a different object type.-
code_interpreter_logs: object { index, type, logs }Text output from the Code Interpreter tool call as part of a run step.
-
index: numberThe index of the output in the outputs array.
-
type: "logs"Always
logs. -
logs: optional stringThe text output from the Code Interpreter tool call.
-
-
code_interpreter_output_image: object { index, type, image }-
index: numberThe index of the output in the outputs array.
-
type: "image"Always
image. -
image: optional object { file_id }-
file_id: optional stringThe file ID of the image.
-
-
-
-
-
-
file_search_tool_call_delta: object { file_search, index, type, id }-
file_search: unknownFor now, this is always going to be an empty object.
-
index: numberThe index of the tool call in the tool calls array.
-
type: "file_search"The type of tool call. This is always going to be
file_searchfor this type of tool call. -
id: optional stringThe ID of the tool call object.
-
-
function_tool_call_delta: object { index, type, id, function }-
index: numberThe index of the tool call in the tool calls array.
-
type: "function"The type of tool call. This is always going to be
functionfor this type of tool call. -
id: optional stringThe ID of the tool call object.
-
function: optional object { arguments, name, output }The definition of the function that was called.
-
arguments: optional stringThe arguments passed to the function.
-
name: optional stringThe name of the function.
-
output: optional stringThe output of the function. This will be
nullif the outputs have not been submitted yet.
-
-
-
-
Tool Calls Step Details
-
tool_calls_step_details: object { tool_calls, type }Details of the tool call.
-
tool_calls: array of ToolCallAn array of tool calls the run step was involved in. These can be associated with one of three types of tools:
code_interpreter,file_search, orfunction.-
code_interpreter_tool_call: object { id, code_interpreter, type }Details of the Code Interpreter tool call the run step was involved in.
-
id: stringThe ID of the tool call.
-
code_interpreter: object { input, outputs }The Code Interpreter tool call definition.
-
input: stringThe input to the Code Interpreter tool call.
-
outputs: array of object { logs, type } or object { image, type }The outputs from the Code Interpreter tool call. Code Interpreter can output one or more items, including text (
logs) or images (image). Each of these are represented by a different object type.-
logs: object { logs, type }Text output from the Code Interpreter tool call as part of a run step.
-
logs: stringThe text output from the Code Interpreter tool call.
-
type: "logs"Always
logs.
-
-
image: object { image, type }-
image: object { file_id }-
file_id: stringThe file ID of the image.
-
-
type: "image"Always
image.
-
-
-
-
type: "code_interpreter"The type of tool call. This is always going to be
code_interpreterfor this type of tool call.
-
-
file_search_tool_call: object { id, file_search, type }-
id: stringThe ID of the tool call object.
-
file_search: object { ranking_options, results }For now, this is always going to be an empty object.
-
ranking_options: optional object { ranker, score_threshold }The ranking options for the file search.
-
ranker: "auto" or "default_2024_08_21"The ranker to use for the file search. If not specified will use the
autoranker.-
"auto" -
"default_2024_08_21"
-
-
score_threshold: numberThe score threshold for the file search. All values must be a floating point number between 0 and 1.
-
-
results: optional array of object { file_id, file_name, score, content }The results of the file search.
-
file_id: stringThe ID of the file that result was found in.
-
file_name: stringThe name of the file that result was found in.
-
score: numberThe score of the result. All values must be a floating point number between 0 and 1.
-
content: optional array of object { text, type }The content of the result that was found. The content is only included if requested via the include query parameter.
-
text: optional stringThe text content of the file.
-
type: optional "text"The type of the content.
"text"
-
-
-
-
type: "file_search"The type of tool call. This is always going to be
file_searchfor this type of tool call.
-
-
function_tool_call: object { id, function, type }-
id: stringThe ID of the tool call object.
-
function: object { arguments, name, output }The definition of the function that was called.
-
arguments: stringThe arguments passed to the function.
-
name: stringThe name of the function.
-
output: stringThe output of the function. This will be
nullif the outputs have not been submitted yet.
-
-
type: "function"The type of tool call. This is always going to be
functionfor this type of tool call.
-
-
-
type: "tool_calls"Always
tool_calls.
-
Messages
List messages
$ openai beta:threads:messages list
get /threads/{thread_id}/messages
Returns a list of messages for a given thread.
Parameters
-
--thread-id: stringThe ID of the thread the messages belong to.
-
--after: optional stringA cursor for use in pagination.
afteris an object ID that defines your place in the list. For instance, if you make a list request and receive 100 objects, ending with obj_foo, your subsequent call can include after=obj_foo in order to fetch the next page of the list. -
--before: optional stringA cursor for use in pagination.
beforeis an object ID that defines your place in the list. For instance, if you make a list request and receive 100 objects, starting with obj_foo, your subsequent call can include before=obj_foo in order to fetch the previous page of the list. -
--limit: optional numberA limit on the number of objects to be returned. Limit can range between 1 and 100, and the default is 20.
-
--order: optional "asc" or "desc"Sort order by the
created_attimestamp of the objects.ascfor ascending order anddescfor descending order. -
--run-id: optional stringFilter messages by the run ID that generated them.
Returns
-
ListMessagesResponse: object { data, first_id, has_more, 2 more }-
data: array of Message-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringIf applicable, the ID of the assistant that authored this message.
-
attachments: array of object { file_id, tools }A list of files attached to the message, and the tools they were added to.
-
file_id: optional stringThe ID of the file to attach to the message.
-
tools: optional array of CodeInterpreterTool or object { type }The tools to add this file to.
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
AssistantToolsFileSearchTypeOnly: object { type }
-
-
-
completed_at: numberThe Unix timestamp (in seconds) for when the message was completed.
-
content: array of MessageContentThe content of the message in array of text and/or images.
-
image_file_content_block: object { image_file, type }References an image File in the content of a message.
-
image_file: object { file_id, detail }-
file_id: stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content. -
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
-
type: "image_file"Always
image_file.
-
-
image_url_content_block: object { image_url, type }References an image URL in the content of a message.
-
image_url: object { url, detail }-
url: stringThe external URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh. Default value isauto-
"auto" -
"low" -
"high"
-
-
-
type: "image_url"The type of the content part.
-
-
text_content_block: object { text, type }The text content that is part of a message.
-
text: object { annotations, value }-
annotations: array of Annotation-
file_citation_annotation: object { end_index, file_citation, start_index, 2 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
end_index: number -
file_citation: object { file_id }-
file_id: stringThe ID of the specific File the citation is from.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_citation"Always
file_citation.
-
-
file_path_annotation: object { end_index, file_path, start_index, 2 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
end_index: number -
file_path: object { file_id }-
file_id: stringThe ID of the file that was generated.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_path"Always
file_path.
-
-
-
value: stringThe data that makes up the text.
-
-
type: "text"Always
text.
-
-
refusal_content_block: object { refusal, type }The refusal content generated by the assistant.
-
refusal: string -
type: "refusal"Always
refusal.
-
-
-
created_at: numberThe Unix timestamp (in seconds) for when the message was created.
-
incomplete_at: numberThe Unix timestamp (in seconds) for when the message was marked as incomplete.
-
incomplete_details: object { reason }On an incomplete message, details about why the message is incomplete.
-
reason: "content_filter" or "max_tokens" or "run_cancelled" or 2 moreThe reason the message is incomplete.
-
"content_filter" -
"max_tokens" -
"run_cancelled" -
"run_expired" -
"run_failed"
-
-
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.message"The object type, which is always
thread.message. -
role: "user" or "assistant"The entity that produced the message. One of
userorassistant.-
"user" -
"assistant"
-
-
run_id: stringThe ID of the run associated with the creation of this message. Value is
nullwhen messages are created manually using the create message or create thread endpoints. -
status: "in_progress" or "incomplete" or "completed"The status of the message, which can be either
in_progress,incomplete, orcompleted.-
"in_progress" -
"incomplete" -
"completed"
-
-
thread_id: stringThe thread ID that this message belongs to.
-
-
first_id: string -
has_more: boolean -
last_id: string -
object: string
-
Example
openai beta:threads:messages list \
--api-key 'My API Key' \
--thread-id thread_id
Response
{
"data": [
{
"id": "id",
"assistant_id": "assistant_id",
"attachments": [
{
"file_id": "file_id",
"tools": [
{
"type": "code_interpreter"
}
]
}
],
"completed_at": 0,
"content": [
{
"image_file": {
"file_id": "file_id",
"detail": "auto"
},
"type": "image_file"
}
],
"created_at": 0,
"incomplete_at": 0,
"incomplete_details": {
"reason": "content_filter"
},
"metadata": {
"foo": "string"
},
"object": "thread.message",
"role": "user",
"run_id": "run_id",
"status": "in_progress",
"thread_id": "thread_id"
}
],
"first_id": "msg_abc123",
"has_more": false,
"last_id": "msg_abc123",
"object": "list"
}
Create message
$ openai beta:threads:messages create
post /threads/{thread_id}/messages
Create a message.
Parameters
-
--thread-id: stringThe ID of the thread to create a message for.
-
--content: string or array of MessageContentPartParamThe text contents of the message.
-
--role: "user" or "assistant"The role of the entity that is creating the message. Allowed values include:
user: Indicates the message is sent by an actual user and should be used in most cases to represent user-generated messages.assistant: Indicates the message is generated by the assistant. Use this value to insert messages from the assistant into the conversation.
-
--attachment: optional array of object { file_id, tools }A list of files attached to the message, and the tools they should be added to.
-
--metadata: optional map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
Returns
-
message: object { id, assistant_id, attachments, 11 more }Represents a message within a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringIf applicable, the ID of the assistant that authored this message.
-
attachments: array of object { file_id, tools }A list of files attached to the message, and the tools they were added to.
-
file_id: optional stringThe ID of the file to attach to the message.
-
tools: optional array of CodeInterpreterTool or object { type }The tools to add this file to.
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
AssistantToolsFileSearchTypeOnly: object { type }
-
-
-
completed_at: numberThe Unix timestamp (in seconds) for when the message was completed.
-
content: array of MessageContentThe content of the message in array of text and/or images.
-
image_file_content_block: object { image_file, type }References an image File in the content of a message.
-
image_file: object { file_id, detail }-
file_id: stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content. -
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
-
type: "image_file"Always
image_file.
-
-
image_url_content_block: object { image_url, type }References an image URL in the content of a message.
-
image_url: object { url, detail }-
url: stringThe external URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh. Default value isauto-
"auto" -
"low" -
"high"
-
-
-
type: "image_url"The type of the content part.
-
-
text_content_block: object { text, type }The text content that is part of a message.
-
text: object { annotations, value }-
annotations: array of Annotation-
file_citation_annotation: object { end_index, file_citation, start_index, 2 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
end_index: number -
file_citation: object { file_id }-
file_id: stringThe ID of the specific File the citation is from.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_citation"Always
file_citation.
-
-
file_path_annotation: object { end_index, file_path, start_index, 2 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
end_index: number -
file_path: object { file_id }-
file_id: stringThe ID of the file that was generated.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_path"Always
file_path.
-
-
-
value: stringThe data that makes up the text.
-
-
type: "text"Always
text.
-
-
refusal_content_block: object { refusal, type }The refusal content generated by the assistant.
-
refusal: string -
type: "refusal"Always
refusal.
-
-
-
created_at: numberThe Unix timestamp (in seconds) for when the message was created.
-
incomplete_at: numberThe Unix timestamp (in seconds) for when the message was marked as incomplete.
-
incomplete_details: object { reason }On an incomplete message, details about why the message is incomplete.
-
reason: "content_filter" or "max_tokens" or "run_cancelled" or 2 moreThe reason the message is incomplete.
-
"content_filter" -
"max_tokens" -
"run_cancelled" -
"run_expired" -
"run_failed"
-
-
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.message"The object type, which is always
thread.message. -
role: "user" or "assistant"The entity that produced the message. One of
userorassistant.-
"user" -
"assistant"
-
-
run_id: stringThe ID of the run associated with the creation of this message. Value is
nullwhen messages are created manually using the create message or create thread endpoints. -
status: "in_progress" or "incomplete" or "completed"The status of the message, which can be either
in_progress,incomplete, orcompleted.-
"in_progress" -
"incomplete" -
"completed"
-
-
thread_id: stringThe thread ID that this message belongs to.
-
Example
openai beta:threads:messages create \
--api-key 'My API Key' \
--thread-id thread_id \
--content string \
--role user
Response
{
"id": "id",
"assistant_id": "assistant_id",
"attachments": [
{
"file_id": "file_id",
"tools": [
{
"type": "code_interpreter"
}
]
}
],
"completed_at": 0,
"content": [
{
"image_file": {
"file_id": "file_id",
"detail": "auto"
},
"type": "image_file"
}
],
"created_at": 0,
"incomplete_at": 0,
"incomplete_details": {
"reason": "content_filter"
},
"metadata": {
"foo": "string"
},
"object": "thread.message",
"role": "user",
"run_id": "run_id",
"status": "in_progress",
"thread_id": "thread_id"
}
Modify message
$ openai beta:threads:messages update
post /threads/{thread_id}/messages/{message_id}
Modifies a message.
Parameters
-
--thread-id: stringThe ID of the thread to which this message belongs.
-
--message-id: stringThe ID of the message to modify.
-
--metadata: optional map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
Returns
-
message: object { id, assistant_id, attachments, 11 more }Represents a message within a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringIf applicable, the ID of the assistant that authored this message.
-
attachments: array of object { file_id, tools }A list of files attached to the message, and the tools they were added to.
-
file_id: optional stringThe ID of the file to attach to the message.
-
tools: optional array of CodeInterpreterTool or object { type }The tools to add this file to.
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
AssistantToolsFileSearchTypeOnly: object { type }
-
-
-
completed_at: numberThe Unix timestamp (in seconds) for when the message was completed.
-
content: array of MessageContentThe content of the message in array of text and/or images.
-
image_file_content_block: object { image_file, type }References an image File in the content of a message.
-
image_file: object { file_id, detail }-
file_id: stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content. -
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
-
type: "image_file"Always
image_file.
-
-
image_url_content_block: object { image_url, type }References an image URL in the content of a message.
-
image_url: object { url, detail }-
url: stringThe external URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh. Default value isauto-
"auto" -
"low" -
"high"
-
-
-
type: "image_url"The type of the content part.
-
-
text_content_block: object { text, type }The text content that is part of a message.
-
text: object { annotations, value }-
annotations: array of Annotation-
file_citation_annotation: object { end_index, file_citation, start_index, 2 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
end_index: number -
file_citation: object { file_id }-
file_id: stringThe ID of the specific File the citation is from.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_citation"Always
file_citation.
-
-
file_path_annotation: object { end_index, file_path, start_index, 2 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
end_index: number -
file_path: object { file_id }-
file_id: stringThe ID of the file that was generated.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_path"Always
file_path.
-
-
-
value: stringThe data that makes up the text.
-
-
type: "text"Always
text.
-
-
refusal_content_block: object { refusal, type }The refusal content generated by the assistant.
-
refusal: string -
type: "refusal"Always
refusal.
-
-
-
created_at: numberThe Unix timestamp (in seconds) for when the message was created.
-
incomplete_at: numberThe Unix timestamp (in seconds) for when the message was marked as incomplete.
-
incomplete_details: object { reason }On an incomplete message, details about why the message is incomplete.
-
reason: "content_filter" or "max_tokens" or "run_cancelled" or 2 moreThe reason the message is incomplete.
-
"content_filter" -
"max_tokens" -
"run_cancelled" -
"run_expired" -
"run_failed"
-
-
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.message"The object type, which is always
thread.message. -
role: "user" or "assistant"The entity that produced the message. One of
userorassistant.-
"user" -
"assistant"
-
-
run_id: stringThe ID of the run associated with the creation of this message. Value is
nullwhen messages are created manually using the create message or create thread endpoints. -
status: "in_progress" or "incomplete" or "completed"The status of the message, which can be either
in_progress,incomplete, orcompleted.-
"in_progress" -
"incomplete" -
"completed"
-
-
thread_id: stringThe thread ID that this message belongs to.
-
Example
openai beta:threads:messages update \
--api-key 'My API Key' \
--thread-id thread_id \
--message-id message_id
Response
{
"id": "id",
"assistant_id": "assistant_id",
"attachments": [
{
"file_id": "file_id",
"tools": [
{
"type": "code_interpreter"
}
]
}
],
"completed_at": 0,
"content": [
{
"image_file": {
"file_id": "file_id",
"detail": "auto"
},
"type": "image_file"
}
],
"created_at": 0,
"incomplete_at": 0,
"incomplete_details": {
"reason": "content_filter"
},
"metadata": {
"foo": "string"
},
"object": "thread.message",
"role": "user",
"run_id": "run_id",
"status": "in_progress",
"thread_id": "thread_id"
}
Retrieve message
$ openai beta:threads:messages retrieve
get /threads/{thread_id}/messages/{message_id}
Retrieve a message.
Parameters
-
--thread-id: stringThe ID of the thread to which this message belongs.
-
--message-id: stringThe ID of the message to retrieve.
Returns
-
message: object { id, assistant_id, attachments, 11 more }Represents a message within a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringIf applicable, the ID of the assistant that authored this message.
-
attachments: array of object { file_id, tools }A list of files attached to the message, and the tools they were added to.
-
file_id: optional stringThe ID of the file to attach to the message.
-
tools: optional array of CodeInterpreterTool or object { type }The tools to add this file to.
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
AssistantToolsFileSearchTypeOnly: object { type }
-
-
-
completed_at: numberThe Unix timestamp (in seconds) for when the message was completed.
-
content: array of MessageContentThe content of the message in array of text and/or images.
-
image_file_content_block: object { image_file, type }References an image File in the content of a message.
-
image_file: object { file_id, detail }-
file_id: stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content. -
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
-
type: "image_file"Always
image_file.
-
-
image_url_content_block: object { image_url, type }References an image URL in the content of a message.
-
image_url: object { url, detail }-
url: stringThe external URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh. Default value isauto-
"auto" -
"low" -
"high"
-
-
-
type: "image_url"The type of the content part.
-
-
text_content_block: object { text, type }The text content that is part of a message.
-
text: object { annotations, value }-
annotations: array of Annotation-
file_citation_annotation: object { end_index, file_citation, start_index, 2 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
end_index: number -
file_citation: object { file_id }-
file_id: stringThe ID of the specific File the citation is from.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_citation"Always
file_citation.
-
-
file_path_annotation: object { end_index, file_path, start_index, 2 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
end_index: number -
file_path: object { file_id }-
file_id: stringThe ID of the file that was generated.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_path"Always
file_path.
-
-
-
value: stringThe data that makes up the text.
-
-
type: "text"Always
text.
-
-
refusal_content_block: object { refusal, type }The refusal content generated by the assistant.
-
refusal: string -
type: "refusal"Always
refusal.
-
-
-
created_at: numberThe Unix timestamp (in seconds) for when the message was created.
-
incomplete_at: numberThe Unix timestamp (in seconds) for when the message was marked as incomplete.
-
incomplete_details: object { reason }On an incomplete message, details about why the message is incomplete.
-
reason: "content_filter" or "max_tokens" or "run_cancelled" or 2 moreThe reason the message is incomplete.
-
"content_filter" -
"max_tokens" -
"run_cancelled" -
"run_expired" -
"run_failed"
-
-
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.message"The object type, which is always
thread.message. -
role: "user" or "assistant"The entity that produced the message. One of
userorassistant.-
"user" -
"assistant"
-
-
run_id: stringThe ID of the run associated with the creation of this message. Value is
nullwhen messages are created manually using the create message or create thread endpoints. -
status: "in_progress" or "incomplete" or "completed"The status of the message, which can be either
in_progress,incomplete, orcompleted.-
"in_progress" -
"incomplete" -
"completed"
-
-
thread_id: stringThe thread ID that this message belongs to.
-
Example
openai beta:threads:messages retrieve \
--api-key 'My API Key' \
--thread-id thread_id \
--message-id message_id
Response
{
"id": "id",
"assistant_id": "assistant_id",
"attachments": [
{
"file_id": "file_id",
"tools": [
{
"type": "code_interpreter"
}
]
}
],
"completed_at": 0,
"content": [
{
"image_file": {
"file_id": "file_id",
"detail": "auto"
},
"type": "image_file"
}
],
"created_at": 0,
"incomplete_at": 0,
"incomplete_details": {
"reason": "content_filter"
},
"metadata": {
"foo": "string"
},
"object": "thread.message",
"role": "user",
"run_id": "run_id",
"status": "in_progress",
"thread_id": "thread_id"
}
Delete message
$ openai beta:threads:messages delete
delete /threads/{thread_id}/messages/{message_id}
Deletes a message.
Parameters
-
--thread-id: stringThe ID of the thread to which this message belongs.
-
--message-id: stringThe ID of the message to delete.
Returns
-
message_deleted: object { id, deleted, object }-
id: string -
deleted: boolean -
object: "thread.message.deleted"
-
Example
openai beta:threads:messages delete \
--api-key 'My API Key' \
--thread-id thread_id \
--message-id message_id
Response
{
"id": "id",
"deleted": true,
"object": "thread.message.deleted"
}
Domain Types
Annotation
-
annotation: FileCitationAnnotation or FilePathAnnotationA citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
file_citation_annotation: object { end_index, file_citation, start_index, 2 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
end_index: number -
file_citation: object { file_id }-
file_id: stringThe ID of the specific File the citation is from.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_citation"Always
file_citation.
-
-
file_path_annotation: object { end_index, file_path, start_index, 2 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
end_index: number -
file_path: object { file_id }-
file_id: stringThe ID of the file that was generated.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_path"Always
file_path.
-
-
Annotation Delta
-
annotation_delta: FileCitationDeltaAnnotation or FilePathDeltaAnnotationA citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
file_citation_delta_annotation: object { index, type, end_index, 3 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
index: numberThe index of the annotation in the text content part.
-
type: "file_citation"Always
file_citation. -
end_index: optional number -
file_citation: optional object { file_id, quote }-
file_id: optional stringThe ID of the specific File the citation is from.
-
quote: optional stringThe specific quote in the file.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
-
file_path_delta_annotation: object { index, type, end_index, 3 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
index: numberThe index of the annotation in the text content part.
-
type: "file_path"Always
file_path. -
end_index: optional number -
file_path: optional object { file_id }-
file_id: optional stringThe ID of the file that was generated.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
-
File Citation Annotation
-
file_citation_annotation: object { end_index, file_citation, start_index, 2 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
end_index: number -
file_citation: object { file_id }-
file_id: stringThe ID of the specific File the citation is from.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_citation"Always
file_citation.
-
File Citation Delta Annotation
-
file_citation_delta_annotation: object { index, type, end_index, 3 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
index: numberThe index of the annotation in the text content part.
-
type: "file_citation"Always
file_citation. -
end_index: optional number -
file_citation: optional object { file_id, quote }-
file_id: optional stringThe ID of the specific File the citation is from.
-
quote: optional stringThe specific quote in the file.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
File Path Annotation
-
file_path_annotation: object { end_index, file_path, start_index, 2 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
end_index: number -
file_path: object { file_id }-
file_id: stringThe ID of the file that was generated.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_path"Always
file_path.
-
File Path Delta Annotation
-
file_path_delta_annotation: object { index, type, end_index, 3 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
index: numberThe index of the annotation in the text content part.
-
type: "file_path"Always
file_path. -
end_index: optional number -
file_path: optional object { file_id }-
file_id: optional stringThe ID of the file that was generated.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
Image File
-
image_file: object { file_id, detail }-
file_id: stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content. -
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
Image File Content Block
-
image_file_content_block: object { image_file, type }References an image File in the content of a message.
-
image_file: object { file_id, detail }-
file_id: stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content. -
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
-
type: "image_file"Always
image_file.
-
Image File Delta
-
image_file_delta: object { detail, file_id }-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
file_id: optional stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content.
-
Image File Delta Block
-
image_file_delta_block: object { index, type, image_file }References an image File in the content of a message.
-
index: numberThe index of the content part in the message.
-
type: "image_file"Always
image_file. -
image_file: optional object { detail, file_id }-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
file_id: optional stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content.
-
-
Image URL
-
image_url: object { url, detail }-
url: stringThe external URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh. Default value isauto-
"auto" -
"low" -
"high"
-
-
Image URL Content Block
-
image_url_content_block: object { image_url, type }References an image URL in the content of a message.
-
image_url: object { url, detail }-
url: stringThe external URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh. Default value isauto-
"auto" -
"low" -
"high"
-
-
-
type: "image_url"The type of the content part.
-
Image URL Delta
-
image_url_delta: object { detail, url }-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
url: optional stringThe URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
Image URL Delta Block
-
image_url_delta_block: object { index, type, image_url }References an image URL in the content of a message.
-
index: numberThe index of the content part in the message.
-
type: "image_url"Always
image_url. -
image_url: optional object { detail, url }-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
url: optional stringThe URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
-
Message
-
message: object { id, assistant_id, attachments, 11 more }Represents a message within a thread.
-
id: stringThe identifier, which can be referenced in API endpoints.
-
assistant_id: stringIf applicable, the ID of the assistant that authored this message.
-
attachments: array of object { file_id, tools }A list of files attached to the message, and the tools they were added to.
-
file_id: optional stringThe ID of the file to attach to the message.
-
tools: optional array of CodeInterpreterTool or object { type }The tools to add this file to.
-
code_interpreter_tool: object { type }-
type: "code_interpreter"The type of tool being defined:
code_interpreter
-
-
AssistantToolsFileSearchTypeOnly: object { type }
-
-
-
completed_at: numberThe Unix timestamp (in seconds) for when the message was completed.
-
content: array of MessageContentThe content of the message in array of text and/or images.
-
image_file_content_block: object { image_file, type }References an image File in the content of a message.
-
image_file: object { file_id, detail }-
file_id: stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content. -
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
-
type: "image_file"Always
image_file.
-
-
image_url_content_block: object { image_url, type }References an image URL in the content of a message.
-
image_url: object { url, detail }-
url: stringThe external URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh. Default value isauto-
"auto" -
"low" -
"high"
-
-
-
type: "image_url"The type of the content part.
-
-
text_content_block: object { text, type }The text content that is part of a message.
-
text: object { annotations, value }-
annotations: array of Annotation-
file_citation_annotation: object { end_index, file_citation, start_index, 2 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
end_index: number -
file_citation: object { file_id }-
file_id: stringThe ID of the specific File the citation is from.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_citation"Always
file_citation.
-
-
file_path_annotation: object { end_index, file_path, start_index, 2 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
end_index: number -
file_path: object { file_id }-
file_id: stringThe ID of the file that was generated.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_path"Always
file_path.
-
-
-
value: stringThe data that makes up the text.
-
-
type: "text"Always
text.
-
-
refusal_content_block: object { refusal, type }The refusal content generated by the assistant.
-
refusal: string -
type: "refusal"Always
refusal.
-
-
-
created_at: numberThe Unix timestamp (in seconds) for when the message was created.
-
incomplete_at: numberThe Unix timestamp (in seconds) for when the message was marked as incomplete.
-
incomplete_details: object { reason }On an incomplete message, details about why the message is incomplete.
-
reason: "content_filter" or "max_tokens" or "run_cancelled" or 2 moreThe reason the message is incomplete.
-
"content_filter" -
"max_tokens" -
"run_cancelled" -
"run_expired" -
"run_failed"
-
-
-
metadata: map[string]Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.
Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
-
object: "thread.message"The object type, which is always
thread.message. -
role: "user" or "assistant"The entity that produced the message. One of
userorassistant.-
"user" -
"assistant"
-
-
run_id: stringThe ID of the run associated with the creation of this message. Value is
nullwhen messages are created manually using the create message or create thread endpoints. -
status: "in_progress" or "incomplete" or "completed"The status of the message, which can be either
in_progress,incomplete, orcompleted.-
"in_progress" -
"incomplete" -
"completed"
-
-
thread_id: stringThe thread ID that this message belongs to.
-
Message Content
-
message_content: ImageFileContentBlock or ImageURLContentBlock or TextContentBlock or RefusalContentBlockReferences an image File in the content of a message.
-
image_file_content_block: object { image_file, type }References an image File in the content of a message.
-
image_file: object { file_id, detail }-
file_id: stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content. -
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
-
type: "image_file"Always
image_file.
-
-
image_url_content_block: object { image_url, type }References an image URL in the content of a message.
-
image_url: object { url, detail }-
url: stringThe external URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh. Default value isauto-
"auto" -
"low" -
"high"
-
-
-
type: "image_url"The type of the content part.
-
-
text_content_block: object { text, type }The text content that is part of a message.
-
text: object { annotations, value }-
annotations: array of Annotation-
file_citation_annotation: object { end_index, file_citation, start_index, 2 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
end_index: number -
file_citation: object { file_id }-
file_id: stringThe ID of the specific File the citation is from.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_citation"Always
file_citation.
-
-
file_path_annotation: object { end_index, file_path, start_index, 2 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
end_index: number -
file_path: object { file_id }-
file_id: stringThe ID of the file that was generated.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_path"Always
file_path.
-
-
-
value: stringThe data that makes up the text.
-
-
type: "text"Always
text.
-
-
refusal_content_block: object { refusal, type }The refusal content generated by the assistant.
-
refusal: string -
type: "refusal"Always
refusal.
-
-
Message Content Delta
-
message_content_delta: ImageFileDeltaBlock or TextDeltaBlock or RefusalDeltaBlock or ImageURLDeltaBlockReferences an image File in the content of a message.
-
image_file_delta_block: object { index, type, image_file }References an image File in the content of a message.
-
index: numberThe index of the content part in the message.
-
type: "image_file"Always
image_file. -
image_file: optional object { detail, file_id }-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
file_id: optional stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content.
-
-
-
text_delta_block: object { index, type, text }The text content that is part of a message.
-
index: numberThe index of the content part in the message.
-
type: "text"Always
text. -
text: optional object { annotations, value }-
annotations: optional array of AnnotationDelta-
file_citation_delta_annotation: object { index, type, end_index, 3 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
index: numberThe index of the annotation in the text content part.
-
type: "file_citation"Always
file_citation. -
end_index: optional number -
file_citation: optional object { file_id, quote }-
file_id: optional stringThe ID of the specific File the citation is from.
-
quote: optional stringThe specific quote in the file.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
-
file_path_delta_annotation: object { index, type, end_index, 3 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
index: numberThe index of the annotation in the text content part.
-
type: "file_path"Always
file_path. -
end_index: optional number -
file_path: optional object { file_id }-
file_id: optional stringThe ID of the file that was generated.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
-
-
value: optional stringThe data that makes up the text.
-
-
-
refusal_delta_block: object { index, type, refusal }The refusal content that is part of a message.
-
index: numberThe index of the refusal part in the message.
-
type: "refusal"Always
refusal. -
refusal: optional string
-
-
image_url_delta_block: object { index, type, image_url }References an image URL in the content of a message.
-
index: numberThe index of the content part in the message.
-
type: "image_url"Always
image_url. -
image_url: optional object { detail, url }-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
url: optional stringThe URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
-
-
Message Content Part Param
-
message_content_part_param: ImageFileContentBlock or ImageURLContentBlock or TextContentBlockParamReferences an image File in the content of a message.
-
image_file_content_block: object { image_file, type }References an image File in the content of a message.
-
image_file: object { file_id, detail }-
file_id: stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content. -
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
-
type: "image_file"Always
image_file.
-
-
image_url_content_block: object { image_url, type }References an image URL in the content of a message.
-
image_url: object { url, detail }-
url: stringThe external URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh. Default value isauto-
"auto" -
"low" -
"high"
-
-
-
type: "image_url"The type of the content part.
-
-
text_content_block_param: object { text, type }The text content that is part of a message.
-
text: stringText content to be sent to the model
-
type: "text"Always
text.
-
-
Message Deleted
-
message_deleted: object { id, deleted, object }-
id: string -
deleted: boolean -
object: "thread.message.deleted"
-
Message Delta
-
message_delta: object { content, role }The delta containing the fields that have changed on the Message.
-
content: optional array of MessageContentDeltaThe content of the message in array of text and/or images.
-
image_file_delta_block: object { index, type, image_file }References an image File in the content of a message.
-
index: numberThe index of the content part in the message.
-
type: "image_file"Always
image_file. -
image_file: optional object { detail, file_id }-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
file_id: optional stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content.
-
-
-
text_delta_block: object { index, type, text }The text content that is part of a message.
-
index: numberThe index of the content part in the message.
-
type: "text"Always
text. -
text: optional object { annotations, value }-
annotations: optional array of AnnotationDelta-
file_citation_delta_annotation: object { index, type, end_index, 3 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
index: numberThe index of the annotation in the text content part.
-
type: "file_citation"Always
file_citation. -
end_index: optional number -
file_citation: optional object { file_id, quote }-
file_id: optional stringThe ID of the specific File the citation is from.
-
quote: optional stringThe specific quote in the file.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
-
file_path_delta_annotation: object { index, type, end_index, 3 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
index: numberThe index of the annotation in the text content part.
-
type: "file_path"Always
file_path. -
end_index: optional number -
file_path: optional object { file_id }-
file_id: optional stringThe ID of the file that was generated.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
-
-
value: optional stringThe data that makes up the text.
-
-
-
refusal_delta_block: object { index, type, refusal }The refusal content that is part of a message.
-
index: numberThe index of the refusal part in the message.
-
type: "refusal"Always
refusal. -
refusal: optional string
-
-
image_url_delta_block: object { index, type, image_url }References an image URL in the content of a message.
-
index: numberThe index of the content part in the message.
-
type: "image_url"Always
image_url. -
image_url: optional object { detail, url }-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
url: optional stringThe URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
-
-
-
role: optional "user" or "assistant"The entity that produced the message. One of
userorassistant.-
"user" -
"assistant"
-
-
Message Delta Event
-
message_delta_event: object { id, delta, object }Represents a message delta i.e. any changed fields on a message during streaming.
-
id: stringThe identifier of the message, which can be referenced in API endpoints.
-
delta: object { content, role }The delta containing the fields that have changed on the Message.
-
content: optional array of MessageContentDeltaThe content of the message in array of text and/or images.
-
image_file_delta_block: object { index, type, image_file }References an image File in the content of a message.
-
index: numberThe index of the content part in the message.
-
type: "image_file"Always
image_file. -
image_file: optional object { detail, file_id }-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image if specified by the user.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
file_id: optional stringThe File ID of the image in the message content. Set
purpose="vision"when uploading the File if you need to later display the file content.
-
-
-
text_delta_block: object { index, type, text }The text content that is part of a message.
-
index: numberThe index of the content part in the message.
-
type: "text"Always
text. -
text: optional object { annotations, value }-
annotations: optional array of AnnotationDelta-
file_citation_delta_annotation: object { index, type, end_index, 3 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
index: numberThe index of the annotation in the text content part.
-
type: "file_citation"Always
file_citation. -
end_index: optional number -
file_citation: optional object { file_id, quote }-
file_id: optional stringThe ID of the specific File the citation is from.
-
quote: optional stringThe specific quote in the file.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
-
file_path_delta_annotation: object { index, type, end_index, 3 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
index: numberThe index of the annotation in the text content part.
-
type: "file_path"Always
file_path. -
end_index: optional number -
file_path: optional object { file_id }-
file_id: optional stringThe ID of the file that was generated.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
-
-
value: optional stringThe data that makes up the text.
-
-
-
refusal_delta_block: object { index, type, refusal }The refusal content that is part of a message.
-
index: numberThe index of the refusal part in the message.
-
type: "refusal"Always
refusal. -
refusal: optional string
-
-
image_url_delta_block: object { index, type, image_url }References an image URL in the content of a message.
-
index: numberThe index of the content part in the message.
-
type: "image_url"Always
image_url. -
image_url: optional object { detail, url }-
detail: optional "auto" or "low" or "high"Specifies the detail level of the image.
lowuses fewer tokens, you can opt in to high resolution usinghigh.-
"auto" -
"low" -
"high"
-
-
url: optional stringThe URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.
-
-
-
-
role: optional "user" or "assistant"The entity that produced the message. One of
userorassistant.-
"user" -
"assistant"
-
-
-
object: "thread.message.delta"The object type, which is always
thread.message.delta.
-
Refusal Content Block
-
refusal_content_block: object { refusal, type }The refusal content generated by the assistant.
-
refusal: string -
type: "refusal"Always
refusal.
-
Refusal Delta Block
-
refusal_delta_block: object { index, type, refusal }The refusal content that is part of a message.
-
index: numberThe index of the refusal part in the message.
-
type: "refusal"Always
refusal. -
refusal: optional string
-
Text
-
text: object { annotations, value }-
annotations: array of Annotation-
file_citation_annotation: object { end_index, file_citation, start_index, 2 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
end_index: number -
file_citation: object { file_id }-
file_id: stringThe ID of the specific File the citation is from.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_citation"Always
file_citation.
-
-
file_path_annotation: object { end_index, file_path, start_index, 2 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
end_index: number -
file_path: object { file_id }-
file_id: stringThe ID of the file that was generated.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_path"Always
file_path.
-
-
-
value: stringThe data that makes up the text.
-
Text Content Block
-
text_content_block: object { text, type }The text content that is part of a message.
-
text: object { annotations, value }-
annotations: array of Annotation-
file_citation_annotation: object { end_index, file_citation, start_index, 2 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
end_index: number -
file_citation: object { file_id }-
file_id: stringThe ID of the specific File the citation is from.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_citation"Always
file_citation.
-
-
file_path_annotation: object { end_index, file_path, start_index, 2 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
end_index: number -
file_path: object { file_id }-
file_id: stringThe ID of the file that was generated.
-
-
start_index: number -
text: stringThe text in the message content that needs to be replaced.
-
type: "file_path"Always
file_path.
-
-
-
value: stringThe data that makes up the text.
-
-
type: "text"Always
text.
-
Text Content Block Param
-
text_content_block_param: object { text, type }The text content that is part of a message.
-
text: stringText content to be sent to the model
-
type: "text"Always
text.
-
Text Delta
-
text_delta: object { annotations, value }-
annotations: optional array of AnnotationDelta-
file_citation_delta_annotation: object { index, type, end_index, 3 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
index: numberThe index of the annotation in the text content part.
-
type: "file_citation"Always
file_citation. -
end_index: optional number -
file_citation: optional object { file_id, quote }-
file_id: optional stringThe ID of the specific File the citation is from.
-
quote: optional stringThe specific quote in the file.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
-
file_path_delta_annotation: object { index, type, end_index, 3 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
index: numberThe index of the annotation in the text content part.
-
type: "file_path"Always
file_path. -
end_index: optional number -
file_path: optional object { file_id }-
file_id: optional stringThe ID of the file that was generated.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
-
-
value: optional stringThe data that makes up the text.
-
Text Delta Block
-
text_delta_block: object { index, type, text }The text content that is part of a message.
-
index: numberThe index of the content part in the message.
-
type: "text"Always
text. -
text: optional object { annotations, value }-
annotations: optional array of AnnotationDelta-
file_citation_delta_annotation: object { index, type, end_index, 3 more }A citation within the message that points to a specific quote from a specific File associated with the assistant or the message. Generated when the assistant uses the "file_search" tool to search files.
-
index: numberThe index of the annotation in the text content part.
-
type: "file_citation"Always
file_citation. -
end_index: optional number -
file_citation: optional object { file_id, quote }-
file_id: optional stringThe ID of the specific File the citation is from.
-
quote: optional stringThe specific quote in the file.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
-
file_path_delta_annotation: object { index, type, end_index, 3 more }A URL for the file that's generated when the assistant used the
code_interpretertool to generate a file.-
index: numberThe index of the annotation in the text content part.
-
type: "file_path"Always
file_path. -
end_index: optional number -
file_path: optional object { file_id }-
file_id: optional stringThe ID of the file that was generated.
-
-
start_index: optional number -
text: optional stringThe text in the message content that needs to be replaced.
-
-
-
value: optional stringThe data that makes up the text.
-
-