Create image
images.generate(**kwargs) -> ImagesResponse
post /images/generations
Creates an image given a prompt. Learn more.
Parameters
-
prompt: StringA text description of the desired image(s). The maximum length is 32000 characters for the GPT image models, 1000 characters for
dall-e-2and 4000 characters fordall-e-3. -
background: :transparent | :opaque | :autoAllows to set transparency for the background of the generated image(s). This parameter is only supported for GPT image models that support transparent backgrounds. Must be one of
transparent,opaque, orauto(default value). Whenautois used, the model will automatically determine the best background for the image.gpt-image-2andgpt-image-2-2026-04-21do not support transparent backgrounds. Requests withbackgroundset totransparentwill return an error for these models; useopaqueorautoinstead.If
transparent, the output format needs to support transparency, so it should be set to eitherpng(default value) orwebp.-
:transparent -
:opaque -
:auto
-
-
model: String | ImageModelThe model to use for image generation. One of
dall-e-2,dall-e-3, or a GPT image model (gpt-image-1,gpt-image-1-mini,gpt-image-1.5,gpt-image-2, orgpt-image-2-2026-04-21). Defaults todall-e-2unless a parameter specific to the GPT image models is used.-
String = String -
ImageModel = :"gpt-image-1" | :"gpt-image-1-mini" | :"gpt-image-2" | 5 more-
:"gpt-image-1" -
:"gpt-image-1-mini" -
:"gpt-image-2" -
:"gpt-image-2-2026-04-21" -
:"gpt-image-1.5" -
:"chatgpt-image-latest" -
:"dall-e-2" -
:"dall-e-3"
-
-
-
moderation: :low | :autoControl the content-moderation level for images generated by the GPT image models. Must be either
lowfor less restrictive filtering orauto(default value).-
:low -
:auto
-
-
n: IntegerThe number of images to generate. Must be between 1 and 10. For
dall-e-3, onlyn=1is supported. -
output_compression: IntegerThe compression level (0-100%) for the generated images. This parameter is only supported for the GPT image models with the
webporjpegoutput formats, and defaults to 100. -
output_format: :png | :jpeg | :webpThe format in which the generated images are returned. This parameter is only supported for the GPT image models. Must be one of
png,jpeg, orwebp.-
:png -
:jpeg -
:webp
-
-
partial_images: IntegerThe number of partial images to generate. This parameter is used for streaming responses that return partial images. Value must be between 0 and 3. When set to 0, the response will be a single image sent in one streaming event.
Note that the final image may be sent before the full number of partial images are generated if the full image is generated more quickly.
-
quality: :standard | :hd | :low | 3 moreThe quality of the image that will be generated.
-
auto(default value) will automatically select the best quality for the given model. -
high,mediumandloware supported for the GPT image models. -
hdandstandardare supported fordall-e-3. -
standardis the only option fordall-e-2. -
:standard -
:hd -
:low -
:medium -
:high -
:auto
-
-
response_format: :url | :b64_jsonThe format in which generated images with
dall-e-2anddall-e-3are returned. Must be one ofurlorb64_json. URLs are only valid for 60 minutes after the image has been generated. This parameter isn't supported for the GPT image models, which always return base64-encoded images.-
:url -
:b64_json
-
-
size: String | :auto | :"1024x1024" | :"1536x1024" | 5 moreThe size of the generated images. For
gpt-image-2andgpt-image-2-2026-04-21, arbitrary resolutions are supported asWIDTHxHEIGHTstrings, for example1536x864. Width and height must both be divisible by 16 and the requested aspect ratio must be between 1:3 and 3:1. Resolutions above2560x1440are experimental, and the maximum supported resolution is3840x2160. The requested size must also satisfy the model's current pixel and edge limits. The standard sizes1024x1024,1536x1024, and1024x1536are supported by the GPT image models;autois supported for models that allow automatic sizing. Fordall-e-2, use one of256x256,512x512, or1024x1024. Fordall-e-3, use one of1024x1024,1792x1024, or1024x1792.-
String = String -
Size = :auto | :"1024x1024" | :"1536x1024" | 5 moreThe size of the generated images. For
gpt-image-2andgpt-image-2-2026-04-21, arbitrary resolutions are supported asWIDTHxHEIGHTstrings, for example1536x864. Width and height must both be divisible by 16 and the requested aspect ratio must be between 1:3 and 3:1. Resolutions above2560x1440are experimental, and the maximum supported resolution is3840x2160. The requested size must also satisfy the model's current pixel and edge limits. The standard sizes1024x1024,1536x1024, and1024x1536are supported by the GPT image models;autois supported for models that allow automatic sizing. Fordall-e-2, use one of256x256,512x512, or1024x1024. Fordall-e-3, use one of1024x1024,1792x1024, or1024x1792.-
:auto -
:"1024x1024" -
:"1536x1024" -
:"1024x1536" -
:"256x256" -
:"512x512" -
:"1792x1024" -
:"1024x1792"
-
-
-
stream: boolGenerate the image in streaming mode. Defaults to
false. See the Image generation guide for more information. This parameter is only supported for the GPT image models. -
style: :vivid | :naturalThe style of the generated images. This parameter is only supported for
dall-e-3. Must be one ofvividornatural. Vivid causes the model to lean towards generating hyper-real and dramatic images. Natural causes the model to produce more natural, less hyper-real looking images.-
:vivid -
:natural
-
-
user: StringA unique identifier representing your end-user, which can help OpenAI to monitor and detect abuse. Learn more.
Returns
-
class ImagesResponseThe response from the image generation endpoint.
-
created: IntegerThe Unix timestamp (in seconds) of when the image was created.
-
background: :transparent | :opaqueThe background parameter used for the image generation. Either
transparentoropaque.-
:transparent -
:opaque
-
-
data: Array[Image]The list of generated images.
-
b64_json: StringThe base64-encoded JSON of the generated image. Returned by default for the GPT image models, and only present if
response_formatis set tob64_jsonfordall-e-2anddall-e-3. -
revised_prompt: StringFor
dall-e-3only, the revised prompt that was used to generate the image. -
url: StringWhen using
dall-e-2ordall-e-3, the URL of the generated image ifresponse_formatis set tourl(default value). Unsupported for the GPT image models.
-
-
output_format: :png | :webp | :jpegThe output format of the image generation. Either
png,webp, orjpeg.-
:png -
:webp -
:jpeg
-
-
quality: :low | :medium | :highThe quality of the image generated. Either
low,medium, orhigh.-
:low -
:medium -
:high
-
-
size: :"1024x1024" | :"1024x1536" | :"1536x1024"The size of the image generated. Either
1024x1024,1024x1536, or1536x1024.-
:"1024x1024" -
:"1024x1536" -
:"1536x1024"
-
-
usage: Usage{ input_tokens, input_tokens_details, output_tokens, 2 more}For
gpt-image-1only, the token usage information for the image generation.-
input_tokens: IntegerThe number of tokens (images and text) in the input prompt.
-
input_tokens_details: InputTokensDetails{ image_tokens, text_tokens}The input tokens detailed information for the image generation.
-
image_tokens: IntegerThe number of image tokens in the input prompt.
-
text_tokens: IntegerThe number of text tokens in the input prompt.
-
-
output_tokens: IntegerThe number of output tokens generated by the model.
-
total_tokens: IntegerThe total number of tokens (images and text) used for the image generation.
-
output_tokens_details: OutputTokensDetails{ image_tokens, text_tokens}The output token details for the image generation.
-
image_tokens: IntegerThe number of image output tokens generated by the model.
-
text_tokens: IntegerThe number of text output tokens generated by the model.
-
-
-
Example
require "openai"
openai = OpenAI::Client.new(api_key: "My API Key")
images_response = openai.images.generate(prompt: "A cute baby sea otter")
puts(images_response)
Response
{
"created": 0,
"background": "transparent",
"data": [
{
"b64_json": "b64_json",
"revised_prompt": "revised_prompt",
"url": "https://example.com"
}
],
"output_format": "png",
"quality": "low",
"size": "1024x1024",
"usage": {
"input_tokens": 0,
"input_tokens_details": {
"image_tokens": 0,
"text_tokens": 0
},
"output_tokens": 0,
"total_tokens": 0,
"output_tokens_details": {
"image_tokens": 0,
"text_tokens": 0
}
}
}