Speech

Create speech

audio.speech.create(SpeechCreateParams**kwargs) -> BinaryResponseContent

post /audio/speech

Generates audio from the input text.

Returns the audio file content, or a stream of audio events.

Parameters

input: str

The text to generate audio for. The maximum length is 4096 characters.
model: Union[str, SpeechModel]

One of the available TTS models: tts-1, tts-1-hd, gpt-4o-mini-tts, or gpt-4o-mini-tts-2025-12-15.
- str
- Literal["tts-1", "tts-1-hd", "gpt-4o-mini-tts", "gpt-4o-mini-tts-2025-12-15"]
  - "tts-1"
  - "tts-1-hd"
  - "gpt-4o-mini-tts"
  - "gpt-4o-mini-tts-2025-12-15"
voice: Voice

The voice to use when generating the audio. Supported built-in voices are alloy, ash, ballad, coral, echo, fable, onyx, nova, sage, shimmer, verse, marin, and cedar. You may also provide a custom voice object with an id, for example { "id": "voice_1234" }. Previews of the voices are available in the Text to speech guide.
- str
- Literal["alloy", "ash", "ballad", 7 more]
  - "alloy"
  - "ash"
  - "ballad"
  - "coral"
  - "echo"
  - "sage"
  - "shimmer"
  - "verse"
  - "marin"
  - "cedar"
- class VoiceID: …
  
  Custom voice reference.
  - id: str
    
    The custom voice ID, e.g. voice_1234.
instructions: Optional[str]

Control the voice of your generated audio with additional instructions. Does not work with tts-1 or tts-1-hd.
response_format: Optional[Literal["mp3", "opus", "aac", 3 more]]

The format to audio in. Supported formats are mp3, opus, aac, flac, wav, and pcm.
- "mp3"
- "opus"
- "aac"
- "flac"
- "wav"
- "pcm"
speed: Optional[float]

The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.
stream_format: Optional[Literal["sse", "audio"]]

The format to stream the audio in. Supported formats are sse and audio. sse is not supported for tts-1 or tts-1-hd.
- "sse"
- "audio"

Returns

BinaryResponseContent

Example

import os
from openai import OpenAI

client = OpenAI(
    api_key=os.environ.get("OPENAI_API_KEY"),  # This is the default and can be omitted
)
speech = client.audio.speech.create(
    input="input",
    model="string",
    voice="string",
)
print(speech)
content = speech.read()
print(content)

Example

from pathlib import Path
import openai

speech_file_path = Path(__file__).parent / "speech.mp3"
with openai.audio.speech.with_streaming_response.create(
  model="gpt-4o-mini-tts",
  voice="alloy",
  input="The quick brown fox jumped over the lazy dog."
) as response:
  response.stream_to_file(speech_file_path)

Domain Types

Speech Model

Literal["tts-1", "tts-1-hd", "gpt-4o-mini-tts", "gpt-4o-mini-tts-2025-12-15"]
- "tts-1"
- "tts-1-hd"
- "gpt-4o-mini-tts"
- "gpt-4o-mini-tts-2025-12-15"

python/resources/audio/subresources/speech/index.md +153 −0 created

1# Speech

3## Create speech

5`audio.speech.create(SpeechCreateParams**kwargs) -> BinaryResponseContent`

7**post** `/audio/speech`

9Generates audio from the input text.

11Returns the audio file content, or a stream of audio events.

13### Parameters

15- `input: str`

17 The text to generate audio for. The maximum length is 4096 characters.

19- `model: Union[str, SpeechModel]`

21 One of the available [TTS models](https://platform.openai.com/docs/models#tts): `tts-1`, `tts-1-hd`, `gpt-4o-mini-tts`, or `gpt-4o-mini-tts-2025-12-15`.

23 - `str`

25 - `Literal["tts-1", "tts-1-hd", "gpt-4o-mini-tts", "gpt-4o-mini-tts-2025-12-15"]`

27 - `"tts-1"`

29 - `"tts-1-hd"`

31 - `"gpt-4o-mini-tts"`

33 - `"gpt-4o-mini-tts-2025-12-15"`

35- `voice: Voice`

37 The voice to use when generating the audio. Supported built-in voices are `alloy`, `ash`, `ballad`, `coral`, `echo`, `fable`, `onyx`, `nova`, `sage`, `shimmer`, `verse`, `marin`, and `cedar`. You may also provide a custom voice object with an `id`, for example `{ "id": "voice_1234" }`. Previews of the voices are available in the [Text to speech guide](https://platform.openai.com/docs/guides/text-to-speech#voice-options).

39 - `str`

41 - `Literal["alloy", "ash", "ballad", 7 more]`

43 - `"alloy"`

45 - `"ash"`

47 - `"ballad"`

49 - `"coral"`

51 - `"echo"`

53 - `"sage"`

55 - `"shimmer"`

57 - `"verse"`

59 - `"marin"`

61 - `"cedar"`

63 - `class VoiceID: …`

65 Custom voice reference.

67 - `id: str`

69 The custom voice ID, e.g. `voice_1234`.

71- `instructions: Optional[str]`

73 Control the voice of your generated audio with additional instructions. Does not work with `tts-1` or `tts-1-hd`.

75- `response_format: Optional[Literal["mp3", "opus", "aac", 3 more]]`

77 The format to audio in. Supported formats are `mp3`, `opus`, `aac`, `flac`, `wav`, and `pcm`.

79 - `"mp3"`

81 - `"opus"`

83 - `"aac"`

85 - `"flac"`

87 - `"wav"`

89 - `"pcm"`

91- `speed: Optional[float]`

93 The speed of the generated audio. Select a value from `0.25` to `4.0`. `1.0` is the default.

95- `stream_format: Optional[Literal["sse", "audio"]]`

97 The format to stream the audio in. Supported formats are `sse` and `audio`. `sse` is not supported for `tts-1` or `tts-1-hd`.

99 - `"sse"`

100

101 - `"audio"`

102

103### Returns

104

105- `BinaryResponseContent`

106

107### Example

108

109```python

110import os

111from openai import OpenAI

112

113client = OpenAI(

114 api_key=os.environ.get("OPENAI_API_KEY"), # This is the default and can be omitted

115)

116speech = client.audio.speech.create(

117 input="input",

118 model="string",

119 voice="string",

120)

121print(speech)

122content = speech.read()

123print(content)

124```

125

126### Example

127

128```python

129from pathlib import Path

130import openai

131

132speech_file_path = Path(__file__).parent / "speech.mp3"

133with openai.audio.speech.with_streaming_response.create(

134 model="gpt-4o-mini-tts",

135 voice="alloy",

136 input="The quick brown fox jumped over the lazy dog."

137) as response:

138 response.stream_to_file(speech_file_path)

139```

140

141## Domain Types

142

143### Speech Model

144

145- `Literal["tts-1", "tts-1-hd", "gpt-4o-mini-tts", "gpt-4o-mini-tts-2025-12-15"]`

146

147 - `"tts-1"`

148

149 - `"tts-1-hd"`

150

151 - `"gpt-4o-mini-tts"`

152

153 - `"gpt-4o-mini-tts-2025-12-15"`

python/resources/audio/subresources/speech/index.md 2026-05-02 05:57 UTC to 2026-05-05 23:00 UTC

Speech

Create speech

Parameters

Returns

Example

Example

Domain Types

Speech Model