Speech
Create speech
HttpResponse audio().speech().create(SpeechCreateParamsparams, RequestOptionsrequestOptions = RequestOptions.none())
post /audio/speech
Generates audio from the input text.
Returns the audio file content, or a stream of audio events.
Parameters
-
SpeechCreateParams params-
String inputThe text to generate audio for. The maximum length is 4096 characters.
-
SpeechModel modelOne of the available TTS models:
tts-1,tts-1-hd,gpt-4o-mini-tts, orgpt-4o-mini-tts-2025-12-15.-
TTS_1("tts-1") -
TTS_1_HD("tts-1-hd") -
GPT_4O_MINI_TTS("gpt-4o-mini-tts") -
GPT_4O_MINI_TTS_2025_12_15("gpt-4o-mini-tts-2025-12-15")
-
-
Voice voiceThe voice to use when generating the audio. Supported built-in voices are
alloy,ash,ballad,coral,echo,fable,onyx,nova,sage,shimmer,verse,marin, andcedar. You may also provide a custom voice object with anid, for example{ "id": "voice_1234" }. Previews of the voices are available in the Text to speech guide.-
String -
enum UnionMember1:-
ALLOY("alloy") -
ASH("ash") -
BALLAD("ballad") -
CORAL("coral") -
ECHO("echo") -
SAGE("sage") -
SHIMMER("shimmer") -
VERSE("verse") -
MARIN("marin") -
CEDAR("cedar")
-
-
class Id:Custom voice reference.
-
String idThe custom voice ID, e.g.
voice_1234.
-
-
-
Optional<String> instructionsControl the voice of your generated audio with additional instructions. Does not work with
tts-1ortts-1-hd. -
Optional<ResponseFormat> responseFormatThe format to audio in. Supported formats are
mp3,opus,aac,flac,wav, andpcm.-
MP3("mp3") -
OPUS("opus") -
AAC("aac") -
FLAC("flac") -
WAV("wav") -
PCM("pcm")
-
-
Optional<Double> speedThe speed of the generated audio. Select a value from
0.25to4.0.1.0is the default. -
Optional<StreamFormat> streamFormatThe format to stream the audio in. Supported formats are
sseandaudio.sseis not supported fortts-1ortts-1-hd.-
SSE("sse") -
AUDIO("audio")
-
-
Example
package com.openai.example;
import com.openai.client.OpenAIClient;
import com.openai.client.okhttp.OpenAIOkHttpClient;
import com.openai.core.http.HttpResponse;
import com.openai.models.audio.speech.SpeechCreateParams;
import com.openai.models.audio.speech.SpeechModel;
public final class Main {
private Main() {}
public static void main(String[] args) {
OpenAIClient client = OpenAIOkHttpClient.fromEnv();
SpeechCreateParams params = SpeechCreateParams.builder()
.input("input")
.model(SpeechModel.TTS_1)
.voice("string")
.build();
HttpResponse speech = client.audio().speech().create(params);
}
}
Domain Types
Speech Model
-
enum SpeechModel:-
TTS_1("tts-1") -
TTS_1_HD("tts-1-hd") -
GPT_4O_MINI_TTS("gpt-4o-mini-tts") -
GPT_4O_MINI_TTS_2025_12_15("gpt-4o-mini-tts-2025-12-15")
-