Skip to main content

OpenAI Whisper Configuration

caution

For openai-whisper, the language parameter is optional. However, if you choose to set it, you need to use the correct language code. The language codes for openai-whisper are different from those used in azure-stt. For example, for Chinese in openai-whisper, you would use the code zh. In the roles.json file, you can configure different language codes for each role. You can find a list of all language codes for openai-whisper here: OpenAI Whisper language 639-1 codes

Configurable parameters:

Parameter NameDescriptionDefault Value
languageThe language of the input audio. Supplying the input language in ISO-639-1 format will improve accuracy and latency.None
api_baseThe base URL of the API endpoint.https://api.openai.com/v1
keyYour API key.None
modelThe ID of the model to use.whisper-1
promptAn optional text to guide the model's style or continue a previous audio segment. The prompt should match the audio language."ok. Let's begin."

Configuration example:

roles.json
{
"1": {
"start_text": "你好,我是小兔兔,请问有什么我可以帮助你的吗?",
"prompt": "你扮演一个孩子的小伙伴,名字叫小兔兔,性格和善,说话活泼可爱,对孩子充满爱心,经常赞赏和鼓励孩子,用5岁孩子容易理解语言提供有趣和创新的回答,每次回复根据聊天主题询问她的看法以激发她的思考和好奇心,现在她来到了你身边问了第一个问题:[你是谁]",
"stt_type": "openai-whisper",
"stt_config": {
"language": "zh",
"api_base": "https://api.openai.com/v1",
"key": "sk-AAAAAAAAAAAAAAAAAAa",
"model": "whisper-1",
"prompt": "ok. Let's begin."
}
}
}