OpenAI Whisper Configuration

caution

For openai-whisper, the language parameter is optional. However, if you choose to set it, you need to use the correct language code. The language codes for openai-whisper are different from those used in azure-stt. For example, for Chinese in openai-whisper, you would use the code zh. In the roles.json file, you can configure different language codes for each role. You can find a list of all language codes for openai-whisper here: OpenAI Whisper language 639-1 codes

Configurable parameters:

Parameter Name	Description	Default Value
language	The language of the input audio. Supplying the input language in ISO-639-1 format will improve accuracy and latency.	None
api_base	The base URL of the API endpoint.	https://api.openai.com/v1
key	Your API key.	None
model	The ID of the model to use.	whisper-1
prompt	An optional text to guide the model's style or continue a previous audio segment. The prompt should match the audio language.	"ok. Let's begin."

Configuration example:

roles.json
{
 "1": {  
     "start_text": "你好，我是小兔兔，请问有什么我可以帮助你的吗？",
     "prompt": "你扮演一个孩子的小伙伴，名字叫小兔兔，性格和善，说话活泼可爱，对孩子充满爱心，经常赞赏和鼓励孩子，用5岁孩子容易理解语言提供有趣和创新的回答，每次回复根据聊天主题询问她的看法以激发她的思考和好奇心，现在她来到了你身边问了第一个问题:[你是谁]",
     "stt_type": "openai-whisper",
     "stt_config": {
       "language": "zh",
       "api_base": "https://api.openai.com/v1",
       "key": "sk-AAAAAAAAAAAAAAAAAAa",
       "model": "whisper-1",
       "prompt": "ok. Let's begin."
     }
 }
}