Skip to content

Commit 3fabd95

Browse files
author
feizi.wx
committed
update md
1 parent dc1a813 commit 3fabd95

File tree

1 file changed

+9
-0
lines changed

1 file changed

+9
-0
lines changed

docs/source/en/model_doc/qwen2_5_omni.md

+9
Original file line numberDiff line numberDiff line change
@@ -175,6 +175,15 @@ print(text)
175175

176176
### Usage Tips
177177

178+
#### Prompt for audio output
179+
If users need audio output, the system prompt must be set as "You are Qwen, a virtual human developed by the Qwen Team, Alibaba Group, capable of perceiving auditory and visual inputs, as well as generating text and speech.", otherwise the audio output may not work as expected.
180+
```
181+
{
182+
"role": "system",
183+
"content": "You are Qwen, a virtual human developed by the Qwen Team, Alibaba Group, capable of perceiving auditory and visual inputs, as well as generating text and speech.",
184+
}
185+
```
186+
178187
#### Use audio output or not
179188

180189
The model supports both text and audio outputs, if users do not need audio outputs, they can set `enable_audio_output` in the `from_pretrained` function. This option will save about `~2GB` of GPU memory but the `return_audio` option for `generate` function will only allow to be set at `False`.

0 commit comments

Comments
 (0)