A suggested feature: adding a voice activity detection to the audio_input

Right now, when using audio_input to interact with AI agent, we need to press the record button to start recording the audio. It will be great to have a keyword like VAD_threshold to enable voice activity detection above the threshold. When the threshold is 0, then the VAD is disabled and we still need to push down the button to start recording.

1 Like