Skip to content

Automatic Audio Feedback on Silence in VoicePipeline #489

Open
@joowon-dm-snu

Description

@joowon-dm-snu

Please read this first

  • Have you read the docs?Agents SDK docs
  • Have you searched for related issues? Others may have had similar requests

Describe the feature

While working on the VoicePipeline—as with my previous issue #488 —I’ve identified a scenario where the voice agent may become unresponsive due to inadequate speech recognition.
In practice, even if a user speaks, there are instances when the transcription session (transcription_session.transcribe_turns()) does not get triggered, resulting in an extended period of silence.

To mitigate this issue, I propose that when a turn concludes, a separate thread should be initiated. Within this thread, if no new turn is started within 3 seconds, the system should automatically stream a fallback message, such as:

"Excuse me, it seems the audio was interrupted. Could you please repeat that?"

This feature would provide a smoother conversational experience by proactively addressing potential audio dropouts or misrecognitions.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions