Skip to content

单声道和多声道音频识别结果差异比较 #211

Open
@qiutzh

Description

@qiutzh

❓ Questions and Help

What is your question?

您好,请问老师有对比asr模型对单声道音频和多声道音频的识别结果差异吗?
我用客服对话数据做了一些测试,发现有些情况(比如双声道一些音频)识别结果还行,有的情况(比如一些单声道音频)识别结果会比较差。
请问老师,公开small版本asr模型,哪些场景下的asr识别结果效果比较不错呢?edu行业客服对话数据是否适合。
代码中,是否支持设置要识别的声道呢?(如识别所有声道,或者识别第1个声道)。感谢!

Code

no code

What have you tried?

What's your environment?

  • OS (e.g., Linux): ubuntu20.04
  • FunASR Version (e.g., 1.0.0): 1.2.6
  • ModelScope Version (e.g., 1.11.0): 1.24.1
  • PyTorch Version (e.g., 2.0.0): 2.5.1
  • How you installed funasr (pip, source): pip
  • Python version: 3.12
  • GPU (e.g., V100M32): rtx4090d
  • CUDA/cuDNN version (e.g., cuda11.7): cuda11.8
  • Docker version (e.g., funasr-runtime-sdk-cpu-0.4.1): no
  • Any other relevant information:

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions