cs.SD - 2023-10-31

Study of speaker localization with binaural microphone array incorporating auditory filters and lateral angle estimation

  • paper_url: http://arxiv.org/abs/2310.20238
  • repo_url: None
  • paper_authors: Yanir Maymon, Israel Nelken, Boaz Rafaely
  • for: 这 paper 是为了研究听话器位置Estimation(SPL)技术,特别是在语音沟通、视频会议和机器人听力方面。
  • methods: 这 paper 提出了一些人类听力的听话器位置Estimation(SPL)技术,包括使用快速 Fourier transform(STFT)和Head Related Transfer Function(HRTF)集来实现方向性搜索。
  • results: 实验和 simulations 表明,提出的方法可以和现有方法相比,并且在两种 binAural DOA estimation 方法上进行了应用。
    Abstract Speaker localization for binaural microphone arrays has been widely studied for applications such as speech communication, video conferencing, and robot audition. Many methods developed for this task, including the direct path dominance (DPD) test, share common stages in their processing, which include transformation using the short-time Fourier transform (STFT), and a direction of arrival (DOA) search that is based on the head related transfer function (HRTF) set. In this paper, alternatives to these processing stages, motivated by human hearing, are proposed. These include incorporating an auditory filter bank to replace the STFT, and a new DOA search based on transformed HRTF as steering vectors. A simulation study and an experimental study are conducted to validate the proposed alternatives, and both are applied to two binaural DOA estimation methods; the results show that the proposed method compares favorably with current methods.
    摘要 喇叭识别技术对双耳麦克频率阵列进行广泛研究,用于语音通信、视频会议和机器听觉等应用。许多这些技术的处理阶段相似,包括使用短时傅立叶变换(STFT)和基于头相关传输函数(HRTF)的方向来源搜索。本文提出了基于人类听觉的代替方案,包括使用听觉滤波器阵列取代STFT,以及基于变换HRTF的新的方向来源搜索方法。我们进行了一个 simulated study 和一个实验研究,以验证提议的方法,并应用于两种双耳DOA估计方法。结果表明,提议的方法与当前方法相比,具有竞争力。