cs.SD - 2023-10-29

Deep Audio Analyzer: a Framework to Industrialize the Research on Audio Forensics

  • paper_url: http://arxiv.org/abs/2310.19081
  • repo_url: None
  • paper_authors: Valerio Francesco Puglisi, Oliver Giudice, Sebastiano Battiato
  • for: 这篇论文是为了提高音频掌控领域的研究和开发过程的简化和加速,使用户可以快速创建、比较和共享结果。
  • methods: 该论文描述了一种核心架构,用于支持多个音频分析任务,包括音频特征视图、预训练模型评估、新Audio分析工作流程创建等功能。
  • results: 通过使用Deep Audio Analyzer工具,法律 enforcement 机构和研究人员可以轻松地评估预训练模型的性能、创建新的Audio分析工作流程,并将其导出和分享。这些功能将提高音频分析实验室的速度和可重复性。
    Abstract Deep Audio Analyzer is an open source speech framework that aims to simplify the research and the development process of neural speech processing pipelines, allowing users to conceive, compare and share results in a fast and reproducible way. This paper describes the core architecture designed to support several tasks of common interest in the audio forensics field, showing possibility of creating new tasks thus customizing the framework. By means of Deep Audio Analyzer, forensics examiners (i.e. from Law Enforcement Agencies) and researchers will be able to visualize audio features, easily evaluate performances on pretrained models, to create, export and share new audio analysis workflows by combining deep neural network models with few clicks. One of the advantages of this tool is to speed up research and practical experimentation, in the field of audio forensics analysis thus also improving experimental reproducibility by exporting and sharing pipelines. All features are developed in modules accessible by the user through a Graphic User Interface. Index Terms: Speech Processing, Deep Learning Audio, Deep Learning Audio Pipeline creation, Audio Forensics.
    摘要 深度音频分析器是一个开源的语音框架,旨在简化语音处理管道的研究和开发过程,让用户快速地实现语音处理任务,并且可以方便地比较和共享结果。本文描述了核心架构,支持audio дляensis领域的多个任务,并示出了创建新任务的可能性,因此可以根据需要自定义框架。通过深度音频分析器,法律机关的审查员和研究人员可以轻松地查看音频特征,快速评估预训练模型的性能,创建、导出和共享新的音频分析工作流程,只需几Click。这个工具的一个优点是快速加速了研究和实践实验的速度,因此也提高了实验 reproducibility。所有功能都是在用户可访问的模块中实现的,可以通过图形用户界面来访问。关键词:语音处理、深度学习音频、深度学习音频管道创建、音频鉴定。