Google 學術搜尋

Multisv: Dataset for far-field multi-channel speaker verification

L Mošner, O Plchot, L Burget… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

L Mošner, O Plchot, L Burget, JH Černocký

ICASSP 2022-2022 IEEE international conference on acoustics …, 2022•ieeexplore.ieee.org

Motivated by unconsolidated data situation and the lack of a standard benchmark in the field, we complement our previous efforts and present a comprehensive corpus designed for training and evaluating text-independent multi-channel speaker verification systems. It can be readily used also for experiments with dereverberation, denoising, and speech enhancement. We tackled the ever-present problem of the lack of multi-channel training data by utilizing data simulation on top of clean parts of the Voxceleb corpus. The development and evaluation trials are based on a retransmitted Voices Obscured in Complex Environmental Settings (VOiCES) corpus, which we modified to provide multi-channel trials. We publish full recipes that create the dataset from public sources as the MultiSV dataset, and we provide results with two of our multi-channel speaker verification systems with neural network-based beamforming based either on predicting ideal binary masks or the more recent Conv-TasNet.

ieeexplore.ieee.org

顯示更多顯示較少

儲存引用被引用 10 次相關文章全部共 3 個版本

引用

進階搜尋

已儲存至「我的圖書館」

Multisv: Dataset for far-field multi-channel speaker verification