Loading…

A Hybrid Reverberation Model and Its Application to Joint Speech Dereverberation and Separation

This article proposes a hybrid reverberation model by integrating two conventional models, namely, the multichannel linear prediction (MCLP) model and the spatial coherence model. The late reverberation is divided into two components. One component is modeled using an MCLP model, and the other is mo...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE/ACM transactions on audio, speech, and language processing speech, and language processing, 2023, Vol.31, p.3000-3014
Main Authors:	Liu, Tongzheng, Lu, Zhihua, da Costa, Joao Paulo J., Fei, Tai
Format:	Article
Language:	English
Subjects:	Analytical models blind source separation Coherence Computational efficiency Computational modeling Covariance matrices dereverberation Linear prediction Mathematical models microphone array multichannel nonnegative matrix factorization Parameters Predictive models Reverberation Reverberation model Reverberation time Separation Spatial coherence Speech speech separation
Citations:	Items that this one cites
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	This article proposes a hybrid reverberation model by integrating two conventional models, namely, the multichannel linear prediction (MCLP) model and the spatial coherence model. The late reverberation is divided into two components. One component is modeled using an MCLP model, and the other is modeled using the spatial coherence model. In contrast with the conventional models, the proposed hybrid model increases modeling capacity, especially in the case of long reverberation time. In order to optimally estimate model parameters, joint speech dereverberation and separation is taken into account. The hybrid reverberation model is then used in conjunction with the multichannel nonnegative matrix factorization (MNMF). The method called Hybrid-FastMNMF is proposed by treating the reverberation component modeled by the spatial coherence model as a noise source and estimating its parameters similarly to speech sources. Furthermore, prior knowledge of the spatial coherence matrix is employed to whiten the observations, resulting in another method called Hybrid-FastMNMF-W. Experimental findings demonstrate the proposed methods' superior performance in terms of joint speech dereverberation and separation, and they further justify the efficiency of the proposed hybrid reverberation model.
ISSN:	2329-9290 2329-9304
DOI:	10.1109/TASLP.2023.3301227