A comparison of composite features under degraded speech in speaker recognition
A variety of features and their sensitivity to noise mismatch between the model and test noise conditions are assessed. The authors use speaker identification (SI) for a performance evaluation as it is very sensitive to feature changes, and propose a target for robustness in terms of matched noise c...
Saved in:
Main Authors: | , , |
---|---|
Format: | Conference Proceeding |
Language: | eng |
Subjects: | |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | A variety of features and their sensitivity to noise mismatch between the model and test noise conditions are assessed. The authors use speaker identification (SI) for a performance evaluation as it is very sensitive to feature changes, and propose a target for robustness in terms of matched noise conditions. Two primary features, mel frequency cepstral coefficients (MFCCs) and PLP, are considered along with their RASTA and first-order regression extensions. PLP-RASTA is found to give the best resilience under cross conditions for a single feature, and the linear discriminant analysis (LDA) combination of MFCC and PLP-RASTA gives the best performance overall. Only in combined training are satisfactory results for any feature found.< > |
---|---|
ISSN: | 1520-6149 2379-190X |