A Smoothing Kernel for Spatially Related Features and Its Application to Speaker Verification


Ferrer, L., Sönmez, M. K., & Shriberg, E. (2007, December). A smoothing kernel for spatially related features and its application to speaker verification. In INTERSPEECH (pp. 738-741).


Most commonly used kernels are invariant to permutations of the feature vector components. This characteristic may make machine learning methods that use such kernels suboptimal in cases where the feature vector has an underlying structure. In this paper we will consider one such case, where the features are spatially related. We show a way to modify the objective function of the support vector machine (SVM) optimization problem to account for this structure. The new optimization problem can be implemented as a standard SVM using a particular smoothing kernel. Results are shown on a speaker verification task using prosodic features that are transformed using a particular implementation of the Fisher score. The proposed method leads to improvements of as much as 15% in equal error rate (EER).

Index Terms: Support Vector Machines, Kernels, Smoothing, Speaker Recognition, Speaker Verification.

Read more from SRI