S. S. Kajarekar, “Across-phone variability and diagonal term in joint factor analysis,” in Proc. 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 4406–4409.
We investigate usefulness of across-phone variability for speaker recognition in a joint factor analysis (JFA) framework. We estimate the variability as across-phone covariance within a conversation side averaged over all conversations. Note that it is a part of channel variability in the current JFA framework. We independently estimate feature subspaces representing across-phone, speaker and channel variability and perform speaker recognition experiments by either keeping them or removing them. The results show that the across-phone subspace is more correlated with the speaker subspace. We also perform speaker recognition experiments when combining the subspaces. Results show an improvement when phone and speaker subspaces are combined. This shows that across-phone variability is useful for speaker recognition. Further experiments show that the results are affected by a diagonal term from JFA. […]