Modeling Pitch Range Variation Within and Across Speakers: Predicting F0 Targets when “Speaking Up”

Citation

Shriberg, E., Ladd, D. R., Terken, J., & Stolcke, A. (1996). Modeling pitch range variation within and across speakers: Predicting f 0 targets when “speaking up”. In Proceedings of the 4th international conference on spoken language processing (pp. 1-4).

Abstract

We study F0 variation produced by “speaking up”, as part of a larger study of pitch range variation within and across speakers. We provide a function to predict target F0 values in this “raised” mode from F0 values at corresponding locations in speech produced in a neutral mode. Targets were F0 measurements at points of low internal variability in read Dutch sentences produced by 15 speakers. Results showed that the speaker dependent variability was well described by an additive-multiplicative model in the linear frequency domain. Furthermore, across speakers, the additive and multiplicative parameters were negatively correlated. One free speaker-dependent parameter could therefore be eliminated by adding a single speaker-independent linear constraint.


Read more from SRI

  • Banner and attendees at the IEEE Hard Tech Venture Summit

    Cultivating hard tech startups that scale

    IEEE’s Hard Tech Venture Summit convened innovators at SRI to refine strategies and build new networks.

  • Patient going into a MRI

    Bringing surgical tools inside the MRI

    Drawing on SRI’s unique innovation ecosystem, the startup Medical Devices Corner is seeking to improve cancer surgery by advancing MRI-safe teleoperation.

  • Christopher Mims and Susan Patrick

    PARC Forum: How to AI

    The Wall Street Journal tech columnist Christopher Mims and SRI Education’s Susan Patrick discuss how AI can strengthen human agency.