Using Prosodic and Spectral Features in Detecting Depression in Elderly Males

Citation

M. H. Sanchez, D. Vergyri, L. Ferrer, C. Richey, P. Garcia, B. Knoth and W. Jarrold, “Using prosodic and spectral features in  detecting depression in elderly males,” in Proc. Interspeech, 2011, pp. 3001–3004.

Abstract

As research in speech processing has matured, there has been much interest in paralinguistic speech processing problems including the speaker’s mental and psychological health.  In this study, we focus on speech features that can identify the speaker’s emotional health, i.e., whether the speaker is depressed or not.  We use prosodic speech measurements, such as pitch and energy, in addition to spectral features, such as formants and spectral tilt, and compute statistics of these features over different regions of the speech signal.  These statistics are used as input features to a discriminative classifier that predicts the speaker’s depression state.  We find that with an N-fold leave-one-out cross-validation setup, we can achieve a prediction accuracy of 81.3%, where random guess is 50%.


Read more from SRI

  • surgeons around a surgical robot

    The SRI research behind today’s surgical robotics

    Intuitive’s da Vinci 5 system represents a major leap in robotic-assisted medicine. It all started at SRI, which continues to advance teleoperation technologies.

  • a collage of digital graphs

    A banner year for quantum

    SRI-managed QED-C’s annual report on quantum trends captures an industry accelerating rapidly from technical promise toward major global impact.

  • ICE Cube containing SRI’s aerogel experiment, photographed prior to launch. Source: Aerospace Applications North America

    An SRI carbon capture experiment launches into space

    By synthesizing carbon-absorbing aerogels in microgravity, SRI research will give us a rare glimpse into how these materials could be radically improved.