Embedded Speech Recognition Applications in Mobile Phones: Status, Trends, and Challenges

Citation

J. Cohen, “Embedded speech recognition applications in mobile phones: Status, trends, and challenges,” 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 2008, pp. 5352-5355, doi: 10.1109/ICASSP.2008.4518869.

Abstract

Voice centric interfaces are widely available in modern mobile phones, including low-cost versions. The applications have evolved from speaker-dependent name dialing, which require user enrollment of frequently dialed names, to speaker-independent capabilities including continuous digit dialing, command and control of phone functions, and name dialing directly from the phone’s contacts directory. Recently available advances include capabilities like voice-enabled SMS, e-mail, and even mobile search with voice. This evolution has been enabled by advances in speech recognition robustness, network capabilities, and increased computational power in small devices. Systems may now be used in hands-busy/eyes-busy conditions including speakerphone and bluetooth scenarios. In this paper, we will provide an overview of embedded speech recognition centric applications in mobile phones, specifically focusing on current status, industry trends, and challenges in customer acceptance. Although voice interfaces are natural and attractive in theory a majority of users do not use the voice-enabled features available in their mobile phones. We will discuss some of the reasons for this user behavior and recommend actions to be taken.


Read more from SRI

  • Banner and attendees at the IEEE Hard Tech Venture Summit

    Cultivating hard tech startups that scale

    IEEE’s Hard Tech Venture Summit convened innovators at SRI to refine strategies and build new networks.

  • Patient going into a MRI

    Bringing surgical tools inside the MRI

    Drawing on SRI’s unique innovation ecosystem, the startup Medical Devices Corner is seeking to improve cancer surgery by advancing MRI-safe teleoperation.

  • Christopher Mims and Susan Patrick

    PARC Forum: How to AI

    The Wall Street Journal tech columnist Christopher Mims and SRI Education’s Susan Patrick discuss how AI can strengthen human agency.