Improving Automatic Sentence Boundary Detection with Confusion Networks


Hillard, D., Ostendorf, M., Stolcke, A., Liu, Y., & Shriberg, E. (2004). Improving automatic sentence boundary detection with confusion networks. WASHINGTON UNIV SEATTLE.


We extend existing methods for automatic sentence boundary detection by leveraging multiple recognizer hypotheses in order to provide robustness to speech recognition errors. For each hypothesized word sequence, an HMM is used to estimate the posterior probability of a sentence boundary at each word boundary. The hypotheses are combined using confusion networks to determine the overall most likely events. Experiments show improved detection of sentences for conversational telephone speech, though results are mixed for broadcast news.

Read more from SRI