Exploiting Information Extraction Annotations for Document Retrieval in Distillation Tasks


Hakkani-Tür, D., Tur, G., & Levit, M. (2007). Exploiting information extraction annotations for document retrieval in distillation tasks. In Eighth Annual Conference of the International Speech Communication Association.


Information distillation aims to extract relevant pieces of information related to a given query from massive, possibly multilingual, audio and textual document sources. In this paper, we present our approach for using information extraction annotations to augment document retrieval for distillation. We take advantage of the fact that some of the distillation queries can be associated with annotation elements introduced for the NIST Automatic Content Extraction (ACE) task. We experimentally show that using the ACE events to constrain the document set returned by an information retrieval engine significantly improves the precision at various recall rates for two different query templates.

Index Terms: information distillation, information retrieval, information extraction, document retrieval

Read more from SRI