Feature Derivation for Exploitation of Distant Annotation via Pattern Induction against Dependency Parses

SRI authors: ,

Citation

Freitag Dayne, Niekrasz John. Feature Derivation for Exploitation of Distant Annotation via Pattern Induction against Dependency Parses, in Proceedings of the 15th Workshop on Biomedical Natural Language Processing, Association for Computational Linguistics, pp. 36-45, August 2016.

Abstract

We consider the use of distant supervision for biological information extraction, and introduce two understudied corpora of this form, the Biological Expression Language (BEL) Large Corpus and the Pathway Logic (PL) Datum Corpus. Each resource eschews annotation at the sentence constituent level, and the PL corpus requires synthesis of information across multiple sentences to construct composite knowledge frames. Decomposing this problem into feature induction for slot-level attributes, followed by event assembly over this space of features, we introduce a novel, general-purpose pattern induction procedure, evaluating it against these two corpora, demonstrating its ability to induce effective detection against dependency parses.


Read more from SRI

  • A rendering of the Parker Solar Probe inside the sun's corona.

    Parker Solar Probe: Our closest look at the sun

    SRI imaging technology supports a record-shattering NASA mission.

  • A photo of Mary Wagner

    Recognizing the life and work of Mary Wagner 

    A cherished SRI colleague and globally respected leader in education research, Mary Wagner leaves behind an extraordinary legacy of groundbreaking work supporting children and youth with disabilities and their families.

  • Testing XRGo in a robotics laboratory

    Robots in the cleanroom

    A global health leader is exploring how SRI’s robotic telemanipulation technology can enhance pharmaceutical manufacturing.