Skip to content

Research
Commercialization
About
- People
- News & Stories
- Events
- Our History
- Contact
Innovating In
Careers
日本支社

Research
Commercialization
About
- People
- News & Stories
- Events
- Our History
- Contact
Innovating In
Careers
日本支社

Search sri.com

January 1, 1998

Multistrategy learning for information extraction

Citation

Freitag D. Multistrategy learning for information extraction, in Proceedings of ICML 98, 1998.

Abstract

Information extraction (IE) is the problem of filling out predefined structured summaries from text documents. We are interested in performing IE in nontraditional domains where much of the text is often ungrammatical such as electronic bulletin board posts and Web pages. We suggest that the best approach is one that takes into account many different kinds of information and argue for the suitability of a multistrategy approach We describe learners for IE drawn from three separate machine learning paradigms: rote memorization, termspace text classification and relational rule induction. By building regression models mapping from learner confidence to probability of correctness and combining probabilities appropriately it is possible to improve extraction accuracy over that achieved by any individual learner. We describe three different multistrategy approaches. Experiments on two IE domains a collection of electronic seminar announcements from a university computer science department and a set of newswire articles describing corporate acquisitions from the Reuters collection demonstrate the effectiveness of all three approaches.

↓ Review online

Read more from SRI

July 14, 2026

A thousand qubits in bloom, now let’s scale

Standardization is the way to the first quantum computer.
July 9, 2026

SRI’s AI platform is rewriting rural healthcare

No hospital? No problem. SRI’s data fix could save rural medicine.
July 7, 2026

Tokyu Land Corporation and SRI Advance Japan’s “Global Startup Campus” Initiative

Collaboration pairs SRI’s Silicon Valley venture-building expertise with TLC’s Shibuya real estate network to grow Japan’s deep tech startup ecosystem.

Join Our Team

Build your own legacy

Explore careers

Hire Us

Solutions to your most complex challenges

Send an inquiry

Contact Us

General inquiries

Get the latest news from SRI

Commercialization

Media Inquiries

333 Ravenswood Ave
Menlo Park, CA 94025 USA

+1 (650) 859-2000

© 2026 SRI INTERNATIONAL

Manage Cookie Consent

To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.

Functional Functional Always active

The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.

Preferences Preferences

The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.

Statistics Statistics

The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.

Marketing Marketing

The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.

Manage options
Manage services
Manage {vendor_count} vendors
Read more about these purposes

View preferences

{title}
{title}
{title}