About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
JCIS 2000
Conference paper
Extracting Information from Text
Abstract
A domain independent information extraction system can be built if it can include a self modifying feature that enables it to automatically adapt to new domains. The theory is that the user should know the peculiarities involved and be able to quickly train the system to gather the specific target facts. This is done by scanning a series of example articles using a special text processing interface and allowing the user to lead the system through the desired retrieval steps. Then the system acquires the generalized patterns and applies them to any large database of text to extract the information of interest.