About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
ICEIS 2003
Conference paper
TME: An knowledge-based information extraction system
Abstract
Information extraction is a form of shallow text processing that locates a specified set of relevant information in a natural-language document. In this paper, a system-Template Match Engine (TME) is developed to extract useful information from unlabelled texts. The main feature of this system is that it improves and refines the initial extraction pattern by the concept knowledge which is incrementally acquired from the corpus. The system first builds an initial pattern by utilizing domain knowledge. Then the initial pattern is used to extract information from electronic documents. This step produces some feedback words by enlarging and analyzing the extracted information. Next, this pattern is refined by the feedback words and concept knowledge related to them. Finally, the refined pattern is used to extract specified information from electronic documents. The experiment results show that TME system increases recall without loss of precision.