Publication
SDM 2007
Conference paper
A system for keyword search on textual streams
Abstract
An increasing amount of data is produced in the form of text streams - these can be RSS news feeds, TV closed captions, emails, etc. We study the problem of answering keyword queries on multiple textual streams. We define the result of a keyword query inspired by previous work on keyword search on static databases. A result to a query is a combination of streams "sufficiently correlated" to each other that collectively contain all query keywords within a specified time span. On the algorithmic side, in this paper we focus on the component of continuously monitoring the streams and outputting results as soon as they are available.