About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
Computer Vision and Image Understanding
Paper
Data- and model-driven multiresolution processing
Abstract
We introduce a technique for multiresolution processing which elegantly fits in our framework for visual recognition, described in earlier papers. The input is processed simultaneously at a coarse resolution throughout the image and at finer resolution within a small window (fovea). We introduce an approach for controlling the movement of the high-resolution window which allows for both data- and model-driven selection of fixation points. Three fixation modes have been implemented, one based on large unexplained areas in the data, one on conflicts in the object-model database, and one on a 2D "space filling" algorithm. We argue that this kind of multiresolution processing is not only useful in limiting the computational time, as has been widely recognized, but also can be a deciding factor in making the entire vision problem a tractable and stable one. To demonstrate the approach, we introduce a class of 3D surface textures as a feature for recognition in our system. Surface texture recognition typically requires higher-resolution processing than that required for the extraction of the underlying surface. As examples, surface texture is used to discriminate between a ping-pong ball and a golf ball, and "curve texture" is used to recognize different types of gears. Other experimental results also are included to show the advantages and the implications of our approach. © 1996 Academic Press, Inc.