Using extended feature objects for partial similarity retrieval

被引:15
作者
Berchtold S. [1 ]
Keim D.A. [1 ]
Kriegel H.-P. [1 ]
机构
[1] Institute for Computer Science, University of Munich, D-80538 Munich
关键词
CAD databases; Fourier transformation; Indexing and query processing of spatial objects; Partial similarity retrieval;
D O I
10.1007/s007780050049
中图分类号
学科分类号
摘要
In this paper, we introduce the concept of extended feature objects for similarity retrieval. Conventional approaches for similarity search in databases map each object in the database to a point in some high-dimensional feature space and define similarity as some distance measure in this space. For many similarity search problems, this feature-based approach is not sufficient. When retrieving partially similar polygons, for example, the search cannot be restricted to edge sequences, since similar polygon sections may start and end anywhere on the edges of the polygons. In general, inherently continuous problems such as the partial similarity search cannot be solved by using point objects in feature space. In our solution, we therefore introduce extended feature objects consisting of an infinite set of feature points. For an efficient storage and retrieval of the extended feature objects, we determine the minimal bounding boxes of the feature objects in multidimensional space and store these boxes using a spatial access structure. In our concrete polygon problem, sets of polygon sections are mapped to 2D feature objects in high-dimensional space which are then approximated by minimal bounding boxes and stored in an R* -tree. The selectivity of the index is improved by using an adaptive decomposition of very large feature objects and a dynamic joining of small feature objects. For the polygon problem, translation, rotation, and scaling invariance is achieved by using the Fourier-transformed curvature of the normalized polygon sections. In contrast to vertex-based algorithms, our algorithm guarantees that no false dismissals may occur and additionally provides fast search times for realistic database sizes. We evaluate our method using real polygon data of a supplier for the car manufacturing industry.
引用
收藏
页码:333 / 348
页数:15
相关论文
共 31 条
  • [1] Alt H., Blomer J., Resemblance and symmetries of geometric patterns. Data structures and efficient algorithms, LNCS, 594, pp. 1-24, (1992)
  • [2] Agrawal R., Faloutsos C., Swami A., Efficient similarity search in sequence databases, Proc. 4th Int. Conf. on Foundations of Data Organization and Algorithms, LNCS, 730, pp. 69-84, (1993)
  • [3] Agrawal R., Lin K., Sawhney H., Shim K., Fast similarity search in the presence of noise, scaling, and translation in time-series databases, Proc. 21st Conf. on Very Large Databases, pp. 490-501, (1995)
  • [4] Berchtold S., Geometry Based Search of Similar Parts. (In German), (1997)
  • [5] Brinkhoff T., Kriegel H.-P., Schneider R., Comparison of approximations of complex objects used for approximation-bused query processing in spatial database systems, Proc. 9th Int. Conf. on Data Engineering, pp. 40-49, (1993)
  • [6] Beckmann N., Kriegel H.-P., Schneider R., Sceger B., The R* -tree: An efficient and robust access method for points and rectangles, Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 322-331, (1990)
  • [7] Faloutsos C., Barber R., Flickner M., Hafner J., Et al., Efficient and effective querying by image content, J Intel Inf Syst, 3, pp. 231-262, (1994)
  • [8] Freeston M., The BANG file: A new kind of grid file, Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 260-269, (1987)
  • [9] Faloutsos C., Ranganathan M., Manolopoulos Y., Fast subsequence matching in time-scries databases, Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 419-429, (1994)
  • [10] Gargantini I., An Effective way to represent quadtrees, Commun ACM, 25, 12, pp. 905-910, (1982)