Approximation-Based Similarity Search for 3-D Surface Segments

被引:23
作者
Kriegel H.-P. [1 ,2 ,3 ,4 ,6 ]
Seidl T. [1 ,4 ,5 ,6 ]
机构
[1] University of Munich, Institute for Computer Science, Oettingenstr. 67
[2] Dept. of Database and Info. Systems, Institute for Computer Science, University of Munich
[3] Institute for Computer Science, University of Munich
关键词
3-D spatial database systems; Approximation-based similarity search; Ellipsoid queries on multidimensional index structures; Multi-step similarity query processing;
D O I
10.1023/A:1009760031965
中图分类号
学科分类号
摘要
The issue of finding similar 3-D surface segments arises in many recent applications of spatial database systems, such as molecular biology, medical imaging, CAD, and geographic information systems. Surface segments being similar in shape to a given query segment are to be retrieved from the database. The two main questions are how to define shape similarity and how to efficiently execute similarity search queries. We propose a new similarity model based on shape approximation by multi-parametric surface functions that are adaptable to specific application domains. We then define shape similarity of two 3-D surface segments in terms of their mutual approximation errors. Applying the multi-step query processing paradigm, we propose algorithms to efficiently support complex similarity search queries in large spatial databases. A new query type, called the ellipsoid query, is utilized in the filter step. Ellipsoid queries, being specified by quadratic forms, represent a general concept for similarity search. Our major contribution is the introduction of efficient algorithms to perform ellipsoid queries on multidimensional index structures. Experimental results on a large 3-D protein database containing 94,000 surface segments demonstrate the successful application and the high performance of our method.
引用
收藏
页码:113 / 147
页数:34
相关论文
共 36 条
[1]  
Agrawal, R., Faloutsos, C., Swami, A., Efficient Similarity Search in Sequence Databases (1993) Lecture Notes in Computer Science, 730, pp. 69-84. , Proc. 4th Int. Conf. on Foundations of Data Organization and Algorithms (FODO '93), Evanston, IL, USA, Springer
[2]  
Agrawal, R., Lin, K.-I., Sawhney, H.S., Shim, K., Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases (1995) Proc. 21st Int. Conf. on Very Large Databases (VLDB '95), pp. 490-501. , Morgan Kaufmann
[3]  
Aehle, W., Sobek, H., Schomburg, D., Evaluation of Protein 3D-Structure Prediction: Comparison of Modelled and X-Ray Structure of an Alkaline Serine Protease (1995) Journal of Biotechnology, 41, pp. 211-220
[4]  
Beckmann, N., Kriegel, H.-P., Schneider, R., Seeger, B., The R*-tree: An Efficient and Robust Access Method for Points and Rectangles (1990) Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 322-331. , Atlantic City, NJ, USA
[5]  
Berchtold, S., Böhm, C., Braunmüller, B., Keim, D., Kriegel, H.-P., Fast Parallel Similarity Search in Multimedia Databases (1997) Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 1-12
[6]  
Berchtold, S., Keim, D., Kriegel, H.-P., The X-tree: An Index Structure for High-Dimensional Data (1996) Proc. 22nd Int. Conf. on Very Large Databases (VLDB '96), pp. 28-39. , Mumbai, India
[7]  
Berchtold, S., Keim, D., Kriegel, H.-P., Using Extended Feature Objects for Partial Similarity Retrieval (1997) VLDB Journal, 6 (4), pp. 333-348
[8]  
Berchtold, S., Kriegel, H.-P., S3: Similarity Search in CAD Database Systems (1997) Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 564-567
[9]  
Bernstein, F.C., Koetzle, T.F., Williams, G.J., Meyer, E.F., Brice, M.D., Rogers, J.R., Kennard, O., Tasumi, M., The Protein Data Bank: A Computer-based Archival File for Macromolecular Structures (1977) Journal of Molecular Biology, 112, pp. 535-542
[10]  
Best, M.J., Ritter, K., (1985) Linear Programming. Active Set Analysis and Computer Programs, , Prentice Hall: Englewood Cliffs, NJ