EFFICIENT DETECTION OF 3-DIMENSIONAL STRUCTURAL MOTIFS IN BIOLOGICAL MACROMOLECULES BY COMPUTER VISION TECHNIQUES

被引:210
作者
NUSSINOV, R
WOLFSON, HJ
机构
[1] TEL AVIV UNIV,SCH MATH SCI,DEPT COMP SCI,IL-69978 TEL AVIV,ISRAEL
[2] NCI,FREDERICK CANC RES FACIL,MATH BIOL LAB,FREDERICK,MD 21702
[3] NYU,COURANT INST MATH SCI,ROBOT RES LAB,NEW YORK,NY 10003
关键词
3-DIMENSIONAL STRUCTURAL COMPARISON; CRYSTALLOGRAPHIC COORDINATES; EFFICIENT COMPUTER VISION ALGORITHM; MACROMOLECULAR STRUCTURE ANALYSIS;
D O I
10.1073/pnas.88.23.10495
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Macromolecules carrying biological information often consist of independent modules containing recurring structural motifs. Detection of a specific structural motif within a protein (or DNA) aids in elucidating the role played by the protein (DNA element) and the mechanism of its operation. The number of crystallographically known structures at high resolution is increasing very rapidly. Yet, comparison of three-dimensional structures is a laborious time-consuming procedure that typically requires a manual phase. To date, there is no fast automated procedure for structural comparisons. We present an efficient O(n3) worst case time complexity algorithm for achieving such a goal (where n is the number of atoms in the examined structure). The method is truly three-dimensional, sequence-order-independent, and thus insensitive to gaps, insertions, or deletions. This algorithm is based on the geometric hashing paradigm, which was originally developed for object recognition problems in computer vision. It introduces an indexing approach based on transformation invariant representations and is especially geared toward efficient recognition of partial structures in rigid objects belonging to large data bases. This algorithm is suitable for quick scanning of structural data bases and will detect a recurring structural motif that is a priori unknown. The algorithm uses protein (or DNA) structures, atomic labels, and their three-dimensional coordinates. Additional information pertaining to the structure speeds the comparisons. The algorithm is straightforwardly parallelizable, and several versions of it for computer vision applications have been implemented on the massively parallel connection machine. A prototype version of the algorithm has been implemented and applied to the detection of substructures in proteins.
引用
收藏
页码:10495 / 10499
页数:5
相关论文
共 43 条
[1]   A SIMPLE QUALITATIVE REPRESENTATION OF POLYPEPTIDE-CHAIN FOLDS - COMPARISON OF PROTEIN TERTIARY STRUCTURES [J].
ABAGYAN, RA ;
MAIOROV, VN .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 1988, 5 (06) :1267-1279
[2]   GENE-REGULATION - ACTION OF LEUCINE ZIPPERS [J].
ABEL, T ;
MANIATIS, T .
NATURE, 1989, 341 (6237) :24-25
[3]   3DPO - A 3-DIMENSIONAL PART ORIENTATION SYSTEM [J].
BOLLES, RC ;
HORAUD, P .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 1986, 5 (03) :3-26
[4]  
Bourdon O., 1990, Proceedings. 10th International Conference on Pattern Recognition (Cat. No.90CH2898-5), P596, DOI 10.1109/ICPR.1990.119438
[5]   MODEL-BASED RECOGNITION IN ROBOT VISION. [J].
Chin, Roland T. ;
Dyer, Charles R. .
Computing surveys, 1986, 18 (01) :67-108
[6]   ANALYSIS OF SP1 INVIVO REVEALS MULTIPLE TRANSCRIPTIONAL DOMAINS, INCLUDING A NOVEL GLUTAMINE-RICH ACTIVATION MOTIF [J].
COUREY, AJ ;
TJIAN, R .
CELL, 1988, 55 (05) :887-898
[7]  
FISCHER D, 1991, IN PRESS J BIOMOL ST
[8]   HOMEO BOXES IN THE STUDY OF DEVELOPMENT [J].
GEHRING, WJ .
SCIENCE, 1987, 236 (4806) :1245-1252
[9]   LOCALIZING OVERLAPPING PARTS BY SEARCHING THE INTERPRETATION TREE [J].
GRIMSON, WEL ;
LOZANOPEREZ, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1987, 9 (04) :469-482
[10]  
HONG J, 1988, NOV P ICPR ROM, P72