PQL: Protein Query Language

被引:0
|
作者
Elfayoumy, Sherif [1 ]
Bathen, Paul [1 ]
机构
[1] Univ North Florida, Sch Comp, Jacksonville, FL 32224 USA
来源
2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 2 | 2012年
关键词
Proteomics; Bioinformatics; Query Languages; Protein Query Language;
D O I
10.1109/ICMLA.2012.217
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a Protein Query Language (PQL) for querying protein structures in an expressive yet concise manner. One of the objectives of the paper is to demonstrate how such a language would be beneficial to protein researchers to obtain in-depth protein data from a relational database without extensive SQL knowledge. The language features options such as limiting query results by key protein characteristics such as methyl donated hydrogen bond interactions, minimum and maximum phi and psi angles, repulsive forces, CH/Pi calculations, and other pertinent factors. A backend data model was designed to support storage and retrieval of protein primary and secondary sequences, atomic level data, as well as calculations on said data. A relational DBMS is used as the persistent storage backend, with every effort made to ensure transparent portability to most relational database systems. In addition, front end applications can be developed to support retrieving, transforming, and preprocessing of information from the Research Collaboratory for Structural Bioinformatics (RCSB) into the backend data repository. The new language and associated architecture allow users to load additional protein files from RCSB into the database, issue standard queries to download pertinent data in user-friendly formats including CSV files, issue non-standard queries against secondary structures via the protein query language, and run error detection routines against data in the database. Query results may include normalized or denormalized data, model and chain data, residue data, atom detail data, and primary as well as secondary structure data.
引用
收藏
页码:127 / 132
页数:6
相关论文
共 50 条
  • [41] A Simpler and Semantic Multidimensional Database Query Language to Facilitate Access to Information in Decision-making
    Palominos, F.
    Cordova, F.
    Duran, C.
    Nunez, B.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2020, 15 (04)
  • [42] SQL/SDA: A query language for supporting spatial data analysis and its Web-based implementation
    Lin, H
    Huang, B
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2001, 13 (04) : 671 - 682
  • [43] Transformer Encoder with Protein Language Model for Protein Secondary Structure Prediction
    Kazm, Ammar
    Ali, Aida
    Hashim, Haslina
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2024, 14 (02) : 13124 - 13132
  • [44] The language of proteins: NLP, machine learning & protein sequences
    Ofer, Dan
    Brandes, Nadav
    Linial, Michal
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 1750 - 1758
  • [45] Protein sequence classification using natural language processing techniques
    Huma Perveen
    Julie Weeds
    Discover Artificial Intelligence, 5 (1):
  • [46] Query exhaustivity, relevance feedback and search success in automatic and interactive query expansion
    Vakkari, P
    Jones, S
    MacFarlane, A
    Sormunen, E
    JOURNAL OF DOCUMENTATION, 2004, 60 (02) : 109 - 127
  • [47] DeepPTM: Protein Post-translational Modification Prediction from Protein Sequences by Combining Deep Protein Language Model with Vision Transformers
    Soylu, Necla Nisa
    Sefer, Emre
    CURRENT BIOINFORMATICS, 2024, 19 (09) : 810 - 824
  • [48] Protein Identification from Tandem Mass Spectra with Probabilistic Language Modeling
    Yang, Yiming
    Harpale, Abhay
    Ganapathy, Subramaniam
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2009, 5782 : 554 - 569
  • [49] Paraconsistent database query languages
    Bagai, Rajiv
    Proceedings of the Second IASTED International Conference on Computational Intelligence, 2006, : 248 - 253
  • [50] Parallel query processing in a polystore
    Pavlos Kranas
    Boyan Kolev
    Oleksandra Levchenko
    Esther Pacitti
    Patrick Valduriez
    Ricardo Jiménez-Peris
    Marta Patiño-Martinez
    Distributed and Parallel Databases, 2021, 39 : 939 - 977