CANCER PATHWAYS: AUTOMATIC EXTRACTION, REPRESENTATION, AND REASONING IN THE 'BIG DATA' ERA

被引:0
|
作者
Gonzalez, Graciela [1 ]
Baral, Chitta [2 ]
Kiefer, Jeff [3 ]
Kim, Seungchan [4 ]
Ye, Jieping [2 ]
机构
[1] Arizona State Univ, Dept Biomed Informat, Scottsdale, AZ 85259 USA
[2] Arizona State Univ, Sch Comp Informat & Decis Syst Engn, Tempe, AZ 85287 USA
[3] Translat Genom Res Inst TGen, Knowledge Min Lab, Scottsdale, AZ 85259 USA
[4] Translat Genom Res Inst TGen, Integrated Canc Genom Div, Biocomp Unit, Phoenix, AZ 85004 USA
关键词
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
There has been great interest and research initiatives in the biomedical community around harnessing "big data", including data from the literature, high-throughput gene expression experiments, array CGH and high-throughput siRNA and many other types of data to generate novel hypothesis to address the most crucial biomedical questions and aid in the discovery of more effective and improved therapeutic options for the treatment of complex and pervasive diseases such as cancer. Cancer research has progressed rapidly in the last decade with the implementation of high-dimensional genomic technologies. The large amount of data generated over the years has enabled a systems-based approach to uncovering and elucidating the complex signaling networks associated with cancer. However, even though new technologies have advanced our understanding of cancer biology beyond what could be imagined even a decade ago, there still exist unique challenges associated precisely with the amount of data that is now routinely generated from even a single patient. The data must be stored and processed, with novel analysis strategies called for to uncover new insights into cancer biology that are literally hidden in 'big data'. Interest in taming 'big data' through methods and systems to extract, represent, and transform it into knowledge that can effectively be used for reasoning and question answering will only increase over time, enabling scientists to finally use the data for personalized treatment, discovery and validation. Work presented in this session includes novel approaches to explore cancer gene expression data, applying algebraic topology (Lockwood and Krishnamoorthy) and Denoising autoencoders (Tan et al) to identify significant properties of genomic data that cannot be found by traditional algorithms. There is also a novel methodology for leveraging somatic mutation data for predicting survival in cancer samples (Kim et al), a computational system for automated gene expression pattern annotation on mouse brain images that could prove to be key to understanding the pathogenesis of brain tumors and their early detection (Yang et al). With respect to knowledge extraction, this session includes work on a weakly supervised machine learning approach for automatic pathway extraction from PubMed abstracts (Poon et al), and on the use protein interaction data from multiple sources to investigate mutations in 125 genes that were earlier identified as driving tumorigenesis when mutated (Engin et al).
引用
收藏
页码:80 / 83
页数:4
相关论文
共 50 条
  • [1] Scalable automatic sleep staging in the era of Big Data
    Nakamura, Takashi
    Davies, Harry J.
    Mandic, Danilo P.
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 2265 - 2268
  • [2] TreeWrapper: Automatic data extraction based on tree representation
    Gao, Xiaoying
    Zhang, Mengjie
    Cao, Minh Duc
    AI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4304 : 566 - +
  • [3] Remotely Sensed Big Data Era and Intelligent Information Extraction
    Zhang B.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2018, 43 (12): : 1861 - 1871
  • [4] Knowledge Entity Extraction and Text Mining in the Era of Big Data
    Zhang, Chengzhi
    Mayr, Philipp
    Lu, Wei
    Zhang, Yi
    Data and Information Management, 2021, 5 (03): : 309 - 311
  • [5] Extraction and Representation of Big Data Based on Iterative Operation of Chaotic Functions
    Yu, Wanbo
    Wang, Xiangxiang
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 126 : 110 - 110
  • [6] Automatic Extraction of POIs in Smart Cities: Big Data Processing in ParticipAct
    Corradi, Antonio
    Curatola, Giovanni
    Foschini, Luca
    Ianniello, Raffaele
    De Rolt, Carlos Roberto
    PROCEEDINGS OF THE 2015 IFIP/IEEE INTERNATIONAL SYMPOSIUM ON INTEGRATED NETWORK MANAGEMENT (IM), 2015, : 1059 - 1064
  • [7] Temporal data representation, normalization, extraction, and reasoning: A review from clinical domain
    Madkour, Mohcine
    Benhaddou, Driss
    Tao, Cui
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2016, 128 : 52 - 68
  • [8] Editorial overview: Functionalizing cancer genomes in the era of big data
    Rad, Roland
    Boutros, Michael
    CURRENT OPINION IN GENETICS & DEVELOPMENT, 2019, 54 : III - VI
  • [9] Finding cancer driver mutations in the era of big data research
    Poulos R.C.
    Wong J.W.H.
    Biophysical Reviews, 2019, 11 (1) : 21 - 29
  • [10] Small data in the era of big data
    Kitchin, Rob
    Lauriault, Tracey P.
    GEOJOURNAL, 2015, 80 (04) : 463 - 475