An Integrative Co-localization (INCO) Analysis for SNV and CNV Genomic Features With an Application to Taiwan Biobank Data

被引:3
|
作者
Yu, Qi-You [1 ]
Lu, Tzu-Pin [1 ,2 ]
Hsiao, Tzu-Hung [3 ]
Lin, Ching-Heng [3 ]
Wu, Chi-Yun [4 ,5 ]
Tzeng, Jung-Ying [1 ,6 ,7 ]
Hsiao, Chuhsing Kate [1 ,2 ]
机构
[1] Natl Taiwan Univ, Inst Epidemiol & Prevent Med, Coll Publ Hlth, Taipei, Taiwan
[2] Natl Taiwan Univ, Dept Publ Hlth, Taipei, Taiwan
[3] Taichung Vet Gen Hosp, Dept Med Res, Taichung, Taiwan
[4] Univ Penn, Grad Grp Genom & Computat Biol, Philadelphia, PA 19104 USA
[5] Univ Penn, Dept Stat, Philadelphia, PA 19104 USA
[6] North Carolina State Univ, Dept Stat, Raleigh, NC USA
[7] North Carolina State Univ, Bioinformat Res Ctr, Raleigh, NC USA
基金
美国国家卫生研究院;
关键词
co-localization; gene-level; integrative analysis; Taiwan Biobank; CNV; SNV; CNV-SNV cross-platform interaction; rare variant; COPY NUMBER VARIANTS; WIDE ASSOCIATION; MISSING HERITABILITY; ARCHITECTURE; NUCLEOTIDE; MYSTERY; TESTS; RARE;
D O I
10.3389/fgene.2021.709555
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genomic studies have been a major approach to elucidating disease etiology and to exploring potential targets for treatments of many complex diseases. Statistical analyses in these studies often face the challenges of multiplicity, weak signals, and the nature of dependence among genetic markers. This situation becomes even more complicated when multi-omics data are available. To integrate the data from different platforms, various integrative analyses have been adopted, ranging from the direct union or intersection operation on sets derived from different single-platform analysis to complex hierarchical multi-level models. The former ignores the biological relationship between molecules while the latter can be hard to interpret. We propose in this study an integrative approach that combines both single nucleotide variants (SNVs) and copy number variations (CNVs) in the same genomic unit to co-localize the concurrent effect and to deal with the sparsity due to rare variants. This approach is illustrated with simulation studies to evaluate its performance and is applied to low-density lipoprotein cholesterol and triglyceride measurements from Taiwan Biobank. The results show that the proposed method can more effectively detect the collective effect from both SNVs and CNVs compared to traditional methods. For the biobank analysis, the identified genetic regions including the gene VNN2 could be novel and deserve further investigation.
引用
收藏
页数:12
相关论文
共 16 条
  • [1] Genomic co-localization of variation affecting agronomic and human gut microbiome traits in a meta-analysis of diverse sorghum
    Korth, Nate
    Yang, Qinnan
    Van Haute, Mallory J.
    Tross, Michael C.
    Peng, Bo
    Shrestha, Nikee
    Zwiener-Malcom, Mackenzie
    Mural, Ravi, V
    Schnable, James C.
    Benson, Andrew K.
    G3-GENES GENOMES GENETICS, 2024, 14 (09):
  • [2] Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data
    Min, Eun Jeong
    Long, Qi
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [3] Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data
    Eun Jeong Min
    Qi Long
    BMC Bioinformatics, 21
  • [4] Co-localization analysis of complex formation among membrane proteins by computerized fluorescence microscopy: application to immunofluorescence co-patching studies
    Lachmanovich, E
    Shvartsman, DE
    Malka, Y
    Botvin, C
    Henis, YI
    Weiss, AM
    JOURNAL OF MICROSCOPY, 2003, 212 : 122 - 131
  • [5] Integrative clustering of multiple genomic data using organoid model with application to subtype analysis in intrahepatic cholangiocarcinoma
    Lee, Hee Seung
    Han, Dai Hoon
    Cho, Kyung Joo
    Park, Soo Been
    Kim, Chanyang
    Leem, Galam
    Kwon, Soon Sung
    Kim, Chul Hoon
    Jo, Jung Hyun
    Lee, HyeWon
    Song, Si Young
    Park, Jun Yong
    JOURNAL OF HEPATOLOGY, 2022, 77 : S653 - S653
  • [6] INTEGRATIVE CLUSTERING OF MULTIPLE GENOMIC DATA USING ORGANOID MODEL WITH APPLICATION TO SUBTYPE ANALYSIS IN INTRAHEPATIC CHOLANGIOCARCINOMA
    Lee, Hee Seung
    Han, Dai Hoon
    Cho, Kyung Joo
    Leem, Galam
    Kim, Chanyang
    Park, Soo Been
    Jung, Dawoon E.
    Jo, Jung Hyun
    Song, Si Young
    Park, Jun Yong
    GASTROENTEROLOGY, 2022, 162 (07) : S1158 - S1158
  • [7] Integrative large-scale causal network analysis of imaging and genomic data and its application in schizophrenia studies
    Lin, N.
    Wang, P.
    Zhu, Y.
    Zhao, J.
    Calhoun, V.
    Xiong, M.
    HUMAN GENOMICS, 2016, 10
  • [8] Coupled co-clustering-based unsupervised transfer learning for the integrative analysis of single-cell genomic data
    Zeng, Pengcheng
    Wangwu, Jiaxuan
    Lin, Zhixiang
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (04)
  • [9] Analysis MicroRNA-Gene Co-Modules in Glioblastoma Multiforme Based on Integrative Two Types of Genomic Data
    Deng, Jin
    Kong, Wei
    Wang, Huimin
    Wang, Shuaiqun
    Mou, Xiaoyang
    2018 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND BIOINFORMATICS (ICBEB 2018), 2018, : 1 - 7
  • [10] Fluorescent co-localization of PTS1 and PTS2 and its application in analysis of the gene function and the peroxisomal dynamic in Magnaporthe oryzae
    Wang, Jiao-yu
    Wu, Xiao-yan
    Zhang, Zhen
    Du, Xin-fa
    Chai, Rong-yao
    Liu, Xiao-hong
    Mao, Xue-qin
    Qiu, Hai-ping
    Wang, Yan-li
    Lin, Fu-cheng
    Sun, Guo-chang
    JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE B, 2008, 9 (10): : 802 - 810