Using Machine Learning to Identify Biomarkers Affecting Fat Deposition in Pigs by Integrating Multisource Transcriptome Information

被引:14
|
作者
Liu, Huatao [1 ]
Xing, Kai [1 ]
Jiang, Yifan [1 ]
Liu, Yibing [1 ]
Wang, Chuduan [1 ]
Ding, Xiangdong [1 ]
机构
[1] China Agr Univ, Coll Anim Sci & Technol, Natl Engn Lab Anim Breeding, Lab Anim Genet Breeding & Reprod, Beijing 100193, Peoples R China
基金
中国国家自然科学基金;
关键词
fat deposition; pigs; data integration; machine learning; biomarkers; CYTOSCAPE; PROGRAM;
D O I
10.1021/acs.jafc.2c03339
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Fat deposition in pigs is not only closely related to pig production efficiency and pork quality but also an ideal model for human obesity. Transcriptome sequencing is widely used to study fat deposition. However, due to small sample sizes, high false positive rates, and poor consistency of results from different studies, new strategies are urgently needed. Machine learning, a new analysis method, can effectively fit complex data and accurately identify samples and genes. In this study, 36 samples of adipose tissue, muscle tissue, and liver tissue were collected from Songliao black pigs and Landrace pigs, and the mRNA of all the samples was sequenced. In addition, we collected transcriptome data for 64 samples in the GEO database from four different sources. After standardization and imputation of missing values in the data set comprising 100 samples, traditional differential expression analysis was carried out, and different numbers of expressed genes were selected as features for the training model of eight machine learning methods. In the 1000 replications of fourfold cross validation with 100 samples, AdaBoost performed best, with an average prediction accuracy greater than 93% and the highest mean area under the curve in predicting the high- and low-fat content groups among the eight ML methods. According to their performance-based ranks inferred by AdaBoost, 12 genes related to fat deposition were identified; among them, FASN and APOD were specifically expressed in adipose tissue, and APOA1 was specifically expressed in the liver, which could be important candidate biomarkers affecting fat deposition.
引用
收藏
页码:10359 / 10370
页数:12
相关论文
共 40 条
  • [1] Identification of key genes affecting intramuscular fat deposition in pigs using machine learning models
    Shi, Yumei
    Wang, Xini
    Chen, Shaokang
    Zhao, Yanhui
    Wang, Yan
    Sheng, Xihui
    Qi, Xiaolong
    Zhou, Lei
    Feng, Yu
    Liu, Jianfeng
    Wang, Chuduan
    Xing, Kai
    FRONTIERS IN GENETICS, 2025, 15
  • [2] Integration of Transcriptome and Whole Genomic Resequencing Data to Identify Key Genes Affecting Swine Fat Deposition
    Xing, Kai
    Zhu, Feng
    Zhai, Liwei
    Liu, Huijie
    Wang, Yuan
    Wang, Zhijun
    Chen, Shaokang
    Hou, Zhuocheng
    Wang, Chuduan
    PLOS ONE, 2015, 10 (04):
  • [3] Smartphone-Based Indoor Localization Using Machine Learning and Multisource Information Fusion
    Yan, Jun
    Huang, Zheng
    Wu, Xiaohuan
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (03) : 2722 - 2734
  • [4] Using Machine Learning to Identify the Most Critical Factors Affecting Maintenance of Tunnels
    Ahmed, Muaz O.
    Khalef, Ramy
    Ali, Gasser G.
    El-Adaway, Islam H.
    COMPUTING IN CIVIL ENGINEERING 2021, 2022, : 1433 - 1440
  • [5] Using Machine Learning to Identify Top Antecedents Affecting Crime in US Communities
    Samara, Kamil
    ADVANCES IN INFORMATION AND COMMUNICATION, FICC, VOL 2, 2023, 652 : 96 - 101
  • [6] Integration of transcriptome and machine learning to identify the potential key genes and regulatory networks affecting drip loss in pork
    Yang, Wen
    Hou, Liming
    Wang, Binbin
    Wu, Jian
    Zha, Chengwan
    Wu, Wangjun
    JOURNAL OF ANIMAL SCIENCE, 2024, 102
  • [7] Integrating single-cell RNA sequencing, WGCNA, and machine learning to identify key biomarkers in hepatocellular carcinoma
    Gang Wang
    Jiaxing Zhang
    Yirong Li
    Yuyu Zhang
    Weiwei Dong
    Hengquan Wu
    Jinglan Wang
    Peiqing Liao
    Ziqiang Yuan
    Tao Liu
    Wenting He
    Scientific Reports, 15 (1)
  • [8] Multi-modal transcriptomics: integrating machine learning and convolutional neural networks to identify immune biomarkers in atherosclerosis
    Chen, Haiqing
    Lai, Haotian
    Chi, Hao
    Fan, Wei
    Huang, Jinbang
    Zhang, Shengke
    Jiang, Chenglu
    Jiang, Lai
    Hu, Qingwen
    Yan, Xiuben
    Chen, Yemeng
    Zhang, Jieying
    Yang, Guanhu
    Liao, Bin
    Wan, Juyi
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2024, 11
  • [9] Identification of Potential Sex-Specific Biomarkers in Pigs with Low and High Intramuscular Fat Content Using Integrated Bioinformatics and Machine Learning
    Yang, Yongli
    Wang, Xiaoyi
    Wang, Shuyan
    Chen, Qiang
    Li, Mingli
    Lu, Shaoxiong
    GENES, 2023, 14 (09)
  • [10] Using a Machine Learning Approach to Identify Key Biomarkers for Renal Clear Cell Carcinoma
    Han, Xiaying
    Song, Dianwen
    INTERNATIONAL JOURNAL OF GENERAL MEDICINE, 2022, 15 : 3541 - 3558