Transforming Big Data into AI-ready data for nutrition and obesity research

被引:1
|
作者
Thomas, Diana M. [1 ]
Knight, Rob [2 ]
Gilbert, Jack A. [3 ,4 ]
Cornelis, Marilyn C. [5 ]
Gantz, Marie G. [6 ]
Burdekin, Kate [6 ]
Cummiskey, Kevin [1 ]
Sumner, Susan C. J. [7 ]
Pathmasiri, Wimal [7 ]
Sazonov, Edward [8 ]
Gabriel, Kelley Pettee [9 ]
Dooley, Erin E. [9 ]
Green, Mark A. [10 ]
Pfluger, Andrew [11 ]
Kleinberg, Samantha [12 ]
机构
[1] United States Mil Acad, Dept Math Sci, West Point, NY 10996 USA
[2] Univ Calif San Diego, Bioinformat & Syst Biol Program, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Dept Pediat, La Jolla, CA USA
[4] Univ Calif San Diego, Scripps Inst Oceanog, La Jolla, CA USA
[5] Northwestern Univ, Feinberg Sch Med, Dept Prevent Med, Chicago, IL USA
[6] Res Triangle Inst Int, Biostat & Epidemiol Div, Res Triangle Pk, NC USA
[7] Univ North Carolina Chapel Hill, Nutr Res Inst, Dept Nutr, Kannapolis, NC USA
[8] Univ Alabama, Elect & Comp Engn Dept, Tuscaloosa, AL USA
[9] Univ Alabama Birmingham, Dept Epidemiol, Birmingham, AL USA
[10] Univ Liverpool, Dept Geog & Planning, Liverpool, England
[11] United States Mil Acad, Dept Geog & Environm Engn, West Point, NY USA
[12] Stevens Inst Technol, Comp Sci Dept, Hoboken, NJ USA
基金
美国国家卫生研究院;
关键词
ENERGY-EXPENDITURE; METABOLOMICS; FOOD; MICROBIOME; CHALLENGES; SEQUENCES; COUNT; WRIST;
D O I
10.1002/oby.23989
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objective: Big Data are increasingly used in obesity and nutrition research to gain new insights and derive personalized guidance; however, this data in raw form are often not usable. Substantial preprocessing, which requires machine learning (ML), human judgment, and specialized software, is required to transform Big Data into artificial intelligence (AI)- and ML-ready data. These preprocessing steps are the most complex part of the entire modeling pipeline. Understanding the complexity of these steps by the end user is critical for reducing misunderstanding, faulty interpretation, and erroneous downstream conclusions. Methods: We reviewed three popular obesity/nutrition Big Data sources: microbiome, metabolomics, and accelerometry. The preprocessing pipelines, specialized software, challenges, and how decisions impact final AI- and ML-ready products were detailed. Results: Opportunities for advances to improve quality control, speed of preprocessing, and intelligent end user consumption were presented. Conclusions: Big Data have the exciting potential for identifying new modifiable factors that impact obesity research. However, to ensure accurate interpretation of conclusions arising from Big Data, the choices involved in preparing AI- and ML-ready data need to be transparent to investigators and clinicians relying on the conclusions.
引用
收藏
页码:857 / 870
页数:14
相关论文
共 50 条
  • [31] Big Data Challenges Impact, potential responses and research needs
    Bachlechner, Daniel
    Leimbach, Timo
    2016 IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND INNOVATIVE BUSINESS PRACTICES FOR THE TRANSFORMATION OF SOCIETIES (EMERGITECH), 2016, : 257 - 264
  • [32] Ethical Issues in Social Science Research Employing Big Data
    Hosseini, Mohammad
    Wieczorek, Michal
    Gordijn, Bert
    SCIENCE AND ENGINEERING ETHICS, 2022, 28 (03)
  • [33] On a Certain Research Gap in Big Data Mining for Customer Insights
    Mach-Krol, Maria
    Hadasik, Bartlomiej
    APPLIED SCIENCES-BASEL, 2021, 11 (15):
  • [34] Problems with research methods in medical device big data analytics
    Strang, Kenneth David
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2020, 9 (02) : 229 - 240
  • [35] Intellectual capital in the age of Big Data: establishing a research agenda
    Secundo, Giustina
    Del Vecchio, Pasquale
    Dumay, John
    Passiante, Giuseppina
    JOURNAL OF INTELLECTUAL CAPITAL, 2017, 18 (02) : 242 - 261
  • [36] Survey Methodology for Data Collection and Analysis in Nutrition and Dietetics Research
    Metallinos-Katsaras, Elizabeth
    Beto, Judith
    JOURNAL OF THE ACADEMY OF NUTRITION AND DIETETICS, 2025, 125 (05) : 615 - 629
  • [37] Big Data, AI and Machine Learning for Precision Psychiatry: How are they changing the clinical practice?
    Winter, Nils Ralf
    Hahn, Tim
    FORTSCHRITTE DER NEUROLOGIE PSYCHIATRIE, 2020, 88 (12) : 786 - 793
  • [38] Big Data-Led Cancer Research, Application, and Insights
    Brown, James A. L.
    Chonghaile, Triona Ni
    Matchett, Kyle B.
    Lynam-Lennon, Niamh
    Kiely, Patrick A.
    CANCER RESEARCH, 2016, 76 (21) : 6167 - 6170
  • [39] The Confluence of AI and Big Data Analytics in Industry 4.0: Fostering Sustainable Strategic Development
    Zheng, Mengze
    Li, Te
    Ye, Jing
    JOURNAL OF THE KNOWLEDGE ECONOMY, 2024, : 5479 - 5515
  • [40] Nutritional Culturomics and Big Data; Macroscopic Patterns of Change in Food, Nutrition and Diet Choices
    Troumbis, Andreas Y.
    Hatziantoniou, Maria
    Vasios, Georgios K.
    CURRENT PHARMACEUTICAL BIOTECHNOLOGY, 2019, 20 (10) : 895 - 908