Transforming Big Data into AI-ready data for nutrition and obesity research

被引:1
|
作者
Thomas, Diana M. [1 ]
Knight, Rob [2 ]
Gilbert, Jack A. [3 ,4 ]
Cornelis, Marilyn C. [5 ]
Gantz, Marie G. [6 ]
Burdekin, Kate [6 ]
Cummiskey, Kevin [1 ]
Sumner, Susan C. J. [7 ]
Pathmasiri, Wimal [7 ]
Sazonov, Edward [8 ]
Gabriel, Kelley Pettee [9 ]
Dooley, Erin E. [9 ]
Green, Mark A. [10 ]
Pfluger, Andrew [11 ]
Kleinberg, Samantha [12 ]
机构
[1] United States Mil Acad, Dept Math Sci, West Point, NY 10996 USA
[2] Univ Calif San Diego, Bioinformat & Syst Biol Program, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Dept Pediat, La Jolla, CA USA
[4] Univ Calif San Diego, Scripps Inst Oceanog, La Jolla, CA USA
[5] Northwestern Univ, Feinberg Sch Med, Dept Prevent Med, Chicago, IL USA
[6] Res Triangle Inst Int, Biostat & Epidemiol Div, Res Triangle Pk, NC USA
[7] Univ North Carolina Chapel Hill, Nutr Res Inst, Dept Nutr, Kannapolis, NC USA
[8] Univ Alabama, Elect & Comp Engn Dept, Tuscaloosa, AL USA
[9] Univ Alabama Birmingham, Dept Epidemiol, Birmingham, AL USA
[10] Univ Liverpool, Dept Geog & Planning, Liverpool, England
[11] United States Mil Acad, Dept Geog & Environm Engn, West Point, NY USA
[12] Stevens Inst Technol, Comp Sci Dept, Hoboken, NJ USA
基金
美国国家卫生研究院;
关键词
ENERGY-EXPENDITURE; METABOLOMICS; FOOD; MICROBIOME; CHALLENGES; SEQUENCES; COUNT; WRIST;
D O I
10.1002/oby.23989
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objective: Big Data are increasingly used in obesity and nutrition research to gain new insights and derive personalized guidance; however, this data in raw form are often not usable. Substantial preprocessing, which requires machine learning (ML), human judgment, and specialized software, is required to transform Big Data into artificial intelligence (AI)- and ML-ready data. These preprocessing steps are the most complex part of the entire modeling pipeline. Understanding the complexity of these steps by the end user is critical for reducing misunderstanding, faulty interpretation, and erroneous downstream conclusions. Methods: We reviewed three popular obesity/nutrition Big Data sources: microbiome, metabolomics, and accelerometry. The preprocessing pipelines, specialized software, challenges, and how decisions impact final AI- and ML-ready products were detailed. Results: Opportunities for advances to improve quality control, speed of preprocessing, and intelligent end user consumption were presented. Conclusions: Big Data have the exciting potential for identifying new modifiable factors that impact obesity research. However, to ensure accurate interpretation of conclusions arising from Big Data, the choices involved in preparing AI- and ML-ready data need to be transparent to investigators and clinicians relying on the conclusions.
引用
收藏
页码:857 / 870
页数:14
相关论文
共 50 条
  • [1] How has big data contributed to obesity research? A review of the literature
    Timmins, Kate A.
    Green, Mark A.
    Radley, Duncan
    Morris, Michelle A.
    Pearce, Jamie
    INTERNATIONAL JOURNAL OF OBESITY, 2018, 42 (12) : 1951 - 1962
  • [2] Big Data and AI - A transformational shift for government: So, what next for research?
    Pencheva, Irina
    Esteve, Marc
    Mikhaylov, Slava Jankin
    PUBLIC POLICY AND ADMINISTRATION, 2020, 35 (01) : 24 - 44
  • [3] Big Data in Gastroenterology Research
    Alizadeh, Madeline
    Sampaio Moura, Natalia
    Schledwitz, Alyssa
    Patil, Seema A.
    Ravel, Jacques
    Raufman, Jean-Pierre
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (03)
  • [4] Speaking Sociologically with Big Data: Symphonic Social Science and the Future for Big Data Research
    Halford, Susan
    Savage, Mike
    SOCIOLOGY-THE JOURNAL OF THE BRITISH SOCIOLOGICAL ASSOCIATION, 2017, 51 (06): : 1132 - 1148
  • [5] A Survey on Data Collection for Machine Learning: A Big Data-AI Integration Perspective
    Roh, Yuji
    Heo, Geon
    Whang, Steven Euijong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (04) : 1328 - 1347
  • [6] Big Data and AI Revolution in Precision Agriculture: Survey and Challenges
    Bhat, Showkat Ahmad
    Huang, Nen-Fu
    IEEE ACCESS, 2021, 9 : 110209 - 110222
  • [7] Transforming big data into smart data: An insight on the use of the k-nearest neighbors algorithm to obtain quality data
    Triguero, Isaac
    Garcia-Gil, Diego
    Maillo, Jesus
    Luengo, Julian
    Garcia, Salvador
    Herrera, Francisco
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 9 (02)
  • [8] Big Data and data science: A critical review of issues for educational research
    Daniel, Ben Kei
    BRITISH JOURNAL OF EDUCATIONAL TECHNOLOGY, 2019, 50 (01) : 101 - 113
  • [9] Application Research: Big Data in Food Industry
    Tao, Qi
    Ding, Hongwei
    Wang, Huixia
    Cui, Xiaohui
    FOODS, 2021, 10 (09)
  • [10] Privacy with Big Data: A Framework Completed Research
    Hoffman, David
    AMCIS 2018 PROCEEDINGS, 2018,