Transforming Big Data into AI-ready data for nutrition and obesity research

被引:1
|
作者
Thomas, Diana M. [1 ]
Knight, Rob [2 ]
Gilbert, Jack A. [3 ,4 ]
Cornelis, Marilyn C. [5 ]
Gantz, Marie G. [6 ]
Burdekin, Kate [6 ]
Cummiskey, Kevin [1 ]
Sumner, Susan C. J. [7 ]
Pathmasiri, Wimal [7 ]
Sazonov, Edward [8 ]
Gabriel, Kelley Pettee [9 ]
Dooley, Erin E. [9 ]
Green, Mark A. [10 ]
Pfluger, Andrew [11 ]
Kleinberg, Samantha [12 ]
机构
[1] United States Mil Acad, Dept Math Sci, West Point, NY 10996 USA
[2] Univ Calif San Diego, Bioinformat & Syst Biol Program, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Dept Pediat, La Jolla, CA USA
[4] Univ Calif San Diego, Scripps Inst Oceanog, La Jolla, CA USA
[5] Northwestern Univ, Feinberg Sch Med, Dept Prevent Med, Chicago, IL USA
[6] Res Triangle Inst Int, Biostat & Epidemiol Div, Res Triangle Pk, NC USA
[7] Univ North Carolina Chapel Hill, Nutr Res Inst, Dept Nutr, Kannapolis, NC USA
[8] Univ Alabama, Elect & Comp Engn Dept, Tuscaloosa, AL USA
[9] Univ Alabama Birmingham, Dept Epidemiol, Birmingham, AL USA
[10] Univ Liverpool, Dept Geog & Planning, Liverpool, England
[11] United States Mil Acad, Dept Geog & Environm Engn, West Point, NY USA
[12] Stevens Inst Technol, Comp Sci Dept, Hoboken, NJ USA
基金
美国国家卫生研究院;
关键词
ENERGY-EXPENDITURE; METABOLOMICS; FOOD; MICROBIOME; CHALLENGES; SEQUENCES; COUNT; WRIST;
D O I
10.1002/oby.23989
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objective: Big Data are increasingly used in obesity and nutrition research to gain new insights and derive personalized guidance; however, this data in raw form are often not usable. Substantial preprocessing, which requires machine learning (ML), human judgment, and specialized software, is required to transform Big Data into artificial intelligence (AI)- and ML-ready data. These preprocessing steps are the most complex part of the entire modeling pipeline. Understanding the complexity of these steps by the end user is critical for reducing misunderstanding, faulty interpretation, and erroneous downstream conclusions. Methods: We reviewed three popular obesity/nutrition Big Data sources: microbiome, metabolomics, and accelerometry. The preprocessing pipelines, specialized software, challenges, and how decisions impact final AI- and ML-ready products were detailed. Results: Opportunities for advances to improve quality control, speed of preprocessing, and intelligent end user consumption were presented. Conclusions: Big Data have the exciting potential for identifying new modifiable factors that impact obesity research. However, to ensure accurate interpretation of conclusions arising from Big Data, the choices involved in preparing AI- and ML-ready data need to be transparent to investigators and clinicians relying on the conclusions.
引用
收藏
页码:857 / 870
页数:14
相关论文
共 50 条
  • [41] A bibliometric review of a decade of research: Big data in business research-Setting a research agenda
    Zhang, Yucheng
    Zhang, Meng
    Li, Jing
    Liu, Guangjian
    Yang, Miles M.
    Liu, Siqi
    JOURNAL OF BUSINESS RESEARCH, 2021, 131 : 374 - 390
  • [42] Applications of Big Data Analytics in Financial Auditing- A Study on The Big Four Completed Research
    Haddara, Moutaz
    Su, Kuan Lin
    Alkayid, Kholoud
    Ali, Maged
    AMCIS 2018 PROCEEDINGS, 2018,
  • [43] Symptom Science Research in the Era of Big Data: Leveraging Interdisciplinary Resources and Partners to Make It Happen
    Corwin, Elizabeth J.
    Jones, Dean P.
    Dunlop, Anne L.
    JOURNAL OF NURSING SCHOLARSHIP, 2019, 51 (01) : 4 - 8
  • [44] Big data in education: a state of the art, limitations, and future research directions
    Baig, Maria Ijaz
    Shuib, Liyana
    Yadegaridehkordi, Elaheh
    INTERNATIONAL JOURNAL OF EDUCATIONAL TECHNOLOGY IN HIGHER EDUCATION, 2020, 17 (01)
  • [45] Access Control in the Era of Big Data: State of the Art and Research Directions
    Colombo, Pietro
    Ferrari, Elena
    SACMAT'18: PROCEEDINGS OF THE 23RD ACM SYMPOSIUM ON ACCESS CONTROL MODELS & TECHNOLOGIES, 2018, : 185 - 192
  • [46] A big-data approach to understanding metabolic rate and response to obesity in laboratory mice
    Corrigan, June K.
    Ramachandran, Deepti
    He, Yuchen
    Palmer, Colin J.
    Jurczak, Michael J.
    Chen, Rui
    Li, Bingshan
    Friedline, Randall H.
    Kim, Jason K.
    Ramsey, Jon J.
    Lantier, Louise
    McGuinness, Owen P.
    Banks, Alexander S.
    ELIFE, 2020, 9 : 1 - 34
  • [47] Research on Enterprise Financial Decision-making in Big Data Era
    Fang, Liu
    Yang Shuyuan
    2019 INTERNATIONAL CONFERENCE ON ARTS, MANAGEMENT, EDUCATION AND INNOVATION (ICAMEI 2019), 2019, : 1220 - 1223
  • [48] Challenges and Solutions in Qualitative Big Data Research: A Methodological Literature Review
    Suvivuo, Sampsa
    Tuunainen, Virpi K.
    COMMUNICATIONS OF THE ASSOCIATION FOR INFORMATION SYSTEMS, 2024, 55 : 37 - 76
  • [49] A Descriptive Literature Review and Classification of Business Intelligence and Big Data Research
    Rashid, Ammar
    Khurshid, Muhammad Mahboob
    INTELLIGENT COMPUTING, VOL 1, 2022, 506 : 865 - 879
  • [50] Big data and performance measurement research: trends, evolution and future opportunities
    Sardi, Alberto
    Sorano, Enrico
    Cantino, Valter
    Garengo, Patrizia
    MEASURING BUSINESS EXCELLENCE, 2023, 27 (04) : 531 - 548