MVApp-Multivariate Analysis Application for Streamlined Data Analysis and Curation

被引:59
作者
Julkowska, Magdalena M. [1 ]
Saade, Stephanie [1 ]
Agarwal, Gaurav [2 ]
Gao, Ge [1 ]
Pailles, Yveline [1 ]
Morton, Mitchell [1 ]
Awlia, Mariam [1 ]
Tester, Mark [1 ]
机构
[1] King Abdullah Univ Sci & Technol, Biol & Environm Sci & Engn Div, Thuwal 239556900, Saudi Arabia
[2] King Abdullah Univ Sci & Technol, Comp Elect & Math Sci & Engn Div, Thuwal 239556900, Saudi Arabia
关键词
R-PACKAGE; VISUALIZATION; OUTLIERS;
D O I
10.1104/pp.19.00235
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Modern phenotyping techniques yield vast amounts of data that are challenging to manage and analyze. When thoroughly examined, this type of data can reveal genotype-to-phenotype relationships and meaningful connections among individual traits. However, efficient data mining is challenging for experimental biologists with limited training in curating, integrating, and exploring complex datasets. Additionally, data transparency, accessibility, and reproducibility are important considerations for scientific publication. The need for a streamlined, user-friendly pipeline for advanced phenotypic data analysis is pressing. In this article we present an open-source, online platform for multivariate analysis (MVApp), which serves as an interactive pipeline for data curation, in-depth analysis, and customized visualization. MVApp builds on the available R-packages and adds extra functionalities to enhance the interpretability of the results. The modular design of the MVApp allows for flexible analysis of various data structures and includes tools underexplored in phenotypic data analysis, such as clustering and quantile regression. MVApp aims to enhance findable, accessible, interoperable, and reproducible data transparency, streamline data curation and analysis, and increase statistical literacy among the scientific community.
引用
收藏
页码:1261 / 1276
页数:16
相关论文
共 41 条
  • [1] Agarwal G, 2018, QUANTILE FUNCTION MO
  • [2] Analyzing outliers: influential or nuisance?
    Altman, Naomi
    Krzywinski, Martin
    [J]. NATURE METHODS, 2016, 13 (04) : 281 - 282
  • [3] High-Throughput Non-destructive Phenotyping of Traits that Contribute to Salinity Tolerance in Arabidopsis thaliana
    Awlia, Mariam
    Nigro, Arianna
    Faikus, Jirl
    Schmoeckel, Sandra M.
    Negrao, Sonia
    Santelia, Diana
    Trtilek, Martin
    Tester, Mark
    Julkowska, Magdalena M.
    Panzarova, Klara
    [J]. FRONTIERS IN PLANT SCIENCE, 2016, 7
  • [4] Fast and Elegant Numerical Linear Algebra Using the RcppEigen Package
    Bates, Douglas
    Eddelbuettel, Dirk
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2013, 52 (05): : 1 - 24
  • [5] Principal component analysis
    Bro, Rasmus
    Smilde, Age K.
    [J]. ANALYTICAL METHODS, 2014, 6 (09) : 2812 - 2831
  • [6] CHALONER K, 1988, BIOMETRIKA, V75, P651
  • [7] Charrad M, 2014, J STAT SOFTW, V61, P1
  • [8] The HTPmod Shiny application enables modeling and visualization of large-scale biological data
    Chen, Dijun
    Fu, Liang-Yu
    Hu, Dahui
    Klukas, Christian
    Chen, Ming
    Kaufmann, Kerstin
    [J]. COMMUNICATIONS BIOLOGY, 2018, 1
  • [9] A note on the Cook's distance
    Díaz-García, JA
    González-Farías, G
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2004, 120 (1-2) : 119 - 136
  • [10] Lights, camera, action: high-throughput plant phenotyping is ready for a close-up
    Fahlgren, Noah
    Gehan, Malia A.
    Baxter, Ivan
    [J]. CURRENT OPINION IN PLANT BIOLOGY, 2015, 24 : 93 - 99