Missing value imputation in a data matrix using the regularised singular value decomposition

被引：1

作者：

Arciniegas-Alarcon, Sergio ^{[1
]}

Garcia-Pena, Marisol ^{[2
]}

Krzanowski, Wojtek J. ^{[3
]}

Rengifo, Camilo ^{[1
]}

机构：

[1] Univ Sabana, Fac Ingn, Chia, Colombia

[2] Pontificia Univ Javeriana, Dept Matemat, Bogota, Colombia

[3] Univ Exeter, Coll Engn Math & Phys Sci, Exeter, England

来源：

METHODSX | 2023年 / 11卷

关键词：

Eigenvalues; Eigenvectors; Iterative computational scheme; Cross-validation; Genotype-by-environment interaction; Overfitting; GGE BIPLOT;

D O I：

10.1016/j.mex.2023.102289

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Some statistical analysis techniques may require complete data matrices, but a frequent problem in the construction of databases is the incomplete collection of information for different reasons. One option to tackle the problem is to estimate and impute the missing data. This paper describes a form of imputation that mixes regression with lower rank approximations. To improve the qual-ity of the imputations, a generalisation is proposed that replaces the singular value decomposition (SVD) of the matrix with a regularised SVD in which the regularisation parameter is estimated by cross-validation. To evaluate the performance of the proposal, ten sets of real data from mul-tienvironment trials were used. Missing values were created in each set at four percentages of missing not at random, and three criteria were then considered to investigate the effectiveness of the proposal. The results show that the regularised method proves very competitive when com-pared to the original method, beating it in several of the considered scenarios. As it is a very general system, its application can be extended to all multivariate data matrices. & BULL; The imputation method is modified through the inclusion of a stable and efficient compu-tational algorithm that replaces the classical SVD least squares criterion by a penalised cri-terion. This penalty produces smoothed eigenvectors and eigenvalues that avoid overfitting problems, improving the performance of the method when the penalty is necessary. The size of the penalty can be determined by minimising one of the following criteria: the prediction errors, the Procrustes similarity statistic or the critical angles between subspaces of principal components.

引用

页数：8

共 32 条

[21] HYPERCOMPLEX TENSOR COMPLETION WITH CAYLEY-DICKSON SINGULAR VALUE DECOMPOSITION
Mizoguchi, Takehiko
Yamada, Isao
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 3979 - 3983
[22] SINGULAR-VALUE-DECOMPOSITION APPROACH TO MULTIVARIABLE GENERALIZED PREDICTIVE CONTROL
KOUVARITAKIS, B
ROSSITER, JA
CHANG, AOT
IEE PROCEEDINGS-D CONTROL THEORY AND APPLICATIONS, 1993, 140 (03): : 145 - 154
[23] Reduced Complexity Power Allocation Strategies for MIMO Systems With Singular Value Decomposition
Zanella, Alberto
Chiani, Marco
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2012, 61 (09) : 4031 - 4041
[24] A fast singular value decomposition algorithm of general k-tridiagonal matrices
Tanasescu, Andrei
Popescu, Pantelimon George
JOURNAL OF COMPUTATIONAL SCIENCE, 2019, 31 : 1 - 5
[25] Using Multiple Imputation to Account for the Uncertainty Due to Missing Data in the Context of Factor Retention
Xia, Yan
Havan, Selim
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2024, 84 (03) : 577 - 593
[26] Fast computation of error bounds for all eigenpairs of a Hermitian and all singular pairs of a rectangular matrix with emphasis on eigen- and singular value clusters
Rump, Siegfried M.
Lange, Marko
JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2023, 434
[27] Face Recognition System by using Eigen Value Decomposition
Tunio, Irfan Ali
Soomro, Shafiullah
Soomro, Toufique Ahmed
Bhatti, Mohammad Tarique
Shaikh, Mohsin
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2018, 18 (05): : 8 - 12
[28] Weighted singular value decomposition and determinantal representations of the quaternion weighted Moore-Penrose inverse
Kyrchei, Ivan
APPLIED MATHEMATICS AND COMPUTATION, 2017, 309 : 1 - 16
[29] MISSING DATA IMPUTATION USING SPATIAL STATISTICS TECHNIQUES APPLIED TO URUGUAY CENSUS OF POPULATION AND HOUSING
Eugenia Riano, Maria
SABERES, 2019, 11 (02) : 153 - 169
[30] Defining a bulk-edge correspondence for non-Hermitian Hamiltonians via singular-value decomposition
Herviou, Loic
Bardarson, Jens H.
Regnault, Nicolas
PHYSICAL REVIEW A, 2019, 99 (05)

← 1 2 3 4 →