TY - JOUR
T1 - Conditional Uncorrelation and Efficient Subset Selection in Sparse Regression
AU - Wang, Jianji
AU - Zhang, Shupei
AU - Liu, Qi
AU - Du, Shaoyi
AU - Guo, Yu Cheng
AU - Zheng, Nanning
AU - Wang, Fei Yue
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2022/10/1
Y1 - 2022/10/1
N2 - Given md -dimensional responsors and nd -dimensional predictors, sparse regression finds at most k predictors for each responsor for linear approximation, 1≤q k ≤q d-1. The key problem in sparse regression is subset selection, which usually suffers from high computational cost. In recent years, many improved approximate methods of subset selection have been published. However, less attention has been paid to the nonapproximate method of subset selection, which is very necessary for many questions in data analysis. Here, we consider sparse regression from the view of correlation and propose the formula of conditional uncorrelation. Then, an efficient nonapproximate method of subset selection is proposed in which we do not need to calculate any coefficients in the regression equation for candidate predictors. By the proposed method, the computational complexity is reduced from O([1/6]k3+(m+1)k2+mkd) to O([1/6]k3+[1/2](m+1)k2) for each candidate subset in sparse regression. Because the dimension d is generally the number of observations or experiments and large enough, the proposed method can greatly improve the efficiency of nonapproximate subset selection. We also apply the proposed method in real scenarios of dental age assessment and sparse coding to validate the efficiency of the proposed method.
AB - Given md -dimensional responsors and nd -dimensional predictors, sparse regression finds at most k predictors for each responsor for linear approximation, 1≤q k ≤q d-1. The key problem in sparse regression is subset selection, which usually suffers from high computational cost. In recent years, many improved approximate methods of subset selection have been published. However, less attention has been paid to the nonapproximate method of subset selection, which is very necessary for many questions in data analysis. Here, we consider sparse regression from the view of correlation and propose the formula of conditional uncorrelation. Then, an efficient nonapproximate method of subset selection is proposed in which we do not need to calculate any coefficients in the regression equation for candidate predictors. By the proposed method, the computational complexity is reduced from O([1/6]k3+(m+1)k2+mkd) to O([1/6]k3+[1/2](m+1)k2) for each candidate subset in sparse regression. Because the dimension d is generally the number of observations or experiments and large enough, the proposed method can greatly improve the efficiency of nonapproximate subset selection. We also apply the proposed method in real scenarios of dental age assessment and sparse coding to validate the efficiency of the proposed method.
KW - Conditional uncorrelation
KW - dental age assessment
KW - multivariate correlation
KW - sparse coding
KW - sparse regression
KW - subset selection
UR - https://www.scopus.com/pages/publications/85104619954
U2 - 10.1109/TCYB.2021.3062842
DO - 10.1109/TCYB.2021.3062842
M3 - 文章
C2 - 33882011
AN - SCOPUS:85104619954
SN - 2168-2267
VL - 52
SP - 10458
EP - 10467
JO - IEEE Transactions on Cybernetics
JF - IEEE Transactions on Cybernetics
IS - 10
ER -