TY - JOUR
T1 - Divide and conquer local average regression
AU - Chang, Xiangyu
AU - Lin, Shao Bo
AU - Wang, Yao
N1 - Publisher Copyright:
© 2017, Institute of Mathematical Statistics. All rights reserved.
PY - 2017
Y1 - 2017
N2 - The divide and conquer strategy, which breaks a massive data set into a series of manageable data blocks, and combines the independent results of data blocks to obtain a final decision, has been recognized as a state-of-the-art method to overcome challenges of massive data analysis. In this paper, we equip the classical local average regression with some divide and conquer strategies to infer the regressive relationship of input-output pairs from a massive data set. When the average mixture, a widely used divide and conquer approach, is adopted, we prove that the optimal learning rate can be achieved under some restrictive conditions on the number of data blocks. We then propose two variants to relax (or remove) these conditions and derive the same optimal learning rates as that for the average mixture local average regression. Our theoretical assertions are verified by a series of experimental studies.
AB - The divide and conquer strategy, which breaks a massive data set into a series of manageable data blocks, and combines the independent results of data blocks to obtain a final decision, has been recognized as a state-of-the-art method to overcome challenges of massive data analysis. In this paper, we equip the classical local average regression with some divide and conquer strategies to infer the regressive relationship of input-output pairs from a massive data set. When the average mixture, a widely used divide and conquer approach, is adopted, we prove that the optimal learning rate can be achieved under some restrictive conditions on the number of data blocks. We then propose two variants to relax (or remove) these conditions and derive the same optimal learning rates as that for the average mixture local average regression. Our theoretical assertions are verified by a series of experimental studies.
KW - Divide and conquer strategy
KW - K nearest neighbor estimate
KW - Local average regression
KW - Nadaraya-Watson estimate
UR - https://www.scopus.com/pages/publications/85018519999
U2 - 10.1214/17-EJS1265
DO - 10.1214/17-EJS1265
M3 - 文章
AN - SCOPUS:85018519999
SN - 1935-7524
VL - 11
SP - 1326
EP - 1350
JO - Electronic Journal of Statistics
JF - Electronic Journal of Statistics
IS - 1
ER -