TY - GEN
T1 - Challenges and recent advances in estimation performance evaluation
AU - Li, X. Rong
PY - 2008
Y1 - 2008
N2 - Existing methods and metrics for estimation performance evaluation have serious limitations. They are narrowly focused, overly pessimistic, hyper sensitive to evaluation scenario, totally reliant on knowing the ground truth, and may ignore crucial coupling between the estimate and estimatee. Three challenges have been articulated: development of a comprehensive metric of estimation performance, evaluation of estimation performance without the need to know the ground truth, and development of alternative methods and measures for estimation performance ranking. A general approach to comprehensive evaluation of estimation performance without the need to know the ground truth is possible. Such an approach has been developed and is outlined here. Various metrics in three classes have been proposed recently: absolute metrics, relative metrics, and frequency counts. Having distinctive pros and cons, they should take over the widely used RMSE in many cases. Three other classes of metrics have been developed: error spectrum, desirability metrics, and concentration measures. They are much more comprehensive than existing ones. The use of a common performance measure is not the only yardstick for performance ranking. Alternatives do exist and may prove superior. A viable one is to rank by outcomes of competitions among estimators. Two measures in this category have been proposed and a third class is under development.
AB - Existing methods and metrics for estimation performance evaluation have serious limitations. They are narrowly focused, overly pessimistic, hyper sensitive to evaluation scenario, totally reliant on knowing the ground truth, and may ignore crucial coupling between the estimate and estimatee. Three challenges have been articulated: development of a comprehensive metric of estimation performance, evaluation of estimation performance without the need to know the ground truth, and development of alternative methods and measures for estimation performance ranking. A general approach to comprehensive evaluation of estimation performance without the need to know the ground truth is possible. Such an approach has been developed and is outlined here. Various metrics in three classes have been proposed recently: absolute metrics, relative metrics, and frequency counts. Having distinctive pros and cons, they should take over the widely used RMSE in many cases. Three other classes of metrics have been developed: error spectrum, desirability metrics, and concentration measures. They are much more comprehensive than existing ones. The use of a common performance measure is not the only yardstick for performance ranking. Alternatives do exist and may prove superior. A viable one is to rank by outcomes of competitions among estimators. Two measures in this category have been proposed and a third class is under development.
UR - https://www.scopus.com/pages/publications/44949232325
U2 - 10.1117/12.802000
DO - 10.1117/12.802000
M3 - 会议稿件
AN - SCOPUS:44949232325
SN - 9780819471598
T3 - Proceedings of SPIE - The International Society for Optical Engineering
SP - lii-lvii
BT - Signal Processing, Sensor Fusion, and Target Recognition XVII
T2 - The International Society for Optical Engineering (SPIE)
Y2 - 17 March 2008 through 19 March 2008
ER -