Likelihood Scores for Sparse Signal and Change-Point Detection

被引:2
作者
Hu, Shouri [1 ]
Huang, Jingyan [2 ]
Chen, Hao [3 ]
Chan, Hock Peng [2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Math Sci, Chengdu 611731, Sichuan, Peoples R China
[2] Natl Univ Singapore, Dept Stat & Data Sci, Singapore 117546, Singapore
[3] Univ Calif Davis, Dept Stat, Davis, CA 95616 USA
关键词
Testing; Estimation; Data models; Standards; Signal detection; Ranking (statistics); Optimization; Asymptotic optimality; change-point; sequence segmentation; signal detection; BINARY SEGMENTATION; HIGHER CRITICISM; NUMBER; STATISTICS; BAYES; RARE;
D O I
10.1109/TIT.2023.3242297
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider here the identification of change-points on large-scale data streams. The objective is to find the most efficient way of combining information across data stream so that detection is possible under the smallest detectable change magnitude. The challenge comes from the sparsity of change-points when only a small fraction of data streams undergo change at any point in time. The most successful approach to the sparsity issue so far has been the application of hard thresholding such that only local scores from data streams exhibiting significant changes are considered and added. However the identification of an optimal threshold is a difficult one. In particular it is unlikely that the same threshold is optimal for different levels of sparsity. We propose here a sparse likelihood score for identifying a sparse signal. The score is a likelihood ratio for testing between the null hypothesis of no change against an alternative hypothesis in which the change-points or signals are barely detectable. By the Neyman-Pearson Lemma this score has maximum detection power at the given alternative. The outcome is that we have a scoring of data streams that is successful in detecting at the boundary of the detectable region of signals and change-points. The likelihood score can be seen as a soft thresholding approach to sparse signal and change-point detection in which local scores that indicate small changes are down-weighted much more than local scores indicating large changes. We are able to show sharp optimality of the sparsity likelihood score in the sense of achieving successful detection at the minimum detectable order of change magnitude as well as the best constant with respect this order of change.
引用
收藏
页码:4065 / 4080
页数:16
相关论文
共 43 条