On Optimal Data Compression in Multiterminal Statistical Inference

被引:14
作者
Amari, Shun-ichi [1 ]
机构
[1] RIKEN Brain Sci Inst, Wako, Saitama 3510198, Japan
关键词
Data compression; Fisher information; linear-threshold encoding; multiterminal source; multiterminal statistical inference; INFORMATION;
D O I
10.1109/TIT.2011.2162270
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The multiterminal theory of statistical inference deals with the problem of estimating or testing the correlation of letters generated from two (or many) correlated information sources under the restriction of a certain transmission rate for each source. A typical example is two binary sources with joint probability p(x, y) where the correlation of x and y is to be tested or estimated. Given n iid observations x(n) = x(1) ... x(n) and y(n) = y(1) ... y(n), only k = rn (0 < r < 1) bits each can be transmitted to a common destination. What is the optimal data compression for statistical inference? A simple idea is to send the first k letters of x(n) and y(n). A simpler problem is the helper case where the optimal data compression of x(n) is searched for under the condition that all of y(n) are transmitted. It is a long standing problem to determine if there is a better data compression scheme than this simple scheme of sending first k letters. The present paper searches for the optimal data compression under the framework of linear-threshold encoding and shows that there is a better data compression scheme depending on the value of correlation. To this end, we evaluate the Fisher information in the class of linear-threshold compression schemes. It is also proved that the simple scheme is optimal when x and y are independent or their correlation is not too large.
引用
收藏
页码:5577 / 5587
页数:11
相关论文
共 12 条
[1]   HYPOTHESIS-TESTING WITH COMMUNICATION CONSTRAINTS [J].
AHLSWEDE, R ;
CSISZAR, I .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1986, 32 (04) :533-542
[2]   ON MINIMAX ESTIMATION IN THE PRESENCE OF SIDE INFORMATION ABOUT REMOTE DATA [J].
AHLSWEDE, R ;
BURNASHEV, MV .
ANNALS OF STATISTICS, 1990, 18 (01) :141-171
[4]   STATISTICAL-INFERENCE UNDER MULTITERMINAL RATE RESTRICTIONS - A DIFFERENTIAL GEOMETRIC APPROACH [J].
AMARI, SI ;
HAN, TS .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1989, 35 (02) :217-227
[5]  
Berger T., 1979, P IEEE 7 SPRING WORK
[6]  
HAN TS, 1987, IEEE T INFORM THEORY, V33, P759, DOI 10.1109/TIT.1987.1057383
[7]   EXPONENTIAL-TYPE ERROR PROBABILITIES FOR MULTITERMINAL HYPOTHESIS-TESTING [J].
HAN, TS ;
KOBAYASHI, K .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1989, 35 (01) :2-14
[8]  
Han TS, 1995, IEEE T INFORM THEORY, V41, P1802, DOI 10.1109/18.476308
[9]  
Han TS, 1998, IEEE T INFORM THEORY, V44, P2300, DOI 10.1109/18.720540
[10]   MULTITERMINAL DETECTION WITH ZERO-RATE DATA-COMPRESSION [J].
SHALABY, HMH ;
PAPAMARCOU, A .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1992, 38 (02) :254-267