Preserving quality of information by using semantic relationships

被引:38
作者
Basu, P. [1 ]
Bao, J. [2 ]
Dean, M. [1 ]
Hendler, J. [3 ]
机构
[1] Raytheon BBN Technol, Cambridge, MA 02138 USA
[2] Samsung US R&D Ctr, San Jose, CA USA
[3] Rensselaer Polytech Inst, Troy, NY USA
关键词
Semantic information theory; Inference; Semantic compression; LOGIC;
D O I
10.1016/j.pmcj.2013.07.013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In pervasive computing and sensing applications, a multitude of devices such as sensors and processors (that perform fusion) serve as rich sources of data and information over long periods of time. It is often the case that the information streams generated inside an application are not independent of each other; instead, they have certain semantic relationships between them. In order to deal with high volumes of information generated over time, it is sometimes necessary to compress these information streams. However, it is often the case that the underlying meaning or semantics of the information is what is critical for maintaining an acceptable level of information quality, rather than the actual data in its entirety. In this paper, we show how semantic redundancy and ambiguity within a semantically-aware source can be exploited to achieve compression with a goal of being able to recover the meaning underlying its messages. We take the preliminary steps to extend the source coding principles of classical information theory and show that by utilizing semantic inference relations between probabilistically expressed messages and underlying models at the source, a higher rate of compression, albeit lossy, may be achieved compared to traditional syntactic compression methods. We define a "semantic entropy" measure for a source and show that it is bounded from above by the mutual information between its models and the syntactic messages it generates. We also consider some simple graph based semantic inference relationships derived from propositional logic and give practical algorithms that exploit the graph structure of a shared knowledge base to facilitate lossless semantic compression. (C) 2013 Elsevier B. V. All rights reserved.
引用
收藏
页码:188 / 202
页数:15
相关论文
共 26 条
[21]   Towards a Semantic Theory of Information [J].
Perez-Amat Garcia, Ricardo .
TRIPLEC-COMMUNICATION CAPITALISM & CRITIQUE, 2009, 7 (02) :158-171
[22]   Markov logic networks [J].
Richardson, M ;
Domingos, P .
MACHINE LEARNING, 2006, 62 (1-2) :107-136
[23]  
Seligman J, 2009, LECT NOTES COMPUT SC, V5363, P193, DOI 10.1007/978-3-642-00659-3_8
[24]   A MATHEMATICAL THEORY OF COMMUNICATION [J].
SHANNON, CE .
BELL SYSTEM TECHNICAL JOURNAL, 1948, 27 (03) :379-423
[25]  
Shokrollahi A, 2002, LECT NOTES COMPUT SC, V2292, P175
[26]  
Willems FMJ, 2005, 2005 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), VOLS 1 AND 2, P214