You Are What You Watch and When You Watch: Inferring Household Structures From IPTV Viewing Data

被引:17
作者
Luo, Dixin [1 ]
Xu, Hongteng [2 ]
Zha, Hongyuan [3 ]
Du, Jun [4 ]
Xie, Rong [1 ]
Yang, Xiaokang [1 ]
Zhang, Wenjun [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai Key Lab Media Proc & Commun, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Georgia Inst Technol, Dept Elect & Comp Engn, Atlanta, GA 30332 USA
[3] Georgia Inst Technol, Coll Comp, Atlanta, GA 30332 USA
[4] China Telecom, Shanghai Branch, Shanghai 200240, Peoples R China
基金
美国国家科学基金会;
关键词
Viewing feature; low-rank model; semisupervised learning; behavior analysis; household structure; census; system simulations;
D O I
10.1109/TBC.2013.2295894
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
What you watch and when you watch say a lot about you, and such information at the aggregated level across a user population obviously provides significant insights for social and commercial applications. In this paper, we propose a model for inferring household structures based on analyzing users' viewing behaviors in Internet Protocol Television (IPTV) systems. We emphasize extracting features of viewing behaviors based on the dynamic of watching time and TV programs and training a classifier for inferring household structures according to the features. In the training phase, instead of merely using the limited labeled samples, we apply semisupervised learning strategy to obtain a graph-based model for classifying household structures from users' features. We test the proposed model on China Telecom IPTV data and demonstrate its utility in census research and system simulation. The demographic characteristics inferred by our approach match well with the population census data of Shanghai, and the inference of household structures of IPTV users gives encouraging results compared with the ground truth obtained by surveys, which opens the door for leveraging IPTV viewing data as a complementary way for time-and resource-consuming census tracking. On the other hand, the proposed model can also synthesize trace data for the simulations of IPTV systems, which provides us with a new strategy for system simulation.
引用
收藏
页码:61 / 72
页数:12
相关论文
共 32 条
[1]   Planning and managing the IPTV service deployment [J].
Agrawal, Dakshi ;
Beigi, Mandis S. ;
Bisdikian, Chatschik ;
Lee, Kang-Won .
2007 10TH IFIP/IEEE INTERNATIONAL SYMPOSIUM ON INTEGRATED NETWORK MANAGEMENT (IM 2009), VOLS 1 AND 2, 2007, :353-+
[2]  
[Anonymous], 2006, ACM SIGOPS OPER SYST
[3]  
[Anonymous], 2003, P 20 INT C MACH LEAR
[4]  
[Anonymous], 2009, Proceedings of the 18th international conference on World wide web, DOI [10.1145/1526709.1526802, 10.1145/1526709, DOI 10.1145/1526709, DOI 10.1145/1526709.1526802]
[5]  
[Anonymous], 2005, SEMISUPERVISED LEARN
[6]  
[Anonymous], 2006, PATTERN RECOGN
[7]   An unsupervised approach to modeling personalized contexts of mobile users [J].
Bao, Tengfei ;
Cao, Huanhuan ;
Chen, Enhong ;
Tian, Jilei ;
Xiong, Hui .
KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 31 (02) :345-370
[8]  
Barford P., 1998, Performance Evaluation Review, V26, P151, DOI 10.1145/277858.277897
[9]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[10]   Exact Matrix Completion via Convex Optimization [J].
Candes, Emmanuel J. ;
Recht, Benjamin .
FOUNDATIONS OF COMPUTATIONAL MATHEMATICS, 2009, 9 (06) :717-772