Wrapper Induction of News Information for Feeding to Social Networking Service on Smartphone

被引:0
作者
Xiang, Zhong-Liang [1 ]
Yu, Xiang-Ru [1 ]
Kang, Dae-Ki [2 ]
机构
[1] Weifang Univ Sci & Technol, Comp Software Inst, Shouguang 262700, Shandong, Peoples R China
[2] Dongseo Univ, Div Comp & Informat Engn, Busan 617716, South Korea
来源
2015 17TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT) | 2015年
关键词
NewsFeedAndroid; Minimum description length; Smartphone; Cellphone; Social network service; Wrapper;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose NewsFeedAndroid, a novel system that interconnects a social networking service and online newspaper sites in order to extracts news articles from the online news sites and to perform feeding of news articles to social network service (SNS) users. In NewsFeedAndroid, news information agents extract news article information from the news and portal sites using Minimum Description Length (MDL) wrapper induction algorithm. The news document collecting module regularly gathers news list information from news list page in the news sites and portals. In the collected documents, the document preprocessing module removes tags that are unnecessary for news information extraction. Lexical analyzer converts the rest text information and tags to a sequence of tokens, and news information is obtained by matching token patterns to the sequence. Those extracted news information from the various sites are integrated in the system and supplied to the end users through the social networking service on a smartphone. NewsFeedAndroid demonstrates a novel usage of integrating social networking services and online newspaper sites.
引用
收藏
页码:292 / 295
页数:4
相关论文
共 21 条
  • [1] Abramsky M., 2009, SMARTPHONES OUTNUMBE
  • [2] Agüero J, 2009, ADV SOFT COMP, V50, P194
  • [3] [Anonymous], 2010, ECONOMIST
  • [4] [Anonymous], 2008, 10 WIDM
  • [5] [Anonymous], 2007, P 16, DOI DOI 10.1145/1242572.1242685
  • [6] The smart phone: A ubiquitous input device
    Ballagas, R
    Borchers, J
    Rohs, M
    Sheridan, JG
    [J]. IEEE PERVASIVE COMPUTING, 2006, 5 (01) : 70 - 77
  • [7] Cohen W. W., 1998, Proceedings of the Second International Conference on Autonomous Agents, P400, DOI 10.1145/280765.280870
  • [8] DOORENBOS R, 1997, P 1 INT C AUT AG
  • [9] XTRACT: Learning Document Type Descriptors from XML document collections
    Garofalakis, M
    Gionis, A
    Rastogi, R
    Seshadri, S
    Shim, K
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2003, 7 (01) : 23 - 56
  • [10] Gibson J., 2007, Proceedings of the 9th Annual ACM International Workshop on Web Information and Data Management, P105, DOI [10.1145/1316902.1316920, DOI 10.1145/1316902.1316920]