The role of news title for linking during preservation process in digital archives

被引:5
作者
Khan, Muzammil [1 ]
Khan, Sarwar Shah [2 ]
Ahmad, Arshad [3 ,4 ]
Rahman, Arif Ur [5 ]
机构
[1] Univ Swat, Dept Comp & Software Technol, Swat, Pakistan
[2] Zhengzhou Univ, Sch Informat Engn, Zhengzhou, Peoples R China
[3] Pak Austria Fachhsch, Dept IT & Comp Sci, Haripur, Pakistan
[4] Inst Appl Sci & Technol, Haripur, Pakistan
[5] Bahria Univ, Dept Comp Sci, Islamabad, Pakistan
关键词
News archiving; News preservation; Linking news; Similarity measure; Digital libraries; RECOMMENDATION;
D O I
10.1108/LHT-07-2020-0157
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Purpose The World Wide Web has become an essential platform for a news publication, and it has become one of the primary sources of information dissemination in the past few years. Electronic media, i.e., television channels, magazines and newspapers, have started publishing news online. This online information is prompt to be disappeared because of short life-span and imperative to be archived for the long-term and future generations. This paper presents a content-based similarity measure based on the headings of the news articles for linking digital news stories published in various newspapers during the preservation process that helps to ensure future accessibility. Design/methodology/approach To evaluate the accuracy and assess the effectiveness and worth of the proposed measure for linking news articles in Digital News Story Archive (DNSA), we adopted both, system-centric and user-centric (human judgment) evaluation over different datasets of news articles. Findings The proposed similarity measure is evaluated using different sizes of datasets, and the results are compared by both user-centric technique, i.e., expert judgment and system-centric techniques, i.e., cosine similarity measure, extended Jaccard measure and common ratio measure for stories (CRMS). The comparison helps to get a broader impact and can be helpful for generalization of the measure for different categories of news articles. Multiple experiments have conducted the findings of which showed that the measure presented viable results for national and international news, while best results for linking sports news articles during preservation based on headings. Originality/value The DNSA preserves a huge number of news articles from multiple news sources and to link with a vast collection, which encourages to introduce an efficient linking mechanism with few terms to manipulate. The CRMS is modified to deal with the headings of news articles as a part of the digital news stories preservation framework and comprehensively analysed.
引用
收藏
页码:1359 / 1383
页数:25
相关论文
共 8 条
  • [1] The Role of Transliterated Words in Linking Bilingual News Articles in an Archive
    Khan, Muzammil
    Khan, Sarwar Shah
    Alharbi, Yasser
    Alferaidi, Ali
    Alharbi, Talal Saad
    Yadav, Kusum
    APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [2] Normalizing Digital News-Stories for Preservation
    Khan, Muzammil
    Rahman, Arif Ur
    Awan, M. Daud
    Alam, Syed Mehtab
    2016 ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM 2016), 2016, : 85 - 90
  • [3] Term-Based Approach for Linking Digital News Stories
    Khan, Muzammil
    Rahman, Arif Ur
    Awan, Muhammad Daud
    DIGITAL LIBRARIES AND MULTIMEDIA ARCHIVES, IRCDL 2018, 2018, 806 : 127 - 138
  • [4] Digital archives for historians: research during the pandemic
    Palma, Patricia
    HISTORIA CIENCIAS SAUDE-MANGUINHOS, 2021, 28 (01): : 293 - 300
  • [5] Provision of digital preservation metadata: a role for ONIX?
    Brindley, G
    Muir, A
    Probets, S
    PROGRAM-ELECTRONIC LIBRARY AND INFORMATION SYSTEMS, 2004, 38 (04) : 240 - 250
  • [6] A Comprehensive Metadata Framework for Preservation and Accessibility of Digital News and Educational Resource Management
    Muzammil Khan
    Huma Rani
    Sana Ullah
    Arif Ur Rahman
    SN Computer Science, 6 (5)
  • [7] Evaluation of Role of Traditional Knowledge Digital Library and Traditional Chinese Medicine Database in Preservation of Traditional Medicinal Knowledge
    Ansari, Mohd Shoaib
    DESIDOC JOURNAL OF LIBRARY & INFORMATION TECHNOLOGY, 2016, 36 (02): : 73 - 78
  • [8] Role of policies in collaborative design process for digital libraries within African higher education
    Ngimwa, Pauline
    Adams, Anne
    LIBRARY HI TECH, 2011, 29 (04) : 678 - 696