Automatic summarization of Instagram social network posts by combining semantic and statistical approaches

被引:0
作者
Tabanmehr, Zainab [1 ]
Akhtarkavan, Ehsan [1 ]
机构
[1] Khatam Univ, Dept Comp Engn, Tehran, Iran
来源
2023 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS, IPRIA | 2023年
关键词
text summarization; extractive approach; abstract approach; natural language processing; social networks;
D O I
10.1109/IPRIA59240.2023.10147186
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasing spread of data and text documents such as articles, web pages, books, posts on social networks, etc. on the Internet, creates a fundamental challenge in various fields of text processing under the title of "automatic text summarization". Manual processing and summarization of large volumes of textual data is a very difficult, expensive, time-consuming, and impossible process for human users. Text summarization systems are divided into extractive and abstract categories. In the extractive summarization method, the final summary of a text document is extracted from the important sentences of the same document without any kind of change. In this method, it is possible to repeat a series of sentences repeatedly and interfere with pronouns. But in the abstract summarization method, the final summary of a textual document is extracted from the meaning of the sentences and words of the same document or other documents. Many of the performed works have used extraction methods or abstracts to summarize the collection of web documents, each of which has advantages and disadvantages in the results obtained in terms of similarity or size. In this research, by developing a crawler, extracting the popular text posts from the Instagram social network, suitable pre-processing, and combining the set of extractive and abstract algorithms, the researcher showed how to use each of the abstract algorithms. and used extraction as a supplement to increase the accuracy and accuracy of another algorithm. Observations made on 820 popular text posts on the Instagram social network show the accuracy (80%) of the proposed system.
引用
收藏
页数:6
相关论文
共 16 条
  • [1] Aggarwal CharuC., 2015, DATA MINING, P429
  • [2] [Anonymous], 1999, AUTOMATIC CREATION L, P15
  • [3] Aria M., 2022, P 16 INT C STAT ANAL, V1, P67
  • [4] Automatic text summarization: A comprehensive survey
    El-Kassas, Wafaa S.
    Salama, Cherif R.
    Rafea, Ahmed A.
    Mohamed, Hoda K.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165
  • [5] Recent automatic text summarization techniques: a survey
    Gambhir, Mahak
    Gupta, Vishal
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2017, 47 (01) : 1 - 66
  • [6] Gupta S., 2022, FREQUENT ITEM SET MI, P1
  • [7] The challenges of automatic summarization
    Hahn, U
    Mani, I
    [J]. COMPUTER, 2000, 33 (11) : 29 - +
  • [8] Khan A., 2020, SUMMARIZING ONLINE M, V2020
  • [9] Mallinson Jonathan, 2022, Edit5: Semi-autoregressive text-editing with t5 warm-start
  • [10] Moratanch N, 2016, PROCEEDINGS OF IEEE INTERNATIONAL CONFERENCE ON CIRCUIT, POWER AND COMPUTING TECHNOLOGIES (ICCPCT 2016)