Semantic Network Analysis Pipeline-Interactive Text Mining Framework for Exploration of Semantic Flows in Large Corpus of Text

被引:1
|
作者
Cenek, Martin [1 ,4 ]
Bulkow, Rowan [2 ]
Pak, Eric [3 ]
Oyster, Levi [3 ]
Ching, Boyd [3 ]
Mulagada, Ashika [1 ]
机构
[1] Univ Portland, Comp Sci, Portland, OR 90203 USA
[2] Resource Data Inc, Anchorage, AK 99503 USA
[3] Univ Alaska Anchorage, Comp Sci, Anchorage, AK 99508 USA
[4] 5000 N Willamette Blvd, Portland, OR 97203 USA
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 24期
关键词
semantic concept; text mining; computational linguistics; language processing; natural language processing; interactive visualization; MODEL;
D O I
10.3390/app9245302
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Historical topic modeling and semantic concepts exploration in a large corpus of unstructured text remains a hard, opened problem. Despite advancements in natural languages processing tools, statistical linguistics models, graph theory and visualization, there is no framework that combines these piece-wise tools under one roof. We designed and constructed a Semantic Network Analysis Pipeline (SNAP) that is available as an open-source web-service that implements work-flow needed by a data scientist to explore historical semantic concepts in a text corpus. We define a graph theoretic notion of a semantic concept as a flow of closely related tokens through the corpus of text. The modular work-flow pipeline processes text using natural language processing tools, statistical content narrowing, creates semantic networks from lexical token chaining, performs social network analysis of token networks and creates a 3D visualization of the semantic concept flows through corpus for interactive concept exploration. Finally, we illustrate the framework's utility to extract the information from a text corpus of Herman Melville's novel Moby Dick, the transcript of the 2015-2016 United States (U.S.) Senate Hearings on Environment and Public Works, and the Australian Broadcast Corporation's short news articles on rural and science topics.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Visual text analysis: Interactive exploration of semantic contents
    Rohrdantz C.
    Koch S.
    Jochim C.
    Heyer G.
    Scheuermann G.
    Ertl T.
    Schütze H.
    Keim D.A.
    Informatik-Spektrum, 2010, 33 (06) : 601 - 611
  • [2] A Text Mining Framework for Accelerating the Semantic Curation of Literature
    Batista-Navarro, Riza
    Hammock, Jennifer
    Ulate, William
    Ananiadou, Sophia
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, TPDL 2016, 2016, 9819 : 459 - 462
  • [3] An Adaptive Latent Semantic Analysis for Text mining
    Hong T. Tu
    Tuoi T. Phan
    Khu P. Nguyen
    2017 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING (ICSSE), 2017, : 588 - 593
  • [4] A Semantic Framework for Extracting Taxonomic Relations from Text Corpus
    Phuoc Thi Hong Doan
    Arch-int, Ngamnij
    Arch-int, Somjit
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2020, 17 (03) : 325 - 337
  • [5] A framework for integrating deep and shallow semantic structures in text mining
    Collier, N
    Takeuchi, K
    Kawazoe, A
    Mullen, T
    Wattarujeekrit, T
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2003, 2773 : 824 - 834
  • [6] A Framework of Chinese Semantic Text Mining Based on Ontology Learning
    Zhang, Yu-feng
    Hu, Feng
    FOURTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2011): MACHINE VISION, IMAGE PROCESSING, AND PATTERN ANALYSIS, 2012, 8349
  • [7] The Multidimensional Semantic Model of Text Objects (MSMTO): A Framework for Text Data Analysis
    Attaf, Sarah
    Benblidia, Nadjia
    Boussaid, Omar
    MODEL AND DATA ENGINEERING, MEDI 2014, 2014, 8748 : 113 - 124
  • [8] Deep Analysis of Power Equipment Defects Based on Semantic Framework Text Mining Technology
    Wang, Huifang
    Cao, Jing
    Lin, Dongyang
    CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, 2022, 8 (04): : 1157 - 1164
  • [9] TRENDS IN OVERTOURISM RESEARCH FROM 2018 TO 2021: TEXT MINING AND SEMANTIC NETWORK ANALYSIS
    Tang, Ruohan
    Lee, Won Seok
    Moon, Joonho
    Shim, Ji Min
    TOURISM REVIEW INTERNATIONAL, 2023, 27 (3-4): : 187 - 200
  • [10] Approaching fashion design trend applications using text mining and semantic network analysis
    Hyosun An
    Minjung Park
    Fashion and Textiles, 7