Scene Classification, Data Cleaning, and Comment Summarization for Large-Scale Location Databases

被引:0
|
作者
Cheng, Hsu-Yung [1 ]
Yu, Chih-Chang [2 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taoyuan 320, Taiwan
[2] Chun Yuan Christian Univ, Dept Informat & Comp Engn, Taoyuan 320, Taiwan
关键词
image analysis; image classification; deep learning; natural language processing;
D O I
10.3390/electronics11131947
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a framework that can automatically analyze the images and comments in user-uploaded location databases. The proposed framework integrates image processing and natural language processing techniques to perform scene classification, data cleaning, and comment summarization so that the cluttered information in user-uploaded databases can be presented in an organized way to users. For scene classification, RGB image features, segmentation features, and the features of discriminative objects are fused with an attention module to improve classification accuracy. For data cleaning, incorrect images are detected using a multilevel feature extractor and a multiresolution distance calculation scheme. Finally, a comment summarization scheme is proposed to overcome the problems of unstructured sentences and the improper usage of punctuation marks, which are commonly found in customer reviews. To validate the proposed framework, a system that can classify and organize scenes and comments for hotels is implemented and evaluated. Comparisons with existing related studies are also performed. The experimental results validate the effectiveness and superiority of the proposed framework.
引用
收藏
页数:18
相关论文
共 50 条
  • [2] Large scale data based audio scene classification
    Sophiya E.
    Jothilakshmi S.
    International Journal of Speech Technology, 2018, 21 (04) : 825 - 836
  • [3] Learning From Large-Scale Noisy Web Data With Ubiquitous Reweighting for Image Classification
    Li, Jia
    Song, Yafei
    Zhu, Jianfeng
    Cheng, Lele
    Su, Ying
    Ye, Lin
    Yuan, Pengcheng
    Han, Shumin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1808 - 1814
  • [4] Classification of Cancer Pathology Reports: A Large-Scale Comparative Study
    Martina, Stefano
    Ventura, Leonardo
    Frasconi, Paolo
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (11) : 3085 - 3094
  • [5] Adaptively Placed Multi-Grid Scene Representation Networks for Large-Scale Data Visualization
    Wurster, Skylar W.
    Xiong, Tianyu
    Shen, Han-Wei
    Guo, Hanqi
    Peterka, Tom
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (01) : 965 - 974
  • [6] Large-scale image classification and nutrient estimation for Chinese dishes
    Feng, Yihang
    Wang, Yi
    Wang, Xinhao
    Bi, Jinbo
    Xiao, Zhenlei
    Luo, Yangchao
    JOURNAL OF AGRICULTURE AND FOOD RESEARCH, 2025, 19
  • [7] Classification of Large-Scale Mobile Laser Scanning Data in Urban Area with LightGBM
    Sevgen, Eray
    Abdikan, Saygin
    REMOTE SENSING, 2023, 15 (15)
  • [8] Optimisation of classification algorithm of associated data features of large-scale network system
    Cao, Yu
    INTERNATIONAL JOURNAL OF INTERNET PROTOCOL TECHNOLOGY, 2020, 13 (02) : 55 - 60
  • [9] BertLoc: Duplicate Location Record Detection in a Large-Scale Location Dataset
    Park, Sujin
    Lee, Sangwon
    Woo, Simon S.
    36TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2021, 2021, : 942 - 951
  • [10] Large-Scale Synthetic Urban Dataset for Aerial Scene Understanding
    Gao, Qian
    Shen, Xukun
    Niu, Wensheng
    IEEE ACCESS, 2020, 8 (08): : 42131 - 42140