Image features for machine learning based web image classification

被引:0
|
作者
Cho, SS [1 ]
Hwang, CJ [1 ]
机构
[1] ETRI, Comp Software Lab, Taejon 305360, South Korea
来源
INTERNET IMAGING IV | 2003年 / 5018卷
关键词
image classification; machine learning; analysis of web documents; Bayes classifier; decision tree;
D O I
10.1117/12.479719
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The ubiquity of the Internet has brought about an increasing amount of multi-formatted Web documents. Although image occupies a large part of importance on these increasing Web documents, there have not been many researches for analyzing and understanding it. Many Web images are used for carrying important information but others are not used for it. If images in a Web document can be classified by which have particular information or not, then it would be very useful for analysis and multi-formatting of Web documents. In this paper we introduce the machine learning based methods of classifying Web images as either eliminable or non-eliminable. For this research, we have detected 16 special and rich features for Web images and experimented by using the Bayesian and decision tree methods. As the results, F-measures of 87.09%, 82.72% were achieved for each method and particularly, from the experiments to compare the effects of feature groups, it has proved that the selected features on this study are very useful for Web image classification.
引用
收藏
页码:328 / 335
页数:8
相关论文
共 50 条
  • [31] A Novel Image Classification Algorithm Based on Extreme Learning Machine
    Yu Jing
    Song Wei
    Li Ming
    Hou Jianjun
    Wang Nan
    CHINA COMMUNICATIONS, 2015, 12 (02) : 48 - 54
  • [32] Research on Image Classification and Recognition Technology Based on Machine Learning
    Wang Y.
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [33] Image-based Candlestick Pattern Classification with Machine Learning
    Xu, Chenghan
    PROCEEDINGS OF 2021 6TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES (ICMLT 2021), 2021, : 26 - 33
  • [34] Phishing Web Sites Features Classification Based on Extreme Learning Machine
    Sonmez, Yasin
    Tuncer, Turker
    Gokal, Huseyin
    Avci, Engin
    2018 6TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSIC AND SECURITY (ISDFS), 2018, : 155 - 159
  • [35] An Efficient Floor Plan Classification with Optimized Image Features using Machine Learning
    Karthik, K.
    Safvan, C. K.
    Samuel, Vinu Abraham
    2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [36] Hyperspectral Image Classification using Spatial Spectral Features and Machine Learning Approach
    Dhandhalya, Jignesh K.
    Parmar, S. K.
    2016 IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2016, : 1161 - 1165
  • [37] Web Page Classification Using Image Analysis Features
    de Boer, Viktor
    van Someren, Maarten W.
    Lupascu, Tiberiu
    WEB INFORMATION SYSTEMS AND TECHNOLOGIES, 2011, 75 : 272 - +
  • [38] WEB image classification based on the fusion of image and text classifiers
    Kalva, Pedro R.
    Enembreck, Fabricio
    Koerich, Alessandro L.
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 561 - 565
  • [39] LEARNING DEEP FEATURES FOR IMAGE EMOTION CLASSIFICATION
    Chen, Ming
    Zhang, Lu
    Allebach, Jan P.
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 4491 - 4495
  • [40] Unsupervised Learning of Quaternion Features for Image Classification
    Risojevic, Vladimir
    Babic, Zdenka
    2013 11TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS IN MODERN SATELLITE, CABLE AND BROADCASTING SERVICES (TELSIKS), VOLS 1 AND 2, 2013, : 345 - 348