Support Vector Machine-Based Focused Crawler

被引:1
作者
Baweja, Vanshita R. [1 ]
Bhatia, Rajesh [1 ]
Kumar, Manish [1 ]
机构
[1] PEC Univ Technol, Comp Sci, Chandigarh, India
来源
INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES, ICICCT 2019 | 2020年 / 89卷
关键词
Support vector machine; Focused crawler; Feature extraction; Web page classification; Uniform resource locator; DISTANCE;
D O I
10.1007/978-981-15-0146-3_63
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Internet is an immense source of information. People use search engines to find desired web pages. All these web pages are gathered from the search engine by using web crawler. In traditional crawler, the information retrieval was based on the occurrence of keywords in a document due to which many irrelevant web pages were also retrieved. For the effective classification of web pages, support vector machine (SVM)-based crawler model is proposed in this paper. Various features of URL and web page are used for effective classification. SVM is trained by using these features and further tested. The proposed model is analyzed using precision and recall metrics. The experimental results exhibit optimized results by using this proposed approach.
引用
收藏
页码:673 / 686
页数:14
相关论文
共 50 条
[31]   Support vector machine-based stuttering dysfluency classification using GMM supervectors [J].
Mahesha, P. ;
Vinod, D. S. .
INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2015, 6 (3-4) :143-149
[32]   Support vector machine-based QSPR for the prediction of glass transition temperatures of polymers [J].
Xinliang Yu .
Fibers and Polymers, 2010, 11 :757-766
[33]   Support Vector Machine-based Image Segmentation Approach for Automatic Agriculture Vehicle [J].
Han, Yonghua ;
Wang, Yaming ;
Zhao, Yun .
PROCEEDINGS OF 2012 INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND SIGNAL PROCESSING, 2012, :251-255
[34]   A focused crawler based on semantic disambiguation vector space model [J].
Liu, Wenjun ;
He, Yu ;
Wu, Jing ;
Du, Yajun ;
Liu, Xing ;
Xi, Tiejun ;
Gan, Zurui ;
Jiang, Pengjun ;
Huang, Xiaoping .
COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (01) :345-366
[35]   A focused crawler based on semantic disambiguation vector space model [J].
Wenjun Liu ;
Yu He ;
Jing Wu ;
Yajun Du ;
Xing Liu ;
Tiejun Xi ;
Zurui Gan ;
Pengjun Jiang ;
Xiaoping Huang .
Complex & Intelligent Systems, 2023, 9 :345-366
[36]   Support Vector Machine-Based Phase Prediction of Multi-Principal Element Alloys [J].
Nguyen Hai Chau ;
Kubo, Masatoshi ;
Le Viet Hai ;
Yamamoto, Tomoyuki .
VIETNAM JOURNAL OF COMPUTER SCIENCE, 2023, 10 (01) :101-116
[37]   Performance improvement method of support vector machine-based model monitoring dam safety [J].
Su, Huaizhi ;
Chen, Zhexin ;
Wen, Zhiping .
STRUCTURAL CONTROL & HEALTH MONITORING, 2016, 23 (02) :252-266
[38]   Improvement of the Support Vector Machine-based Monte Carlo Simulation for Calculating Failure Probability [J].
Lee, Seunggyu ;
Kim, Jae Hoon .
TRANSACTIONS OF THE KOREAN SOCIETY OF MECHANICAL ENGINEERS A, 2020, 44 (04) :269-279
[39]   mACPpred: A Support Vector Machine-Based Meta-Predictor for Identification of Anticancer Peptides [J].
Boopathi, Vinothini ;
Subramaniyam, Sathiyamoorthy ;
Malik, Adeel ;
Lee, Gwang ;
Manavalan, Balachandran ;
Yang, Deok-Chun .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (08)
[40]   Support Vector Machine-based Automatic Music Transcription for Transcribing Polyphonic Music into MusicXML [J].
Fathurahman, Krisna ;
Lestari, Dessi Puji .
5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS 2015, 2015, :535-539