Support Vector Machine-Based Focused Crawler

被引:1
作者
Baweja, Vanshita R. [1 ]
Bhatia, Rajesh [1 ]
Kumar, Manish [1 ]
机构
[1] PEC Univ Technol, Comp Sci, Chandigarh, India
来源
INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES, ICICCT 2019 | 2020年 / 89卷
关键词
Support vector machine; Focused crawler; Feature extraction; Web page classification; Uniform resource locator; DISTANCE;
D O I
10.1007/978-981-15-0146-3_63
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Internet is an immense source of information. People use search engines to find desired web pages. All these web pages are gathered from the search engine by using web crawler. In traditional crawler, the information retrieval was based on the occurrence of keywords in a document due to which many irrelevant web pages were also retrieved. For the effective classification of web pages, support vector machine (SVM)-based crawler model is proposed in this paper. Various features of URL and web page are used for effective classification. SVM is trained by using these features and further tested. The proposed model is analyzed using precision and recall metrics. The experimental results exhibit optimized results by using this proposed approach.
引用
收藏
页码:673 / 686
页数:14
相关论文
共 50 条
[41]   Support Vector Machine-based approach for Recognizing Bonsai Species using Leaf Image [J].
Aminuddin, Raihah ;
Maskan, Farizul Azlan ;
Jalil, Ummu Mardhiah Abdul ;
Fesol, Siti Feirusz Ahmad ;
Ibrahim, Shafaf .
2022 IEEE 18TH INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & APPLICATIONS (CSPA 2022), 2022, :266-271
[42]   A support vector machine-based voice activity detection employing effective feature vectors [J].
Jo, Q-Haing ;
Park, Yun-Sik ;
Lee, Kye-Hwan ;
Chang, Joon-Hyuk .
IEICE TRANSACTIONS ON COMMUNICATIONS, 2008, E91B (06) :2090-2093
[43]   Support Vector Machine-based Automatic Music Transcription for Transcribing Polyphonic Music into MusicXML [J].
Fathurahman, Krisna ;
Lestari, Dessi Puji .
5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS 2015, 2015, :535-539
[44]   Development and validation of a support vector machine-based nomogram for diagnosis of obstetric antiphospholipid syndrome [J].
Qi, Xuan ;
Han, Yan ;
Zhang, Yue ;
Ma, Nianqiang ;
Liu, Zhifeng ;
Zhai, Jiajia ;
Guo, Huifang .
CLINICA CHIMICA ACTA, 2025, 568
[45]   RESEARCH ON SUPPORT VECTOR MACHINE-BASED URBAN RECREATIONAL PUBLIC TRANSPORTATION LINE DISCRIMINATION [J].
Ji, Xiaoxi ;
Huang, Anmin .
JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2025, 26 (06) :1615-1628
[46]   Support vector machine-based similarity selection method for structural transient reliability analysis [J].
Chen, Jun-Yu ;
Feng, Yun-Wen ;
Teng, Da ;
Lu, Cheng ;
Fei, Cheng-Wei .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2022, 223
[47]   Support vector machine-based active fault-tolerant control for wind turbine [J].
El Bakri, Ayoub ;
Boumhidi, Jaouad ;
Boumhidi, Ismail .
2019 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS 2019), 2019,
[48]   An improved focused crawler based on Semantic Similarity Vector Space Model [J].
Du, Yajun ;
Liu, Wenjun ;
Lv, Xianjing ;
Peng, Guoli .
APPLIED SOFT COMPUTING, 2015, 36 :392-407
[49]   A novel hybrid adaptive framework for support vector machine-based reliability analysis: A comparative study [J].
Yang, Shiyuan ;
He, Zhenyu ;
Chai, Jiangbo ;
Meng, Debiao ;
Macek, Wojciech ;
Branco, Ricardo ;
Zhu, Shun-Peng .
STRUCTURES, 2023, 58
[50]   Support vector machine-based optimized decision threshold adjustment strategy for classifying imbalanced data [J].
Yu, Hualong ;
Mu, Chaoxu ;
Sun, Changyin ;
Yang, Wankou ;
Yang, Xibei ;
Zuo, Xin .
KNOWLEDGE-BASED SYSTEMS, 2015, 76 :67-78