Online Recruitment Fraud (ORF) Detection Using Deep Learning Approaches

被引:0
|
作者
Akram, Natasha [1 ]
Irfan, Rabia [1 ]
Al-Shamayleh, Ahmad Sami [2 ]
Kousar, Adila [1 ]
Qaddos, Abdul [3 ]
Imran, Muhammad [3 ]
Akhunzada, Adnan [4 ]
机构
[1] Natl Univ Sci & Technol NUST, Sch Elect Engn & Comp Sci SEECS, Islamabad 44000, Pakistan
[2] Al Ahliyya Amman Univ, Fac Informat Technol, Dept Data Sci & Artificial Intelligence, Amman 19328, Jordan
[3] COMSATS Univ Islamabad CUI, Dept Comp Sci, Islamabad 45550, Pakistan
[4] Univ Doha Sci & Technol, Coll Comp & Informat Technol, Dept Data & Cybersecur, Doha 24449, Qatar
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Accuracy; Classification tree analysis; Recruitment; Transformers; Deep learning; Nearest neighbor methods; Employment; Machine learning; Data augmentation; Personnel; Online services; Class imbalance; data augmentation; deep learning; employment scam; fraud detection; machine learning; online recruitment; SMOTE; transformer-based models; CLASSIFICATION; SMOTE;
D O I
10.1109/ACCESS.2024.3435670
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most companies nowadays are using digital platforms for the recruitment of new employees to make the hiring process easier. The rapid increase in the use of online platforms for job posting has resulted in fraudulent advertising. The scammers are making money through fraudulent job postings. Online recruitment fraud has emerged as an important issue in cybercrime. Therefore, it is necessary to detect fake job postings to get rid of online job scams. In recent studies, traditional machine learning and deep learning algorithms have been implemented to detect fake job postings; this research aims to use two transformer-based deep learning models, i.e., Bidirectional Encoder Representations from Transformers (BERT) and Robustly Optimized BERT-Pretraining Approach (RoBERTa) to detect fake job postings precisely. In this research, a novel dataset of fake job postings is proposed, formed by the combination of job postings from three different sources. Existing benchmark datasets are outdated and limited due to knowledge of specific job postings, which limits the existing models' capability in detecting fraudulent jobs. Hence, we extend it with the latest job postings. Exploratory Data Analysis (EDA) highlights the class imbalance problem in detecting fake jobs, which tends the model to act aggressively toward the minority class. Responding to overcome this problem, the work at hand implements ten top-performing Synthetic Minority Oversampling Technique (SMOTE) variants. The models' performances balanced by each SMOTE variant are analyzed and compared. All implemented approaches are performed competitively. However, BERT+SMOBD SMOTE achieved the highest balanced accuracy and recall of about 90%.
引用
收藏
页码:109388 / 109408
页数:21
相关论文
共 50 条
  • [41] A systematic review of literature on credit card cyber fraud detection using machine and deep learning
    Btoush, Eyad Abdel Latif Marazqah
    Zhou, Xujuan
    Gururajan, Raj
    Chan, Ka Ching
    Genrich, Rohan
    Sankaran, Prema
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [42] A systematic review of literature on credit card cyber fraud detection using machine and deep learning
    Btoush E.A.L.M.
    Zhou X.
    Gururajan R.
    Chan K.C.
    Genrich R.
    Sankaran P.
    PeerJ Computer Science, 2023, 9
  • [43] Skin Cancer Detection Using Deep Learning Approaches
    Haque, Shafiul
    Ahmad, Faraz
    Singh, Vineeta
    Mathkor, Darin Mansor
    Babegi, Ashjan
    CANCER BIOTHERAPY AND RADIOPHARMACEUTICALS, 2025,
  • [44] Click fraud detection for online advertising using machine learning
    Aljabri, Malak
    Mohammad, Rami Mustafa A.
    EGYPTIAN INFORMATICS JOURNAL, 2023, 24 (02) : 341 - 350
  • [45] An Ensemble Architecture Based on Deep Learning Model for Click Fraud Detection in Pay-Per-Click Advertisement Campaign
    Batool, Amreen
    Byun, Yung-Cheol
    IEEE ACCESS, 2022, 10 : 113410 - 113426
  • [46] Handling Class Imbalance in Online Transaction Fraud Detection
    Kanika
    Singla, Jimmy
    Bashir, Ali Kashif
    Nam, Yunyoung
    Hasan, Najam U., I
    Tariq, Usman
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (02): : 2861 - 2877
  • [47] Machine and deep learning approaches for alzheimer disease detection using magnetic resonance images: An updated review
    Menagadevi, M.
    Devaraj, Somasundaram
    Madian, Nirmala
    Thiyagarajan, D.
    MEASUREMENT, 2024, 226
  • [48] Leveraging graph-based learning for credit card fraud detection: a comparative study of classical, deep learning and graph-based approaches
    Harish, Sunisha
    Lakhanpal, Chirag
    Jafari, Amir Hossein
    Neural Computing and Applications, 2024, 36 (34) : 21873 - 21883
  • [49] Deep learning approaches for detection and removal of ghosting artifacts in MR spectroscopy
    Kyathanahally, Sreenath P.
    Doering, Andre
    Kreis, Roland
    MAGNETIC RESONANCE IN MEDICINE, 2018, 80 (03) : 851 - 863
  • [50] Predicting fraud in MD&A sections using deep learning
    Sivasubramanian, Sachin Velloor
    Skillicorn, David
    JOURNAL OF BUSINESS ANALYTICS, 2024, 7 (03) : 197 - 206