Performance Evaluation of Efficient and Accurate Text Detection and Recognition in Natural Scenes Images Using EAST and OCR Fusion

被引：0

作者：

Soni, Vishnu Kant ^{[1
]}

Shukla, Vivek ^{[1
]}

Tandan, S. R. ^{[2
]}

Pimpalkar, Amit ^{[3
]}

Nema, Neetesh Kumar ^{[1
]}

Naik, Muskan ^{[4
]}

机构：

[1] Dr CV Raman Univ, Dept Comp Sci & Engn, Bilaspur, CG, India

[2] Govt RVRS Kanya Mahavidyalaya, Dept Comp Sci, Kawardha, CG, India

[3] Ramdeobaba Univ, Engn AIML Shri Ramdeobaba Coll Engn & Management, Dept Comp Sci, Nagpur, India

[4] Lakhmi Chand Inst Technol, Dept Comp Sci & Engn, Bilaspur, CG, India

来源：

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS | 2025年 / 16卷 / 01期

关键词：

Scene text recognition; optical character recognition; deep learning; feature extraction; scene text detection; ATTENTION;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

texts refer to arbitrary text found in images captured by cameras in real-world settings. The tasks of text detection and recognition are critical components of computer vision, with applications spanning scene understanding, information retrieval, robotics, and autonomous driving. Despite significant advancements in deep learning methods, achieving accurate text detection and recognition in complex images remains a formidable challenge for robust real-world applications. Several factors contribute to these challenges. First, the diversity of text shapes, fonts, colors, and styles complicates detection efforts. Second, the myriad combinations of characters, often with unstable attributes, make complete detection difficult, especially when background interruptions obscure character strokes and shapes. Finally, effective coordination of multiple sub-tasks in end-to-end learning is essential for success. This research aimed to tackle these challenges by enhancing text discriminative representation. This study focused on two interconnected problems: Scene Text Recognition (STR), which involves recognizing text from scene images, and Scene Text Detection (STD), which entails simultaneously detecting and recognizing multiple texts within those images. This research focuses on implementing and evaluating the Efficient and Accurate Scene Text Detector (EAST) algorithm for text detection and recognition in natural scene images. The study aims to compare the performance of three prominent Optical Character Recognition (OCR) techniques-TesseractOCR, PaddleOCR, and EasyOCR. The EAST model was applied to a series of sample test images, and the results were visually represented with bounding boxes highlighting the detected text regions. The inference times for each image were recorded, highlighting the algorithm's efficiency, with average times of 0.446, 0.439, and 0.440 seconds for the respective test images. These results indicate that the EAST algorithm is accurate and operates in real-time, making it suitable for applications requiring immediate text recognition.

引用

页码：445 / 453

页数：9