An Automatic System to Detect and Extract Text in Medical Images for De-identification

被引:2
|
作者
Zhu, Yingxuan [1 ]
Singh, P. D. [2 ]
Siddiqui, Khan [2 ]
Gillam, Michael [2 ]
机构
[1] Syracuse Univ, Dept Elect Engn & Comp Sci, Syracuse, NY 13244 USA
[2] Microsoft Corp, Hlth Solut Grp, Chevy Chase, MD USA
来源
MEDICAL IMAGING 2010: ADVANCED PACS-BASED IMAGING INFORMATICS AND THERAPEUTIC APPLICATIONS | 2010年 / 7628卷
关键词
VIDEO;
D O I
10.1117/12.855588
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Recently, there is an increasing need to share medical images for research purpose. In order to respect and preserve patient privacy, most of the medical images are de-identified with protected health information ( PHI) before research sharing. Since manual de-identification is time-consuming and tedious, so an automatic de-identification system is necessary and helpful for the doctors to remove text from medical images. A lot of papers have been written about algorithms of text detection and extraction, however, little has been applied to de-identification of medical images. Since the de-identification system is designed for end-users, it should be effective, accurate and fast. This paper proposes an automatic system to detect and extract text from medical images for de-identification purposes, while keeping the anatomic structures intact. First, considering the text have a remarkable contrast with the background, a region variance based algorithm is used to detect the text regions. In post processing, geometric constraints are applied to the detected text regions to eliminate over-segmentation, e. g., lines and anatomic structures. After that, a region based level set method is used to extract text from the detected text regions. A GUI for the prototype application of the text detection and extraction system is implemented, which shows that our method can detect most of the text in the images. Experimental results validate that our method can detect and extract text in medical images with a 99% recall rate. Future research of this system includes algorithm improvement, performance evaluation, and computation optimization.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] DEDUCE: A pattern matching method for automatic de-identification of Dutch medical text
    Menger, Vincent
    Scheepers, Floor
    van Wijk, Lisette Maria
    Spruit, Marco
    TELEMATICS AND INFORMATICS, 2018, 35 (04) : 727 - 736
  • [2] Is Multiclass Automatic Text De-Identification Worth the Effort?
    Duy Duc An Bui
    Redden, David T.
    Cimino, James J.
    METHODS OF INFORMATION IN MEDICINE, 2018, 57 (04) : 177 - 184
  • [3] A rule-based system for automatic de-identification of medical narrative texts
    Jaćimović, Jelena
    Krstev, Cvetana
    Jelovac, Drago
    Informatica (Slovenia), 2015, 39 (01): : 45 - 53
  • [4] A Rule-Based System for Automatic De-identification of Medical Narrative Texts
    Jacimovic, Jelena
    Krstev, Cvetana
    Jelovac, Drago
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2015, 39 (01): : 43 - 51
  • [5] Leveraging text skeleton for de-identification of electronic medical records
    Zhao, Yue-Shu
    Zhang, Kun-Li
    Ma, Hong-Chao
    Li, Kun
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2018, 18
  • [6] Leveraging text skeleton for de-identification of electronic medical records
    Yue-Shu Zhao
    Kun-Li Zhang
    Hong-Chao Ma
    Kun Li
    BMC Medical Informatics and Decision Making, 18
  • [7] Automated de-identification of free-text medical records
    Neamatullah, Ishna
    Douglass, Margaret M.
    Lehman, Li-wei H.
    Reisner, Andrew
    Villarroel, Mauricio
    Long, William J.
    Szolovits, Peter
    Moody, George B.
    Mark, Roger G.
    Clifford, Gari D.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2008, 8 (1)
  • [8] Automated de-identification of free-text medical records
    Ishna Neamatullah
    Margaret M Douglass
    Li-wei H Lehman
    Andrew Reisner
    Mauricio Villarroel
    William J Long
    Peter Szolovits
    George B Moody
    Roger G Mark
    Gari D Clifford
    BMC Medical Informatics and Decision Making, 8
  • [9] Automatic Clinical Text De-Identification: Is It Worth It, and Could It Work for Me?
    Meystre, Stephane M.
    Dalianis, Hercules
    Aberdeen, John
    Malin, Brad
    MEDINFO 2013: PROCEEDINGS OF THE 14TH WORLD CONGRESS ON MEDICAL AND HEALTH INFORMATICS, PTS 1 AND 2, 2013, 192 : 1242 - 1242
  • [10] De-identification of Medical Images with Retention of Scientific Research Value
    Moore, Stephen M.
    Maffitt, David R.
    Smith, Kirk E.
    Kirby, Justin S.
    Clark, Kenneth W.
    Freymann, John B.
    Vendt, Bruce A.
    Tarbox, Lawrence R.
    Prior, Fred W.
    RADIOGRAPHICS, 2015, 35 (03) : 727 - 735