Text modeling for real-time document categorization

被引:0
|
作者
Byrnes, John [1 ]
Rohwer, Richard [1 ]
机构
[1] Fair Isaac Corp, NC Software LLC, San Diego, CA 92130 USA
来源
2005 IEEE Aerospace Conference, Vols 1-4 | 2005年
关键词
D O I
暂无
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
We report on experiments in adapting document categorization techniques to provide for implementation in high-speed hardware.(1,2) Because resources are scarce, it is important to have a small set of robust and maximally informative variables over which learning can occur. We generate variables using information-theoretic clustering. The resulting performance is on par with general-purpose computing implementations which are able to take advantage of large amounts of time and memory. We conclude that custom high-speed hardware for document categorization can be made very accurate. We also believe that some of the strengths of information-theoretic data analysis techniques are brought out.
引用
收藏
页码:3081 / 3091
页数:11
相关论文
共 50 条
  • [1] A Text Categorization System with Soft Real-Time Guarantee
    WANG Hua-yong
    Wuhan University Journal of Natural Sciences, 2006, (01) : 226 - 229
  • [2] A Real-Time Categorization and Clustering Method for Text Data of Laws and Regulations
    Su, Bianping
    Wang, Rong
    Wang, Yiping
    2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM), 2010,
  • [3] Automated Text Document Categorization
    Yasotha, R.
    Charles, E. Y. A.
    2015 IEEE SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INFORMATION SYSTEMS (ICICIS), 2015, : 522 - 528
  • [4] Document indexing in text categorization
    Zhang, QR
    Zhang, L
    Dong, SB
    Tan, JH
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 3792 - 3796
  • [5] Improving Performance of Massive Text Real-Time Classification for Document Confidentiality Management
    Tan, Lingling
    Yi, Junkai
    Yang, Fei
    APPLIED SCIENCES-BASEL, 2024, 14 (04):
  • [6] Text document categorization by term association
    Antonie, ML
    Zaïane, OR
    2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 19 - 26
  • [7] REAL-TIME VOICE TO TEXT WITH STENOGRAPHERS
    OAKEY, JE
    JOURNAL OF MICROCOMPUTER APPLICATIONS, 1993, 16 (03): : 271 - 276
  • [8] A Real-Time Text Analysis System
    Chi Mai Nguyen
    Phat Trien Thai
    Duy Khang Lam
    Van Tuan Nguyen
    2023 IEEE 47TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC, 2023, : 340 - 345
  • [9] Real-time user interest modeling for real-time ranking
    Liu, Xiaozhong
    Turtle, Howard
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2013, 64 (08): : 1557 - 1576
  • [10] CATEGORIZATION OF REAL-TIME SENSATION PATTERNS DURING URODYNAMICS
    Cullingsworth, Zachary
    Klausner, Adam
    Nagle, Anna
    Simmons, William
    Morin, Jacqueline
    Vince, Randy
    Rapp, David
    Speich, John
    JOURNAL OF UROLOGY, 2017, 197 (04): : E837 - E837