Learning to Tag Text from Rules and Examples

被引:0
|
作者
Diligenti, Michelangelo [1 ]
Gori, Marco [1 ]
Maggini, Marco [1 ]
机构
[1] Univ Siena, Dipartimento Ingn Informaz, I-53100 Siena, Italy
来源
AI(STAR)IA 2011: ARTIFICIAL INTELLIGENCE AROUND MAN AND BEYOND | 2011年 / 6934卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tagging has become a popular way to improve the access to resources, especially in social networks and folksonomies. Most of the resource sharing tools allow a manual labeling of the available items by the community members. However, the manual approach can fail to provide a consistent tagging especially when the dimension of the vocabulary of the tags increases and, consequently, the users do not comply to a shared semantic knowledge. Hence, automatic tagging can provide an effective way to complete the manual added tags, especially for dynamic or very large collections of documents like the Web. However, when an automatic text tagger is trained over the tags inserted by the users, it may inherit the inconsistencies of the training data. In this paper, we propose a novel approach where a set of text categorizers, each associated to a tag in the vocabulary, are trained both from examples and a higher level abstract representation consisting of FOL clauses that describe semantic rules constraining the use of the corresponding tags. The FOL clauses are compiled into a set of equivalent continuous constraints, and the integration between logic and learning is implemented in a multi-task learning scheme. In particular, we exploit the kernel machine mathematical apparatus casting the problem as primal optimization of a function composed of the loss on the supervised examples, the regularization term, and a penalty term deriving from forcing the constraints resulting from the conversion of the logic knowledge. The experimental results show that the proposed approach provides a significant accuracy improvement on the tagging of bibtex entries.
引用
收藏
页码:45 / 56
页数:12
相关论文
共 50 条
  • [41] Automatic acquisition of transfer rules from translation examples
    Winiwarter, W
    ADVANCES IN NATURAL LANGUAGE PROCESSING, 2004, 3230 : 13 - 24
  • [42] Learning from order examples
    Kamishima, T
    Akaho, S
    2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 645 - 648
  • [43] CONCEPT-LEARNING FROM EXAMPLES AND COUNTER EXAMPLES
    RALESCU, AL
    BALDWIN, JF
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1989, 30 (03): : 329 - 354
  • [44] An efficient SAT formulation for learning multiple criteria non-compensatory sorting rules from examples
    Belahcene, K.
    Labreuche, C.
    Maudet, N.
    Mousseau, V.
    Ouerdane, W.
    COMPUTERS & OPERATIONS RESEARCH, 2018, 97 : 58 - 71
  • [45] Recommending Model Refactoring Rules from Refactoring Examples
    Mokaddem, Chihab Eddine
    Sahraoui, Houari
    Syriani, Eugene
    21ST ACM/IEEE INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS (MODELS 2018), 2018, : 257 - 267
  • [46] Hybrid approach for text categorization based on machine learning and rules
    Villena-Roman, Julio
    Collada-Perez, Sonia
    Lana-Serrano, Sara
    Carlos Gonzalez-Cristobal, Jose
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (46): : 35 - 42
  • [47] Learning from examples: Instructional principles from the worked examples research
    Atkinson, RK
    Derry, SJ
    Renkl, A
    Wortham, D
    REVIEW OF EDUCATIONAL RESEARCH, 2000, 70 (02) : 181 - 214
  • [48] Learning from text
    Kintsch, W
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1996, 31 (3-4) : 5323 - 5323
  • [49] EXAMPLES ILLUSTRATING ANGLO-AMERICAN CATALOGUING RULES, BRITISH TEXT, 1967 - HUNTER,EJ
    CHAN, LM
    LIBRARY RESOURCES & TECHNICAL SERVICES, 1974, 18 (03): : 311 - 312
  • [50] Towards Adversarially Robust Text Classifiers by Learning to Reweight Clean Examples
    Xu, Jianhan
    Zhang, Cenyuan
    Zheng, Xiaoqing
    Li, Linyang
    Hsieh, Cho-Jui
    Chang, Kai-Wei
    Huang, Xuanjing
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 1694 - 1707