Concept naming vs concept categorisation: a faceted approach to semantic annotation

被引:5
作者
Prasad, A. R. D. [1 ]
Guha, Nabonita [1 ]
机构
[1] Documentat Res & Training Ctr, Indian Stat Inst, Bangalore, Karnataka, India
关键词
Semantics; Resource description languages; Markup languages;
D O I
10.1108/14684520810897377
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose - The purpose of this paper is to show that concept naming alone in document annotation is not sufficient to convey the thought content of the information resource. The paper presents an outline of semantic document annotation which combines two major processes: facet analysis and concept categorisation. This is also an effort to show how RDF schema can be designed and implemented so that the properties of the schema are able to express the basic structure of the subject matter of the resource. Design/methodology/approach - This paper presents a methodology for representing the subject matter of a document in terms of RDF. For the purposes of faceted subject annotation, it has developed an extended RDF schema for simple knowledge organisation system (SKOS). The facets and relationships of the faceted subject indexing language postulate-based permuted subject indexing system (POPSI) have been transformed into RDFS classes. The elementary categories of POPSI form the property classes in the POPSI/RDF Schema. These property classes have been used to formulate the subject description of a document. Findings - The subject annotation of a document using this schema expresses all the components of the thought content of an information resource. Practical implications - The examples given in this paper show the applicability of this schema in describing resources in web directories and annotating scholarly documents in digital libraries. In a broader perspective, this provides a methodology for formulating the subject metadata of web resources. This schema helps in formulating the subject string(s) for a resource outlining the skeleton structure of its thought content. Originality/value - SKOS has been developed as an RDF schema representation of the traditional knowledge organisation systems. But the schema has limited room to accommodate subject indexing languages. The present schema extends the SKOS schema to accommodate the representation of faceted subject indexing languages. The faceted subject annotation system has been adopted for the very reason that it has precedence over the enumerated classification systems, controlled vocabulary lists, etc. The potential to describe the specific subject of the document with more accuracy and representation of context gives the faceted subject indexing languages strength to make the subject description explicit and machine processible.
引用
收藏
页码:500 / 510
页数:11
相关论文
共 21 条
[1]  
BHATTACHARYYA G, 1979, LIBR SCI SLANT DOC, V16, P1
[2]  
BHATTACHARYYA G, 1981, DRTC REFR SEM 13 NEW
[3]  
Broughton V, 2004, SIGNUM, V8, P5
[4]   The need for a faceted methods of information retrieval [J].
Broughton, Vanda .
ASLIB PROCEEDINGS, 2006, 58 (1-2) :49-72
[5]  
CIMIANO P, 2003, P ACL 2003 WORKSH LI, V19, P14
[6]  
Cimiano P., 2005, P 14 INT C WORLD WID, P332
[7]  
CIMIANO P, 2004, P 13 INT C WORLD WID, P462, DOI DOI 10.1145/988672.988735
[8]   The Ontology Lookup Service, a lightweight cross-platform tool for controlled vocabulary queries [J].
Côté, RG ;
Jones, P ;
Apweiler, R ;
Hermjakob, H .
BMC BIOINFORMATICS, 2006, 7 (1)
[9]  
DEVADASON FJ, 1985, INT CLASSIF, V12, P87
[10]  
Ellis D., 2000, Journal of Internet Cataloging, V2, P97, DOI 10.1300/J141v02n03_07