Text mining in a digital library

被引:2
作者
Witten I.H. [1 ]
Don K.J. [1 ]
Dewsnip M. [1 ]
Tablan V. [2 ]
机构
[1] Computer Science, University of Waikato, Hamilton
[2] Computer Science, University of Sheffield, Sheffield
关键词
Digital libraries; GATE; Greenstone; Information extraction; Text mining;
D O I
10.1007/s00799-003-0066-4
中图分类号
学科分类号
摘要
[No abstract available]
引用
收藏
页码:56 / 59
页数:3
相关论文
共 8 条
[1]  
Baker P., Hardie A., McEnery A., Cunningham H., Gaizauskas R., EMILLE, a 67-million word corpus of Indic languages: Data collection, mark-up and harmonisation, Proceedings of the Conference On Language Resources and Evaluation, pp. 819-825, (2002)
[2]  
Cunningham H., GATE, a general architecture for text engineering, Comput Humanit, 36, pp. 223-254, (2002)
[3]  
Declerck T., Wittenberg P., Cunningham H., The automatic generation of formal annotations in a multimedia indexing and searching environment, Proceedings of the ACL/EACL Workshop On Human Language Technology and Knowledge Management, pp. 129-136, (2001)
[4]  
Hearst M.A., Untangling text mining, Proceedings of the Annual Meeting of the Association For Computational Linguistics, (1999)
[5]  
Paynter G.W., Witten I.H., A combined phrase and thesaurus browser for large document collections, Proceedings of the European Conference On Digital Libraries, pp. 25-36, (2001)
[6]  
Tablan V., Ursu C., Bontcheva K., Cunningham H., Maynard D., Hamza O., McEnery A., Baker P., Leisher M., A Unicode based environment for creation and use of language resources, Proceedings of the Conference On Language Resources and Evaluation, pp. 66-71, (2002)
[7]  
Witten I.H., Bainbridge D., How to Build a Digital Library, (2003)
[8]  
Witten I.H., Text mining, Practical Handbook of Internet Computing