PARAMETER-ESTIMATION FOR PROBABILISTIC DOCUMENT-RETRIEVAL MODELS

被引:0
|
作者
LOSEE, RM
机构
[1] Univ of North Carolina, Chapel Hill,, NC, USA, Univ of North Carolina, Chapel Hill, NC, USA
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE | 1988年 / 39卷 / 01期
关键词
MATHEMATICAL MODELS - PROBABILITY;
D O I
10.1002/(SICI)1097-4571(198801)39:1<8::AID-ASI3>3.0.CO;2-W
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Probability distributions that may be used to describe the distribution of features include binary and Poisson distributions. Techniques for estimating the parameters of distributions are suggested. A proposal has been tested that parameters of distributions describing the distribution of features in nonrelevant documents be estimated from the parameters of the corresponding distributions of the entire database; the confidence parameter of such an estimate resulting in the highest average precision is given. Tests of several methods for estimating the parameters of distributions describing the distribution of features in relevant documents suggest that small values of the confidence parameter be used in initial estimates of parameters for relevant documents.
引用
收藏
页码:8 / 16
页数:9
相关论文
共 50 条