Improving a gold standard: treating human relevance judgments of MEDLINE document pairs

Kim Won; Wilbur W

doi:10.1186/1471-2105-12-S3-S5

BMC Bioinformatics (Jun 2011)

Improving a gold standard: treating human relevance judgments of MEDLINE document pairs

Kim Won,
Wilbur W

Affiliations

Kim Won
Wilbur W

DOI: https://doi.org/10.1186/1471-2105-12-S3-S5
Journal volume & issue: Vol. 12, no. Suppl 3
p. S5

Abstract

Read online

Abstract Given prior human judgments of the condition of an object it is possible to use these judgments to make a maximal likelihood estimate of what future human judgments of the condition of that object will be. However, if one has a reasonably large collection of similar objects and the prior human judgments of a number of judges regarding the condition of each object in the collection, then it is possible to make predictions of future human judgments for the whole collection that are superior to the simple maximal likelihood estimate for each object in isolation. This is possible because the multiple judgments over the collection allow an analysis to determine the relative value of a judge as compared with the other judges in the group and this value can be used to augment or diminish a particular judge’s influence in predicting future judgments. Here we study and compare five different methods for making such improved predictions and show that each is superior to simple maximal likelihood estimates.

Published in BMC Bioinformatics

ISSN: 1471-2105 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Biology (General)
Website: http://www.biomedcentral.com/bmcbioinformatics/

About the journal