FindZebra: A search engine for rare diseases

被引:59
作者
Dragusin, Radu [1 ,2 ]
Petcu, Paula [1 ,3 ]
Lioma, Christina [1 ,2 ]
Larsen, Birger [4 ]
Jorgensen, Henrik L. [5 ]
Cox, Ingemar J. [1 ,6 ]
Hansen, Lars Kai [1 ]
Ingwersen, Peter [4 ]
Winther, Ole [1 ]
机构
[1] Tech Univ Denmark, DTU Compute, Lyngby, Denmark
[2] Univ Copenhagen, Dept Comp Sci, DK-1168 Copenhagen, Denmark
[3] Findwise, Copenhagen, Denmark
[4] Royal Sch Lib & Informat Sci, Informat Syst & Interact Design, Copenhagen, Denmark
[5] Bispebjerg Hosp, Dept Clin Biochem, DK-2400 Copenhagen, Denmark
[6] UCL, Dept Comp Sci, London, England
关键词
Rare diseases; Specialised search engine; Information retrieval; Evaluation of web search engines; HEALTH INFORMATION; MUTATION; DEFICIENCY; RETRIEVAL; INTERNET;
D O I
10.1016/j.ijmedinf.2013.01.005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background: The web has become a primary information resource about illnesses and treatments for both medical and non-medical users. Standard web search is by far the most common interface to this information. It is therefore of interest to find out how well web search engines work for diagnostic queries and what factors contribute to successes and failures. Among diseases, rare (or orphan) diseases represent an especially challenging and thus interesting class to diagnose as each is rare, diverse in symptoms and usually has scattered resources associated with it. Methods: We design an evaluation approach for web search engines for rare disease diagnosis which includes 56 real life diagnostic cases, performance measures, information resources and guidelines for customising Google Search to this task. In addition, we introduce FindZebra, a specialized (vertical) rare disease search engine. FindZebra is powered by open source search technology and uses curated freely available online medical information. Results: FindZebra outperforms Google Search in both default set-up and customised to the resources used by FindZebra. We extend FindZebra with specialized functionalities exploiting medical ontological information and UMLS medical concepts to demonstrate different ways of displaying the retrieved results to medical experts. Conclusions: Our results indicate that a specialized search engine can improve the diagnostic quality without compromising the ease of use of the currently widely popular standard web search. The proposed evaluation approach can be valuable for future development and benchmarking. The FindZebra search engine is available at http://www.findzebra.com/. (c) 2013 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:528 / 538
页数:11
相关论文
共 50 条
  • [1] Hereditary renal adysplasia, pulmonary hypoplasia and Mayer-Rokitansky-Kuster-Hauser (MRKH) syndrome: a case report
    Acien, Pedro
    Galan, Francisco
    Manchon, Irene
    Ruiz, Eva
    Acien, Maribel
    Alcaraz, Luis A.
    [J]. ORPHANET JOURNAL OF RARE DISEASES, 2010, 5
  • [2] The role of standardized data and terminological systems in computerized clinical decision support systems: Literature review and survey
    Ahmadian, Leila
    van Engen-Verheul, Mariette
    Bakhshi-Raiez, Ferishta
    Peek, Niels
    Cornet, Ronald
    de Keizer, Nicolette F.
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2011, 80 (02) : 81 - 93
  • [3] Craniocervical junction malformation in a child with Oromandibular-limb hypogenesis-Mobius syndrome
    Al Kaissi, Ali
    Grill, Franz
    Safi, Hatem
    Ben Ghachem, Maher
    Ben Chehida, Farid
    Klaushofer, Klaus
    [J]. ORPHANET JOURNAL OF RARE DISEASES, 2007, 2 (1)
  • [4] A novel mutation and first report of dilated cardiomyopathy in ALG6-CDG (CDG-Ic): a case report
    Al-Owain, Mohammed
    Mohamed, Sarar
    Kaya, Namik
    Zagal, Ahmad
    Matthijs, Gert
    Jaeken, Jaak
    [J]. ORPHANET JOURNAL OF RARE DISEASES, 2010, 5
  • [5] [Anonymous], 2008, Introduction to information retrieval
  • [6] [Anonymous], 2003, Language Modeling for Information Retrieval
  • [7] DXPLAIN - AN EVOLVING DIAGNOSTIC DECISION-SUPPORT SYSTEM
    BARNETT, GO
    CIMINO, JJ
    HUPP, JA
    HOFFER, EP
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1987, 258 (01): : 67 - 74
  • [8] Bhavnani S.K., 2002, EXTENDED ABSTRACTS A, P610
  • [9] Case report of intrafamilial variability in autosomal recessive centronuclear myopathy associated to a novel BIN1 stop mutation
    Bohm, Johann
    Yis, Uluc
    Ortac, Ragip
    Cakmakci, Handan
    Kurul, Semra Hiz
    Dirik, Eray
    Laporte, Jocelyn
    [J]. ORPHANET JOURNAL OF RARE DISEASES, 2010, 5
  • [10] 'Doctor Google' ending the diagnostic odyssey in lysosomal storage disorders: parents using internet search engines as an efficient diagnostic strategy in rare diseases
    Bouwman, Machtelt G.
    Teunissen, Quirine G. A.
    Wijburg, Frits A.
    Linthorst, Gabor E.
    [J]. ARCHIVES OF DISEASE IN CHILDHOOD, 2010, 95 (08) : 642 - U96