Frankenplace: Interactive Thematic Mapping for Ad Hoc Exploratory Search

被引:35
作者
Adams, Benjamin [1 ]
McKenzie, Grant [2 ]
Gahegan, Mark [1 ]
机构
[1] Univ Auckland, Dept Comp Sci, Ctr eRes, Auckland, New Zealand
[2] Univ Calif Santa Barbara, Dept Geog, Santa Barbara, CA 93106 USA
来源
PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015) | 2015年
关键词
Geographic search; interactive search; information retrieval; information visualization; visual analytics; exploratory search;
D O I
10.1145/2736277.2741137
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ad hoc keyword search engines built using modern information retrieval methods do a good job of handling fine-grained queries. However, they perform poorly at facilitating spatial and spatially-embedded thematic exploration of the results, despite the fact that many queries, e.g. civil war, refer to different documents and topics in different places. This is not for lack of data: geographic information, such as place names, events, and coordinates are common in unstructured document collections on the web. The associations between geographic and thematic contents in these documents can provide a rich groundwork to organize information for exploratory research. In this paper we describe the architecture of an interactive thematic map search engine, Frankenplace, designed to facilitate document exploration at the intersection of theme and place. The map interface enables a user to zoom the geographic context of their query in and out, and quickly explore through thousands of search results in a meaningful way. And by combining topic models with geographically contextualized search results, users can discover related topics based on geographic context. Frankenplace utilizes a novel indexing method called geoboost for boosting terms associated with cells on a discrete global grid. The resulting index factors in the geographic scale of the place or feature mentioned in related text, the relative textual scope of the place reference, and the overall importance of the containing document in the document network. The system is currently indexed with over 5 million documents from the web, including the English Wikipedia and online travel blog entries. We demonstrate that Frankenplace can support four distinct types of exploratory search tasks while being adaptive to scale and location of interest.
引用
收藏
页码:12 / 22
页数:11
相关论文
共 44 条
[1]  
Adams B., 2012, ICWSM, P375, DOI DOI 10.1609/ICWSM.V6I1.14309
[2]  
Adams B., 2012, ICWSM, P616
[3]   Probabilistic models of information retrieval based on measuring the divergence from randomness [J].
Amati, G ;
Van Rijsbergen, CJ .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2002, 20 (04) :357-389
[4]  
[Anonymous], MONOGRAPHS STAT APPL
[5]  
[Anonymous], VLDB
[6]  
[Anonymous], 2011, P 49 ANN M ASS COMP
[7]  
[Anonymous], 2014, Semantic Web Journal
[8]  
Backstrom Lars., 2008, P 17 INT C WORLD WID, P357, DOI DOI 10.1145/1367497.1367546
[9]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[10]   Evaluation of methods for classifying epidemiological data on choropleth maps in series [J].
Brewer, CA ;
Pickle, L .
ANNALS OF THE ASSOCIATION OF AMERICAN GEOGRAPHERS, 2002, 92 (04) :662-681