Distributed frameworks and parallel algorithms for processing large-scale geographic data

被引:78
|
作者
Hawick, KA [1 ]
Coddington, PD
James, HA
机构
[1] Univ Wales, Sch Informat, Div Comp Sci, Bangor LL57 1UT, Gwynedd, Wales
[2] Univ Adelaide, Dept Comp Sci, Adelaide, SA 5005, Australia
关键词
parallel computing; distributed computing; grid computing; metacomputing; geographic information systems;
D O I
10.1016/j.parco.2003.04.001
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The number of applications that require parallel and high-performance computing techniques has diminished in recent years due to to the continuing increase in power of PC, workstation and mono-processor systems. However, Geographic information systems (GIS) still provide a resource-hungry application domain that can make good use of parallel techniques. We describe our work with geographical systems for environmental and defence applications and some of the algorithms and techniques we have deployed to deliver high-performance prototype systems that can deal with large data sets. GIS applications are often run operationally as part of decision support systems with both a human interactive component as well as large scale batch or server-based components. Parallel computing technology embedded in a distributed system therefore provides an ideal and practical solution for multi-site organisations and especially government agencies who need to extract the best value from bulk geographic data. We describe the distributed computing approaches we have used to integrate bulk data and metadata sources and the grid computing techniques we have used to embed parallel services in an operational infrastructure. We describe some of the parallel techniques we have used: for data assimilation; for image and map data processing; for data cluster analysis; and for data mining. We also discuss issues related to emerging standards for data exchange and design issues for integrating together data in a distributed ownership system. We include a historical review of our work in this area over the last decade which leads us to believe parallel computing will continue to play an important role in GIS. We speculate on algorithmic and systems issues for the future. (C) 2003 Published by Elsevier B.V.
引用
收藏
页码:1297 / 1333
页数:37
相关论文
共 50 条
  • [1] Processing large-scale mufti-dimensional data in parallel and distributed environments
    Beynon, M
    Chang, CL
    Catalyurek, U
    Kurc, T
    Sussman, A
    Andrade, H
    Ferreira, R
    Saltz, J
    PARALLEL COMPUTING, 2002, 28 (05) : 827 - 859
  • [2] ShuffleBench: A Benchmark for Large-Scale Data Shuffling Operations with Distributed Stream Processing Frameworks
    Henning, Soeren
    Vogel, Adriano
    Leichtfried, Michael
    Ertl, Otmar
    Rabiser, Rick
    PROCEEDINGS OF THE 15TH ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING, ICPE 2024, 2024, : 2 - 13
  • [3] THE DUAL BACKBONE NETWORK - DISTRIBUTED AND PARALLEL PROCESSING ON A LARGE-SCALE
    ENDRIZZI, A
    COMPUTER NETWORKS AND ISDN SYSTEMS, 1987, 14 (2-5): : 373 - 381
  • [4] Parallel Strategy for the Large-Scale Data Streams Processing
    Yuan, Ya-Juan
    Ma, Guo-Jie
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND INFORMATION SYSTEMS, 2016, 52 : 232 - 234
  • [5] An Analysis of Distributed Programming Models and Frameworks for Large-scale Graph Processing
    Corbellini, Alejandro
    Godoy, Daniela
    Mateos, Cristian
    Schiaffino, Silvia
    Zunino, Alejandro
    IETE JOURNAL OF RESEARCH, 2022, 68 (04) : 3065 - 3073
  • [6] Distributed Data Processing for Large-Scale Simulations on Cloud
    Lu, Tianjian
    Hoyer, Stephan
    Wang, Qing
    Hu, Lily
    Chen, Yi-Fan
    2021 JOINT IEEE INTERNATIONAL SYMPOSIUM ON ELECTROMAGNETIC COMPATIBILITY, SIGNAL & POWER INTEGRITY, AND EMC EUROPE (EMC+SIPI AND EMC EUROPE), 2021, : 53 - 58
  • [7] A Workflow for Parallel and Distributed Computing of Large-Scale Genomic Data
    Choi, Hyun-Hwa
    Kim, Byoung-Seob
    Ahn, Shin-Young
    Bae, Seung-Jo
    2013 8TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2013, : 215 - 218
  • [8] Extension of Parallel Primitives and Their Applications to Large-Scale Data Processing
    Nakano, Masashi
    Chang, Qiong
    Miyazaki, Jun
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT II, DEXA 2024, 2024, 14911 : 248 - 253
  • [9] Designing Parallel Data Processing for Large-Scale Sensor Orchestration
    Kabac, Milan
    Consel, Charles
    2016 INT IEEE CONFERENCES ON UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING AND COMMUNICATIONS, CLOUD AND BIG DATA COMPUTING, INTERNET OF PEOPLE, AND SMART WORLD CONGRESS (UIC/ATC/SCALCOM/CBDCOM/IOP/SMARTWORLD), 2016, : 57 - 65
  • [10] Recent trends of research and development for large-scale data storing and parallel distributed processing in big data era
    Fujii, Hidekaki
    Haraguchi, Hiroshi
    Hijiya, Makoto
    Iwazume, Michiaki
    Iwase, Takahiro
    Computer Software, 2013, 30 (01) : 130 - 151