On the Opportunities and Challenges of Foundation Models for GeoAI (Vision Paper)

被引:17
|
作者
Mai, Gengchen [1 ]
Huang, Weiming [2 ]
Sun, Jin [3 ]
Song, Suhang [4 ]
Mishra, Deepak [5 ]
Liu, Ninghao [3 ]
Gao, Song [6 ]
Liu, Tianming [3 ]
Cong, Gao [2 ]
Hu, Yingjie [7 ]
Cundy, Chris [8 ]
Li, Ziyuan [9 ]
Zhu, Rui [10 ]
Lao, Ni [11 ]
机构
[1] Univ Georgia, Dept Geog, 210 Field St, Athens, GA USA
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Block N4,50 Nanyang Ave, Singapore, Singapore
[3] Univ Georgia Athens, Sch Comp, 415 Boyd Res & Educ Ctr, Athens, GA 30602 USA
[4] Univ Georgia, Coll Publ Hlth, Rhodes Hall,105 Spear Rd, Athens, GA 30602 USA
[5] Univ Georgia, Dept Geog, 210 Field St, Athens, GA 30602 USA
[6] Univ Wisconsin Madison, Dept Geog, Geospatial Data Sci Lab, Sci Hall,550 N Pk St, Madison, WI 53715 USA
[7] Univ Buffalo, Dept Geog, GeoAI Lab, Ste 105, Buffalo, NY 14261 USA
[8] Stanford Univ, Dept Comp Sci, 353 Jane Stanford Way, Stanford, CA 94305 USA
[9] Univ Connecticut, Sch Business, 2100 Hillside Rd, Storrs, CT 06269 USA
[10] Sch Geog Sci, Univ Rd, Bristol BS81SS, Avon, England
[11] Google, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA
基金
美国国家科学基金会;
关键词
Foundation models; geospatial artificial intelligence; multimodal learning; GEOGRAPHICALLY WEIGHTED REGRESSION; URBAN LAND-USE; GEOSPATIAL SEMANTICS; HEALTH GEOGRAPHY; KNOWLEDGE GRAPH; TRAJECTORIES; LOCATION; CONTEXT; IMPACT; PLACE;
D O I
10.1145/3653070
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Large pre-trained models, also known as foundation models (FMs), are trained in a task-agnostic manner on large-scale data and can be adapted to a wide range of downstream tasks by fine-tuning, few-shot, or even zero-shot learning. Despite their successes in language and vision tasks, we have not yet seen an attempt to develop foundation models for geospatial artificial intelligence (GeoAI). In this work, we explore the promises and challenges of developing multimodal foundation models for GeoAI. We first investigate the potential of many existing FMs by testing their performances on seven tasks across multiple geospatial domains, including Geospatial Semantics, Health Geography, Urban Geography, and Remote Sensing. Our results indicate that on several geospatial tasks that only involve text modality, such as toponym recognition, location description recognition, and US state-level/county-level dementia time series forecasting, the task-agnostic large learning models (LLMs) can outperform task-specific fully supervised models in a zero-shot or few-shot learning setting. However, on other geospatial tasks, especially tasks that involve multiple data modalities (e.g., POI-based urban function classification, street view image-based urban noise intensity classification, and remote sensing image scene classification), existing FMs still underperform task-specific models. Based on these observations, we propose that one of the major challenges of developing an FM for GeoAI is to address the multimodal nature of geospatial tasks. After discussing the distinct challenges of each geospatial data modality, we suggest the possibility of a multimodal FM that can reason over various types of geospatial data through geospatial alignments. We conclude this article by discussing the unique risks and challenges to developing such a model for GeoAI.
引用
收藏
页数:46
相关论文
共 50 条
  • [21] Future Developments in Geographical Agent-Based Models: Challenges and Opportunities
    Heppenstall, Alison
    Crooks, Andrew
    Malleson, Nick
    Manley, Ed
    Ge, Jiaqi
    Batty, Michael
    GEOGRAPHICAL ANALYSIS, 2021, 53 (01) : 76 - 91
  • [22] Remanufacturing Electric Mobility: Challenges and Opportunities in Designing Circular Business Models
    Oleskow-Szlapka, Joanna
    Corum, Adnan
    Hoffa-Dabrowska, Patrycja
    Stachowiak, Agnieszka
    ADVANCES IN MANUFACTURING IV, VOL 3, MANUFACTURING 2024, 2024, : 269 - 281
  • [23] Challenges and opportunities in synthesizing historical geospatial data using statistical models
    Eitzel, M. V.
    Kelly, Maggi
    Dronova, Iryna
    Valachovic, Yana
    Quinn-Davidson, Lenya
    Solera, Jon
    de Valpine, Perry
    ECOLOGICAL INFORMATICS, 2016, 31 : 100 - 111
  • [24] Agent-based models in urban transportation: review, challenges, and opportunities
    Bastarianto, Faza Fawzan
    Hancock, Thomas O.
    Choudhury, Charisma Farheen
    Manley, Ed
    EUROPEAN TRANSPORT RESEARCH REVIEW, 2023, 15 (01)
  • [25] Invited Paper: Security and Privacy in Large Language and Foundation Models: A Survey on GenAI Attacks
    Italiano, Giuseppe Francesco
    Martino, Alessandro
    Piccardo, Giorgio
    DISTRIBUTED COMPUTING AND INTELLIGENT TECHNOLOGY, ICDCIT 2025, 2025, 15507 : 1 - 17
  • [26] Tackling societal challenges as opportunities: a case study of a Swiss foundation's value-creating function
    Straus, Nadine
    Mpadanes, Markos
    JOURNAL OF COMMUNICATION MANAGEMENT, 2024, 28 (02) : 328 - 344
  • [27] Learning in professionally 'distant' contexts: opportunities and challenges
    Mausz, Justin
    Tavares, Walter
    ADVANCES IN HEALTH SCIENCES EDUCATION, 2017, 22 (03) : 581 - 600
  • [28] Systematic mapping study on energy models with thermal energy carriers: Opportunities and challenges
    Bustamante-Castano, Samuel
    Bernal-del Rio, Santiago
    Osorio-Gomez, Gilberto
    ENERGY FOR SUSTAINABLE DEVELOPMENT, 2024, 81
  • [29] Ubicomp Systems at 20: Progress, Opportunities, and Challenges
    Caceres, Ramon
    Friday, Adrian
    IEEE PERVASIVE COMPUTING, 2012, 11 (01) : 14 - 21
  • [30] Challenges and opportunities of a-priori and a-posteriori variability in maternal immune activation models
    Weber-Stadlbauer, Ulrike
    Meyer, Urs
    CURRENT OPINION IN BEHAVIORAL SCIENCES, 2019, 28 : 119 - 128