Extracting individual trajectories from text by fusing large language models with diverse knowledge

被引:0
作者
Liu, Le [1 ,2 ]
Pei, Tao [1 ,2 ,3 ]
Fang, Zidong [1 ,2 ]
Yan, Xiaorui [1 ,2 ]
Zheng, Chenglong [1 ,2 ]
Wang, Xi [1 ,2 ]
Song, Ci [1 ,2 ]
Luan, Wenfei [1 ,4 ]
Chen, Jie [1 ]
机构
[1] Chinese Acad Sci, State Key Lab Resources & Environm Informat Syst, Inst Geog Sci & Nat Resources Res, Beijing 100101, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Jiangsu Ctr Collaborat Innovat Geog Informat Resou, Nanjing 210023, Peoples R China
[4] Henan Polytech Univ, Sch Surveying & Land Informat Engn, Jiaozuo 454000, Peoples R China
基金
中国国家自然科学基金;
关键词
Large language model; Human mobility trajectory; Geographical information retrieval; Prompt learning; GeoAI; RECOGNITION; MOBILE;
D O I
10.1016/j.jag.2025.104654
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Individual trajectories offer insights into human mobility, with data either passively recorded, such as GPS, or actively recorded, such as natural language text. While the former provides detailed movement data, it lacks important context such as personal experiences, which can be obtained from the latter. Extracting trajectories from text can enhance travel experience optimization, historical analysis, and pandemic management. However, existing trajectory extraction methods rely on rule-based frameworks that fail to capture contextual semantics, resulting in limited generalizability and loss of trajectory semantics. While general-purpose large language models (LLMs) demonstrate potential for contextual reasoning capabilities, their deficient domain-specific knowledge pertinent to trajectory patterns hinders efficient and precise trajectory extraction. To address these limitations, we propose T2TrajLLM, a novel framework that fuses LLMs with domain knowledge through three components: (1) a lightweight trajectory model for structured guidance, (2) a text-to-trajectory transformation model enabling multi-step reasoning, and (3) labelled text-trajectory samples for learning domain-adaptive constraint rules. Central to T2TrajLLM is a prompt method that dynamically fuses these components with LLMs while avoids rigid rule dependency. Evaluated across three heterogeneous datasets, T2TrajLLM achieves similar to 8 % higher accuracy than existing methods, demonstrating strong transferability across datasets and extensibility to diverse application requirements. Overall, T2TrajLLM effectively extracts trajectories from diverse textual sources, providing robust support for the analysis and understanding of individual mobility.
引用
收藏
页数:16
相关论文
共 67 条
[1]   Extraction of spatio-temporal data about historical events from text documents [J].
Abraham, Susanna ;
Maes, Stephan ;
Bernard, Lars .
TRANSACTIONS IN GIS, 2018, 22 (03) :677-696
[2]   What human mobility data tell us about COVID-19 spread [J].
Alessandretti, Laura .
NATURE REVIEWS PHYSICS, 2022, 4 (01) :12-13
[3]   The scales of human mobility [J].
Alessandretti, Laura ;
Aslak, Ulf ;
Lehmann, Sune .
NATURE, 2020, 587 (7834) :402-+
[4]  
Amitay E., 2004, Proceedings of Sheffield SIGIR 2004. The Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P273, DOI 10.1145/1008992.1009040
[5]  
Brown TB, 2020, Arxiv, DOI [arXiv:2005.14165, DOI 10.48550/ARXIV.2005.14165]
[6]  
Baglioni M, 2008, LECT NOTES COMPUT SC, V5232, P344, DOI 10.1007/978-3-540-87991-6_41
[7]   The exciting potential and daunting challenge of using GPS human-mobility data for epidemic modeling [J].
Barreras, Francisco ;
Watts, Duncan J. .
NATURE COMPUTATIONAL SCIENCE, 2024, 4 (06) :398-411
[8]   Transformer based named entity recognition for place name extraction from unstructured text [J].
Berragan, Cillian ;
Singleton, Alex ;
Calafiore, Alessia ;
Morley, Jeremy .
INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2023, 37 (04) :747-766
[9]  
Budig B., 2017, GIR 17 11 WORKSH GEO, P1
[10]  
[陈洁 Chen Jie], 2023, [地理学报, Acta Geographica Sinica], V78, P1028