Adapted K-Nearest Neighbors for Detecting Anomalies on Spatio-Temporal Traffic Flow

被引:89
作者
Djenouri, Youcef [1 ]
Belhadi, Asma [2 ]
Lin, Jerry Chun-Wei [3 ]
Cano, Alberto [4 ]
机构
[1] Norwegian Univ Sci & Technol, Dept Comp Sci, Trondheim, Norway
[2] Univ Sci & Technol Houari Boumediene, RIMA, Algiers, Algeria
[3] Western Norway Univ Appl Sci, Dept Comp Math & Phys, Bergen, Norway
[4] Virginia Commonwealth Univ, Dept Comp Sci, Richmond, VA 23284 USA
关键词
Anomaly detection; kNN; flow distribution probability; OUTLIER DETECTION; PATTERNS; INFORMATION; DYNAMICS; MODEL;
D O I
10.1109/ACCESS.2019.2891933
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Outlier detection is an extensive research area, which has been intensively studied in several domains such as biological sciences, medical diagnosis, surveillance, and traffic anomaly detection. This paper explores advances in the outlier detection area by finding anomalies in spatio-temporal urban traffic flow. It proposes a new approach by considering the distribution of the flows in a given time interval. The flow distribution probability (FDP) databases are first constructed from the traffic flows by considering both spatial and temporal information. The outlier detection mechanism is then applied to the coming flow distribution probabilities, the inliers are stored to enrich the FDP databases, while the outliers are excluded from the FDP databases. Moreover, a k-nearest neighbor for distance-based outlier detection is investigated and adopted for FDP outlier detection. To validate the proposed framework, real data from Odense traffic flow case are evaluated at ten locations. The results reveal that the proposed framework is able to detect the real distribution of flow outliers. Another experiment has been carried out on Beijing data, the results show that our approach outperforms the baseline algorithms for high-urban traffic flow.
引用
收藏
页码:10015 / 10027
页数:13
相关论文
共 65 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
Ahmed A., 2008, P 2008 SIAM INT C DA, P219, DOI DOI 10.1137/1.9781611972788.20
[3]   Outlier mining in large high-dimensional data sets [J].
Angiulli, F ;
Pizzuti, C .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (02) :203-215
[4]  
[Anonymous], 2001, IEEE Trans. Inf. Theory
[5]  
[Anonymous], 2018, P 8 INT C WEB INT MI
[6]   Spatiotemporal Patterns in Large-Scale Traffic Speed Prediction [J].
Asif, Muhammad Tayyab ;
Dauwels, Justin ;
Goh, Chong Yang ;
Oran, Ali ;
Fathi, Esmail ;
Xu, Muye ;
Dhanya, Menoth Mohan ;
Mitrovic, Nikola ;
Jaillet, Patrick .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2014, 15 (02) :794-804
[7]   Spatio-Temporal Data Mining: A Survey of Problems and Methods [J].
Atluri, Gowtham ;
Karpatne, Anuj ;
Kumar, Vipin .
ACM COMPUTING SURVEYS, 2018, 51 (04)
[8]  
Barz B., IEEE T PATTERN ANAL
[9]   A multi-step outlier-based anomaly detection approach to network-wide traffic [J].
Bhuyan, Monowar H. ;
Bhattacharyya, D. K. ;
Kalita, J. K. .
INFORMATION SCIENCES, 2016, 348 :243-271
[10]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104