Learning from streaming data with unsupervised heterogeneous domain adaptation

被引：0

作者：

Moradi, Mona ^{[1
]}

Rahmanimanesh, Mohammad ^{[1
]}

Shahzadi, Ali ^{[1
]}

机构：

[1] Semnan Univ, Fac Elect & Comp Engn, Semnan, Iran

来源：

INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS | 2025年 / 19卷 / 01期

关键词：

Concept drift; Heterogeneous domain adaptation; Streaming data; Unsupervised learning; DRIFT DETECTION METHODS; CHANGE-POINT DETECTION; TIME-SERIES DATA; NEURAL-NETWORKS; ONLINE; ENSEMBLE; MACHINE; ALGORITHM; MODEL;

D O I：

10.1007/s41060-023-00463-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many efforts on domain adaptation focus on stationary environments and assume that the target domain samples are available before the learning process. However, real-world applications frequently involve the availability of non-stationary data sequentially. This study develops an unsupervised heterogeneous domain adaptation approach to address non-stationary scenarios where data streams continually feed the learning model. This process employs a fuzzy-based model that has been trained on a different but related domain. Subsequently, a neighborhood-based weight assignment fine-tunes the attraction and repulsion between neighbors based on prior knowledge about their domains and the similarity between class labels. To avoid unnecessary adaptation for each target domain chunk, domain adaptation is triggered only when concept drift is detected. This way, the model gradually adjusts to the evolving data, incorporating the unique characteristics of the new domain. When no drift is detected, existing parameters are reused for feature adaptation. At the end, the source domain is updated by incorporating the drifting data and their predicted labels. The proposed method offers several advantages, including avoidance of excessive alignment, reduction in domain adaptation cost, and a gradual reduction in dependency on the source domain for domain adaptation. To evaluate the method's performance, experiments were conducted on several tasks extracted from two benchmark datasets, considering different types of concept drift. The experimental results demonstrate that the proposed model significantly improves classification accuracy while reducing computational time.

引用

页码：61 / 81

页数：21

共 89 条

[1] Alippi C., 2015, ARXIV
[2] Scalable Detection of Concept Drift: A Learning Technique Based on Support Vector Machines
Altendeitering, Marcel
Dubler, Stephan
[J]. 30TH INTERNATIONAL CONFERENCE ON FLEXIBLE AUTOMATION AND INTELLIGENT MANUFACTURING (FAIM2021), 2020, 51 : 400 - 407
[3] Big data directed acyclic graph model for real-time COVID-19 twitter stream detection
Amen, Bakhtiar
Faiz, Syahirul
Do, Thanh-Toan
[J]. PATTERN RECOGNITION, 2022, 123
[4] Arora S, 2017, 5 INT C LEARN REPR I, P1
[5] Ashfahani A, 2019, Data Min, P666
[6] Baena-Garcia M., 2006, 4 INT WORKSH KNOWL D, V6, P77
[7] Bhattacharyya distance based concept drift detection method for evolving data stream
Baidari, Ishwar
Honnikoll, Nagaraj
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 183
[8] RDDM: Reactive drift detection method
Barros, Roberto S. M.
Cabral, Danilo R. L.
Goncalves, Paulo M., Jr.
Santos, Silas G. T. C.
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 90 : 344 - 355
[9] Bifet A, 2007, PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, P443
[10] An Adaptive Framework for Multistream Classification
Chandra, Swarup
Hague, Ahsanul
Khan, Latifur
Aggarwal, Charu
[J]. CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1181 - 1190

← 1 2 3 4 5 6 7 8 9 →