Towards Unsupervised Domain-Specific Open-World Recognition

被引：1

作者：

Alfarisy, Gusti Ahmad Fanshuri ^{[1
,2
]}

Malik, Owais Ahmed ^{[1
]}

Hong, Ong Wee ^{[1
]}

机构：

[1] Univ Brunei Darussalam, Sch Digital Sci, Jalan Tungku Link, Gadong BE1410, Brunei

[2] Inst Teknol Kalimantan, Dept Informat, Jalan Soekarno Hatta KM 15, Balikpapan 76127, Indonesia

来源：

NEUROCOMPUTING | 2025年 / 619卷

关键词：

Open-world learning; Open-world recognition; Open-set recognition; Lifelong machine learning; Continual learning; Deep learning; NEURAL-NETWORK; CLASSIFICATION; ENCODER;

D O I：

10.1016/j.neucom.2024.129141

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Open-World Recognition (OWR) is an emerging study that constructs machine-learning models to recognize unknown classes and learn them continually. The classical formalization of OWR relies on three main components: classifier, unknown identification, and continual learning. However, fora model that operates on domain-specific tasks, training rejected unknown classes directly will harm the models in terms of effectivity and efficiency (i.e., a waste collector robot will learn unnecessary classes and will collect novel non-waste objects). Filtering these novel objects manually requires human-in-the-loop which is costly and unable to learn on the job. Therefore, in this study, we introduce and formalize Unsupervised Domain-specific Open-world Recognition (UDOR) that has the potential framework to achieve a fully automated agent in an open-world environment. In addition, we formalize the specific component in UDOR called novelty manager to assist the model to learn on the job. Furthermore, we propose a unified model using Continual Multi-Channel Contrastive Prototype Networks (CMCCPN), Automated Machine learning (AutoML), and class discovery with Hierarchical DBSCAN (HDBSCAN) or First Integer Neighbor Clustering Hierarchy (FINCH) as a step towards UDOR. Our experimentation results suggest that CMCCPN produced the highest performance, AutoML provides almost exemplary capability in differentiating novel classes, and Vision Transformer with HDBSCAN or FINCH shows a good technique to be investigated in discovering classes with a small number of classes. Our source code is available at https://github.com/gusti-alfarisy/udor.

引用

页数：26

共 93 条

[81] DER: Dynamically Expandable Representation for Class Incremental Learning [J].

Yan, Shipeng ;

Xie, Jiangwei ;

He, Xuming .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :3013-3022

[82] Convolutional Prototype Network for Open Set Recognition [J].

Yang, Hong-Ming ;

Zhang, Xu-Yao ;

Yin, Fei ;

Yang, Qing ;

Liu, Cheng-Lin .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (05) :2358-2370

[83] Robust Classification with Convolutional Prototype Learning [J].

Yang, Hong-Ming ;

Zhang, Xu-Yao ;

Yin, Fei ;

Liu, Cheng-Lin .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3474-3482

[84] Classification-Reconstruction Learning for Open-Set Recognition [J].

Yoshihashi, Ryota ;

Shao, Wen ;

Kawakami, Rei ;

You, Shaodi ;

Iida, Makoto ;

Naemura, Takeshi .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4011-4020

[85] Online Deep Clustering for Unsupervised Representation Learning [J].

Zhan, Xiaohang ;

Xie, Jiahao ;

Liu, Ziwei ;

Ong, Yew-Soon ;

Loy, Chen Change .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6687-6696

[86] Few-Shot Incremental Learning with Continually Evolved Classifiers [J].

Zhang, Chi ;

Song, Nan ;

Lin, Guosheng ;

Zheng, Yun ;

Pan, Pan ;

Xu, Yinghui .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12450-12459

[87] Sparse Representation-Based Open Set Recognition [J].

Zhang, He ;

Patel, Vishal M. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (08) :1690-1696

[88]

Zhang JT, 2020, IEEE WINT CONF APPL, P1120, DOI [10.1109/WACV45572.2020.9093365, 10.1109/wacv45572.2020.9093365]

[89]

Zhou D.-W., 2022, arXiv

[90] TV100: a TV series dataset that pre-trained CLIP has not seen [J].

Zhou, Da-Wei ;

Qi, Zhi-Hong ;

Ye, Han-Jia ;

Zhan, De-Chuan .

FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (05)

← 1 2 3 4 5 6 7 8 9 10 →