Towards Unsupervised Domain-Specific Open-World Recognition

被引:1
作者
Alfarisy, Gusti Ahmad Fanshuri [1 ,2 ]
Malik, Owais Ahmed [1 ]
Hong, Ong Wee [1 ]
机构
[1] Univ Brunei Darussalam, Sch Digital Sci, Jalan Tungku Link, Gadong BE1410, Brunei
[2] Inst Teknol Kalimantan, Dept Informat, Jalan Soekarno Hatta KM 15, Balikpapan 76127, Indonesia
关键词
Open-world learning; Open-world recognition; Open-set recognition; Lifelong machine learning; Continual learning; Deep learning; NEURAL-NETWORK; CLASSIFICATION; ENCODER;
D O I
10.1016/j.neucom.2024.129141
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Open-World Recognition (OWR) is an emerging study that constructs machine-learning models to recognize unknown classes and learn them continually. The classical formalization of OWR relies on three main components: classifier, unknown identification, and continual learning. However, fora model that operates on domain-specific tasks, training rejected unknown classes directly will harm the models in terms of effectivity and efficiency (i.e., a waste collector robot will learn unnecessary classes and will collect novel non-waste objects). Filtering these novel objects manually requires human-in-the-loop which is costly and unable to learn on the job. Therefore, in this study, we introduce and formalize Unsupervised Domain-specific Open-world Recognition (UDOR) that has the potential framework to achieve a fully automated agent in an open-world environment. In addition, we formalize the specific component in UDOR called novelty manager to assist the model to learn on the job. Furthermore, we propose a unified model using Continual Multi-Channel Contrastive Prototype Networks (CMCCPN), Automated Machine learning (AutoML), and class discovery with Hierarchical DBSCAN (HDBSCAN) or First Integer Neighbor Clustering Hierarchy (FINCH) as a step towards UDOR. Our experimentation results suggest that CMCCPN produced the highest performance, AutoML provides almost exemplary capability in differentiating novel classes, and Vision Transformer with HDBSCAN or FINCH shows a good technique to be investigated in discovering classes with a small number of classes. Our source code is available at https://github.com/gusti-alfarisy/udor.
引用
收藏
页数:26
相关论文
共 93 条
[81]   DER: Dynamically Expandable Representation for Class Incremental Learning [J].
Yan, Shipeng ;
Xie, Jiangwei ;
He, Xuming .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :3013-3022
[82]   Convolutional Prototype Network for Open Set Recognition [J].
Yang, Hong-Ming ;
Zhang, Xu-Yao ;
Yin, Fei ;
Yang, Qing ;
Liu, Cheng-Lin .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (05) :2358-2370
[83]   Robust Classification with Convolutional Prototype Learning [J].
Yang, Hong-Ming ;
Zhang, Xu-Yao ;
Yin, Fei ;
Liu, Cheng-Lin .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3474-3482
[84]   Classification-Reconstruction Learning for Open-Set Recognition [J].
Yoshihashi, Ryota ;
Shao, Wen ;
Kawakami, Rei ;
You, Shaodi ;
Iida, Makoto ;
Naemura, Takeshi .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4011-4020
[85]   Online Deep Clustering for Unsupervised Representation Learning [J].
Zhan, Xiaohang ;
Xie, Jiahao ;
Liu, Ziwei ;
Ong, Yew-Soon ;
Loy, Chen Change .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6687-6696
[86]   Few-Shot Incremental Learning with Continually Evolved Classifiers [J].
Zhang, Chi ;
Song, Nan ;
Lin, Guosheng ;
Zheng, Yun ;
Pan, Pan ;
Xu, Yinghui .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12450-12459
[87]   Sparse Representation-Based Open Set Recognition [J].
Zhang, He ;
Patel, Vishal M. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (08) :1690-1696
[88]  
Zhang JT, 2020, IEEE WINT CONF APPL, P1120, DOI [10.1109/WACV45572.2020.9093365, 10.1109/wacv45572.2020.9093365]
[89]  
Zhou D.-W., 2022, arXiv
[90]   TV100: a TV series dataset that pre-trained CLIP has not seen [J].
Zhou, Da-Wei ;
Qi, Zhi-Hong ;
Ye, Han-Jia ;
Zhan, De-Chuan .
FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (05)