Patent Document Clustering Using Dimensionality Reduction

被引:1
作者
Girthana, K. [1 ]
Swamynathan, S. [1 ]
机构
[1] Anna Univ, Dept Informat Sci & Technol, Madras 600025, Tamil Nadu, India
来源
PROGRESS IN ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, VOL 2 | 2018年 / 564卷
关键词
Prior art search; Dimensionality reduction; Clustering;
D O I
10.1007/978-981-10-6875-1_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Patents are a type of intellectual property rights that provide exclusive rights to the invention. Whenever there is a novelty or an invention, prior art search on patents is carried out to check the degree of innovation. Clustering is used to group the relevant documents of prior art search to gain insights about the patent document. The patent documents represent hundreds of features (words extracted from the title and abstract fields). The common sets of features between the documents are subtle. Therefore, the number of features for clustering increases drastically. This leads to the curse of dimensionality. Hence, in thiswork, dimensionality reduction techniques such as PCA and SVD are employed to compare and analyze the quality of clusters formed from the Google patent documents. This comparative analysiswas performed by considering title, abstract, and classification code fields of the patent document. Classification code information was used to decide the number of clusters.
引用
收藏
页码:167 / 176
页数:10
相关论文
共 50 条
[41]   Comparison of Dimensionality Reduction Techniques for Clustering and Visualization of Load Profiles [J].
Arechiga, A. ;
Barocio, E. ;
Ayon, J. J. ;
Garcia-Baleon, H. A. .
2016 IEEE PES TRANSMISSION & DISTRIBUTION CONFERENCE AND EXPOSITION-LATIN AMERICA (PES T&D-LA), 2016,
[42]   A Framework for Semi-Supervised Clustering Based on Dimensionality Reduction [J].
Cui Peng ;
Zhang Ru-bo .
FIRST INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, :192-+
[43]   Clustering and Dimensionality Reduction to Discover Interesting Patterns in Binary Data [J].
Palumbo, Francesco ;
D'Enza, Alfonso Iodice .
ADVANCES IN DATA ANALYSIS, DATA HANDLING AND BUSINESS INTELLIGENCE, 2010, :45-+
[44]   Advanced visualization for the quant strategy universe: clustering and dimensionality reduction [J].
Sidibe, Boubacar ;
de la Bastide, Christophe ;
Peres, Florian .
JOURNAL OF INVESTMENT STRATEGIES, 2024, 13 (03) :17-37
[45]   Data dimensionality reduction technique for clustering problem of metabolomics data [J].
Rustam ;
Gunawan, Agus Yodi ;
Kresnowati, Made Tri Ari Penia .
HELIYON, 2022, 8 (06)
[46]   Transferred Dimensionality Reduction [J].
Wang, Zheng ;
Song, Yangqiu ;
Zhang, Changshui .
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 :550-565
[47]   Dimensionality Reduction Using Convolutional Autoencoders [J].
Mittal, Shweta ;
Sangwan, Om Prakash .
ADVANCES IN INFORMATION COMMUNICATION TECHNOLOGY AND COMPUTING, AICTC 2021, 2022, 392 :507-516
[48]   A comprehensive survey of dimensionality reduction and clustering methods for single-cell and spatial transcriptomics data [J].
Sun, Yidi ;
Kong, Lingling ;
Huang, Jiayi ;
Deng, Hongyan ;
Bian, Xinling ;
Li, Xingfeng ;
Cui, Feifei ;
Dou, Lijun ;
Cao, Chen ;
Zou, Quan ;
Zhang, Zilong .
BRIEFINGS IN FUNCTIONAL GENOMICS, 2024, 23 (06) :733-744
[49]   Ranking and Dimensionality Reduction Using Biclustering [J].
Madhuri, V. Hema ;
Rani, T. Sobha .
PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON FUZZY AND NEURO COMPUTING (FANCCO - 2015), 2015, 415 :209-226
[50]   KRATOS: Context-Aware Cell Type Classification and Interpretation Using Joint Dimensionality Reduction and Clustering [J].
Zhou, Zihan ;
Du, Zijia ;
Chaterji, Somali .
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, :2616-2625