Retrieval and extraction of Unique Patterns from Compressed Text Data using the SVD Technique on Hadoop Apache Mahout Framework

被引:0
作者
Dhumal, Poonam [1 ]
Deshmukh, S. S. [1 ]
机构
[1] Pimpri Chinchwad Coll Engn, Dept Comp Engn, Pune 411044, Maharashtra, India
来源
2016 INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA) | 2016年
关键词
Apache MAHOUT; Hadoop; information retrieval; pattern matching; rule generation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays Designing of an efficient information retrieval system for multimedia dataset is a big task to understand the trend. Searching for the repeated pattern within a particular genetic sequence has become a much required task in the data mining sectors. Due to the increasing size of text and audio data over internet, various techniques are needed to help with the finding and extraction of very specific information relevant to a user's task or finding the new trends. Matching the unique patterns and generate the rules by using different Pattern Matching, Rule Generation Algorithm. In addition to extracting questionnaires and curiosity based sentences patterns from large database some different implementation of the pattern matching algorithms is proposed. Real world data is large in size like images, speech signals holding high dimensions to represent data. Multiple dimensional data are more critical for detecting and developing the associations among terms. Dimensionality reduction is a technique used for reducing complexity and gives the most frequent item from high dimensional data. It reduces the dimensions of the original input data. Singular value decomposition this dimensionality reduction technique is used for large data reduction on Apache MAHOUT of Hadoop framework. Finally, retrieve and extract the curiosity and questionnaires based unique pattern from large data size using the Map Reduce Framework and compress the result using the SVD technique to give curiosity and questionnaires based subject.
引用
收藏
页数:5
相关论文
共 11 条
[1]  
[Anonymous], 2013, INT J ENG SCI TECHNO
[2]  
[Anonymous], 2013, THESIS
[3]  
[Anonymous], IMPROVING SPEAKER ID
[4]  
Fatehpuria Suresh, 2014, 2014 IEEE INT C ADV
[5]  
Ghosh S., INT J ADV RES COMPUT
[6]  
Gupta V., 2009, Journal of Emerging Technologies in Web Intelligence, V1
[7]  
Joshi Snehal K, 2014, EVOLUTION EVALUATION
[8]  
Kim Jarang, 2014, TPE BASED TUPLE EXTR
[9]  
Mutakabbir Kazi Mahbub, 2014, 3 INT C INF EL VIS 2
[10]  
Sari Yunita, 2010, IEEE