Nano language and distribution of article title terms according to power laws

被引:7
作者
Bartol, Tomaz [1 ]
Stopar, Karmen [1 ]
机构
[1] Univ Ljubljana, Biotech Fac, Ljubljana, Slovenia
关键词
Nanoscience; Bibliometrics; Lexical analysis; Power laws; Terminology; Subject categories; Search strategy; Compound words; RESEARCH FIELDS; NANOTECHNOLOGY; NANOSCIENCE; SCIENCE; INTERDISCIPLINARITY; PUBLICATIONS; PATENTS; WEB;
D O I
10.1007/s11192-015-1546-1
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Scientometric evaluation of nanoscience/nanotechnology requires complex search strategies and lengthy queries which retrieve massive amount of information. In order to offer some insight based on the most frequently occurring terms our research focused on a limited amount of data, collected on uniform principles. The prefix nano comes about in many different compound words thus offering a possibility for such assessment. The aim is to identify the scatter of nanoconcepts, among and within journals, as well as more generally, in the Web of Science (WOS). Ten principal journals were identified along with all unique nanoterms in article titles. Such terms occur on average in half of all titles. Terms were thoroughly investigated and mapped by lemmatization or stemming to the appropriate roots-nanoconcepts. The scatter of concepts follows the characteristics of power laws, especially Zipf's law, exhibiting clear inversely proportional relationship between rank and frequency. The same three nanoconcepts are most frequently occurring in as many as seven journals. Two concepts occupy the first and the second rank in six journals. The same six concepts are the most frequently occurring in ten journals as well as full WOS database, representing almost two thirds of all nanotitled articles, in both instances. Subject categories don't play a decisive role. Frequency falls progressively, quickly producing a long tail of rare concepts. Drop is almost linear on the log scale. The existence of hundreds of different closed-form compound nanoterms has consequences for the retrieval on the Internet search engines (e.g. Google Scholar) which do not permit truncation.
引用
收藏
页码:435 / 451
页数:17
相关论文
共 46 条
[1]  
Adamic L. A., 2000, Zipf, power-laws, and Pareto-a ranking tutorial
[2]  
[Anonymous], 2004, NANOTECH L BUS
[3]  
[Anonymous], CT20020001 ECPPN LEI
[4]  
Baird D., 2004, DISCOVERING NANOSCAL, P1
[5]   Informetrics at the beginning of the 21st century - A review [J].
Bar-Ilan, Judit .
JOURNAL OF INFORMETRICS, 2008, 2 (01) :1-52
[6]   Assessment of research fields in Scopus and Web of Science in the view of national research evaluation in Slovenia [J].
Bartol, Tomaz ;
Budimir, Gordana ;
Dekleva-Smrekar, Doris ;
Pusnik, Miro ;
Juznic, Primoz .
SCIENTOMETRICS, 2014, 98 (02) :1491-1504
[7]   Mapping nanosciences by citation flows: A preliminary analysis [J].
Bassecoulard, Elise ;
Lelu, Alain ;
Zitt, Michel .
SCIENTOMETRICS, 2007, 70 (03) :859-880
[8]   Discovery of power-laws in chemical space [J].
Benz, Ryan W. ;
Swamidass, S. Joshua ;
Baldi, Pierre .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2008, 48 (06) :1138-1151
[9]   Nanoscience and nanotechnology on the balance [J].
Braun, T ;
Schubert, A ;
Zsindely, S .
SCIENTOMETRICS, 1997, 38 (02) :321-325
[10]   How to identify research groups using publication analysis:: an example in the field of nanotechnology [J].
Calero, C ;
Buter, R ;
Valdés, CC ;
Noyons, E .
SCIENTOMETRICS, 2006, 66 (02) :365-376