Agreement of drug discovery data with Benford's law

被引:5
|
作者
Orita, Masaya [1 ]
Hagiwara, Yosuke [1 ]
Moritomo, Ayako [1 ]
Tsunoyama, Kazuhisa [2 ]
Watanabe, Toshihiro [1 ]
Ohno, Kazuki [1 ]
机构
[1] Astellas Pharma Inc, Drug Discovery Res, Chem Res Labs, Tsukuba, Ibaraki 3058585, Japan
[2] Astellas Pharma Inc, Drug Discovery Res, Mol Med Res Labs, Tsukuba, Ibaraki 3058585, Japan
关键词
Benford's law; data quality; scientific misconduct; chi(2) statistic; FRAUD;
D O I
10.1517/17460441.2013.740007
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
The ever-increasing rate of drug discovery data has complicated data analysis and potentially compromised data quality due to factors such as data handling errors. Parallel to this concern is the rise in blatant scientific misconduct. Combined, these problems highlight the importance of developing a method that can be used to systematically assess data quality. Benford's law has been used to discover data manipulation and data fabrication in various fields. In the authors' previous studies, it was demonstrated that the distribution of the corresponding activity and solubility data followed Benford's law distribution. It was also shown that too intense a selection of training data sets of regression model can disrupt Benford's law. Here, the authors present the application of Benford's law to a wider range of drug discovery data such as microarray and sequence data. They also suggest that Benford's law could also be applied to model building and reliability for structure-activity relationship study. Finally, the authors propose a protocol based on Benford's law which will provide researchers with an efficient method for data quality assessment. However, multifaceted quality control such as combinatorial use with data visualization may also be needed to further improve its reliability.
引用
收藏
页码:1 / 5
页数:5
相关论文
共 50 条
  • [21] Compliance of LC50 and NOEC data with Benford's Law: An indication of reliability?
    de Vries, Pepijn
    Murk, Albertinka J.
    ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY, 2013, 98 : 171 - 178
  • [22] Benford's Law for Mixtures
    Balanzario, Eugenio P.
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2015, 44 (04) : 698 - 709
  • [23] Using the Benford's Law as a First Step to Assess the Quality of the Cancer Registry Data
    Crocetti, Emanuele
    Randi, Giorgia
    FRONTIERS IN PUBLIC HEALTH, 2016, 4
  • [24] Testing firm-level data quality in China against Benford's Law
    Huang, Yasheng
    Niu, Zhiyong
    Yang, Clair
    ECONOMICS LETTERS, 2020, 192
  • [25] Application of Benford's Law for the analysis of the reliability of production quality data
    Rajda-Tasior, Angelina
    33RD INTERNATIONAL CONFERENCE MATHEMATICAL METHODS IN ECONOMICS (MME 2015), 2015, : 695 - 700
  • [26] USING BENFORD'S LAW IN THE ANALYSIS OF SOCIO-ECONOMIC DATA
    Coman, Dan-Marius
    Horga, Maria-Gabriela
    Danila, Alexandra
    Coman, Mihaela-Denisa
    JOURNAL OF SCIENCE AND ARTS, 2018, (01) : 167 - 174
  • [27] Benford's law and the FSD distribution of economic behavioral micro data
    Villas-Boas, Sofia B.
    Fu, Qiuzi
    Judge, George
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2017, 486 : 711 - 719
  • [28] Use of Benford's law on academic publishing networks
    Tosic, Aleksandar
    Vicic, Jernej
    JOURNAL OF INFORMETRICS, 2021, 15 (03)
  • [29] APPLYING BENFORD′S LAW IN FINANCIAL STATEMENTS ANALYSIS
    Kubascikova, Zuzana
    IFRS: GLOBAL RULES & LOCAL USE, 2017, : 337 - 342
  • [30] An Alternative to the Oversimplifying Benford's Law in Experimental Fields
    Da Silva, Stephane Blondeau
    SANKHYA-SERIES B-APPLIED AND INTERDISCIPLINARY STATISTICS, 2022,