A Python']Python package based on robust statistical analysis for serial crystallography data processing

被引:0
作者
Hadian-Jazi, Marjan [1 ,2 ]
Sadri, Alireza [3 ]
机构
[1] Walter & Eliza Hall Inst Med Res, Parkville, Vic 3052, Australia
[2] Univ Melbourne, Dept Med Biol, Parkville, Vic 3052, Australia
[3] Monash Univ, Sch Phys & Astron, Clayton, Vic 3800, Australia
来源
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY | 2023年 / 79卷
关键词
RGFlib; robust statistics; serial crystallography; robust peak-finding; robust bad pixel mask making; X-RAY-DIFFRACTION; SOFTWARE;
D O I
10.1107/S2059798323005855
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The term robustness in statistics refers to methods that are generally insensitive to deviations from model assumptions. In other words, robust methods are able to preserve their accuracy even when the data do not perfectly fit the statistical models. Robust statistical analyses are particularly effective when analysing mixtures of probability distributions. Therefore, these methods enable the discretization of X-ray serial crystallography data into two probability distributions: a group comprising true data points (for example the background intensities) and another group comprising outliers (for example Bragg peaks or bad pixels on an X-ray detector). These characteristics of robust statistical analysis are beneficial for the ever-increasing volume of serial crystallography (SX) data sets produced at synchrotron and X-ray free-electron laser (XFEL) sources. The key advantage of the use of robust statistics for some applications in SX data analysis is that it requires minimal parameter tuning because of its insensitivity to the input parameters. In this paper, a software package called Robust Gaussian Fitting library (RGFlib) is introduced that is based on the concept of robust statistics. Two methods are presented based on the concept of robust statistics and RGFlib for two SX data-analysis tasks: (i) a robust peak-finding algorithm and (ii) an automated robust method to detect bad pixels on X-ray pixel detectors.
引用
收藏
页码:820 / 829
页数:10
相关论文
共 27 条
[1]   AGIPD, a high dynamic range fast detector for the European XFEL [J].
Allahgholi, A. ;
Becker, J. ;
Bianco, L. ;
Delfs, A. ;
Dinapoli, R. ;
Goettlicher, P. ;
Graafsma, H. ;
Greiffenberg, D. ;
Hirsemann, H. ;
Jack, S. ;
Klanner, R. ;
Klyuev, A. ;
Krueger, H. ;
Lange, S. ;
Marras, A. ;
Mezza, D. ;
Mozzanica, A. ;
Rah, S. ;
Xia, Q. ;
Schmitt, B. ;
Schwandt, J. ;
Sheviakov, I. ;
Shi, X. ;
Smoljanin, S. ;
Trunk, U. ;
Zhang, J. ;
Zimmer, M. .
JOURNAL OF INSTRUMENTATION, 2015, 10
[2]  
[Anonymous], 1987, Robust Regression and Outlier Detection
[3]   Robust segmentation of visual data using ranked unbiased scale estimate [J].
Bab-Hadiashar, A ;
Suter, D .
ROBOTICA, 1999, 17 :649-660
[4]  
Bab-Hadiashar A., 2008, DIGITAL IMAGE COMPUT, P1
[5]   Cheetah: software for high-throughput reduction and analysis of serial femtosecond X-ray diffraction data [J].
Barty, Anton ;
Kirian, Richard A. ;
Maia, Filipe R. N. C. ;
Hantke, Max ;
Yoon, Chun Hong ;
White, Thomas A. ;
Chapman, Henry .
JOURNAL OF APPLIED CRYSTALLOGRAPHY, 2014, 47 :1118-1131
[6]   The serial millisecond crystallography instrument at the Australian Synchrotron incorporating the "Lipidico" injector [J].
Berntsen, P. ;
Jazi, M. Hadian ;
Kusel, M. ;
Martin, A. V. ;
Ericsson, T. ;
Call, M. J. ;
Trenker, R. ;
Roque, F. G. ;
Darmanin, C. ;
Abbey, B. .
REVIEW OF SCIENTIFIC INSTRUMENTS, 2019, 90 (08)
[7]   Femtosecond X-ray protein nanocrystallography [J].
Chapman, Henry N. ;
Fromme, Petra ;
Barty, Anton ;
White, Thomas A. ;
Kirian, Richard A. ;
Aquila, Andrew ;
Hunter, Mark S. ;
Schulz, Joachim ;
DePonte, Daniel P. ;
Weierstall, Uwe ;
Doak, R. Bruce ;
Maia, Filipe R. N. C. ;
Martin, Andrew V. ;
Schlichting, Ilme ;
Lomb, Lukas ;
Coppola, Nicola ;
Shoeman, Robert L. ;
Epp, Sascha W. ;
Hartmann, Robert ;
Rolles, Daniel ;
Rudenko, Artem ;
Foucar, Lutz ;
Kimmel, Nils ;
Weidenspointner, Georg ;
Holl, Peter ;
Liang, Mengning ;
Barthelmess, Miriam ;
Caleman, Carl ;
Boutet, Sebastien ;
Bogan, Michael J. ;
Krzywinski, Jacek ;
Bostedt, Christoph ;
Bajt, Sasa ;
Gumprecht, Lars ;
Rudek, Benedikt ;
Erk, Benjamin ;
Schmidt, Carlo ;
Hoemke, Andre ;
Reich, Christian ;
Pietschner, Daniel ;
Strueder, Lothar ;
Hauser, Guenter ;
Gorke, Hubert ;
Ullrich, Joachim ;
Herrmann, Sven ;
Schaller, Gerhard ;
Schopper, Florian ;
Soltau, Heike ;
Kuehnel, Kai-Uwe ;
Messerschmidt, Marc .
NATURE, 2011, 470 (7332) :73-U81
[8]   CASS-CFEL-ASG software suite [J].
Foucar, Lutz ;
Barty, Anton ;
Coppola, Nicola ;
Hartmann, Robert ;
Holl, Peter ;
Hoppe, Uwe ;
Kassemeyer, Stephan ;
Kimmel, Nils ;
Kuepper, Jochen ;
Scholz, Mirko ;
Techert, Simone ;
White, Thomas A. ;
Strueder, Lothar ;
Ullrich, Joachim .
COMPUTER PHYSICS COMMUNICATIONS, 2012, 183 (10) :2207-2213
[9]   Femtosecond X-ray diffraction from two-dimensional protein crystals [J].
Frank, Matthias ;
Carlson, David B. ;
Hunter, Mark S. ;
Williams, Garth J. ;
Messerschmidt, Marc ;
Zatsepin, Nadia A. ;
Barty, Anton ;
Benner, W. Henry ;
Chu, Kaiqin ;
Graf, Alexander T. ;
Hau-Riege, Stefan P. ;
Kirian, Richard A. ;
Padeste, Celestino ;
Pardini, Tommaso ;
Pedrini, Bill ;
Segelke, Brent ;
Seibert, M. Marvin ;
Spence, John C. H. ;
Tsai, Ching-Ju ;
Lane, Stephen M. ;
Li, Xiao-Dan ;
Schertler, Gebhard ;
Boutet, Sebastien ;
Coleman, Matthew ;
Evans, James E. .
IUCRJ, 2014, 1 :95-100
[10]  
FUKUNAGA K, 1975, IEEE T INFORM THEORY, V21, P32, DOI 10.1109/TIT.1975.1055330