摘要
Background At present,it is insufficient to understand the basic data characteristics of the correlated X-ray scattering.And there is a great challenge about how to master the nature of the data.So it is difficult to use and analyze the experimental data more effectively.In addition,there are many reasons,for the experimental artifacts such as whether the shutter is on or off,whether there is the beam line or not,the swaying of the nozzle and the shadow of the detector.So it is rather challenging to analyze the scattering patterns.Purpose The purpose of this paper was to develop a method to filter the invalid scattering data and provide the theoretical and experiment fundamentals for studying the X-ray scattering data of the complex biological sample further.Methods The heliummolecules were scattered by the X-ray free-electron laser in Spring8 in Japan.Andmillions of scattering patterns were obtained from the X-ray free-electron laser experiment.Through the analysis of the scattering data,the sum,mean,median and variance of the scattering intensity were obtained.Then different clusters were obtained with the densitybased spatial clustering of applications with noise(DBSCAN)algorithm.Results Based on the DBSCAN,some of the scattering patterns with high artifacts were removed and different clusters were clarified.So the experimental scattering data could be analyzed more effectively.Conclusion The theoretical and experiment fundamentals for comprehensively studying the X-ray scattering data of the complex biological sample were provided.After the data filtering,the angular autocorrelation of different clusters with Kam’s method will be computed and analyzed effectively.
基金
National Institutes of Health research Grant 251 R01-GM097463
Stanford NIH Biotechnology Training Grant No.5T32GM008412-20,US Department of Energy Office of Science under Contract No.DE-AC02-05CH11231
National Nature Science Foundation of China for theoretical physics Grant No.11547238.